Skip to main content

๐ŸŸข Comprehensive Project Collection

The approach to building digital humans can be divided into three steps:

  • Flesh and Bones โ€” Create the digital human's appearance.
  • Senses Creation โ€” Convert text to audio; combine audio with appearance to make the digital human "speak."
  • Soul Infusion โ€” Input domain knowledge to facilitate intelligent dialogues.

Hence, we have compiled numerous outstanding projects tailored to different domains. There's definitely one suitable for you!

DomainFunctionNameLinkNotesUpdated
AppearanceInput real photos, generate digital human photosMINISTER AIhttps://mst.xyz/homeFree stable23/08/07
AppearanceGenerate images from commandsMidjourneyhttps://discord.com/invite/midjourneyFree trial available23/08/07
AudioVoice cloning, e.g., generate cover songsso-vits-svchttps://github.com/svc-develop-team/so-vits-svcOpen source23/08/07
AudioText-to-speech with support for music and simple sound effectsbarkhttps://huggingface.co/spaces/suno/barkOpen source23/08/07
AudioAI cover song voice transformationDDSP-SVChttps://github.com/yxlllc/DDSP-SVCOpen source, suitable for low-spec computers23/08/07
VideoInput images, text/audio to generate digital human speaking videosDIDhttps://bittly.cc/studioDIFree trial available. More info: https://www.learnprompt.pro/docs/Images/start23/08/07
Video1. Input photo and text to generate digital human videos; 2. Input real video to produce digital human videosHeyGenhttps://app.heygen.com/Subscription required23/08/07
VideoInput audio and SDR space video to make the person in the original video speak the target contentVideo Retalkinghttps://github.com/OpenTalker/video-retalkingOpen source23/08/07
VideoInput audio and SDR space video to make the person in the original video speak the target contentWav2Liphttps://github.com/Rudrabha/Wav2LipOpen source23/08/07
VideoInput audio and image to generate digital human speaking videosSadTalkerhttps://github.com/OpenTalker/SadTalkerOpen source, also supports direct Windows app installation: https://www.bilibili.com/video/BV1gW4y1o7FC/23/08/07
VideoInput text, select a character template to generate talking head videoskreadoaihttps://www.kreadoai.com/Free. The website also supports AI image segmentation and other functions23/08/07
VideoChange a photo to a video with face replacementRoophttps://github.com/s0md3v/roopOpen source, try it on: https://colab.research.google.com/drive/157RluIDQnvjQy9UBFXL8U5Q-UwgZPqAK23/08/07
Digital Human ApplicationsFlexible combinations for different use cases: virtual anchors, live sales, product guides, voice assistants, remote voice assistants, digital human interactions, digital human interviewers and psychological assessments, Jarvis, HerFayhttps://github.com/TheRamU/FayOpen source23/08/07

Next, we will select some projects to demonstrate how to build your own digital human.