🟢 Comprehensive Project Collection

The approach to building digital humans can be divided into three steps:

Flesh and Bones — Create the digital human's appearance.
Senses Creation — Convert text to audio; combine audio with appearance to make the digital human "speak."
Soul Infusion — Input domain knowledge to facilitate intelligent dialogues.

Hence, we have compiled numerous outstanding projects tailored to different domains. There's definitely one suitable for you!

Domain	Function	Name	Link	Notes	Updated
Appearance	Input real photos, generate digital human photos	MINISTER AI	https://mst.xyz/home	Free stable	23/08/07
Appearance	Generate images from commands	Midjourney	https://discord.com/invite/midjourney	Free trial available	23/08/07
Audio	Voice cloning, e.g., generate cover songs	so-vits-svc	https://github.com/svc-develop-team/so-vits-svc	Open source	23/08/07
Audio	Text-to-speech with support for music and simple sound effects	bark	https://huggingface.co/spaces/suno/bark	Open source	23/08/07
Audio	AI cover song voice transformation	DDSP-SVC	https://github.com/yxlllc/DDSP-SVC	Open source, suitable for low-spec computers	23/08/07
Video	Input images, text/audio to generate digital human speaking videos	DID	https://bittly.cc/studioDI	Free trial available. More info: https://www.learnprompt.pro/docs/Images/start	23/08/07
Video	1. Input photo and text to generate digital human videos; 2. Input real video to produce digital human videos	HeyGen	https://app.heygen.com/	Subscription required	23/08/07
Video	Input audio and SDR space video to make the person in the original video speak the target content	Video Retalking	https://github.com/OpenTalker/video-retalking	Open source	23/08/07
Video	Input audio and SDR space video to make the person in the original video speak the target content	Wav2Lip	https://github.com/Rudrabha/Wav2Lip	Open source	23/08/07
Video	Input audio and image to generate digital human speaking videos	SadTalker	https://github.com/OpenTalker/SadTalker	Open source, also supports direct Windows app installation: https://www.bilibili.com/video/BV1gW4y1o7FC/	23/08/07
Video	Input text, select a character template to generate talking head videos	kreadoai	https://www.kreadoai.com/	Free. The website also supports AI image segmentation and other functions	23/08/07
Video	Change a photo to a video with face replacement	Roop	https://github.com/s0md3v/roop	Open source, try it on: https://colab.research.google.com/drive/157RluIDQnvjQy9UBFXL8U5Q-UwgZPqAK	23/08/07
Digital Human Applications	Flexible combinations for different use cases: virtual anchors, live sales, product guides, voice assistants, remote voice assistants, digital human interactions, digital human interviewers and psychological assessments, Jarvis, Her	Fay	https://github.com/TheRamU/Fay	Open source	23/08/07

Next, we will select some projects to demonstrate how to build your own digital human.