SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Extract the target voice from mixture speech given an enrollment speech.

Learn more about 🎯SoloSpeech on the SoloSpeech Repo.

Tip: To extract sound effects or music from audio, try using SoloAudio.

Select Test Demo