An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
-
Updated
Aug 14, 2025 - Python
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Audio super-resolution with an ECA-enhanced encoder-decoder CNN. Upsamples low-rate speech (2/4/8 kHz → 16 kHz) to improve ASR performance, trained on LibriSpeech with Wav2Vec2-based perceptual loss.
Add a description, image, and links to the speech-super-resolution topic page so that developers can more easily learn about it.
To associate your repository with the speech-super-resolution topic, visit your repo's landing page and select "manage topics."