mayavoz

Pytorch based speech enhancement toolkit.

audio-enhancement deep-learning denoiser pretrained-models python pytorch speech-enhancement

Go to file

Shahul ES 915574bd30 Merge pull request #31 from shahules786/dev update readme		2022-12-02 12:48:37 +05:30
.github/workflows	Update ci.yaml	2022-11-24 15:41:09 +05:30
mayavoz	overlap-add	2022-12-01 10:16:10 +05:30
notebooks	notebooks	2022-11-24 11:10:05 +05:30
recipes	add MS-SNSD recipes	2022-11-24 12:15:19 +05:30
tests	test pretrained	2022-11-24 15:56:42 +05:30
.flake8	rename package	2022-11-14 11:37:26 +05:30
.gitattributes	Update .gitattributes	2022-11-23 19:19:43 +05:30
.gitignore	gitignore	2022-11-15 21:39:35 +05:30
.pre-commit-config.yaml	rename package	2022-11-14 11:37:26 +05:30
LICENSE	rename owner	2022-12-01 09:37:43 +05:30
MANIFEST.in	include files	2022-11-23 17:09:49 +05:30
README.md	update readme	2022-12-02 12:33:28 +05:30
environment.yml	rename package	2022-11-14 11:37:26 +05:30
pyproject.toml	toml	2022-10-05 20:42:51 +05:30
requirements.txt	downgrade mlflow	2022-11-23 17:44:28 +05:30
setup.cfg	setup	2022-11-23 17:10:20 +05:30
setup.py	rename package	2022-11-14 11:37:26 +05:30
version.txt	version	2022-10-06 17:24:55 +05:30

README.md

mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio practioners & researchers. It provides easy to use pretrained speech enhancement models and facilitates highly customisable model training.

Key features 🔑

Various pretrained models nicely integrated with huggingface hub 🤗 that users can select and use without any hastle.
📦 Ability to train and validate your own custom speech enhancement models with just under 10 lines of code!
🪄 A command line tool that facilitates training of highly customisable speech enhacement models from the terminal itself!
⚡ Supports multi-gpu training integrated with Pytorch Lightning.
🛡️ data augmentations integrated using torch-augmentations

Demo

Noisy speech followed by enhanced version.

https://user-images.githubusercontent.com/25312635/203756185-737557f4-6e21-4146-aa2c-95da69d0de4c.mp4

Quick Start 🔥

from mayavoz.models import Mayamodel

model = Mayamodel.from_pretrained("shahules786/mayavoz-waveunet-valentini-28spk")
model.enhance("noisy_audio.wav")

Recipes

Model	Dataset	STOI	PESQ	URL
WaveUnet	Valentini-28spk	0.836	2.78	shahules786/mayavoz-waveunet-valentini-28spk
Demucs	Valentini-28spk	0.961	2.56	shahules786/mayavoz-demucs-valentini-28spk
DCCRN	Valentini-28spk	0.724	2.55	shahules786/mayavoz-dccrn-valentini-28spk
Demucs	MS-SNSD-20hrs	0.56	1.26	shahules786/mayavoz-demucs-ms-snsd-20

Test scores are based on respective test set associated with train dataset.

See tutorials to train your custom model

Installation

Only Python 3.8+ is officially supported (though it might work with Python 3.7)

With Pypi

pip install mayavoz

With conda

conda env create -f environment.yml
conda activate mayavoz

From source code

git clone url
cd mayavoz
pip install -e .

Support

For commercial enquiries and scientific consulting, please contact me.

Acknowledgements

Sincere gratitude to AMPLYFI for supporting this project.