|
|
||
|---|---|---|
| .github/workflows | ||
| mayavoz | ||
| notebooks | ||
| recipes | ||
| tests | ||
| .flake8 | ||
| .gitignore | ||
| .pre-commit-config.yaml | ||
| LICENSE | ||
| README.md | ||
| environment.yml | ||
| pyproject.toml | ||
| requirements.txt | ||
| setup.cfg | ||
| setup.py | ||
| version.txt | ||
README.md
mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio researchers. Is provides easy to use pretrained audio enhancement models and facilitates highly customisable model training.
| Quick Start | Installation | Tutorials | Available Recipes | Demo
Key features 🔑
- Various pretrained models nicely integrated with huggingface 🤗 that users can select and use without any hastle.
- 📦 Ability to train and validation your own custom speech enhancement models with just under 10 lines of code!
- 🪄 A command line tool that facilitates training of highly customisable speech enhacement models from the terminal itself!
- ⚡ Supports multi-gpu training integrated with Pytorch Lightning.
Quick Start 🔥
from mayavoz import Mayamodel
model = Mayamodel.from_pretrained("shahules786/mayavoz-waveunet-valentini-28spk")
model("noisy_audio.wav")
Recipes
| Model | Dataset | STOI | PESQ | URL |
|---|---|---|---|---|
| WaveUnet | Vctk-28spk | 0.836 | 2.78 | shahules786/mayavoz-waveunet-valentini-28spk |
| Demucs | Vctk-28spk | 0.961 | 2.56 | shahules786/mayavoz-demucs-valentini-28spk |
| DCCRN | Vctk-28spk | 0.724 | 2.55 | shahules786/mayavoz-dccrn-valentini-28spk |
Installation
Only Python 3.8+ is officially supported (though it might work with Python 3.7)
- With Pypi
pip install mayavoz
- With conda
conda env create -f environment.yml
conda activate mayavoz
- From source code
git clone url
cd mayavoz
pip install -e .