mayavoz/README.md

2.1 KiB

[![codecov][codecov-badge]][codecov] GitHub issues GitHub Repo stars

mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio researchers. Is provides easy to use pretrained audio enhancement models and facilitates highly customisable model training.

| Quick Start | Installation | Tutorials | Available Recipes | Demo

Key features 🔑

  • Various pretrained models nicely integrated with huggingface 🤗 that users can select and use without any hastle.
  • 📦 Ability to train and validation your own custom speech enhancement models with just under 10 lines of code!
  • 🪄 A command line tool that facilitates training of highly customisable speech enhacement models from the terminal itself!
  • Supports multi-gpu training integrated with Pytorch Lightning.

Quick Start 🔥

from mayavoz import Mayamodel

model = Mayamodel.from_pretrained("mayavoz/waveunet")
model("noisy_audio.wav")

Recipes

Model Dataset STOI PESQ URL
WaveUnet Vctk-28spk mayavoz/waveunet-vctk28
Demucs DNS-2020 (30hrs) mayavoz/demucs-vctk28
DCCRN DNS-2020 (30hrs) mayavoz/dccrn-vctk28

Installation

Only Python 3.8+ is officially supported (though it might work with Python 3.7)

  • With Pypi
pip install mayavoz
  • With conda
conda env create -f environment.yml
conda activate mayavoz
  • From source code
git clone url
cd mayavoz
pip install -e .