79 lines
		
	
	
		
			3.1 KiB
		
	
	
	
		
			Markdown
		
	
	
	
			
		
		
	
	
			79 lines
		
	
	
		
			3.1 KiB
		
	
	
	
		
			Markdown
		
	
	
	
| <p align="center">
 | |
|   <img src="https://user-images.githubusercontent.com/25312635/195514652-e4526cd1-1177-48e9-a80d-c8bfdb95d35f.png" />
 | |
| </p>
 | |
| 
 | |
| 
 | |
| 
 | |
| 
 | |
| 
 | |
| 
 | |
| mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio practioners & researchers. It provides easy to use pretrained speech enhancement models and facilitates highly customisable model training.
 | |
| 
 | |
| | **[Quick Start](#quick-start-fire)** | **[Installation](#installation)** | **[Tutorials](https://github.com/shahules786/enhancer/tree/main/notebooks)** | **[Available Recipes](#recipes)** | **[Demo](#demo)**
 | |
| ## Key features :key:
 | |
| 
 | |
| * Various pretrained models nicely integrated with [huggingface hub](https://huggingface.co/docs/hub/index) :hugs: that users can select and use without any hastle.
 | |
| * :package: Ability to train and validate your own custom speech enhancement models with just under 10 lines of code!
 | |
| * :magic_wand: A command line tool that facilitates training of highly customisable speech enhacement models from the terminal itself!
 | |
| * :zap: Supports multi-gpu training integrated with [Pytorch Lightning](https://pytorchlightning.ai/).
 | |
| * :shield: data augmentations integrated using [torch-augmentations](https://github.com/asteroid-team/torch-audiomentations)
 | |
| 
 | |
| 
 | |
| ## Demo
 | |
| 
 | |
| Noisy speech followed by enhanced version.
 | |
| 
 | |
| https://user-images.githubusercontent.com/25312635/203756185-737557f4-6e21-4146-aa2c-95da69d0de4c.mp4
 | |
| 
 | |
| 
 | |
| 
 | |
| ## Quick Start :fire:
 | |
| ``` python
 | |
| from mayavoz.models import Mayamodel
 | |
| 
 | |
| model = Mayamodel.from_pretrained("shahules786/mayavoz-waveunet-valentini-28spk")
 | |
| model.enhance("noisy_audio.wav")
 | |
| ```
 | |
| 
 | |
| ## Recipes
 | |
| 
 | |
| | Model     | Dataset      | STOI    | PESQ  | URL                           |
 | |
| | :---:     |  :---:       | :---:   | :---: | :---:                         |
 | |
| | WaveUnet  | Valentini-28spk   | 0.836   | 2.78  |  shahules786/mayavoz-waveunet-valentini-28spk      |
 | |
| | Demucs    | Valentini-28spk   | 0.961   | 2.56  |  shahules786/mayavoz-demucs-valentini-28spk       |
 | |
| | DCCRN     | Valentini-28spk   | 0.724   | 2.55  |  shahules786/mayavoz-dccrn-valentini-28spk         |
 | |
| | Demucs     | MS-SNSD-20hrs  | 0.56 | 1.26  | shahules786/mayavoz-demucs-ms-snsd-20       |
 | |
| 
 | |
| Test scores are based on respective test set associated with train dataset.
 | |
| 
 | |
| **See [tutorials](/notebooks/) to train your custom model**
 | |
| 
 | |
| ## Installation
 | |
| Only Python 3.8+ is officially supported (though it might work with Python 3.7)
 | |
| 
 | |
| - With Pypi
 | |
| ```
 | |
| pip install mayavoz
 | |
| ```
 | |
| 
 | |
| - With conda
 | |
| 
 | |
| ```
 | |
| conda env create -f environment.yml
 | |
| conda activate mayavoz
 | |
| ```
 | |
| 
 | |
| - From source code
 | |
| ```
 | |
| git clone url
 | |
| cd mayavoz
 | |
| pip install -e .
 | |
| ```
 | |
| 
 | |
| ## Support
 | |
| 
 | |
| For commercial enquiries and scientific consulting, please [contact me](https://shahules786.github.io/).
 | |
| 
 | |
| ### Acknowledgements
 | |
| Sincere gratitude to [AMPLYFI](https://amplyfi.com/) for supporting this project.
 |