Fork of StarGANv2-VC

This fork focuses on using StarGANv2-VC for language translation and Voice Conversion to preserve speaker characteristics in the translation.

Below are the original authors and the description of StarGANv2-VC from the original repo: https://github.com/yl4579/StarGANv2-VC

Yinghao Aaron Li, Ali Zare, Nima Mesgarani

We present an unsupervised non-parallel many-to-many voice conversion (VC) method using a generative adversarial network (GAN) called StarGAN v2. Using a combination of adversarial source classifier loss and perceptual loss, our model significantly outperforms previous VC models. Although our model is trained only with 20 English speakers, it generalizes to a variety of voice conversion tasks, such as any-to-many, cross-lingual, and singing conversion. Using a style encoder, our framework can also convert plain reading speech into stylistic speech, such as emotional and falsetto speech. Subjective and objective evaluation experiments on a non-parallel many-to-many voice conversion task revealed that our model produces natural sounding voices, close to the sound quality of state-of-the-art text-tospeech (TTS) based voice conversion methods without the need for text labels. Moreover, our model is completely convolutional and with a faster-than-real-time vocoder such as Parallel WaveGAN can perform real-time voice conversion.

Paper: https://arxiv.org/abs/2107.10394

Audio samples: https://starganv2-vc.github.io/

Base Setup

For setting up the project with the data and trainig configurations from the original paper use:

make base_setup

Training

Use

make train

Further Information

See the original repo: https://github.com/yl4579/StarGANv2-VC

Acknowledgement

The authors of StarGANv2-VC: Yinghao Aaron Li, Ali Zare, Nima Mesgarani

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.vscode		.vscode
Configs		Configs
Data		Data
Demo		Demo
Models		Models
Speech-to-text		Speech-to-text
Utils		Utils
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
base_setup.py		base_setup.py
losses.py		losses.py
meldataset.py		meldataset.py
models.py		models.py
optimizers.py		optimizers.py
project_notebook.ipynb		project_notebook.ipynb
requirements.txt		requirements.txt
train.py		train.py
trainer.py		trainer.py
transforms.py		transforms.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fork of StarGANv2-VC

Yinghao Aaron Li, Ali Zare, Nima Mesgarani

Base Setup

Training

Further Information

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

wdmdev/Babelfish

Folders and files

Latest commit

History

Repository files navigation

Fork of StarGANv2-VC

Yinghao Aaron Li, Ali Zare, Nima Mesgarani

Base Setup

Training

Further Information

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages