GitPedia

Voice Cloning App

A Python/Pytorch app for easily synthesising human voices

From voice-cloning-app·Updated June 12, 2026·View on GitHub·

**Voice Cloning App** is A Python/Pytorch app for easily synthesising human voices The project is written primarily in Python, distributed under the BSD 3-Clause "New" or "Revised" License license, first published in 2021. It has gained significant community traction with 1,437 stars and 239 forks on GitHub. Key topics include: deep-learning, python, pytorch, tacotron2, text-to-speech.

Latest release: v1.1.1Version 1.1.1
February 7, 2022View Changelog →

Voice Cloning App

CircleCI
Discord
codecov
comment
comment

A Python/Pytorch app for easily synthesising human voices

Preview

Documentation

Discord Server

Video guide

Voice Sharing Hub

FAQ's

System Requirements

  • Windows 10 or Ubuntu 20.04+ operating system
  • 5GB+ Disk space
  • NVIDIA GPU with at least 4GB of memory & driver version 456.38+ (optional)

Key features

  • Automatic dataset generation (with support for subtitles and audiobooks)
  • Additional language support
  • Local & remote training
  • Easy train start/stop
  • Data importing/exporting
  • Multi GPU support

Manual Guides

Future Improvements

  • Add support for Talknet
  • Add GTA alignment for Hifi-gan
  • Improved batch size estimation
  • AMD GPU support

Other resources

Acknowledgements

This project uses a reworked version of Tacotron2. All rights for belong to NVIDIA and follow the requirements of their BSD-3 licence.

Additionally, the project uses DSAlign, Silero, DeepSpeech & hifi-gan.

Thank you to Dr. John Bustard at Queen's University Belfast for his support throughout the project.

Supported by uberduck.ai, reach out to them for live model hosting.

Also a big thanks to the members of the VocalSynthesis subreddit for their feedback.

Finally thank you to everyone raising issues and contributing to the project.

Contributors

Showing top 11 contributors by commit count.

View all contributors on GitHub →

This article is auto-generated from voice-cloning-app/Voice-Cloning-App via the GitHub API.Last fetched: 6/13/2026