Coqui

Pricing model
Paid
Upvote 0
Coqui Studio is a platform powered by AI for voice direction, enabling users to create, replicate, and manage AI voices for video games, post-production, dubbing, and other applications. It includes features such as voice cloning, generative AI voices, sophisticated editors, project management, and timeline editors to enhance workflow efficiency. Coqui Studio also provides 30 minutes of free synthesis time.

Similar neural networks:

Free
Upvote 0
OpenAI.fm, introduced in 2025, is an interactive platform featuring OpenAI's cutting-edge text-to-speech technology. It enables users to transform text into highly customizable audio with an array of pre-configured voice characters and adaptable speaking styles. This tool is tailored for developers, content creators, businesses, and anyone keen on exploring AI-driven speech. OpenAI.fm could be the choice for those looking to swiftly prototype voice applications, craft personalized voice content, or produce natural-sounding voiceovers for diverse media projects, all without the need for extensive coding.
Open Source
Upvote 0
KittenTTS is an ultra-lightweight open-source text-to-speech model that converts written text into natural-sounding speech with impressive quality, all while requiring minimal computational resources. Unlike most speech conversion AI models that demand powerful hardware, KittenTTS operates efficiently on almost any device, including older computers, Raspberry Pi, and even browsers, thanks to its tiny size of 25 MB and design with 15 million parameters. This AI model provides several realistic voices in real-time without needing an internet connection or GPUs, making it ideal for developers creating privacy-focused applications, edge computing projects, accessibility tools, or any scenarios where resource efficiency is vital. Combining high output quality, incredible speed on CPU-only systems, and an open-source Apache 2.0 license, KittenTTS represents a breakthrough in AI-powered voice conversion where larger models simply cannot function.
Paid
Upvote 0
AudioBot is an online platform powered by artificial intelligence that transforms written text into realistic audio across various languages and accents. With more than 500 voices to select from, users can download the audio as an mp3 file. This service is ideal for producing voiceovers for videos, presentations, and radio programs, and is offered in Spanish and Portuguese. A free trial is available, allowing 500 characters, and users can subscribe for additional features. The tool is owned by AudioBot, and the generated audio remains under the user's copyright.