KittenTTS

Pricing model
Open Source
Upvote 0
KittenTTS is an ultra-lightweight open-source text-to-speech model that converts written text into natural-sounding speech with impressive quality, all while requiring minimal computational resources. Unlike most speech conversion AI models that demand powerful hardware, KittenTTS operates efficiently on almost any device, including older computers, Raspberry Pi, and even browsers, thanks to its tiny size of 25 MB and design with 15 million parameters. This AI model provides several realistic voices in real-time without needing an internet connection or GPUs, making it ideal for developers creating privacy-focused applications, edge computing projects, accessibility tools, or any scenarios where resource efficiency is vital. Combining high output quality, incredible speed on CPU-only systems, and an open-source Apache 2.0 license, KittenTTS represents a breakthrough in AI-powered voice conversion where larger models simply cannot function.

Similar neural networks:

Freemium
Upvote 0
Letterly is a mobile application driven by AI that transforms spoken words into organized text, providing different rewriting options and supporting several languages. It caters to professionals, authors, students, and anyone who favors speech over typing. The app allows users to save time, record thoughts on the move, produce refined content from spoken notes, and craft diverse writing forms more effectively. Equipped with offline recording, cross-device synchronization, and privacy safeguards, Letterly seeks to simplify the writing process and enhance user productivity.
Paid
Upvote 0
DeepZen is a platform specializing in digital voice solutions that transform text into high-quality, emotionally resonant audio. It offers digital voice services for various applications, including audiobooks, advertisements, marketing, brand voices, and other voice content like podcasts, gaming, and virtual assistants. By utilizing licensed voice replicas of talented narrators and actors, along with skilled audio editors who expertly manage the full emotional range of the vocal output, it delivers a final product that seamlessly mimics traditional narration. DeepZen caters to publishers, authors, agencies, marketers, production companies, content creators, voice actors, game developers, and educators.
Freemium
Upvote 0
Listnr is an online AI-powered voice generator and text-to-speech tool enabling users to produce lifelike voiceovers from text, featuring over 900 voices across more than 142 languages. It allows users to create perfectly timed, human-like voiceovers for ads, e-learning, product demonstrations, presentations, audiobooks, and YouTube videos. Furthermore, Listnr offers developers straightforward and dependable APIs, and allows users to create a podcast from text, publish it on a customized page, and share it on all major platforms.