Skip to content

KittenML/KittenTTS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kitten TTS 😻

Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis.

Currently in developer preview

Join our discord

For custom support - fill this form

Email the creators with any questions : [email protected]

✨ Features

  • Ultra-lightweight: Model size less than 25MB
  • CPU-optimized: Runs without GPU on any device
  • High-quality voices: Several premium voice options available
  • Fast inference: Optimized for real-time speech synthesis

🚀 Quick Start

Installation

pip install https://github.com/KittenML/KittenTTS/releases/download/0.1/kittentts-0.1.0-py3-none-any.whl

Basic Usage

from kittentts import KittenTTS
m = KittenTTS("KittenML/kitten-tts-nano-0.1")

audio = m.generate("This high quality TTS model works without a GPU", voice='expr-voice-2-f' )

# available_voices : [  'expr-voice-2-m', 'expr-voice-2-f', 'expr-voice-3-m', 'expr-voice-3-f',  'expr-voice-4-m', 'expr-voice-4-f', 'expr-voice-5-m', 'expr-voice-5-f' ]

# Save the audio
import soundfile as sf
sf.write('output.wav', audio, 24000)

💻 System Requirements

Works literally everywhere

Checklist

  • Release a preview model
  • Release the fully trained model weights
  • Release mobile SDK
  • Release web version

About

State-of-the-art TTS model under 25MB 😻

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages