🎤 Chương trình chuyển đổi text thành giọng nói.
🔊 Sample Voice
Drop Audio Here
- or -
Click to Upload
📝 Text
⚡ Speed
↺
0.3
2
🔊 Volume
↺
0.1
2
⏸ Pause between sentences (seconds)
↺
0
3
🔥 Generate Voice
🎧 Generated Audio
📊 Spectrogram
❗ Model Limitations
1. This model may not perform well with numerical characters, dates, special characters, etc. => A text normalization module is needed. 2. The rhythm of some generated audios may be inconsistent or choppy => It is recommended to select clearly pronounced sample audios with minimal pauses for better synthesis quality. 3. Default, reference audio text uses the pho-whisper-medium model, which may not always accurately recognize Vietnamese, resulting in poor voice synthesis quality. 4. Inference with overly long paragraphs may produce poor results.