We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HF: https://huggingface.co/SparkAudio/Spark-TTS-0.5B
A new TTS model that can be supported by llama.cpp.
I might be wrong here but it seems like SparkTTS has a simlar architecture as OuteTTS and Orpheus TTS (#12476) but it uses Qwen2.5-0.5B.
They are using their own audio decoder called BiCodec. Sample python implementation: https://github.com/SparkAudio/Spark-TTS/blob/main/sparktts/models/bicodec.py
Similar model support (OuteTTS): #10784 Can be used as a reference how to implement this.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Prerequisites
Feature Description
HF: https://huggingface.co/SparkAudio/Spark-TTS-0.5B
Motivation
A new TTS model that can be supported by llama.cpp.
Possible Implementation
I might be wrong here but it seems like SparkTTS has a simlar architecture as OuteTTS and Orpheus TTS (#12476) but it uses Qwen2.5-0.5B.
They are using their own audio decoder called BiCodec. Sample python implementation: https://github.com/SparkAudio/Spark-TTS/blob/main/sparktts/models/bicodec.py
Similar model support (OuteTTS): #10784
Can be used as a reference how to implement this.
The text was updated successfully, but these errors were encountered: