Text-to-Speech
Transformers
Safetensors
higgs_multimodal_qwen3
text-generation
speech-generation
voice-agent
expressive-speech
controllable-tts
multilingual-tts
Instructions to use bosonai/higgs-audio-v3-tts-4b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use bosonai/higgs-audio-v3-tts-4b with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="bosonai/higgs-audio-v3-tts-4b")# Load model directly from transformers import AutoModelForSeq2SeqLM model = AutoModelForSeq2SeqLM.from_pretrained("bosonai/higgs-audio-v3-tts-4b", dtype="auto") - Notebooks
- Google Colab
- Kaggle
ComfyUI Ready 🎉
#4 opened about 8 hours ago
by
drbaph
VRAM use
3
#3 opened about 11 hours ago
by
JacobR22
How to handle mixed Mandarin and Cantonese text when the model does not support per‑word language selection?
1
#2 opened about 21 hours ago
by
sgxtj
Space Errors when reference audio clip is provided
1
#1 opened about 23 hours ago
by
weightsnweights