Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
-
Updated
Aug 16, 2024 - Python
Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
Run XTTS with Docker/Podman for voice fine-tuning in Gradio's Web UI
Saya Voice Assistant for Discord AI voice bot: listens, detects keywords, chats via LM Studio, and replies with TTS or voice cloning.
Automatic video translator and dubber using Whisper, XTTS v2 for voice cloning, and Ollama for local LLM translation. Supports 100+ languages.
Dubbing english videos into russian.
XTTS fine-tuning via CLI
A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.
This program is designed to provide a graphical user interface for the xtts_api_server project: https://github.com/daswer123/xtts-api-server
This project aims to find a solution to make the xtts v2 model accessible via an API.
🎙️ Build high-quality, self-hosted Text-to-Speech applications with voice cloning and multi-language support using the XTTS-v2 API.
Add a description, image, and links to the xtts-v2 topic page so that developers can more easily learn about it.
To associate your repository with the xtts-v2 topic, visit your repo's landing page and select "manage topics."