Build software better, together

DrewThomasson / ebook2audiobook

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

multilingual windows linux docker mac tts english epub chinese gradio audiobooks voice-cloning xtts

Updated Dec 18, 2024
Python

daswer123 / xtts-webui

Star

Webui for using XTTS and for finetuning it

tts finetuning xtts xttsv2 cocqui

Updated Oct 17, 2024
Python

daswer123 / xtts-api-server

Star

A simple FastAPI Server to run XTTSv2

tts tts-api realtime-tts sillytavern xtts xttsv2

Updated Jul 21, 2024
Python

voxos-ai / bolna

Sponsor

Star

End-to-end platform for building voice first multimodal agents

Updated Oct 28, 2024
Python

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.

Updated Nov 12, 2024
Python

nsourlos / voice_cloning_tools

Star

Various tools to clone a voice

mp3 tts bark opus tortoise deepfake voice-cloning coqui-ai voice-clone coqui-tts xtts xttsv2

Updated Feb 24, 2024
Jupyter Notebook

merekat / children-stories

Star

OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.

data-science ai text-generation tts story neural-networks image-generation llama lora lux audio-generation large-language-models stable-diffusion xtts

Updated Sep 24, 2024
Jupyter Notebook

pbanuru / xtts2-ui

Star

A User Interface for XTTS-2 Text-Based Voice Cloning with 10 seconds

text-to-speech gui voice tts easy-to-use gradio zero-shot zero-shot-learning voice-cloning gradio-interface xtts xttsv2

Updated Jul 1, 2024
Python

omenius / epub2mp3

Star

Converts epub e-book files to mp3 audiobook files.

python text-to-speech deep-learning speech audiobook tts speech-synthesis epub audiobooks xtts xttsv2

Updated Mar 2, 2024
Python

lukaszliniewicz / easy_xtts_trainer

Star

A command line utility to easily finetune XTTS models in a fully automated way. Developed for Pandrator.

tts fine-tuning tts-engine xtts xttsv2 pandrator

Updated Nov 13, 2024
Python

DrewThomasson / doc2interview

Star

This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.

pdf tts generative-ai ollama xtts

Updated Dec 8, 2024
Python

Work-Nobu / OhanashiGPT

Star

OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.

data-science ai text-generation image-generation lora audio-generation large-language-models stable-diffusion llamacpp low-rank-adaptation xtts llama3

Updated Sep 13, 2024
Jupyter Notebook

KoppAlexander / Ohanashi-ChildGPT

Star

OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates these stories using an AI-generated voice that mimics a parent, trained on their audio samples. The app also creates illustrations to accompany each story, providing a unique and engaging experience for children.

flux data-science ai text-generation tts story neural-networks image-generation llama lora audio-generation large-language-models stable-diffusion generative-ai xtts

Updated Aug 27, 2024
Jupyter Notebook

bilelouahmed / vocal-assistant

Star

Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.

python text-to-speech neo4j chatbot speech-recognition transcription whisper rag llm mistral-7b xtts

Updated Apr 17, 2024
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xtts

Here are 14 public repositories matching this topic...

DrewThomasson / ebook2audiobook

daswer123 / xtts-webui

daswer123 / xtts-api-server

voxos-ai / bolna

lukaszliniewicz / Pandrator

nsourlos / voice_cloning_tools

merekat / children-stories

pbanuru / xtts2-ui

omenius / epub2mp3

lukaszliniewicz / easy_xtts_trainer

DrewThomasson / doc2interview

Work-Nobu / OhanashiGPT

KoppAlexander / Ohanashi-ChildGPT

bilelouahmed / vocal-assistant

Improve this page

Add this topic to your repo