[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models

Python 61 3 Updated Mar 31, 2024

trallnag / prometheus-fastapi-instrumentator

Instrument your FastAPI with Prometheus metrics.

Python 942 84 Updated Jun 17, 2024

KdaiP / StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 347 39 Updated Sep 13, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,512 739 Updated Jun 24, 2024

unrealspeech / unrealspeech

Python 7 Updated Mar 11, 2024

thuhcsi / DiffVar

Python 30 5 Updated Aug 12, 2023

X-LANCE / StoryTTS

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

HTML 132 4 Updated Apr 27, 2024

leminhnguyen / useful_snippets

This repository provides some useful snippets that you may need in some situations.

Shell 10 Updated Jan 16, 2024

huggingface / setfit

Efficient few-shot learning with Sentence Transformers

Jupyter Notebook 2,173 219 Updated Sep 19, 2024

sh-lee-prml / HierSpeechpp-demo

HTML 3 Updated Dec 5, 2023

Xrehman / StyleTTS

Forked from yl4579/StyleTTS

Official Implementation of StyleTTS

Jupyter Notebook 1 Updated Nov 3, 2023

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,844 1,110 Updated Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bao-Sinh Nguyen sinhprous1

Block or report sinhprous1

Stars

aishwaryanr / awesome-generative-ai-guide

csteinmetz1 / ai-audio-startups

thinh-vu / vnstock

zhenye234 / xcodec

zhenye234 / FlashSpeech

jishengpeng / TextrolSpeech

acherstyx / AutoTransition

cwx-worst-one / EAT

maum-ai / univnet

VSydorskyy / BirdCLEF_2023_1st_place

flinkerlab / neural_speech_decoding

gitmylo / bark-voice-cloning-HuBERT-quantizer

metavoiceio / metavoice-src

haoheliu / versatile_audio_super_resolution

huggingface / parler-tts

seastar105 / pflow-encodec

ga642381 / speech-trident

shivammehta25 / Matcha-TTS

XiangLi2022 / CM-TTS