interspeech

Here are 19 public repositories matching this topic...

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

text-to-speech deep-learning pytorch tts speech-synthesis voice-conversion icassp speech-quality quality-of-experience interspeech

Updated Mar 8, 2024
Python

DmitryRyumin / INTERSPEECH-2023-24-Papers

Star

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

Updated Aug 9, 2024

soham97 / awesome-sound_event_detection

Star

Reading list for research topics in Sound AI

representation-learning audio-processing zero-shot-learning icassp sound-event-detection interspeech acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Aug 8, 2024

DmitryRyumin / NewEraAI-Papers

Star

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!

natural-language-processing computer-vision deep-learning text-classification signal-processing image-processing artificial-intelligence video-processing neural-networks emnlp cvpr iccv icassp ismir interspeech mashine-learning

Updated May 18, 2024
Python

BakerBunker / FreeV

Star

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

speech speech-synthesis vocoder interspeech

Updated Jul 4, 2024
Python

FrenchKrab / IS2023-powerset-diarization

Star

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

speaker-diarization interspeech pyannote

Updated Oct 18, 2023
Jupyter Notebook

hechmik / voxceleb_enrichment_age_gender

Star

Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021

machine-learning deep-learning sound gender-recognition age age-regression age-prediction interspeech voxceleb asru2021 voxceleb-enrichment

Updated Dec 18, 2021
Jupyter Notebook

pika-online / AESRC2020

Star

a deep accent recognition network

keras resnet speaker-recognition asr ctc mtl crnn arcface netvlad interspeech cosface ghostvlad circle-loss accent-recognition

Updated Aug 25, 2021
Python

ronggong / interspeech2018_submission01

Star

Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

hmm keras cnn forced-alignment hsmm beijing-opera singing-voice interspeech

Updated Aug 8, 2018
Python

doerlbh / MiniVox

Star

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

paper speaker-recognition online-learning speaker-diarization contextual-bandits bandit-algorithms interspeech self-supervised-learning acml interspeech2020 online-speaker-diarization

Updated Sep 20, 2021
Cuda

Lhx94As / PHO-LID

Star

PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification

pytorch interspeech spoken-language-identification

Updated Aug 24, 2023
Python

doheejin / SB_loss_PA

Star

This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).

nlp apa language-learning pronunciation assessment loss-functions scoring-functions interspeech pronunciation-scoring balanced-loss interspeech2023 score-balanced-loss automatic-pronunciation-assessment

Updated Apr 29, 2024
Python

cmu-mlsp / learning_from_weak_labels

Star

[Interspeech 2022] Tutorial - Learning from Weak Labels

interspeech weak-label

Updated Oct 23, 2024
MATLAB

whydinkov / interspeech-2019

Star

Interspeech 2019 experiments

nlp sklearn keras audio-processing interspeech

Updated Aug 28, 2019
Python

jlinear / ReMASC_Exp

Star

Baseline Experiments for ReMASC dataset.

vcs replay-attack interspeech remasc

Updated Mar 14, 2020
C

allyoushawn / timit_gas

Star

The implementation code for the paper "Gate Activation Signal Analysis for Gated Recurrent Neural Networks and Its Correlation with Phoneme Boundaries"

deep-learning tensorflow rnn speech-processing interspeech interspeech2017