[go: up one dir, main page]

Skip to content
View jcvasquezc's full-sized avatar

Block or report jcvasquezc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

"Cyberpunk style" for matplotlib plots

Python 1,694 72 Updated Sep 3, 2024

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

Python 730 106 Updated Oct 2, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,087 253 Updated Nov 15, 2024

Generating Talking Face Landmarks from Speech

Python 156 43 Updated Dec 22, 2022

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python 257 56 Updated May 23, 2023

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,442 385 Updated Apr 3, 2024

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,553 296 Updated Oct 18, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,047 156 Updated Nov 4, 2024

Flower: A Friendly Federated AI Framework

Python 5,130 881 Updated Nov 18, 2024

Source code for LCN submission for ADReSS-M challenge (formerly called MADReSS).

Python 10 Updated Jun 1, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,196 2,551 Updated Nov 9, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 12,494 1,313 Updated Aug 21, 2024

😸 💬 A module to compute textual lexical richness (aka lexical diversity).

Python 92 19 Updated Aug 27, 2023

A Python wrapper for the high-quality vocoder "World"

Cython 725 122 Updated Oct 23, 2023
Jupyter Notebook 981 219 Updated Mar 20, 2024

This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

Python 514 93 Updated Oct 11, 2019

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Python 2,788 491 Updated Mar 31, 2023

Single shot neural network pruning before training the model, based on connection sensitivity

Jupyter Notebook 11 2 Updated Aug 7, 2019

Low-level Python library used to interact with a Substra network

Python 271 33 Updated Oct 14, 2024

Papr Readr Bot

Python 6 Updated Apr 11, 2023

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,373 428 Updated Nov 13, 2024

an editor for spoken-word audio with automatic transcription

TypeScript 1,689 40 Updated Oct 11, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 71,411 8,481 Updated Nov 13, 2024

A collection of utilities for handling IPA phones.

Python 24 2 Updated Sep 24, 2023

Automatic classification of the Big-Five personality traits from texts using embeddings and Long short-term memory network.

Jupyter Notebook 1 Updated Jun 9, 2020

Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).

Python 27 9 Updated Jul 25, 2023

add statistical significance annotations on seaborn plots. Further development of statannot, with bugfixes, new features, and a different API.

Python 673 75 Updated Jul 31, 2024

Compute Sentence Embeddings Fast!

Jupyter Notebook 618 83 Updated Mar 2, 2023

A Dataset of German Legal Documents for Named Entity Recognition

Python 160 32 Updated Oct 19, 2022
Next