-
Music and Audio Research Group(MARG)
- Seoul, Republic of Korea.
-
10:52
(UTC +09:00) - https://www.linkedin.com/in/jonghochoi/
- @_jonghochoi
Stars
🔊 Text-Prompted Generative Audio Model
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
ONSETS&VELOCITIES real-time piano detection - Python demo [EUSIPCO2023]
PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.
BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. (ISMIR 2021's paper implementation).
🦜🔗 Build context-aware reasoning applications
Schedule-Free Optimization in PyTorch
Simple GUI for ByteDance's Piano Transcription with Pedals
Learning audio concepts from natural language supervision
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Pytorch project accompanying the paper "Stabilizing Training with Soft Dynamic Time Warping: A Case Study for Pitch Class Estimation with Weakly Aligned Targets", ISMIR 2023
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
This is the official repository for M2UGen
Instant voice cloning by MIT and MyShell.
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
so-vits-svc fork with realtime support, improved interface and more features.
Easily train a good VC model with voice data <= 10 mins!
Free App for Music, Meditation and Podcasts 🎸
Read, write and manipulate GP3, GP4 and GP5 files.