- SF Bay Area
-
13:45
(UTC -08:00) - https://bryanyzhu.github.io/
Highlights
- Pro
Stars
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
Dead simple FLUX LoRA training UI with LOW VRAM support
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
An automated pipeline for evaluating LLMs for role-playing.
Instant voice cloning by MIT and MyShell.
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
High-quality datasets, tools, and concepts for LLM fine-tuning.
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事
Open Source framework for voice and multimodal conversational AI
A simple tool for visually comparing two PDF files
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Recurrent neural network audio manipulation tool to mute "laugh track" audio segments found commonly in sitcoms.
WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手