-
Salesforce
- Los Angeles, California
- http://khuangaf.github.io/
- @steeve__huang
Highlights
- Pro
Stars
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments
Code for Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World
This is the code we have used for research paper, "Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate"
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Multilingual safety benchmark for Large Language Models
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning
An open source implementation of CLIP.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A very simple Salesforce.com REST API client for Python
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
A guidance language for controlling large language models.
Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.
Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models (ACL-Findings 2024)
Repo for paper: https://arxiv.org/abs/2404.06479
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation (NAACL 2024)
A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey"
Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.