Release v0.2.34
What's Changed
- fix tokenizer.pad_token attribute error by @wangshuai09 in #2710
- support stable-vicuna model by @hi-jin in #2696
- Exllama cache 8bit by @mjkaye in #2719
- Add Yi support by @infwinston in #2723
- Add Hermes 2.5 [fixed] by @152334H in #2725
- Fix Hermes2Adapter by @lewtun in #2727
- Fix YiAdapter by @Jingsong-Yan in #2730
- add trust_remote_code argument by @wangshuai09 in #2715
- Add revision arg to MT Bench answer generation by @lewtun in #2728
- Fix MPS backend 'index out of range' error by @suquark in #2737
- add starling support by @infwinston in #2738
- Add deepseek chat by @BabyChouSr in #2760
- a convenient script for spinning up the API with Model Workers by @ckgresla in #2790
- Prevent returning partial stop string in vllm worker by @pandada8 in #2780
- Update UI and new models by @infwinston in #2762
- Support MetaMath by @iojw in #2748
- Use common logging code in the OpenAI API server by @geekoftheweek in #2758
- Show how to turn on experiment tracking for fine-tuning by @morganmcg1 in #2742
- Support xDAN-L1-Chat Model by @xiechengmude in #2732
- Update the version to 0.2.34 by @merrymercy in #2793
New Contributors
- @mjkaye made their first contribution in #2719
- @152334H made their first contribution in #2725
- @Jingsong-Yan made their first contribution in #2730
- @ckgresla made their first contribution in #2790
- @pandada8 made their first contribution in #2780
- @iojw made their first contribution in #2748
- @geekoftheweek made their first contribution in #2758
- @morganmcg1 made their first contribution in #2742
- @xiechengmude made their first contribution in #2732
Full Changelog: v0.2.33...v0.2.34