Release v0.2.34

@wangshuai09

What's Changed

fix tokenizer.pad_token attribute error by @wangshuai09 in #2710
support stable-vicuna model by @hi-jin in #2696
Exllama cache 8bit by @mjkaye in #2719
Add Yi support by @infwinston in #2723
Add Hermes 2.5 [fixed] by @152334H in #2725
Fix Hermes2Adapter by @lewtun in #2727
Fix YiAdapter by @Jingsong-Yan in #2730
add trust_remote_code argument by @wangshuai09 in #2715
Add revision arg to MT Bench answer generation by @lewtun in #2728
Fix MPS backend 'index out of range' error by @suquark in #2737
add starling support by @infwinston in #2738
Add deepseek chat by @BabyChouSr in #2760
a convenient script for spinning up the API with Model Workers by @ckgresla in #2790
Prevent returning partial stop string in vllm worker by @pandada8 in #2780
Update UI and new models by @infwinston in #2762
Support MetaMath by @iojw in #2748
Use common logging code in the OpenAI API server by @geekoftheweek in #2758
Show how to turn on experiment tracking for fine-tuning by @morganmcg1 in #2742
Support xDAN-L1-Chat Model by @xiechengmude in #2732
Update the version to 0.2.34 by @merrymercy in #2793

Full Changelog: v0.2.33...v0.2.34