Code repository for training Taiwan-ELM models, including data preprocessing, tokenizer development, and model fine-tuning.
nlp
taiwan
transformer
traditional-chinese
llama
apache2
chinese-dataset
large-language-models
llm
instruction-tuning
large-language-model
twllm
openelm
-
Updated
Aug 11, 2024 - Jupyter Notebook