Zalo AI Challenge 2023 - Elementary Maths Solving

A Solution of top-5 private leaderboard

Problem

Problem Statement can be found at https://challenge.zalo.ai/portal/elementary-maths-solving

Overview

Our inference pipeline

Our contributions as follow: 😍

Creating rule-base algorithm to pass basic testcases
Collecting and augmenting Vietnamese elementary mathematics from internet
Training a LLM
Applying RAG technique
Implementing re-evaluation method for inference

Details

1. Rule-base algorithm

We were able to calculate and derive outcomes directly by using regex and numexprto infer results without relying solely on LLM.

2. Dataset

The given dataset contains approx. 1200 training examples, half of them include explanation field. So we decided to collect more multiple choice data from VietJack. Furthermore, we augmented data by calling GPT-4 API to fill missing explanation samples. We also created dataset programmatically for some types of math problem (including basic calculation). To diversify our dataset, we translated famous public datasets from Huggingface 🤗. Note that our dataset not only contains samples in multiple choice format, but also in question-answering format.

3. LLM

We conducted experiments using publicly available 7B and 13B models from WizardLM, meta-math, FelixChao, and EleutherAI. For efficient training, we employed LoRA, deepspeed and used hyperparameter tuning techniques to identify the optimal model configuration.

4. RAG

We utilized RAG to enhance accuracy of LLM. Our model encountered frequent failures in certain problem types, and this is where RAG shined. We appended RAG knowledges into the input prompt to provide LLM with additional information, hence improving its reasoning abilities. In total, we had 10-20 knowledges and employed a simple keyword-based matching algorithm for retrieval.

5. Re-evaluation

Although applying advanced techniques, we observed that LLM still encountered challenges in certain problems due to limitations in calculation abilities, despite their correct reasoning capabilities. To address this, we implemented a big loop where we re-evaluated calculation results using numexpr each time a equation appeared in output. Note that, in order to reduce the complexity of our solution, only basic arithmetic equations would be considered.

Authors

Hope you guys love our solution ! 🥰 🥰 🥰

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zalo AI Challenge 2023 - Elementary Maths Solving

A Solution of top-5 private leaderboard

Problem

Overview

Details

1. Rule-base algorithm

2. Dataset

3. LLM

4. RAG

5. Re-evaluation

Authors

About

Releases

Packages

duongkstn/LLM-elementary-maths-solving-pipeline

Folders and files

Latest commit

History

Repository files navigation

Zalo AI Challenge 2023 - Elementary Maths Solving

A Solution of top-5 private leaderboard

Problem

Overview

Details

1. Rule-base algorithm

2. Dataset

3. LLM

4. RAG

5. Re-evaluation

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages