wenhu commited on
Commit
a37ce77
1 Parent(s): cc28ad1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -37,10 +37,12 @@ The models are fine-tuned with the MathInstruct dataset using the original Llama
37
  The models are evaluated using open-ended and multiple-choice math problems from several datasets. Here are the results:
38
 
39
 
40
- | **Model** | **Decoding** | **GSM** | **MATH** | **AQuA** | **NumG** | **SVA** | **Mat** | **Sim** | **SAT** | **MMLU** | **AVG** |
41
- |---------------------------|---------------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|
42
- | **MAmmoTH-7B-Mistral** | CoT | 50.5 | 10.4 | 43.7 | 44.0 | 47.3 | 9.2 | 18.9 | 32.7 | 39.9 | 33.0 |
43
- | | PoT | 51.6 | 28.7 | 43.3 | 52.3 | 65.1 | 41.9 | 48.2 | 39.1 | 44.6 | 46.1 |
 
 
44
 
45
  ## Usage
46
  You can use the models through Huggingface's Transformers library. Use the pipeline function to create a text-generation pipeline with the model of your choice, then feed in a math problem to get the solution.
 
37
  The models are evaluated using open-ended and multiple-choice math problems from several datasets. Here are the results:
38
 
39
 
40
+ | **Model** | **Decoding** | **GSM** | **MATH** | **MMLU** |
41
+ |---------------------------|---------------|-----------|-----------|-----------|
42
+ | **MAmmoTH-7B-Mistral** | CoT | | | |
43
+ | | PoT | | | |
44
+ | | Hybrid | 75.0 | 40.0 | |
45
+
46
 
47
  ## Usage
48
  You can use the models through Huggingface's Transformers library. Use the pipeline function to create a text-generation pipeline with the model of your choice, then feed in a math problem to get the solution.