Date: 2024-03-19-16-31-03
Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.056
Date: 2024-03-19-17-42-24
Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.036
Date: 2024-03-19-18-44-04
Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.032
Date: 2024-03-19-19-45-57
Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.034
Date: 2024-03-20-21-19-40
Model: gpt-3.5-azure
Test on 1 samples:
Accuracy: 0.0
Date: 2024-03-20-21-20-38
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.1
Date: 2024-03-20-21-24-00
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.0
Date: 2024-03-20-21-24-53
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.6
Date: 2024-03-20-21-26-48
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.1
Date: 2024-03-20-21-29-20
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.2
Date: 2024-03-20-21-31-48
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.2
Date: 2024-03-20-21-35-57
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.3
Date: 2024-03-20-21-40-04
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.3
Date: 2024-03-20-21-40-38
Model: gpt-3.5-azure
Test on 10 samples:
Accuracy: 0.3
Date: 2024-04-09-20-17-35
Model: gpt-3.5-azure
Test on 1 samples:
Accuracy: 0.0
Date: 2024-04-09-20-18-29
Model: gpt-3.5-azure
Test on 1 samples:
Accuracy: 1.0
Date: 2024-04-15-09-00-42
Split-value: 1Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.502
Generation type: propose
Date: 2024-04-16-06-52-19
Split-value: 1Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.523
Generation type: propose
Date: 2024-04-17-01-22-34
Split-value: 1Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.478
Generation type: sample
Date: 2024-04-20-02-11-30
Split-value: 1Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.475
Generation type: sample
Date: 2024-04-22-22-03-52
Split-value: 1Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.498
Generation type: propose
Date: 2024-04-26-07-41-11
Split-value: 1Model: gpt-3.5-azure
Test on 1000 samples:
Accuracy: 0.508
Generation type: propose
Date: 2024-05-12-23-39-17
Split-value: 1Model: mixtral-7B
Test on 1000 samples:
Accuracy: 0.463
Generation type: sample
Date: 2024-05-27-02-59-26
Split-value: 1Model: llama-3-8B-groq
Test on 1000 samples:
Accuracy: 0.48
Generation type: propose
Date: 2024-06-05-11-08-37
Split-value: 1Model: llama-3-70B
Test on 1000 samples:
Accuracy: 0.544
Generation type: sample
