0000000000000000000000000000000000000000 2a187daf512e2ea2614661cbc444dec5bb716a2f Luca Benedetto <luca.benedetto93@gmail.com> 1698835054 +0000	clone: from github.com:lucabenedetto/roleplaying-qa-llms.git
2a187daf512e2ea2614661cbc444dec5bb716a2f f75b41d10326c658987bfe7826becfd84eff42af Luca Benedetto <luca.benedetto93@gmail.com> 1698835390 +0000	commit: add first version of the .py files
f75b41d10326c658987bfe7826becfd84eff42af 8bf0fa882c41409ab393c445cc496cd46cda8e67 Luca Benedetto <luca.benedetto93@gmail.com> 1698835430 +0000	commit: add gitignore
8bf0fa882c41409ab393c445cc496cd46cda8e67 118ce88d60837aa79e03a8b4fbbbaf78f934620f Luca Benedetto <luca.benedetto93@gmail.com> 1698835555 +0000	commit: add first version of requirements file
118ce88d60837aa79e03a8b4fbbbaf78f934620f 0f6be2c0d6dacdf05b858fea840d17e4eb0606fd Luca Benedetto <luca.benedetto93@gmail.com> 1698850112 +0000	commit: add levels and system message for prompt id 46
0f6be2c0d6dacdf05b858fea840d17e4eb0606fd 4689f029b31bc2f9da95bd7c74135c518b18ee77 Luca Benedetto <luca.benedetto93@gmail.com> 1698853438 +0000	commit: added "best" prompts for reading comprehensino questions
4689f029b31bc2f9da95bd7c74135c518b18ee77 e889776516199a6a9db5c7079956461b043c2531 Luca Benedetto <luca.benedetto93@gmail.com> 1698857689 +0000	commit: rename file
e889776516199a6a9db5c7079956461b043c2531 71266ed22496e2d692067b71972385184eadc324 Luca Benedetto <luca.benedetto93@gmail.com> 1698859438 +0000	commit: add requirements
71266ed22496e2d692067b71972385184eadc324 539c17ab291484aa9280de2b09ac0ae12578f065 Luca Benedetto <luca.benedetto93@gmail.com> 1698859450 +0000	commit: add first version to test inference with llama
539c17ab291484aa9280de2b09ac0ae12578f065 3b426220b1e43005324ae960b23451af61e48f84 Luca Benedetto <luca.benedetto93@gmail.com> 1698943210 +0000	commit: add accelerate. Had an error with llama (ImportError: Using `low_cpu_mem_usage=True` or a `device_map` requires Accelerate: `pip install accelerate`)
3b426220b1e43005324ae960b23451af61e48f84 8e8d58cf469666537935113b14b86fdff530c0c6 Luca Benedetto <luca.benedetto93@gmail.com> 1698944542 +0000	commit: first test with arc data
8e8d58cf469666537935113b14b86fdff530c0c6 acffed1e798efe738ffe7af8941b3159dc74b4f1 Luca Benedetto <luca.benedetto93@gmail.com> 1698945016 +0000	commit: add first test of prompt from ARC for llama2
acffed1e798efe738ffe7af8941b3159dc74b4f1 f75438d706e1ea4f8c79cea7ae16cde2e17af2d1 Luca Benedetto <luca.benedetto93@gmail.com> 1698945387 +0000	commit: new tests on max_length
f75438d706e1ea4f8c79cea7ae16cde2e17af2d1 8b1c19a0e036db76790d14ad45c12d7242c820e8 Luca Benedetto <luca.benedetto93@gmail.com> 1698945787 +0000	commit: new tests on max_length
8b1c19a0e036db76790d14ad45c12d7242c820e8 a5aaf034ac63e3731b5609ad87a33732d3e2b74d Luca Benedetto <luca.benedetto93@gmail.com> 1698946290 +0000	commit: new tests on max_length
a5aaf034ac63e3731b5609ad87a33732d3e2b74d 4eb285ad9a10b6f5317a0065eb675e0aa3966412 Luca Benedetto <luca.benedetto93@gmail.com> 1699006767 +0000	commit: add venv to gitignore
4eb285ad9a10b6f5317a0065eb675e0aa3966412 4eb285ad9a10b6f5317a0065eb675e0aa3966412 Luca Benedetto <luca.benedetto93@gmail.com> 1699006808 +0000	checkout: moving from main to 231103-refactor-methods-to-work-with-both-llama-and-gpt
4eb285ad9a10b6f5317a0065eb675e0aa3966412 47057418c78e065877f79bd0050e9379a613e806 Luca Benedetto <luca.benedetto93@gmail.com> 1699008599 +0000	commit: refactor constants and add new ones
47057418c78e065877f79bd0050e9379a613e806 4fdb67a8d882a2132dc3c443397041462b43c0a2 Luca Benedetto <luca.benedetto93@gmail.com> 1699008647 +0000	commit: moved shared utils to the utils.py file and add new prompt to list of system settings
4fdb67a8d882a2132dc3c443397041462b43c0a2 32d26c15afa6e2b103e2fd58e006b453b42b8df0 Luca Benedetto <luca.benedetto93@gmail.com> 1699008682 +0000	commit: refactor script to use updates methods and constants
32d26c15afa6e2b103e2fd58e006b453b42b8df0 6f6cf470042f14136e58d9cd19a96ecc66a992d6 Luca Benedetto <luca.benedetto93@gmail.com> 1699008701 +0000	commit: add first version of script that uses llama
6f6cf470042f14136e58d9cd19a96ecc66a992d6 d586b6c599bf85655880dc69626aa5f20ba929e8 Luca Benedetto <luca.benedetto93@gmail.com> 1699008976 +0000	commit: rename constants for data paths
d586b6c599bf85655880dc69626aa5f20ba929e8 eb52df86be901e08b45a525b90dd1b80679dacb1 Luca Benedetto <luca.benedetto93@gmail.com> 1699009761 +0000	commit: fix the if else to select prompts and user levels
eb52df86be901e08b45a525b90dd1b80679dacb1 9dcd1113ee1e61e00386996e19776c8a94414dd8 Luca Benedetto <luca.benedetto93@gmail.com> 1699020074 +0000	commit: add datasets to requirements
9dcd1113ee1e61e00386996e19776c8a94414dd8 4bf7d51b0c5ba104b0d32f1cb419ef3b91ea41c2 Luca Benedetto <luca.benedetto93@gmail.com> 1699020496 +0000	commit: add datasets to requirements
4bf7d51b0c5ba104b0d32f1cb419ef3b91ea41c2 7be22d37196db557cbb7cb6e96dd883e0b6aedf1 Luca Benedetto <luca.benedetto93@gmail.com> 1699020510 +0000	commit: refactor script for llama
7be22d37196db557cbb7cb6e96dd883e0b6aedf1 0c2bbd616d770b90769d61b7ac9dfc388d76c805 Luca Benedetto <luca.benedetto93@gmail.com> 1699020621 +0000	commit: fix access to column input_prompt
0c2bbd616d770b90769d61b7ac9dfc388d76c805 a3e5d8b9b82cd8e93f1752ff855e9a00be454c6a Luca Benedetto <luca.benedetto93@gmail.com> 1699020669 +0000	commit: fix to_list
a3e5d8b9b82cd8e93f1752ff855e9a00be454c6a 54371ea374f610d01969c6ad4601a63beab8af8e Luca Benedetto <luca.benedetto93@gmail.com> 1699021321 +0000	commit: update code for dataset preparation
54371ea374f610d01969c6ad4601a63beab8af8e 87851bd79dccb96699b11bb173e1f5bf2dab2831 Luca Benedetto <luca.benedetto93@gmail.com> 1699021447 +0000	commit: update code for dataset preparation
87851bd79dccb96699b11bb173e1f5bf2dab2831 822315da8ea86fd98ea451300c2038d3bef21fa8 Luca Benedetto <luca.benedetto93@gmail.com> 1699021497 +0000	commit: update code for dataset preparation
822315da8ea86fd98ea451300c2038d3bef21fa8 9c417f6f4d9c873d1ffe474b97d76b66f66ef142 Luca Benedetto <luca.benedetto93@gmail.com> 1699021655 +0000	commit: update code for dataset preparation
9c417f6f4d9c873d1ffe474b97d76b66f66ef142 441c4a63d942e7b3937621349197e4fb91900056 Luca Benedetto <luca.benedetto93@gmail.com> 1699021880 +0000	commit: update code for dataset preparation
441c4a63d942e7b3937621349197e4fb91900056 7a6fef244a8a53362857018c093da8f45139b669 Luca Benedetto <luca.benedetto93@gmail.com> 1699022092 +0000	commit: update code for dataset preparation
7a6fef244a8a53362857018c093da8f45139b669 a6bb5dc15c2c1bb86552b3a36d871fa767ec4805 Luca Benedetto <luca.benedetto93@gmail.com> 1699022327 +0000	commit: update code for dataset preparation
a6bb5dc15c2c1bb86552b3a36d871fa767ec4805 5e3eff821a2656ae5d93c13cba66916806e46e9e Luca Benedetto <luca.benedetto93@gmail.com> 1699022573 +0000	commit: update code for dataset preparation
5e3eff821a2656ae5d93c13cba66916806e46e9e a05da7e08020315d2b5a0e005b90faed85351f54 Luca Benedetto <luca.benedetto93@gmail.com> 1699022874 +0000	commit: update code for dataset preparation
a05da7e08020315d2b5a0e005b90faed85351f54 fccffa54c16ee5dcf7113d4379f4e446142fc4d3 Luca Benedetto <luca.benedetto93@gmail.com> 1699023099 +0000	commit: update code for dataset preparation
fccffa54c16ee5dcf7113d4379f4e446142fc4d3 b454173c25bd26544fe4a258a4cd15e4bb7e46c4 Luca Benedetto <luca.benedetto93@gmail.com> 1699023474 +0000	commit: update code for dataset preparation
b454173c25bd26544fe4a258a4cd15e4bb7e46c4 44f03cbd36fc2b3f8c812d34f1b1d2f918787309 Luca Benedetto <luca.benedetto93@gmail.com> 1699024235 +0000	commit: refactor preprocessing of llama answer
44f03cbd36fc2b3f8c812d34f1b1d2f918787309 316a81c1f85ab1c17c5af9bb74776ca3fc6fd155 Luca Benedetto <luca.benedetto93@gmail.com> 1699024387 +0000	commit: refactor utils methods into a utils_llama file
316a81c1f85ab1c17c5af9bb74776ca3fc6fd155 da6661d333ce102fea23727ad3be79612f322920 Luca Benedetto <luca.benedetto93@gmail.com> 1699024481 +0000	commit: fix postprocessing of llama responses
da6661d333ce102fea23727ad3be79612f322920 4b42147e7183ffe4745c301afb108a89aa487b51 Luca Benedetto <luca.benedetto93@gmail.com> 1699024615 +0000	commit: add data and pycache to gitignore
4b42147e7183ffe4745c301afb108a89aa487b51 f077c690c57b830a1358c10ddff246f82cf68de0 Luca Benedetto <luca.benedetto93@gmail.com> 1699025161 +0000	commit: remove tmp line
f077c690c57b830a1358c10ddff246f82cf68de0 0a36971c8b1e10f1ec5afd08827bc45528e96d56 Luca Benedetto <luca.benedetto93@gmail.com> 1699026741 +0000	commit: remove old file
0a36971c8b1e10f1ec5afd08827bc45528e96d56 7955f46069516e0b7207112947896905c5e96d31 Luca Benedetto <luca.benedetto93@gmail.com> 1699026802 +0000	commit: remove datasets from requirements
7955f46069516e0b7207112947896905c5e96d31 4eb285ad9a10b6f5317a0065eb675e0aa3966412 Luca Benedetto <luca.benedetto93@gmail.com> 1699026957 +0000	checkout: moving from 231103-refactor-methods-to-work-with-both-llama-and-gpt to main
4eb285ad9a10b6f5317a0065eb675e0aa3966412 001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a Luca Benedetto <luca.benedetto93@gmail.com> 1699026962 +0000	pull: Fast-forward
001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a 001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a Luca Benedetto <luca.benedetto93@gmail.com> 1699028280 +0000	checkout: moving from main to 231103-add-new-prompts
001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a 001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a Luca Benedetto <luca.benedetto93@gmail.com> 1699264096 +0000	checkout: moving from 231103-add-new-prompts to main
001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a 001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a Luca Benedetto <luca.benedetto93@gmail.com> 1699264111 +0000	checkout: moving from main to 231106-add-script-eval
001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a 138ac6080cd2bf1da7b2ea445954e1d74dd530bf Luca Benedetto <luca.benedetto93@gmail.com> 1699264246 +0000	commit: add tmp to gitignore
138ac6080cd2bf1da7b2ea445954e1d74dd530bf 7e4fc4442b41dca0814a75f18a28d9699653366d Luca Benedetto <luca.benedetto93@gmail.com> 1699265165 +0000	commit: add method to keep only questions answered by all role-played levels
7e4fc4442b41dca0814a75f18a28d9699653366d e39ea53b01cf2d8bb4a0ed0292d2b77508a2f05e Luca Benedetto <luca.benedetto93@gmail.com> 1699265830 +0000	commit: add numpy to requirements
e39ea53b01cf2d8bb4a0ed0292d2b77508a2f05e 33aafd8bcb941e9865cf277679e5a26dc2dc314a Luca Benedetto <luca.benedetto93@gmail.com> 1699267752 +0000	commit: add difficulty levels for the two datasets to the constants file
33aafd8bcb941e9865cf277679e5a26dc2dc314a 870345987898aa6be166563c56b053de327c1163 Luca Benedetto <luca.benedetto93@gmail.com> 1699267982 +0000	commit: add method get_original_dataset to get the whole test datasets
870345987898aa6be166563c56b053de327c1163 e10102d111e7bed64b2231236fa01960eda5c32f Luca Benedetto <luca.benedetto93@gmail.com> 1699268117 +0000	commit: add method to get dict that maps from qid to correct answer
e10102d111e7bed64b2231236fa01960eda5c32f 916bf5dc356f83624ecc44e6f63ef9eef3710785 Luca Benedetto <luca.benedetto93@gmail.com> 1699268239 +0000	commit: add method that returns dict that maps from qid to true difficulty
916bf5dc356f83624ecc44e6f63ef9eef3710785 f9fbad4126ff30523c71d5b052328e6a64ac1ee8 Luca Benedetto <luca.benedetto93@gmail.com> 1699268504 +0000	commit: add methods that returns the mapping from "true" difficulty ot the set of questions with that difficulty
f9fbad4126ff30523c71d5b052328e6a64ac1ee8 10f79ef43e786faee03227dc011cd5e7914afbf6 Luca Benedetto <luca.benedetto93@gmail.com> 1699269344 +0000	commit: add method that returns the avg accuracy per role-played level (both overall and per difficulty level)
10f79ef43e786faee03227dc011cd5e7914afbf6 8839ca4311ba306417edadc70f178040cb5a4e38 Luca Benedetto <luca.benedetto93@gmail.com> 1699269915 +0000	commit: add method get_response_correctness_per_model to utils
8839ca4311ba306417edadc70f178040cb5a4e38 5801297e9aeacf2b3b065841f29da312bfd94278 Luca Benedetto <luca.benedetto93@gmail.com> 1699270393 +0000	commit: add matplotlib to requirements
5801297e9aeacf2b3b065841f29da312bfd94278 9e930d9925e93bbf234ff109bbb324a3ae530903 Luca Benedetto <luca.benedetto93@gmail.com> 1699271650 +0000	commit: fix method get_response_correctness_per_model
9e930d9925e93bbf234ff109bbb324a3ae530903 b9b071a68c28cd989d0b0ae2027a5d3db770443c Luca Benedetto <luca.benedetto93@gmail.com> 1699271871 +0000	commit: add first version methods for plot_accuracy_per_model and plot_accuracy_per_difficulty_per_model
b9b071a68c28cd989d0b0ae2027a5d3db770443c b6cef248c0e8c583b840c65e329574b77d57841b Luca Benedetto <luca.benedetto93@gmail.com> 1699272342 +0000	commit: add formatting of plot for method plot_accuracy_per_model
b6cef248c0e8c583b840c65e329574b77d57841b bcf6b788f0753eeaae7856987533cdaebea5477e Luca Benedetto <luca.benedetto93@gmail.com> 1699272575 +0000	commit: clean formatting of method plot_accuracy_per_difficulty_per_model
bcf6b788f0753eeaae7856987533cdaebea5477e 75ce32679470f4bc868ccbdc9988af0ba8bb040a Luca Benedetto <luca.benedetto93@gmail.com> 1699273205 +0000	commit: add first version of the script to eval the LLM responses
75ce32679470f4bc868ccbdc9988af0ba8bb040a e953b609f857369731a25a44ec3456115662e1f4 Luca Benedetto <luca.benedetto93@gmail.com> 1699273758 +0000	commit: add method plot_accuracy_per_difficulty_for_different_role_played_levels
e953b609f857369731a25a44ec3456115662e1f4 931701ed8de492ad2338ce19819bbbf834410f81 Luca Benedetto <luca.benedetto93@gmail.com> 1699275056 +0000	commit: add seaborn to requirements
931701ed8de492ad2338ce19819bbbf834410f81 445c964becbbe75fc8c802dcb4589ad510e5490a Luca Benedetto <luca.benedetto93@gmail.com> 1699275255 +0000	commit: add plot to study correlation between QA accuracy and true difficulty
445c964becbbe75fc8c802dcb4589ad510e5490a 269476893960f566b5d58e25802767be3290a482 Luca Benedetto <luca.benedetto93@gmail.com> 1699275370 +0000	commit: change definition of folder_name
269476893960f566b5d58e25802767be3290a482 743a48d3bd03abf2cac32a721bc08d338e910bbf Luca Benedetto <luca.benedetto93@gmail.com> 1699275390 +0000	commit: add first complete version of the script to plot the analysis of the results
743a48d3bd03abf2cac32a721bc08d338e910bbf 001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a Luca Benedetto <luca.benedetto93@gmail.com> 1699275512 +0000	checkout: moving from 231106-add-script-eval to main
001a4ba1d2fa1bc2154a6edd2e36b5375b7c949a e47794ff2186da2a91eef4c4754ade04855f4035 Luca Benedetto <luca.benedetto93@gmail.com> 1699275516 +0000	pull: Fast-forward
e47794ff2186da2a91eef4c4754ade04855f4035 e47794ff2186da2a91eef4c4754ade04855f4035 Luca Benedetto <luca.benedetto93@gmail.com> 1699276302 +0000	checkout: moving from main to 231106-fix-student-level-idx-in-output-file
e47794ff2186da2a91eef4c4754ade04855f4035 aa3fe23f94f6a3c2cd6661d1e219203f314fbd62 Luca Benedetto <luca.benedetto93@gmail.com> 1699276745 +0000	commit: fix idx to access output files
aa3fe23f94f6a3c2cd6661d1e219203f314fbd62 e47794ff2186da2a91eef4c4754ade04855f4035 Luca Benedetto <luca.benedetto93@gmail.com> 1699276887 +0000	checkout: moving from 231106-fix-student-level-idx-in-output-file to main
e47794ff2186da2a91eef4c4754ade04855f4035 2d738079de79229c01a23864c50d41d6475d89ed Luca Benedetto <luca.benedetto93@gmail.com> 1699276891 +0000	pull: Fast-forward
2d738079de79229c01a23864c50d41d6475d89ed aada3d79cbac58b560067ed50d40d5c1aa1422b9 Luca Benedetto <luca.benedetto93@gmail.com> 1699277167 +0000	commit: fix definition of five_levels_int
aada3d79cbac58b560067ed50d40d5c1aa1422b9 aada3d79cbac58b560067ed50d40d5c1aa1422b9 Luca Benedetto <luca.benedetto93@gmail.com> 1699348997 +0000	checkout: moving from main to 231107-analyse-gpt-responses
aada3d79cbac58b560067ed50d40d5c1aa1422b9 6e79d0cc00da88685f4363cf075c972a58f2bf69 Luca Benedetto <luca.benedetto93@gmail.com> 1699349037 +0000	commit: add output_figures to gitignore
6e79d0cc00da88685f4363cf075c972a58f2bf69 bebc506d63f071bf1e7fcade5e30fd29b2385822 Luca Benedetto <luca.benedetto93@gmail.com> 1699349268 +0000	commit: add output_filepath as optional argument
bebc506d63f071bf1e7fcade5e30fd29b2385822 42897ecccbe3af030a9eaeec142c260a24e7d655 Luca Benedetto <luca.benedetto93@gmail.com> 1699349365 +0000	commit: add if-else to plot or save the figures
42897ecccbe3af030a9eaeec142c260a24e7d655 240df4767274c0430ee71aff57ceae2483f51ed1 Luca Benedetto <luca.benedetto93@gmail.com> 1699349794 +0000	commit: add figsize as optional argument
240df4767274c0430ee71aff57ceae2483f51ed1 251ccf8a3c164164a0603b173233aad301bf9051 Luca Benedetto <luca.benedetto93@gmail.com> 1699350949 +0000	commit: add arg for saving plots instead of showing them
251ccf8a3c164164a0603b173233aad301bf9051 aada3d79cbac58b560067ed50d40d5c1aa1422b9 Luca Benedetto <luca.benedetto93@gmail.com> 1699457616 +0000	checkout: moving from 231107-analyse-gpt-responses to main
aada3d79cbac58b560067ed50d40d5c1aa1422b9 f491aed7f228d7b7101f77069470d9047eddc66a Luca Benedetto <luca.benedetto93@gmail.com> 1699457620 +0000	pull: Fast-forward
f491aed7f228d7b7101f77069470d9047eddc66a f491aed7f228d7b7101f77069470d9047eddc66a Luca Benedetto <luca.benedetto93@gmail.com> 1699457640 +0000	checkout: moving from main to 231108-add-new-prompts
f491aed7f228d7b7101f77069470d9047eddc66a a2f02374fbf1dee550598915691dd4fdafd0b1d6 Luca Benedetto <luca.benedetto93@gmail.com> 1699458123 +0000	commit: add prompt 48
a2f02374fbf1dee550598915691dd4fdafd0b1d6 67c58482c12529e742e9d9849b02e96e6b754242 Luca Benedetto <luca.benedetto93@gmail.com> 1699458438 +0000	commit: add prompt 49
67c58482c12529e742e9d9849b02e96e6b754242 73f47870ba705c21e2d998991b7b9c2cdc375e53 Luca Benedetto <luca.benedetto93@gmail.com> 1699471004 +0000	commit: add print at the end of the script (prompt idx and dataset name)
73f47870ba705c21e2d998991b7b9c2cdc375e53 70a5c66fa9e77f6887fda871588add8294b3dc64 Luca Benedetto <luca.benedetto93@gmail.com> 1699481135 +0000	commit: add prompt 50
70a5c66fa9e77f6887fda871588add8294b3dc64 3f551fdc40f2de2b33ba9573b20829caea7294f2 Luca Benedetto <luca.benedetto93@gmail.com> 1699481512 +0000	commit: add prompt 52
3f551fdc40f2de2b33ba9573b20829caea7294f2 220908dbc3839aed840755db54b4a52f27552533 Luca Benedetto <luca.benedetto93@gmail.com> 1699481696 +0000	commit: add prompt 53
220908dbc3839aed840755db54b4a52f27552533 946c8e5305db5c7a332c1cc54bcf23fab8b998c8 Luca Benedetto <luca.benedetto93@gmail.com> 1699481998 +0000	commit: add prompt 51
946c8e5305db5c7a332c1cc54bcf23fab8b998c8 f491aed7f228d7b7101f77069470d9047eddc66a Luca Benedetto <luca.benedetto93@gmail.com> 1699531993 +0000	checkout: moving from 231108-add-new-prompts to main
f491aed7f228d7b7101f77069470d9047eddc66a 3a2f4c89324af53a7d6e4309ed32374110d21adb Luca Benedetto <luca.benedetto93@gmail.com> 1699531998 +0000	pull: Fast-forward
3a2f4c89324af53a7d6e4309ed32374110d21adb 3a2f4c89324af53a7d6e4309ed32374110d21adb Luca Benedetto <luca.benedetto93@gmail.com> 1699537606 +0000	checkout: moving from main to 231109-add-new-prompts
3a2f4c89324af53a7d6e4309ed32374110d21adb 9a14546cabb0ea26dbaebf0514f70273307520a0 Luca Benedetto <luca.benedetto93@gmail.com> 1699537675 +0000	commit: add prompt 54
9a14546cabb0ea26dbaebf0514f70273307520a0 87e014e10676b6552d4f9e134daa2e5002be8d8d Luca Benedetto <luca.benedetto93@gmail.com> 1699537722 +0000	commit: fix single "}" in new prompt
87e014e10676b6552d4f9e134daa2e5002be8d8d 3a2f4c89324af53a7d6e4309ed32374110d21adb Luca Benedetto <luca.benedetto93@gmail.com> 1699542403 +0000	checkout: moving from 231109-add-new-prompts to main
3a2f4c89324af53a7d6e4309ed32374110d21adb aa2defde5093a48903476a8b37b5aaff3d2c5e16 Luca Benedetto <luca.benedetto93@gmail.com> 1699542407 +0000	pull: Fast-forward
aa2defde5093a48903476a8b37b5aaff3d2c5e16 aa2defde5093a48903476a8b37b5aaff3d2c5e16 Luca Benedetto <luca.benedetto93@gmail.com> 1699542424 +0000	checkout: moving from main to 231109-refactor-scripts
aa2defde5093a48903476a8b37b5aaff3d2c5e16 f217d3125faf58d8e9b189e1443cd94775e7c0b1 Luca Benedetto <luca.benedetto93@gmail.com> 1699542683 +0000	commit: add final print to scripts
f217d3125faf58d8e9b189e1443cd94775e7c0b1 39f1e0f6bf85dca293d5bb8f5ab2144349e7423a Luca Benedetto <luca.benedetto93@gmail.com> 1699542983 +0000	commit: add the if name == main
39f1e0f6bf85dca293d5bb8f5ab2144349e7423a 873048af3da8fb6abf960ae05385f926b58c4369 Luca Benedetto <luca.benedetto93@gmail.com> 1699543004 +0000	commit: add the if name == main
873048af3da8fb6abf960ae05385f926b58c4369 7937e2d0effdaa6dea44545169ae84ab036f1871 Luca Benedetto <luca.benedetto93@gmail.com> 1699543045 +0000	commit: change how the folder name is selected
7937e2d0effdaa6dea44545169ae84ab036f1871 6339e92ccf4a6791f3d3a6733fc31f613eb59788 Luca Benedetto <luca.benedetto93@gmail.com> 1699543224 +0000	commit: minor refactor - change line order in initial definitions to be consistent
6339e92ccf4a6791f3d3a6733fc31f613eb59788 337b06382d817bf84b8b2f5466646dc41dcc72d0 Luca Benedetto <luca.benedetto93@gmail.com> 1699543596 +0000	commit: add the if name == main
337b06382d817bf84b8b2f5466646dc41dcc72d0 aa2defde5093a48903476a8b37b5aaff3d2c5e16 Luca Benedetto <luca.benedetto93@gmail.com> 1699543761 +0000	checkout: moving from 231109-refactor-scripts to main
aa2defde5093a48903476a8b37b5aaff3d2c5e16 70b8fb683c3aaeee8f6338562b9e7fc14f6f009a Luca Benedetto <luca.benedetto93@gmail.com> 1699543765 +0000	pull: Fast-forward
70b8fb683c3aaeee8f6338562b9e7fc14f6f009a 70b8fb683c3aaeee8f6338562b9e7fc14f6f009a Luca Benedetto <luca.benedetto93@gmail.com> 1699550078 +0000	checkout: moving from main to 23-11-09-add-new-scripts
70b8fb683c3aaeee8f6338562b9e7fc14f6f009a 787b1db81b4f6503723e72ada7c9ba59b374a4f4 Luca Benedetto <luca.benedetto93@gmail.com> 1699550330 +0000	commit: add prompt 55
787b1db81b4f6503723e72ada7c9ba59b374a4f4 f6f8755e18b057539a01c17d36b8d3629f7ab79a Luca Benedetto <luca.benedetto93@gmail.com> 1699550479 +0000	commit: add prompt 56
f6f8755e18b057539a01c17d36b8d3629f7ab79a 70b8fb683c3aaeee8f6338562b9e7fc14f6f009a Luca Benedetto <luca.benedetto93@gmail.com> 1699554182 +0000	checkout: moving from 23-11-09-add-new-scripts to main
70b8fb683c3aaeee8f6338562b9e7fc14f6f009a 38943636cce53f41693cca2b58fb05f291ed8e32 Luca Benedetto <luca.benedetto93@gmail.com> 1699554187 +0000	pull: Fast-forward
38943636cce53f41693cca2b58fb05f291ed8e32 d0f559c95d0b5581c0bca35446948a8ed5ab38f9 Luca Benedetto <luca.benedetto93@gmail.com> 1699611369 +0000	commit: remove print
d0f559c95d0b5581c0bca35446948a8ed5ab38f9 d0f559c95d0b5581c0bca35446948a8ed5ab38f9 Luca Benedetto <luca.benedetto93@gmail.com> 1699617539 +0000	checkout: moving from main to 23-11-10-refactor-utils-llama
d0f559c95d0b5581c0bca35446948a8ed5ab38f9 d3d3feeb1551cb993b32230bae7dad90f990b4ef Luca Benedetto <luca.benedetto93@gmail.com> 1699617742 +0000	commit: add new args to method
d3d3feeb1551cb993b32230bae7dad90f990b4ef 4a8f047bdac6217b2a3d37cae00214dbb98222dd Luca Benedetto <luca.benedetto93@gmail.com> 1699617759 +0000	commit: use new args and move constants out of function
4a8f047bdac6217b2a3d37cae00214dbb98222dd c640a77bb0375175ff56817f5e77edb39d2a9777 Luca Benedetto <luca.benedetto93@gmail.com> 1699617781 +0000	commit: move constants out of function def
c640a77bb0375175ff56817f5e77edb39d2a9777 d0f559c95d0b5581c0bca35446948a8ed5ab38f9 Luca Benedetto <luca.benedetto93@gmail.com> 1699617865 +0000	checkout: moving from 23-11-10-refactor-utils-llama to main
d0f559c95d0b5581c0bca35446948a8ed5ab38f9 2afc06c6841051d0626abf713480ebf13eaf6abe Luca Benedetto <luca.benedetto93@gmail.com> 1699617869 +0000	pull: Fast-forward
2afc06c6841051d0626abf713480ebf13eaf6abe 2afc06c6841051d0626abf713480ebf13eaf6abe Luca Benedetto <luca.benedetto93@gmail.com> 1699876742 +0000	checkout: moving from main to 23-11-13-new-prompt
2afc06c6841051d0626abf713480ebf13eaf6abe ccd924d5a8a0a34f9ab2dcbd420123601260d492 Luca Benedetto <luca.benedetto93@gmail.com> 1699876854 +0000	commit: add prompt 57
ccd924d5a8a0a34f9ab2dcbd420123601260d492 2afc06c6841051d0626abf713480ebf13eaf6abe Luca Benedetto <luca.benedetto93@gmail.com> 1699879414 +0000	checkout: moving from 23-11-13-new-prompt to main
2afc06c6841051d0626abf713480ebf13eaf6abe f25c6f631fb6a852db0e8fd1ebfac99a5abcdcc6 Luca Benedetto <luca.benedetto93@gmail.com> 1699879420 +0000	pull: Fast-forward
f25c6f631fb6a852db0e8fd1ebfac99a5abcdcc6 f25c6f631fb6a852db0e8fd1ebfac99a5abcdcc6 Luca Benedetto <luca.benedetto93@gmail.com> 1699953463 +0000	checkout: moving from main to 23-11-14-refactor-output-folders-for-llama-and-add-cupa-results
f25c6f631fb6a852db0e8fd1ebfac99a5abcdcc6 ba4c1f3c6ad34f51b324419d12c0b03037aefad9 Luca Benedetto <luca.benedetto93@gmail.com> 1699953770 +0000	commit: add method to get model for hf from name
ba4c1f3c6ad34f51b324419d12c0b03037aefad9 4b6b78762d4fae958b1db649267ad9680cb4089f Luca Benedetto <luca.benedetto93@gmail.com> 1699953785 +0000	commit: add constants for llama2 model names
4b6b78762d4fae958b1db649267ad9680cb4089f 66c0ac1ba1e82afca8c5c5f86022333f406a774e Luca Benedetto <luca.benedetto93@gmail.com> 1699953812 +0000	commit: use new methods in the script and change output dir
66c0ac1ba1e82afca8c5c5f86022333f406a774e 50f843bbdb92931f7753dda31d351110ed44d78b Luca Benedetto <luca.benedetto93@gmail.com> 1699976230 +0000	commit: add model name for gpt 3.5
50f843bbdb92931f7753dda31d351110ed44d78b 5fba8e9ed16e9531fa77fe751df1af460d1d59a3 Luca Benedetto <luca.benedetto93@gmail.com> 1699976244 +0000	commit: add constants for CUPA data
5fba8e9ed16e9531fa77fe751df1af460d1d59a3 35d1b5bef569f7e94da4e72c5caf35edf2325671 Luca Benedetto <luca.benedetto93@gmail.com> 1699976255 +0000	commit: add model name for gpt 3.5
35d1b5bef569f7e94da4e72c5caf35edf2325671 9d0424ece411ff877a984f2dfa8bb6231787079a Luca Benedetto <luca.benedetto93@gmail.com> 1699976564 +0000	commit: refactor eval script to account for model name
9d0424ece411ff877a984f2dfa8bb6231787079a 48dd0c91cd5e025bc1420b1ff9b4c61e9b7172f3 Luca Benedetto <luca.benedetto93@gmail.com> 1699978832 +0000	commit: add code for evaluating llama and cupa data
48dd0c91cd5e025bc1420b1ff9b4c61e9b7172f3 f25c6f631fb6a852db0e8fd1ebfac99a5abcdcc6 Luca Benedetto <luca.benedetto93@gmail.com> 1699978875 +0000	checkout: moving from 23-11-14-refactor-output-folders-for-llama-and-add-cupa-results to main
f25c6f631fb6a852db0e8fd1ebfac99a5abcdcc6 73cf0f4b4901376ebe41d79452b4a41b72847a2c Luca Benedetto <luca.benedetto93@gmail.com> 1699978880 +0000	pull: Fast-forward
73cf0f4b4901376ebe41d79452b4a41b72847a2c dd95ff2faa76e85f3da5ddd223bccfac7acee6a5 Luca Benedetto <luca.benedetto93@gmail.com> 1699979203 +0000	commit: add param for llama 70b
dd95ff2faa76e85f3da5ddd223bccfac7acee6a5 3184ff13e2f562eb87de1971420a082488f3dbc0 Luca Benedetto <luca.benedetto93@gmail.com> 1699980940 +0000	commit: test with 4bit
3184ff13e2f562eb87de1971420a082488f3dbc0 3184ff13e2f562eb87de1971420a082488f3dbc0 Luca Benedetto <luca.benedetto93@gmail.com> 1700062960 +0000	checkout: moving from main to 23-11-15-fix-llama-files-removing-70b
3184ff13e2f562eb87de1971420a082488f3dbc0 285eeffe0014cb3683f104fcbab2a7fd1667cb75 Luca Benedetto <luca.benedetto93@gmail.com> 1700063171 +0000	commit: remove llama 70b constant
285eeffe0014cb3683f104fcbab2a7fd1667cb75 3b07cddfdee2f487d71663aae65d813790d3a7b4 Luca Benedetto <luca.benedetto93@gmail.com> 1700063204 +0000	commit: remove 70b and fix if-elif-else
3b07cddfdee2f487d71663aae65d813790d3a7b4 c190c02ed9fdf494bc480ab83f89eeb8f0cde2f3 Luca Benedetto <luca.benedetto93@gmail.com> 1700063246 +0000	commit: remove 70b and tmp line
c190c02ed9fdf494bc480ab83f89eeb8f0cde2f3 3184ff13e2f562eb87de1971420a082488f3dbc0 Luca Benedetto <luca.benedetto93@gmail.com> 1700133192 +0000	checkout: moving from 23-11-15-fix-llama-files-removing-70b to main
3184ff13e2f562eb87de1971420a082488f3dbc0 c5233b757b21e3a2a0b30bf6e0cf71261d483e37 Luca Benedetto <luca.benedetto93@gmail.com> 1700133256 +0000	commit: fix gpt version
c5233b757b21e3a2a0b30bf6e0cf71261d483e37 c190c02ed9fdf494bc480ab83f89eeb8f0cde2f3 Luca Benedetto <luca.benedetto93@gmail.com> 1700133270 +0000	checkout: moving from main to 23-11-15-fix-llama-files-removing-70b
c190c02ed9fdf494bc480ab83f89eeb8f0cde2f3 cde8796cb5b5300b9595da91a50195b746f05e8c Luca Benedetto <luca.benedetto93@gmail.com> 1700214997 +0000	commit: rename llama chat constants
cde8796cb5b5300b9595da91a50195b746f05e8c f43585b4d8986960109d68a595d1fe03b8493fa8 Luca Benedetto <luca.benedetto93@gmail.com> 1700215220 +0000	commit: add constant for llama 13b (no chat)
f43585b4d8986960109d68a595d1fe03b8493fa8 c5233b757b21e3a2a0b30bf6e0cf71261d483e37 Luca Benedetto <luca.benedetto93@gmail.com> 1700215228 +0000	checkout: moving from 23-11-15-fix-llama-files-removing-70b to main
c5233b757b21e3a2a0b30bf6e0cf71261d483e37 b61a09a47648475039dac33f81aba8a686ce2a65 Luca Benedetto <luca.benedetto93@gmail.com> 1700215362 +0000	pull: Fast-forward
b61a09a47648475039dac33f81aba8a686ce2a65 b61a09a47648475039dac33f81aba8a686ce2a65 Luca Benedetto <luca.benedetto93@gmail.com> 1700228211 +0000	checkout: moving from main to 23-11-17-test-newer-gpt3-5
b61a09a47648475039dac33f81aba8a686ce2a65 8fe3186ae8a8cac1e5ebd51be3749a0735cd2665 Luca Benedetto <luca.benedetto93@gmail.com> 1700228752 +0000	commit: define constant for gpt-3.5-turbo-1106 and rename get_model method for llama
8fe3186ae8a8cac1e5ebd51be3749a0735cd2665 3b3cd69630db895ebb3a7fb7447053b51a2ebd2e Luca Benedetto <luca.benedetto93@gmail.com> 1700229265 +0000	commit: refactor validate_answer for gpt
3b3cd69630db895ebb3a7fb7447053b51a2ebd2e f2ca6afc735c1e7f98ca13fb9c89557226300348 Luca Benedetto <luca.benedetto93@gmail.com> 1700229307 +0000	commit: saving both raw answers and validated answers
f2ca6afc735c1e7f98ca13fb9c89557226300348 64bcb0a79200ebecede1af4ee7f28aa101087843 Luca Benedetto <luca.benedetto93@gmail.com> 1700242542 +0000	commit: add vicuna model to constants
64bcb0a79200ebecede1af4ee7f28aa101087843 e5af14b090e7b5e470f07accb3732a984428f570 Luca Benedetto <luca.benedetto93@gmail.com> 1700242566 +0000	commit: now using vicuna
e5af14b090e7b5e470f07accb3732a984428f570 007d19bf47e2a33d1018402f5b0bb6eabac076e5 Luca Benedetto <luca.benedetto93@gmail.com> 1700242594 +0000	commit: change return index values for errors to identify different errors
007d19bf47e2a33d1018402f5b0bb6eabac076e5 7cf07875afbea1db8b99df8268885d84fad2c99c Luca Benedetto <luca.benedetto93@gmail.com> 1700242615 +0000	commit: add vicuna model
7cf07875afbea1db8b99df8268885d84fad2c99c 0b30bc54a75a131d5c02696baeffb6fdab89925b Luca Benedetto <luca.benedetto93@gmail.com> 1700242636 +0000	commit: add vicuna model
0b30bc54a75a131d5c02696baeffb6fdab89925b 253d40f2d076a4c46843cca6bb92b2cdd9499d50 Luca Benedetto <luca.benedetto93@gmail.com> 1700242697 +0000	commit: run on race prompt 40
253d40f2d076a4c46843cca6bb92b2cdd9499d50 cd34f1eaa97527e7c73026c0a65db1aa9f9f5f48 Luca Benedetto <luca.benedetto93@gmail.com> 1700253918 +0000	commit: rename utils file
cd34f1eaa97527e7c73026c0a65db1aa9f9f5f48 f056f4b5da1564eeb2980e95d3489860b32acd5e Luca Benedetto <luca.benedetto93@gmail.com> 1700253956 +0000	commit: move back check on output
f056f4b5da1564eeb2980e95d3489860b32acd5e d14078baa4f482e79b01bd510909561f97952c61 Luca Benedetto <luca.benedetto93@gmail.com> 1700253983 +0000	commit: change method to consider multiple hf models
d14078baa4f482e79b01bd510909561f97952c61 3ab61ae504ece7370943d8a18aa23eebe9c8ea68 Luca Benedetto <luca.benedetto93@gmail.com> 1700254005 +0000	commit: add check on differen error codes in index
3ab61ae504ece7370943d8a18aa23eebe9c8ea68 77d9164d19d39c3cfaa13a22c0fe1ade4a3575a9 Luca Benedetto <luca.benedetto93@gmail.com> 1700254030 +0000	commit: add method for getting vicuna prompt
77d9164d19d39c3cfaa13a22c0fe1ade4a3575a9 fed390717b1a9a5905a2bc6ab7c2bff295f02176 Luca Benedetto <luca.benedetto93@gmail.com> 1700254071 +0000	commit: rename script
fed390717b1a9a5905a2bc6ab7c2bff295f02176 095f048a515427542ec2e7998313e7445ea4ec54 Luca Benedetto <luca.benedetto93@gmail.com> 1700604960 +0000	commit: add explicit indexes for vicuna model
095f048a515427542ec2e7998313e7445ea4ec54 8b19246d689e180a64855c810d6f660257b03e2b Luca Benedetto <luca.benedetto93@gmail.com> 1700606217 +0000	commit: fix enumerate in for loop
8b19246d689e180a64855c810d6f660257b03e2b a5aefc4cb05655aa44ae44805708bb30736e24e0 Luca Benedetto <luca.benedetto93@gmail.com> 1700906270 +0000	commit: add prompt 58
a5aefc4cb05655aa44ae44805708bb30736e24e0 b61a09a47648475039dac33f81aba8a686ce2a65 Luca Benedetto <luca.benedetto93@gmail.com> 1700906541 +0000	checkout: moving from 23-11-17-test-newer-gpt3-5 to main
b61a09a47648475039dac33f81aba8a686ce2a65 03a4d12453f58c7ea974bb2ac7c4e7bd54a4100d Luca Benedetto <luca.benedetto93@gmail.com> 1700906545 +0000	pull: Fast-forward
03a4d12453f58c7ea974bb2ac7c4e7bd54a4100d 03a4d12453f58c7ea974bb2ac7c4e7bd54a4100d Luca Benedetto <luca.benedetto93@gmail.com> 1700906615 +0000	checkout: moving from main to 23-11-25-save-gpt-raw-outputs
03a4d12453f58c7ea974bb2ac7c4e7bd54a4100d 8d32ce8917022a6a9447fa32827b855f409f2091 Luca Benedetto <luca.benedetto93@gmail.com> 1700907758 +0000	commit: add prompt 59
8d32ce8917022a6a9447fa32827b855f409f2091 c4231186ec75fd0eb1cc34181c32e3cd677910f9 Luca Benedetto <luca.benedetto93@gmail.com> 1700907796 +0000	commit: remove validate_answer from method to get gpt response, to save raw answer
c4231186ec75fd0eb1cc34181c32e3cd677910f9 9c1278483ff70ad067c882006e24f21ad92c07f9 Luca Benedetto <luca.benedetto93@gmail.com> 1700907846 +0000	commit: now saving raw answer
9c1278483ff70ad067c882006e24f21ad92c07f9 fe8d7f4974ba09d7dafd0b02d7307582341506a6 Luca Benedetto <luca.benedetto93@gmail.com> 1700911072 +0000	commit: fix import name
fe8d7f4974ba09d7dafd0b02d7307582341506a6 cedae46b880942780d2e84b27574a6780112de25 Luca Benedetto <luca.benedetto93@gmail.com> 1701081659 +0000	commit: add prompt 60, which uses cefr levels with explanation from council of europe
cedae46b880942780d2e84b27574a6780112de25 f5a67cf0ac9051b3e988e3cfb565b1c0178e6517 Luca Benedetto <luca.benedetto93@gmail.com> 1701081811 +0000	commit: add evaluation for newer gpt-3.5
f5a67cf0ac9051b3e988e3cfb565b1c0178e6517 03a4d12453f58c7ea974bb2ac7c4e7bd54a4100d Luca Benedetto <luca.benedetto93@gmail.com> 1701082353 +0000	checkout: moving from 23-11-25-save-gpt-raw-outputs to main
03a4d12453f58c7ea974bb2ac7c4e7bd54a4100d 2954289392c2b9988553a13534b606fcc4d1e077 Luca Benedetto <luca.benedetto93@gmail.com> 1701082357 +0000	pull: Fast-forward
2954289392c2b9988553a13534b606fcc4d1e077 2954289392c2b9988553a13534b606fcc4d1e077 Luca Benedetto <luca.benedetto93@gmail.com> 1701099847 +0000	checkout: moving from main to 23-11-17-new-version-vicuna-methods
2954289392c2b9988553a13534b606fcc4d1e077 15e2299ae93576a580839fe7d6dd30017132bc33 Luca Benedetto <luca.benedetto93@gmail.com> 1701100160 +0000	commit: update code for running vicuna
15e2299ae93576a580839fe7d6dd30017132bc33 0d2a3da4929a248cfd06052f3353fca15921efa7 Luca Benedetto <luca.benedetto93@gmail.com> 1701100405 +0000	commit: update code for running vicuna
0d2a3da4929a248cfd06052f3353fca15921efa7 bbd3948e1dfdc5be45e964e3a6669dbaaf8fca6d Luca Benedetto <luca.benedetto93@gmail.com> 1701101900 +0000	commit: update code for running vicuna
bbd3948e1dfdc5be45e964e3a6669dbaaf8fca6d 6cac12ec3b7b4acd49f510473cb93da0869e1ada Luca Benedetto <luca.benedetto93@gmail.com> 1701101911 +0000	commit: update code for running vicuna
6cac12ec3b7b4acd49f510473cb93da0869e1ada c9af8bc8bab8a1c3ec9881510e5f68e042d42c98 Luca Benedetto <luca.benedetto93@gmail.com> 1701102080 +0000	commit: update code for running vicuna
c9af8bc8bab8a1c3ec9881510e5f68e042d42c98 201ee4ecf2b586ca94e0f4f491ecfea4ec1ed3ff Luca Benedetto <luca.benedetto93@gmail.com> 1701102251 +0000	commit: change params to fix warnings
201ee4ecf2b586ca94e0f4f491ecfea4ec1ed3ff c102afb0982b8a38b84808199309d55b3a7052e2 Luca Benedetto <luca.benedetto93@gmail.com> 1701103689 +0000	commit: remove explicit indexes
c102afb0982b8a38b84808199309d55b3a7052e2 770412d424c7554377592ed681ebe70635569f89 Luca Benedetto <luca.benedetto93@gmail.com> 1701256779 +0000	commit: add eval for vicuna model
770412d424c7554377592ed681ebe70635569f89 79216ec95698154b4e2b4b741de9c9cd742339cb Luca Benedetto <luca.benedetto93@gmail.com> 1701257148 +0000	commit: add code to specify response format
79216ec95698154b4e2b4b741de9c9cd742339cb edd16d341ce1461e282444e51f5b92ba38c58288 Luca Benedetto <luca.benedetto93@gmail.com> 1701257171 +0000	commit: update methods to consider response format param for GPT
edd16d341ce1461e282444e51f5b92ba38c58288 334b6f27c78e04ff47029a20a82264350675d1b8 Luca Benedetto <luca.benedetto93@gmail.com> 1701257306 +0000	commit: add type hinting
334b6f27c78e04ff47029a20a82264350675d1b8 2954289392c2b9988553a13534b606fcc4d1e077 Luca Benedetto <luca.benedetto93@gmail.com> 1701261969 +0000	checkout: moving from 23-11-17-new-version-vicuna-methods to main
2954289392c2b9988553a13534b606fcc4d1e077 f213911f3642cfd08600ac1c972fc2d93d10a3f7 Luca Benedetto <luca.benedetto93@gmail.com> 1701261973 +0000	pull: Fast-forward
f213911f3642cfd08600ac1c972fc2d93d10a3f7 f213911f3642cfd08600ac1c972fc2d93d10a3f7 Luca Benedetto <luca.benedetto93@gmail.com> 1701340911 +0000	checkout: moving from main to 23-11-30-update-plot-methods
f213911f3642cfd08600ac1c972fc2d93d10a3f7 1bff12fc057ef86fdb7e6c1f066b63edcbb80e95 Luca Benedetto <luca.benedetto93@gmail.com> 1701341037 +0000	commit: add extension as argument
1bff12fc057ef86fdb7e6c1f066b63edcbb80e95 c127ae2802ca2cdc35045764d5234d0d4bf6f759 Luca Benedetto <luca.benedetto93@gmail.com> 1701341111 +0000	commit: reorder colors
c127ae2802ca2cdc35045764d5234d0d4bf6f759 32b35d7da716acef5c6c7e56333abe9ce114c074 Luca Benedetto <luca.benedetto93@gmail.com> 1701341337 +0000	commit: update plot QA accuracy, now with line plot instead of barplot
32b35d7da716acef5c6c7e56333abe9ce114c074 889844283ea3efecb2f52937e764ac5dda2eb844 Luca Benedetto <luca.benedetto93@gmail.com> 1701446705 +0000	commit: add cupa to utils file
889844283ea3efecb2f52937e764ac5dda2eb844 1916c7b54188b31f1e6393cd6f5607f26439d5b6 Luca Benedetto <luca.benedetto93@gmail.com> 1701446730 +0000	commit: add new method that gets all the variables needed for the plots
1916c7b54188b31f1e6393cd6f5607f26439d5b6 8aa0e695532e5b37e7a5e66ee162efb4ab122d15 Luca Benedetto <luca.benedetto93@gmail.com> 1701446752 +0000	commit: add first part of the plots for the paper
8aa0e695532e5b37e7a5e66ee162efb4ab122d15 f213911f3642cfd08600ac1c972fc2d93d10a3f7 Luca Benedetto <luca.benedetto93@gmail.com> 1701446809 +0000	checkout: moving from 23-11-30-update-plot-methods to main
f213911f3642cfd08600ac1c972fc2d93d10a3f7 f1dafebc42953bd931636682ad5885d12f684e04 Luca Benedetto <luca.benedetto93@gmail.com> 1701446813 +0000	pull: Fast-forward
f1dafebc42953bd931636682ad5885d12f684e04 f1dafebc42953bd931636682ad5885d12f684e04 Luca Benedetto <luca.benedetto93@gmail.com> 1701446828 +0000	checkout: moving from main to 23-12-01-add-plots-for-paper
f1dafebc42953bd931636682ad5885d12f684e04 97ae94799c33cfaf181f5b86d83904e95617c02b Luca Benedetto <luca.benedetto93@gmail.com> 1701447905 +0000	commit: add plot for mcqa analysis on exam marks
97ae94799c33cfaf181f5b86d83904e95617c02b 83566fa6fed6f730c4cfd032fdcd90d25f381f80 Luca Benedetto <luca.benedetto93@gmail.com> 1701448242 +0000	commit: add cupa to is reading question dict
83566fa6fed6f730c4cfd032fdcd90d25f381f80 30757cf79eb1c5e4da5f86862cdb66f533ca1710 Luca Benedetto <luca.benedetto93@gmail.com> 1701448256 +0000	commit: add cupa to get dataset method
30757cf79eb1c5e4da5f86862cdb66f533ca1710 a130665692947e009a465a94af7bee38faf03df6 Luca Benedetto <luca.benedetto93@gmail.com> 1701449109 +0000	commit: add cupa import
a130665692947e009a465a94af7bee38faf03df6 6c2a2d8d626170dffe43fbcd3b2d49c18f04dd2f Luca Benedetto <luca.benedetto93@gmail.com> 1701453441 +0000	commit: update plot for gpt3.5 vs. gpt3.5 1106
6c2a2d8d626170dffe43fbcd3b2d49c18f04dd2f 35acb48cc688e52a1107102778f241dad5e1c1a0 Luca Benedetto <luca.benedetto93@gmail.com> 1701453577 +0000	commit: comment unused lines
35acb48cc688e52a1107102778f241dad5e1c1a0 b321e668c9343f1af2592cec3124d980e6d7ad78 Luca Benedetto <luca.benedetto93@gmail.com> 1701454043 +0000	commit: add cupa to plot mcqa by exam grade
b321e668c9343f1af2592cec3124d980e6d7ad78 1aa59382d0c615619e26e3dee348930e8e37d249 Luca Benedetto <luca.benedetto93@gmail.com> 1701454357 +0000	commit: add except for KeyError
1aa59382d0c615619e26e3dee348930e8e37d249 ee10e932f13371d4069bc2907c46169953dbd795 Luca Benedetto <luca.benedetto93@gmail.com> 1701470030 +0000	commit: add plot for analysis language proficiency scales on CUPA
ee10e932f13371d4069bc2907c46169953dbd795 fb127a8e5545cf8d634626bf8fd3d91ca7999420 Luca Benedetto <luca.benedetto93@gmail.com> 1701515149 +0000	commit: remove difficulty levels as param and add difficulty_column, now infer the levels from the df column
fb127a8e5545cf8d634626bf8fd3d91ca7999420 d1b4f335f58e7844623ba629f7be08a08cf34d65 Luca Benedetto <luca.benedetto93@gmail.com> 1701515275 +0000	commit: add params to plot and or save figure
d1b4f335f58e7844623ba629f7be08a08cf34d65 530612703494ef6adb67c5485e4130e7ac91000d Luca Benedetto <luca.benedetto93@gmail.com> 1701515336 +0000	commit: comment unused constant
530612703494ef6adb67c5485e4130e7ac91000d 03cbca0809e76d8b32bb4eb8526b068a807e355b Luca Benedetto <luca.benedetto93@gmail.com> 1701515348 +0000	commit: comment unused constant
03cbca0809e76d8b32bb4eb8526b068a807e355b 66202312522d48c7ec35fbae945eaf6e1e80960f Luca Benedetto <luca.benedetto93@gmail.com> 1701515847 +0000	commit: add image about MCQA by role played level, separately for different target levels -- CUPA
66202312522d48c7ec35fbae945eaf6e1e80960f 0c199eb42bae86d87c2bb814c78de8eea382d949 Luca Benedetto <luca.benedetto93@gmail.com> 1701518679 +0000	commit: update get_difficulty_dict_from_df method, now can pass the column name
0c199eb42bae86d87c2bb814c78de8eea382d949 df2e5aaf19e10287d9d468515c516b9f9745a8b0 Luca Benedetto <luca.benedetto93@gmail.com> 1701518692 +0000	commit: add methods for analysis virtual pretesting
df2e5aaf19e10287d9d468515c516b9f9745a8b0 57e0a3abd308f4ab00c1e6d4b563df04eca13935 Luca Benedetto <luca.benedetto93@gmail.com> 1701521026 +0000	commit: fix plot to show difficulty instead of accuracy
57e0a3abd308f4ab00c1e6d4b563df04eca13935 f1dafebc42953bd931636682ad5885d12f684e04 Luca Benedetto <luca.benedetto93@gmail.com> 1701684179 +0000	checkout: moving from 23-12-01-add-plots-for-paper to main
f1dafebc42953bd931636682ad5885d12f684e04 de36b70316ba32d83c029c0cc57b5d85795978a0 Luca Benedetto <luca.benedetto93@gmail.com> 1701684184 +0000	pull: Fast-forward
de36b70316ba32d83c029c0cc57b5d85795978a0 de36b70316ba32d83c029c0cc57b5d85795978a0 Luca Benedetto <luca.benedetto93@gmail.com> 1701684201 +0000	checkout: moving from main to 23-12-04-refactor-data-collection
de36b70316ba32d83c029c0cc57b5d85795978a0 49ccac412795f7997349901baaf1f634b8633baa Luca Benedetto <luca.benedetto93@gmail.com> 1701684262 +0000	commit: rename param and add arg for split
49ccac412795f7997349901baaf1f634b8633baa c93eb2ad443df84feb436f7bf593917a70faf227 Luca Benedetto <luca.benedetto93@gmail.com> 1701684337 +0000	commit: refactor call to get_dataset method
c93eb2ad443df84feb436f7bf593917a70faf227 c191e04576beca6dff5d276ac8d434bcbb427686 Luca Benedetto <luca.benedetto93@gmail.com> 1701684483 +0000	commit: refactor method to store gpt answers
c191e04576beca6dff5d276ac8d434bcbb427686 e4be088f0f806ec3365266203b09f91d2ad5e002 Luca Benedetto <luca.benedetto93@gmail.com> 1701684537 +0000	commit: refactor method to store hf answers
e4be088f0f806ec3365266203b09f91d2ad5e002 790c94b24d3f9cca453555f90f72a708920cb112 Luca Benedetto <luca.benedetto93@gmail.com> 1701685221 +0000	commit: add retry to requirements
790c94b24d3f9cca453555f90f72a708920cb112 a4958fd66c09d1e059608a43f56f44718610bdcc Luca Benedetto <luca.benedetto93@gmail.com> 1701685339 +0000	commit: add retry to reduce max timeouts
a4958fd66c09d1e059608a43f56f44718610bdcc 650dafa4d4ed4bfeb06f57928421e2e1e6f5851f Luca Benedetto <luca.benedetto93@gmail.com> 1701685693 +0000	commit: add prompt 38 to release code
650dafa4d4ed4bfeb06f57928421e2e1e6f5851f 55d0d9941f648f35b8b9650691c617374a4b7634 Luca Benedetto <luca.benedetto93@gmail.com> 1701685827 +0000	commit: add prompt 37 to release code
55d0d9941f648f35b8b9650691c617374a4b7634 8ecb967fefd372c49f79ab0ab4678f0682c1e3dc Luca Benedetto <luca.benedetto93@gmail.com> 1701686040 +0000	commit: add prompt 36 to release code
8ecb967fefd372c49f79ab0ab4678f0682c1e3dc 6d84cc2fc6256ce104eccf4986f19624f699b653 Luca Benedetto <luca.benedetto93@gmail.com> 1701696522 +0000	commit: add split to output folder for figures
6d84cc2fc6256ce104eccf4986f19624f699b653 937fe17d9ddf729f14518b5c1015e75f24e41f09 Luca Benedetto <luca.benedetto93@gmail.com> 1701703113 +0000	commit: add info about split in variable names and get all info method
937fe17d9ddf729f14518b5c1015e75f24e41f09 a9d4cb8ab8c8eb7e44af6c8c41671cf7577d590f Luca Benedetto <luca.benedetto93@gmail.com> 1701703177 +0000	commit: add dev set to arc analysis
a9d4cb8ab8c8eb7e44af6c8c41671cf7577d590f 259df403068206a19e3b84cbfb9763d537b5a5e5 Luca Benedetto <luca.benedetto93@gmail.com> 1701703272 +0000	commit: add dev set to arc analysis
259df403068206a19e3b84cbfb9763d537b5a5e5 8232d74c6940e8b350d678353f510d0fa8e760d5 Luca Benedetto <luca.benedetto93@gmail.com> 1701703969 +0000	commit: change format of first two figures
8232d74c6940e8b350d678353f510d0fa8e760d5 018ea62720a783ca0b67d3990e8ccd2a35f09864 Luca Benedetto <luca.benedetto93@gmail.com> 1701704464 +0000	commit: redo plot for MCQA RACE on different levels
018ea62720a783ca0b67d3990e8ccd2a35f09864 28ed269456031ff1cd3f9d030a3fcd7707ff33ce Luca Benedetto <luca.benedetto93@gmail.com> 1701705351 +0000	commit: redo plot for MCQA on arc per level
28ed269456031ff1cd3f9d030a3fcd7707ff33ce 2b9e8b533bdec3201ca318b4fffd2569682478a8 Luca Benedetto <luca.benedetto93@gmail.com> 1701705433 +0000	commit: fix labels
2b9e8b533bdec3201ca318b4fffd2569682478a8 459d70941c7faf93696aba397facc8c5232c5e67 Luca Benedetto <luca.benedetto93@gmail.com> 1701708086 +0000	commit: add analysis gpt4
459d70941c7faf93696aba397facc8c5232c5e67 592da5a7485790152135b43b43d6c7abe795a200 Luca Benedetto <luca.benedetto93@gmail.com> 1701793695 +0000	commit: add test and dev split names to constants
592da5a7485790152135b43b43d6c7abe795a200 5714c45cc9ace37d8f539a595297ecee4a46d271 Luca Benedetto <luca.benedetto93@gmail.com> 1701793715 +0000	commit: use constant for split name
5714c45cc9ace37d8f539a595297ecee4a46d271 9c9f1f5ad8b003cb0d1bf4f26104ace5634a5f09 Luca Benedetto <luca.benedetto93@gmail.com> 1701793741 +0000	commit: add GPT4 to list of models managed with this code
9c9f1f5ad8b003cb0d1bf4f26104ace5634a5f09 7ef7ea90fbcc07764b2f096cef7f99d277ada3c4 Luca Benedetto <luca.benedetto93@gmail.com> 1701793783 +0000	commit: use constants for split name
7ef7ea90fbcc07764b2f096cef7f99d277ada3c4 de36b70316ba32d83c029c0cc57b5d85795978a0 Luca Benedetto <luca.benedetto93@gmail.com> 1701793862 +0000	checkout: moving from 23-12-04-refactor-data-collection to main
de36b70316ba32d83c029c0cc57b5d85795978a0 12418be34d8c72c72a272ae0f6b42dbc018c4310 Luca Benedetto <luca.benedetto93@gmail.com> 1701793869 +0000	pull: Fast-forward
12418be34d8c72c72a272ae0f6b42dbc018c4310 12418be34d8c72c72a272ae0f6b42dbc018c4310 Luca Benedetto <luca.benedetto93@gmail.com> 1701869349 +0000	checkout: moving from main to 23-12-06-update-plots-for-paper
12418be34d8c72c72a272ae0f6b42dbc018c4310 b15bd4e1abcd6fef80cde7cdc3f7134cab3edf95 Luca Benedetto <luca.benedetto93@gmail.com> 1701869631 +0000	commit: add plot for gpt4 analysis on RACE
b15bd4e1abcd6fef80cde7cdc3f7134cab3edf95 d290f7dd207aa44faa3cb88b82dcc3ee3ce91885 Luca Benedetto <luca.benedetto93@gmail.com> 1702245486 +0000	commit: add plot for additional analysis on ARC for the appendix -- MCQA accuracy on different (even) grade)
d290f7dd207aa44faa3cb88b82dcc3ee3ce91885 d7a4e7136e3175cef07b3eaa79302683e8e65789 Luca Benedetto <luca.benedetto93@gmail.com> 1702247374 +0000	commit: add plot for additional analysis language proficiency scales
d7a4e7136e3175cef07b3eaa79302683e8e65789 8f68c3977ff08e309f5419645e8faa1053570285 Luca Benedetto <luca.benedetto93@gmail.com> 1702250085 +0000	commit: reorder variable definitions
8f68c3977ff08e309f5419645e8faa1053570285 07478086c3e2682e83cc1266c5578aaf20babf0a Luca Benedetto <luca.benedetto93@gmail.com> 1702288029 +0000	commit: remove unused import
07478086c3e2682e83cc1266c5578aaf20babf0a 12418be34d8c72c72a272ae0f6b42dbc018c4310 Luca Benedetto <luca.benedetto93@gmail.com> 1702292846 +0000	checkout: moving from 23-12-06-update-plots-for-paper to main
12418be34d8c72c72a272ae0f6b42dbc018c4310 8e389dbf1d07b72a5f734a2fda1bf772f0f2279b Luca Benedetto <luca.benedetto93@gmail.com> 1702292851 +0000	pull: Fast-forward
8e389dbf1d07b72a5f734a2fda1bf772f0f2279b 8e389dbf1d07b72a5f734a2fda1bf772f0f2279b Luca Benedetto <luca.benedetto93@gmail.com> 1702293051 +0000	checkout: moving from main to 23-12-11-repeat-prompts-on-dev-set
8e389dbf1d07b72a5f734a2fda1bf772f0f2279b 8e389dbf1d07b72a5f734a2fda1bf772f0f2279b Luca Benedetto <luca.benedetto93@gmail.com> 1702293151 +0000	checkout: moving from 23-12-11-repeat-prompts-on-dev-set to main
8e389dbf1d07b72a5f734a2fda1bf772f0f2279b a6deb22561f75ead6a2410ac9f39d884ecdc3239 Luca Benedetto <luca.benedetto93@gmail.com> 1702293171 +0000	commit: fix typo
a6deb22561f75ead6a2410ac9f39d884ecdc3239 8e389dbf1d07b72a5f734a2fda1bf772f0f2279b Luca Benedetto <luca.benedetto93@gmail.com> 1702293178 +0000	checkout: moving from main to 23-12-11-repeat-prompts-on-dev-set
8e389dbf1d07b72a5f734a2fda1bf772f0f2279b a6deb22561f75ead6a2410ac9f39d884ecdc3239 Luca Benedetto <luca.benedetto93@gmail.com> 1702293186 +0000	merge main: Fast-forward
a6deb22561f75ead6a2410ac9f39d884ecdc3239 2af2d371ffa7cb4d8193395f7e5ec655c4a8da6b Luca Benedetto <luca.benedetto93@gmail.com> 1702293361 +0000	commit: add prompt 35
2af2d371ffa7cb4d8193395f7e5ec655c4a8da6b 5a8c323bb3f046039ecd4ff6fe83a7ef66348b9c Luca Benedetto <luca.benedetto93@gmail.com> 1702293427 +0000	commit: add prompt 32
5a8c323bb3f046039ecd4ff6fe83a7ef66348b9c c5c2383a962ffd4d1e6603c2315a2f82c1b5781e Luca Benedetto <luca.benedetto93@gmail.com> 1702293554 +0000	commit: add prompt 31
c5c2383a962ffd4d1e6603c2315a2f82c1b5781e 4e236026be085c1e304df153b59f6b7b3bc5b4f7 Luca Benedetto <luca.benedetto93@gmail.com> 1702294224 +0000	commit: add prompt 28
4e236026be085c1e304df153b59f6b7b3bc5b4f7 a6deb22561f75ead6a2410ac9f39d884ecdc3239 Luca Benedetto <luca.benedetto93@gmail.com> 1702294345 +0000	checkout: moving from 23-12-11-repeat-prompts-on-dev-set to main
a6deb22561f75ead6a2410ac9f39d884ecdc3239 eaf974a615da16a97a53e7efe6f1058aa5bbbcae Luca Benedetto <luca.benedetto93@gmail.com> 1702294349 +0000	pull: Fast-forward
eaf974a615da16a97a53e7efe6f1058aa5bbbcae eaf974a615da16a97a53e7efe6f1058aa5bbbcae Luca Benedetto <luca.benedetto93@gmail.com> 1702294360 +0000	checkout: moving from main to 23-12-11-remove-huggingface-code
eaf974a615da16a97a53e7efe6f1058aa5bbbcae 94408d101fcd7451e79501a7dae328f1f1e18bc9 Luca Benedetto <luca.benedetto93@gmail.com> 1702294401 +0000	commit: remove huggingface requirements
94408d101fcd7451e79501a7dae328f1f1e18bc9 ea09da27e66fb761f98c4c7e8c5d14b4910b41d6 Luca Benedetto <luca.benedetto93@gmail.com> 1702294441 +0000	commit: remove script for running huggingface models
ea09da27e66fb761f98c4c7e8c5d14b4910b41d6 ccaeecbd838af55fdee9686e31f2dab5a4f20559 Luca Benedetto <luca.benedetto93@gmail.com> 1702294470 +0000	commit: remove utils file for huggingface models
ccaeecbd838af55fdee9686e31f2dab5a4f20559 2d356686d4b76a6dd5972da18f52bb4dbfec3cdc Luca Benedetto <luca.benedetto93@gmail.com> 1702294515 +0000	commit: remove constants for huggingface models
2d356686d4b76a6dd5972da18f52bb4dbfec3cdc b8e2d7f02437e35f7079df2fb952baa21f261169 Luca Benedetto <luca.benedetto93@gmail.com> 1702294570 +0000	commit: remove script for "generic" plots
b8e2d7f02437e35f7079df2fb952baa21f261169 eaf974a615da16a97a53e7efe6f1058aa5bbbcae Luca Benedetto <luca.benedetto93@gmail.com> 1702294911 +0000	checkout: moving from 23-12-11-remove-huggingface-code to main
eaf974a615da16a97a53e7efe6f1058aa5bbbcae 9c8b06e6a32280907b89c1da66953d44613b88f6 Luca Benedetto <luca.benedetto93@gmail.com> 1702294916 +0000	pull: Fast-forward
9c8b06e6a32280907b89c1da66953d44613b88f6 9c8b06e6a32280907b89c1da66953d44613b88f6 Luca Benedetto <luca.benedetto93@gmail.com> 1702302147 +0000	checkout: moving from main to 23-12-11-add-new-plots-to-paper
9c8b06e6a32280907b89c1da66953d44613b88f6 debee625f7783f58ba4e54357ccb9ba7a5e11618 Luca Benedetto <luca.benedetto93@gmail.com> 1702303958 +0000	commit: add plot for copmarison prompts on dev set arc
debee625f7783f58ba4e54357ccb9ba7a5e11618 ab42559ed8aaf3f4939e189e826f402e05ae75ae Luca Benedetto <luca.benedetto93@gmail.com> 1702304081 +0000	commit: fix size of virtual pretesting plot
ab42559ed8aaf3f4939e189e826f402e05ae75ae e0350990eb82347151a4f05a312c7b54de7b5587 Luca Benedetto <luca.benedetto93@gmail.com> 1702458674 +0000	commit: add analysis GPT-4 on CUPA
e0350990eb82347151a4f05a312c7b54de7b5587 0e8f5998f91ba9d79abdf4a063d455fb2dd4d13e Luca Benedetto <luca.benedetto93@gmail.com> 1702464786 +0000	commit: add plot for analysis qualitative scale - beginner intermediate advanced
0e8f5998f91ba9d79abdf4a063d455fb2dd4d13e a62e8593ae02b91194cd9ca485f53e39922ff924 Luca Benedetto <luca.benedetto93@gmail.com> 1702553431 +0000	commit: fix plot comparison two versions gpt-3.5
a62e8593ae02b91194cd9ca485f53e39922ff924 7ff1c5d290460a6ac694753dcca1ef4e8e0dd0c7 Luca Benedetto <luca.benedetto93@gmail.com> 1702553472 +0000	commit: fix comment
7ff1c5d290460a6ac694753dcca1ef4e8e0dd0c7 3dd1f04b9c2d35add3a318c0158623897a2981de Luca Benedetto <luca.benedetto93@gmail.com> 1702553577 +0000	commit: refactor plot analysis language proficiency scales
3dd1f04b9c2d35add3a318c0158623897a2981de 1a13922f8ad2327f57fad442ff550f6ed570dcef Luca Benedetto <luca.benedetto93@gmail.com> 1702566238 +0000	commit: update scatter plot for virtual pretesting cupa
1a13922f8ad2327f57fad442ff550f6ed570dcef ebe036b4cc1de9824c2362bdd88885683f2dd4d6 Luca Benedetto <luca.benedetto93@gmail.com> 1702567036 +0000	commit: add scipy to compute correlation
ebe036b4cc1de9824c2362bdd88885683f2dd4d6 9df48b9035a429e6e644b7c30a214917627d42e0 Luca Benedetto <luca.benedetto93@gmail.com> 1702568014 +0000	commit: add scipy to compute correlation
9df48b9035a429e6e644b7c30a214917627d42e0 b3619a096ef0b345b7ba94c03dec9284e93af857 Luca Benedetto <luca.benedetto93@gmail.com> 1702568123 +0000	commit: uncomment line to save plot
b3619a096ef0b345b7ba94c03dec9284e93af857 3fbc4e272c54f747074b39d3b4eefa46863be2a1 Luca Benedetto <luca.benedetto93@gmail.com> 1702568156 +0000	commit: move code for computing correlation
3fbc4e272c54f747074b39d3b4eefa46863be2a1 7ec802ca090456b80014f671b7641099b7778833 Luca Benedetto <luca.benedetto93@gmail.com> 1702570863 +0000	commit: add simulation with IRT
7ec802ca090456b80014f671b7641099b7778833 9c8b06e6a32280907b89c1da66953d44613b88f6 Luca Benedetto <luca.benedetto93@gmail.com> 1702570872 +0000	checkout: moving from 23-12-11-add-new-plots-to-paper to main
9c8b06e6a32280907b89c1da66953d44613b88f6 4406abbdb3087250b632276e0261e4f727f63c99 Luca Benedetto <luca.benedetto93@gmail.com> 1702570915 +0000	pull: Fast-forward
4406abbdb3087250b632276e0261e4f727f63c99 a571fecf7f76e650f0470c366a2ad0e94bf0b8fb Luca Benedetto <luca.benedetto93@gmail.com> 1702571246 +0000	commit: fix folder path
a571fecf7f76e650f0470c366a2ad0e94bf0b8fb 88c681514f316645dd7bfbbb6f320f58d13c22ec Luca Benedetto <luca.benedetto93@gmail.com> 1702571261 +0000	commit: fix param
88c681514f316645dd7bfbbb6f320f58d13c22ec 7ec802ca090456b80014f671b7641099b7778833 Luca Benedetto <luca.benedetto93@gmail.com> 1702572851 +0000	checkout: moving from main to 23-12-11-add-new-plots-to-paper
7ec802ca090456b80014f671b7641099b7778833 88c681514f316645dd7bfbbb6f320f58d13c22ec Luca Benedetto <luca.benedetto93@gmail.com> 1702572965 +0000	checkout: moving from 23-12-11-add-new-plots-to-paper to main
88c681514f316645dd7bfbbb6f320f58d13c22ec f86bc81569b68b2fad30773d9abd660a7d83c498 Luca Benedetto <luca.benedetto93@gmail.com> 1702572977 +0000	checkout: moving from main to gpt-answers
f86bc81569b68b2fad30773d9abd660a7d83c498 ee583c51ae51c0aa73da714e558fc4014e54ef9a Luca Benedetto <luca.benedetto93@gmail.com> 1702572984 +0000	pull: Fast-forward
ee583c51ae51c0aa73da714e558fc4014e54ef9a 2ffbae3c62d4b077e9926f8e047eff9717d24c78 Luca Benedetto <luca.benedetto93@gmail.com> 1702573103 +0000	commit: fix conflicts
2ffbae3c62d4b077e9926f8e047eff9717d24c78 88c681514f316645dd7bfbbb6f320f58d13c22ec Luca Benedetto <luca.benedetto93@gmail.com> 1702573364 +0000	checkout: moving from gpt-answers to main
88c681514f316645dd7bfbbb6f320f58d13c22ec 208e2aa5706c52543c3468c48a6d0627541d76cc Luca Benedetto <luca.benedetto93@gmail.com> 1702573368 +0000	pull: Fast-forward
208e2aa5706c52543c3468c48a6d0627541d76cc 7ec802ca090456b80014f671b7641099b7778833 Luca Benedetto <luca.benedetto93@gmail.com> 1702573371 +0000	checkout: moving from main to 23-12-11-add-new-plots-to-paper
7ec802ca090456b80014f671b7641099b7778833 24b7cf2d5b1ba54b3cf73efc7384f3e1b89b3894 Luca Benedetto <luca.benedetto93@gmail.com> 1702573404 +0000	commit: remove commented lines
24b7cf2d5b1ba54b3cf73efc7384f3e1b89b3894 208e2aa5706c52543c3468c48a6d0627541d76cc Luca Benedetto <luca.benedetto93@gmail.com> 1702573471 +0000	checkout: moving from 23-12-11-add-new-plots-to-paper to main
208e2aa5706c52543c3468c48a6d0627541d76cc ed4144cb97cace05bb83f05b3d0401165f2b121a Luca Benedetto <luca.benedetto93@gmail.com> 1702573475 +0000	pull: Fast-forward
ed4144cb97cace05bb83f05b3d0401165f2b121a ed4144cb97cace05bb83f05b3d0401165f2b121a Luca Benedetto <luca.benedetto93@gmail.com> 1702638249 +0000	checkout: moving from main to 23-12-15-add-prompts-gpt-no-simulated-level
ed4144cb97cace05bb83f05b3d0401165f2b121a 27dab50ce74e6acb404d385e70c8f9064c6a1ea7 Luca Benedetto <luca.benedetto93@gmail.com> 1702638553 +0000	commit: added new prompts to the utils file
27dab50ce74e6acb404d385e70c8f9064c6a1ea7 f20761d397bd5216fa6f92a26aaea451c85026f7 Luca Benedetto <luca.benedetto93@gmail.com> 1702638795 +0000	commit: remove comments
f20761d397bd5216fa6f92a26aaea451c85026f7 a1e31a772edf4e3d7d860f3f8e0431cd3e33b4c2 Luca Benedetto <luca.benedetto93@gmail.com> 1702639266 +0000	commit: remove unused method
a1e31a772edf4e3d7d860f3f8e0431cd3e33b4c2 ed4144cb97cace05bb83f05b3d0401165f2b121a Luca Benedetto <luca.benedetto93@gmail.com> 1702653180 +0000	checkout: moving from 23-12-15-add-prompts-gpt-no-simulated-level to main
ed4144cb97cace05bb83f05b3d0401165f2b121a faeaa3c29c80a986f8d1bdb12ac5c31dd73f6b3f Luca Benedetto <luca.benedetto93@gmail.com> 1702653185 +0000	pull: Fast-forward
faeaa3c29c80a986f8d1bdb12ac5c31dd73f6b3f faeaa3c29c80a986f8d1bdb12ac5c31dd73f6b3f Luca Benedetto <luca.benedetto93@gmail.com> 1702653212 +0000	checkout: moving from main to 23-12-15-last-edits-to-share-code
faeaa3c29c80a986f8d1bdb12ac5c31dd73f6b3f 4f329cb0fa96beda73f4b866fb8be2ac90861726 Luca Benedetto <luca.benedetto93@gmail.com> 1702653283 +0000	commit: add script to perform the general evaluation of the LLM responses
4f329cb0fa96beda73f4b866fb8be2ac90861726 7cc4d6effe0c9f967ab9ec3886edcee597ac86d3 Luca Benedetto <luca.benedetto93@gmail.com> 1702653323 +0000	commit: remove commented line
7cc4d6effe0c9f967ab9ec3886edcee597ac86d3 737cf4bacbe47ad183067063aa6e787c60dad29d Luca Benedetto <luca.benedetto93@gmail.com> 1702653346 +0000	commit: remove unused variable
737cf4bacbe47ad183067063aa6e787c60dad29d 94d55a2fff72ea76574efe6efdeee1878a94006e Luca Benedetto <luca.benedetto93@gmail.com> 1702653363 +0000	commit: remove unused import
94d55a2fff72ea76574efe6efdeee1878a94006e 6e8ec275dbbfb1b09eccadf8c6e6a6ee214e9a5d Luca Benedetto <luca.benedetto93@gmail.com> 1702653457 +0000	commit: remove comments and unused variables
6e8ec275dbbfb1b09eccadf8c6e6a6ee214e9a5d 6474a84bb0518774d4c571beb1e6602fbd8860a0 Luca Benedetto <luca.benedetto93@gmail.com> 1702656340 +0000	commit: change import api key
6474a84bb0518774d4c571beb1e6602fbd8860a0 16177236a46543af0757d0016544fc7d8c7228f1 Luca Benedetto <luca.benedetto93@gmail.com> 1702656472 +0000	commit: add first version readme
16177236a46543af0757d0016544fc7d8c7228f1 faeaa3c29c80a986f8d1bdb12ac5c31dd73f6b3f Luca Benedetto <luca.benedetto93@gmail.com> 1702656679 +0000	checkout: moving from 23-12-15-last-edits-to-share-code to main
faeaa3c29c80a986f8d1bdb12ac5c31dd73f6b3f 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1702656683 +0000	pull: Fast-forward
38b932534bf9a4acaeecff857991bded082909e6 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1709562190 +0000	checkout: moving from main to 24_03_04_experiments_for_cae_paper
38b932534bf9a4acaeecff857991bded082909e6 8e1cdd675a8d8260287b00fabb8a95d3ea156a73 Luca Benedetto <luca.benedetto93@gmail.com> 1709562559 +0000	commit: add prompts for additional experiments
8e1cdd675a8d8260287b00fabb8a95d3ea156a73 c0647284279afb5ea07314d3c6a3bd848ad3e4ae Luca Benedetto <luca.benedetto93@gmail.com> 1709563318 +0000	commit: commented unfinished prompt
c0647284279afb5ea07314d3c6a3bd848ad3e4ae 5b5924b421c8c1c04738786dde6c98bedb68db05 Luca Benedetto <luca.benedetto93@gmail.com> 1709564037 +0000	commit: add explicit name of CEFR
5b5924b421c8c1c04738786dde6c98bedb68db05 126450b6a6246ea7c39b49efefdccf9c4934443d Luca Benedetto <luca.benedetto93@gmail.com> 1709631843 +0000	commit: add student levels for new prompts
126450b6a6246ea7c39b49efefdccf9c4934443d b220fc1fe82d595191396d2a99abb91db2da831c Luca Benedetto <luca.benedetto93@gmail.com> 1709631859 +0000	commit: add prompts for classification reading passage
b220fc1fe82d595191396d2a99abb91db2da831c 8f87f470abe847412da6f73184c1e6a5389cc1df Luca Benedetto <luca.benedetto93@gmail.com> 1709631909 +0000	commit: add mock student level for new prompts
8f87f470abe847412da6f73184c1e6a5389cc1df d039ba190963c84e0a971d99735aac58d1fc537a Luca Benedetto <luca.benedetto93@gmail.com> 1709632226 +0000	commit: add list of acceptable values
d039ba190963c84e0a971d99735aac58d1fc537a a4e0c6626c6781eba56a4de2f03dd5a8b7d9c67a Luca Benedetto <luca.benedetto93@gmail.com> 1709730813 +0000	commit: add tmp code for classification of reading passages levels
a4e0c6626c6781eba56a4de2f03dd5a8b7d9c67a f827c8278d15701c14852a46e997c3db57c4656f Luca Benedetto <luca.benedetto93@gmail.com> 1709730898 +0000	commit: add tmp code for using azure cupa key
f827c8278d15701c14852a46e997c3db57c4656f 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1709730905 +0000	checkout: moving from 24_03_04_experiments_for_cae_paper to main
38b932534bf9a4acaeecff857991bded082909e6 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1709730931 +0000	checkout: moving from main to 23_03_analysis_explanation
38b932534bf9a4acaeecff857991bded082909e6 aefbbf3e9c34359695d00295c0f549542692b5ea Luca Benedetto <luca.benedetto93@gmail.com> 1709809518 +0000	commit: add script to analyse explanation, first version
aefbbf3e9c34359695d00295c0f549542692b5ea 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1709809520 +0000	checkout: moving from 23_03_analysis_explanation to main
38b932534bf9a4acaeecff857991bded082909e6 f827c8278d15701c14852a46e997c3db57c4656f Luca Benedetto <luca.benedetto93@gmail.com> 1709809534 +0000	checkout: moving from main to 24_03_04_experiments_for_cae_paper
f827c8278d15701c14852a46e997c3db57c4656f bb9ec438a5b1f5767ec3d3ec9ea4c2858bcd9486 Luca Benedetto <luca.benedetto93@gmail.com> 1710755781 +0000	commit: add prompts for automated scoring
bb9ec438a5b1f5767ec3d3ec9ea4c2858bcd9486 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1710755840 +0000	checkout: moving from 24_03_04_experiments_for_cae_paper to main
38b932534bf9a4acaeecff857991bded082909e6 aefbbf3e9c34359695d00295c0f549542692b5ea Luca Benedetto <luca.benedetto93@gmail.com> 1710756878 +0000	checkout: moving from main to 23_03_analysis_explanation
aefbbf3e9c34359695d00295c0f549542692b5ea 0000000000000000000000000000000000000000 Luca Benedetto <luca.benedetto93@gmail.com> 1710756928 +0000	Branch: renamed refs/heads/23_03_analysis_explanation to refs/heads/24_03_analysis_explanation
0000000000000000000000000000000000000000 aefbbf3e9c34359695d00295c0f549542692b5ea Luca Benedetto <luca.benedetto93@gmail.com> 1710756928 +0000	Branch: renamed refs/heads/23_03_analysis_explanation to refs/heads/24_03_analysis_explanation
aefbbf3e9c34359695d00295c0f549542692b5ea bb9ec438a5b1f5767ec3d3ec9ea4c2858bcd9486 Luca Benedetto <luca.benedetto93@gmail.com> 1711449398 +0000	checkout: moving from 24_03_analysis_explanation to 24_03_04_experiments_for_cae_paper
bb9ec438a5b1f5767ec3d3ec9ea4c2858bcd9486 bcad4b4edd6d780687519e0229ca93fca88299c9 Luca Benedetto <luca.benedetto93@gmail.com> 1711453502 +0000	commit: add code for cefr level classification with hf
bcad4b4edd6d780687519e0229ca93fca88299c9 dcd219e371b6d2faadfb838758633bb0ce63d128 Luca Benedetto <luca.benedetto93@gmail.com> 1711453891 +0000	commit: fix data path
dcd219e371b6d2faadfb838758633bb0ce63d128 3e7b250ff7aa90a7ef38691a76793e47828eb614 Luca Benedetto <luca.benedetto93@gmail.com> 1711460761 +0000	commit: fix column name
3e7b250ff7aa90a7ef38691a76793e47828eb614 af36b394805a1129deb66bde832d252553d475fe Luca Benedetto <luca.benedetto93@gmail.com> 1711464099 +0000	commit: update to fix output name and update model
af36b394805a1129deb66bde832d252553d475fe 0fb50ceb52e14932a73f0e4685654c9cf15724a3 Luca Benedetto <luca.benedetto93@gmail.com> 1711474492 +0000	commit: add code for converting the cerd dataset
0fb50ceb52e14932a73f0e4685654c9cf15724a3 6731a166038feb60abae2f3153072bf4c24bfcd5 Luca Benedetto <luca.benedetto93@gmail.com> 1711530327 +0000	commit: update script to use Gemma 2B and the whole CERD dataset
6731a166038feb60abae2f3153072bf4c24bfcd5 83ee28b29f5f5bd6c930b61712485d7574c24984 Luca Benedetto <luca.benedetto93@gmail.com> 1711530500 +0000	commit: update script to use Gemma 2B and the whole CERD dataset
83ee28b29f5f5bd6c930b61712485d7574c24984 03a74fb2c6f6fb29e50c7a52d2476984515df65e Luca Benedetto <luca.benedetto93@gmail.com> 1711536923 +0000	commit: update script to use Gemma 2B and the whole CERD dataset
03a74fb2c6f6fb29e50c7a52d2476984515df65e aa863d6df0cd240e87819752e995c83395bfb603 Luca Benedetto <luca.benedetto93@gmail.com> 1711617711 +0000	commit: fix prompt 2003
aa863d6df0cd240e87819752e995c83395bfb603 4cef58834ba1ecc1bdc5a2c09ca6f8fd004d679f Luca Benedetto <luca.benedetto93@gmail.com> 1711622156 +0000	commit: add script for automated scoring
4cef58834ba1ecc1bdc5a2c09ca6f8fd004d679f a5763b993c9bc856c6fd11f56c9446d58e46e433 Luca Benedetto <luca.benedetto93@gmail.com> 1711622231 +0000	commit: remove gemma 7b
a5763b993c9bc856c6fd11f56c9446d58e46e433 e54ded6cd80bd8697cf0f16a4ead7546c587733a Luca Benedetto <luca.benedetto93@gmail.com> 1711622825 +0000	commit: add param to choose dataset, now working only with fce_clc
e54ded6cd80bd8697cf0f16a4ead7546c587733a d45c948b15705b874fc6623b1145b54c24531571 Luca Benedetto <luca.benedetto93@gmail.com> 1711625987 +0000	commit: update prompt
d45c948b15705b874fc6623b1145b54c24531571 4264e9a2ed79579ec18d25509c8107aa99663ede Luca Benedetto <luca.benedetto93@gmail.com> 1712149893 +0100	commit: update having prompt id as param
4264e9a2ed79579ec18d25509c8107aa99663ede b0a0e4027c9442f93323a06d4df20cfff722ac5e Luca Benedetto <luca.benedetto93@gmail.com> 1712149918 +0100	commit: update prompts
b0a0e4027c9442f93323a06d4df20cfff722ac5e 7fd60105d4aa2c2a861bb720a04f70b111092c8e Luca Benedetto <luca.benedetto93@gmail.com> 1712149942 +0100	commit: update for experiments
7fd60105d4aa2c2a861bb720a04f70b111092c8e 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1712149948 +0100	checkout: moving from 24_03_04_experiments_for_cae_paper to main
38b932534bf9a4acaeecff857991bded082909e6 aefbbf3e9c34359695d00295c0f549542692b5ea Luca Benedetto <luca.benedetto93@gmail.com> 1712149955 +0100	checkout: moving from main to 24_03_analysis_explanation
aefbbf3e9c34359695d00295c0f549542692b5ea 38b932534bf9a4acaeecff857991bded082909e6 Luca Benedetto <luca.benedetto93@gmail.com> 1713296374 +0100	checkout: moving from 24_03_analysis_explanation to main
38b932534bf9a4acaeecff857991bded082909e6 5567110d237beea9e9a0c14de6cfdc9a04a15e6b Luca Benedetto <luca.benedetto93@gmail.com> 1713299567 +0100	commit: add plot for analysis generalisation on the three dataset
5567110d237beea9e9a0c14de6cfdc9a04a15e6b 4793794e5d6aa6892420c18b3d0c3de937168d49 Luca Benedetto <luca.benedetto93@gmail.com> 1713299584 +0100	commit: remove comment
4793794e5d6aa6892420c18b3d0c3de937168d49 992eb6186db3f5c9f450b358ef8534963bb45d5a Luca Benedetto <luca.benedetto93@gmail.com> 1713300056 +0100	commit: update plot gpt4 vs. gpt3.5 to improve readability
992eb6186db3f5c9f450b358ef8534963bb45d5a 2671928e22121a685d4f68ebb913b54eab3fa75d Luca Benedetto <luca.benedetto93@gmail.com> 1715675317 +0100	commit: update figure for analysis generalisation to new datasets
2671928e22121a685d4f68ebb913b54eab3fa75d fbcd71b04417432e462d2487686d5ea1d2b0429a Luca Benedetto <luca.benedetto93@gmail.com> 1715675642 +0100	commit: update plot comparison with reference prompt
fbcd71b04417432e462d2487686d5ea1d2b0429a ea82df35b4c947f46339288851b1b7b04aa4cfa1 Luca Benedetto <luca.benedetto93@gmail.com> 1715675662 +0100	commit: fix commented line
ea82df35b4c947f46339288851b1b7b04aa4cfa1 d88e15f33b6b6c7eddb8e5bf322c9ff2b2eb0953 Luca Benedetto <luca.benedetto93@gmail.com> 1715690832 +0100	commit: add plot CUPA MCQA accuracy per difficulty level
d88e15f33b6b6c7eddb8e5bf322c9ff2b2eb0953 1cf2385522053468f8e68433227b9f98b32cf10d Luca Benedetto <luca.benedetto93@gmail.com> 1715691014 +0100	commit: update size of plots MCQA accuracy per difficulty level
1cf2385522053468f8e68433227b9f98b32cf10d 72c99e251bc6f1ce3696d2c2b3ccb9c8b9d7b7c4 Luca Benedetto <luca.benedetto93@gmail.com> 1715693565 +0100	commit: add new version of plots for comparison different llms
72c99e251bc6f1ce3696d2c2b3ccb9c8b9d7b7c4 0a9d4bfcda5769043581b875ac0edc3268e36b7c Luca Benedetto <luca.benedetto93@gmail.com> 1715701333 +0100	commit: add plt.tight_layout()
0a9d4bfcda5769043581b875ac0edc3268e36b7c b09e1da8fce07290c78b5211b3f7f3fe6866de7a Luca Benedetto <luca.benedetto93@gmail.com> 1715701375 +0100	commit: add plt.tight_layout()
b09e1da8fce07290c78b5211b3f7f3fe6866de7a b09e1da8fce07290c78b5211b3f7f3fe6866de7a Luca Benedetto <luca.benedetto93@gmail.com> 1715760271 +0100	checkout: moving from main to 2024-05-15-difficulty
b09e1da8fce07290c78b5211b3f7f3fe6866de7a 22ab5322f2d493835531578f633431d75a08f970 Luca Benedetto <luca.benedetto93@gmail.com> 1715763576 +0100	commit: add first version of the script to analyse the difficulty level given by the llms
22ab5322f2d493835531578f633431d75a08f970 b09e1da8fce07290c78b5211b3f7f3fe6866de7a Luca Benedetto <luca.benedetto93@gmail.com> 1715765928 +0100	checkout: moving from 2024-05-15-difficulty to main
b09e1da8fce07290c78b5211b3f7f3fe6866de7a b09e1da8fce07290c78b5211b3f7f3fe6866de7a Luca Benedetto <luca.benedetto93@gmail.com> 1715765964 +0100	checkout: moving from main to 2024-05-15-analysis-virtual-pretesting
b09e1da8fce07290c78b5211b3f7f3fe6866de7a 22ab5322f2d493835531578f633431d75a08f970 Luca Benedetto <luca.benedetto93@gmail.com> 1715765978 +0100	checkout: moving from 2024-05-15-analysis-virtual-pretesting to 2024-05-15-difficulty
22ab5322f2d493835531578f633431d75a08f970 b09e1da8fce07290c78b5211b3f7f3fe6866de7a Luca Benedetto <luca.benedetto93@gmail.com> 1715765993 +0100	checkout: moving from 2024-05-15-difficulty to 2024-05-15-analysis-virtual-pretesting
b09e1da8fce07290c78b5211b3f7f3fe6866de7a 4bb32daecf74cd2b0e95f9ce849e92b568ac19e5 Luca Benedetto <luca.benedetto93@gmail.com> 1715766620 +0100	commit: added analysis with Mae for simulated students
4bb32daecf74cd2b0e95f9ce849e92b568ac19e5 b09e1da8fce07290c78b5211b3f7f3fe6866de7a Luca Benedetto <luca.benedetto93@gmail.com> 1715766909 +0100	checkout: moving from 2024-05-15-analysis-virtual-pretesting to main
b09e1da8fce07290c78b5211b3f7f3fe6866de7a 2267f12a2cf52fd7fcf2dea755e9a125518315f8 Luca Benedetto <luca.benedetto93@gmail.com> 1715766912 +0100	pull: Fast-forward
2267f12a2cf52fd7fcf2dea755e9a125518315f8 22ab5322f2d493835531578f633431d75a08f970 Luca Benedetto <luca.benedetto93@gmail.com> 1715766919 +0100	checkout: moving from main to 2024-05-15-difficulty
22ab5322f2d493835531578f633431d75a08f970 d34ee7a82130fdf4aec95548abf42f1e472c093e Luca Benedetto <luca.benedetto93@gmail.com> 1715775957 +0100	commit (merge): add MAPE as metric
d34ee7a82130fdf4aec95548abf42f1e472c093e cd619afe392bb94d53aee2f48eddc5360d314b3a Luca Benedetto <luca.benedetto93@gmail.com> 1715776035 +0100	commit: fix formatting
cd619afe392bb94d53aee2f48eddc5360d314b3a 03deff0d3f13533ce71cf89f380e9baf39ae52c2 Luca Benedetto <luca.benedetto93@gmail.com> 1715776148 +0100	commit: restore changes
03deff0d3f13533ce71cf89f380e9baf39ae52c2 7150409a936e789bcfca95ae29f0bcba091faf78 Luca Benedetto <luca.benedetto93@gmail.com> 1715776230 +0100	commit: add MAPE
7150409a936e789bcfca95ae29f0bcba091faf78 2267f12a2cf52fd7fcf2dea755e9a125518315f8 Luca Benedetto <luca.benedetto93@gmail.com> 1715777835 +0100	checkout: moving from 2024-05-15-difficulty to main
2267f12a2cf52fd7fcf2dea755e9a125518315f8 5428276f19fee1a7f6130caa95b33e33e9a95756 Luca Benedetto <luca.benedetto93@gmail.com> 1715777839 +0100	pull: Fast-forward
5428276f19fee1a7f6130caa95b33e33e9a95756 5428276f19fee1a7f6130caa95b33e33e9a95756 Luca Benedetto <luca.benedetto93@gmail.com> 1716198751 +0100	checkout: moving from main to 24-05-20-analysis-monotonicity
5428276f19fee1a7f6130caa95b33e33e9a95756 3bb459c87292a86a22ae600274e39c6d83d45d3b Luca Benedetto <luca.benedetto93@gmail.com> 1716988317 +0100	commit: add eval monotonicity
3bb459c87292a86a22ae600274e39c6d83d45d3b b1b507a04e4db26778fd7078c14d9fe9eb11e476 Luca Benedetto <luca.benedetto93@gmail.com> 1716993331 +0100	commit: add analysis with eval metric for monotonicity for other datasets and models as well
b1b507a04e4db26778fd7078c14d9fe9eb11e476 5428276f19fee1a7f6130caa95b33e33e9a95756 Luca Benedetto <luca.benedetto93@gmail.com> 1716993409 +0100	checkout: moving from 24-05-20-analysis-monotonicity to main
5428276f19fee1a7f6130caa95b33e33e9a95756 56647a8af75ca6607f36621d5b8c80c9a130a2e3 Luca Benedetto <luca.benedetto93@gmail.com> 1716993413 +0100	pull: Fast-forward
56647a8af75ca6607f36621d5b8c80c9a130a2e3 7fd60105d4aa2c2a861bb720a04f70b111092c8e Luca Benedetto <luca.benedetto93@gmail.com> 1716993884 +0100	checkout: moving from main to 24_03_04_experiments_for_cae_paper
7fd60105d4aa2c2a861bb720a04f70b111092c8e 56647a8af75ca6607f36621d5b8c80c9a130a2e3 Luca Benedetto <luca.benedetto93@gmail.com> 1717512294 +0100	checkout: moving from 24_03_04_experiments_for_cae_paper to main
56647a8af75ca6607f36621d5b8c80c9a130a2e3 8d1f48e55c89ed8344f879eb63873d9d12655291 Luca Benedetto <luca.benedetto93@gmail.com> 1717747023 +0100	commit: refactor eval metric monotonicity
8d1f48e55c89ed8344f879eb63873d9d12655291 043b6179e05403061d7f06399ec99085084d581f Luca Benedetto <luca.benedetto93@gmail.com> 1717747071 +0100	commit: add plot_metrics_difficulty_analysis
043b6179e05403061d7f06399ec99085084d581f 104a17f183bb5565063822e6d8e5440cfe2bd07e Luca Benedetto <luca.benedetto93@gmail.com> 1717749059 +0100	commit: add script for plotting the confusion matrices of the explanation analysis, the values are taken from another script
