Task 4 Results
Team | Correct | Accuracy |
---|---|---|
KIS3 | 66 | 0.9041 |
KIS1 | 64 | 0.8767 |
*LUONG01 | 63 | 0.8630 |
*UIRunCoT | 62 | 0.8493 |
KIS2 | 62 | 0.8493 |
CAPTAIN2 | 60 | 0.8219 |
UIRunLang | 60 | 0.8219 |
JNLP002 | 59 | 0.8082 |
JNLP003 | 59 | 0.8082 |
CAPTAIN1 | 58 | 0.7945 |
CAPTAIN3 | 58 | 0.7945 |
UA2 | 57 | 0.7808 |
UA3 | 57 | 0.7808 |
JNLP001 | 56 | 0.7671 |
KLAP.H2 | 56 | 0.7671 |
UA1 | 55 | 0.7534 |
NOWJ.run1 | 54 | 0.7397 |
NOWJ.run2 | 54 | 0.7397 |
NOWJ.run3 | 54 | 0.7397 |
OVGU1 | 54 | 0.7397 |
KLAP.H1 | 48 | 0.6575 |
RUG_V1 | 48 | 0.6575 |
OVGU3 | 46 | 0.6301 |
RUG_V3 | 46 | 0.6301 |
RUG_V2 | 45 | 0.6164 |
AIIRLLaMA | 44 | 0.6027 |
UIRunFTune | 44 | 0.6027 |
OVGU2 | 44 | 0.6027 |
AIIRMistral | 41 | 0.5616 |
BaseLine | 37 | 0.5068 |
Total Teams: | 11 | |
Total submissions: | 30 | |
Total Test Cases: | 73 |
Results of *LUONG01 and *UIRunCoT are unofficial submissions that do not follow the rules https://coliee.org/rules (opens in a new tab).
Note: One question was removed from the final evaluation due to a mistranslation issue from Japanese to English, reducing the total test cases from 74 to 73.
Last updated on