Task 4 Results
| Team | Correct | Accuracy |
|---|---|---|
| KIS3 | 66 | 0.9041 |
| KIS1 | 64 | 0.8767 |
| *LUONG01 | 63 | 0.8630 |
| *UIRunCoT | 62 | 0.8493 |
| KIS2 | 62 | 0.8493 |
| CAPTAIN2 | 60 | 0.8219 |
| UIRunLang | 60 | 0.8219 |
| JNLP002 | 59 | 0.8082 |
| JNLP003 | 59 | 0.8082 |
| CAPTAIN1 | 58 | 0.7945 |
| CAPTAIN3 | 58 | 0.7945 |
| UA2 | 57 | 0.7808 |
| UA3 | 57 | 0.7808 |
| JNLP001 | 56 | 0.7671 |
| KLAP.H2 | 56 | 0.7671 |
| UA1 | 55 | 0.7534 |
| NOWJ.run1 | 54 | 0.7397 |
| NOWJ.run2 | 54 | 0.7397 |
| NOWJ.run3 | 54 | 0.7397 |
| OVGU1 | 54 | 0.7397 |
| KLAP.H1 | 48 | 0.6575 |
| RUG_V1 | 48 | 0.6575 |
| OVGU3 | 46 | 0.6301 |
| RUG_V3 | 46 | 0.6301 |
| RUG_V2 | 45 | 0.6164 |
| AIIRLLaMA | 44 | 0.6027 |
| UIRunFTune | 44 | 0.6027 |
| OVGU2 | 44 | 0.6027 |
| AIIRMistral | 41 | 0.5616 |
| BaseLine | 37 | 0.5068 |
| Total Teams: | 11 | |
| Total submissions: | 30 | |
| Total Test Cases: | 73 |
Results of *LUONG01 and *UIRunCoT are unofficial submissions that do not follow the rules https://coliee.org/rules (opens in a new tab).
Note: One question was removed from the final evaluation due to a mistranslation issue from Japanese to English, reducing the total test cases from 74 to 73.
Last updated on