BAM: Benchmarking Argument Mining




   
   
   

   


Evaluation Results

Filename S B C R Timestamp
output-R(Llama-3.2-3B-Instruct-zero_shot+context-0.2) - - - 0.077 20241018T114719
output-R(Llama-3.1-70B-Instruct-few_shot+context-0.2) - - - 0.149 20241018T114624
output-R(Llama-3.2-3B-Instruct-few_shot-0.2) - - - 0.106 20241018T114553
output-R(Llama-3.2-3B-Instruct-few_shot+context-0.2) - - - 0.088 20241018T114524
output-B(Llama-3.2-3B-Instruct-zero_shot+context-0.2) 0.523 0.11 - - 20241016T104536
output-S(Llama-3.1-8B-Instruct-few_shot-0.2) 0.667 - - - 20241016T104502
output-B(Llama-3.2-3B-Instruct-few_shot-0.2) 0.516 0.105 - - 20241016T101909
output-S(Llama-3.2-3B-Instruct-few_shot+context-0.2) 0.457 - - - 20241014T155036
output-S(Llama-3.2-3B-Instruct-few_shot-0.2) 0.459 - - - 20241014T155022
output-S(Llama-3.2-3B-Instruct-zero_shot+context-0.2) 0.457 - - - 20241014T155010
output-S(Llama-3.2-3B-Instruct-zero_shot) 0.458 - - - 20241014T154959
output-B(Llama-3.1-8B-Instruct-zero_shot+context-0.2) 0.567 0.17 - - 20241014T111539
output-S(Llama-3.1-8B-Instruct-few_shot+context-0.2) 0.661 - - - 20241011T003247
output-S(Llama-3.1-8B-Instruct-few_shot-0.2) 0.666 - - - 20241011T003236
output-S(Llama-3.1-8B-Instruct-zero_shot+context-0.2) 0.689 - - - 20241011T003211
output-C(Llama-3.1-8B-Instruct-zero_shot-0.2) 0.579 0.166 0.024 - 20241010T135330
output-B(Llama-3.1-8B-Instruct-zero_shot-0.2) 0.586 0.171 - - 20241010T112415
output-S(Llama-3.1-8B-Instruct-zero_shot-0.2) 0.646 - - - 20241009T142715
output-S(Llama-3.1-8B-Instruct-zero_shot) 0.646 - - - 20241009T140517
output-S(Llama-3.2-3B-Instruct-zero_shot) 0.458 - - - 20241009T124837
output-C(Meta-Llama-3.1-70B-Instruct-few_shot+context) 0.644 0.284 0.366 - 20241004T105621
output-C(Meta-Llama-3.1-70B-Instruct-zero_shot+context) 0.635 0.253 0.195 - 20241004T105133
output-C(Meta-Llama-3.1-70B-Instruct-few_shot) 0.631 0.237 0.292 - 20241004T104700
output-C(Meta-Llama-3.1-70B-Instruct-zero_shot) 0.633 0.232 0.141 - 20241004T104204
output-B(Meta-Llama-3.1-70B-Instruct-few_shot) 0.654 0.19 - - 20241004T100607
output-B(Meta-Llama-3.1-70B-Instruct-few_shot+context) 0.658 0.252 - - 20241003T135849
output-B(Meta-Llama-3.1-70B-Instruct-zero_shot+context) 0.677 0.19 - - 20241003T135214
output-B(Meta-Llama-3.1-70B-Instruct-zero_shot) 0.464 0.007 - - 20241002T114424
output-S(Meta-Llama-3.1-70B-Instruct-few_shot+context) 0.702 - - - 20241002T114406
output-S(Meta-Llama-3.1-70B-Instruct-zero_shot+context) 0.674 - - - 20241002T114357
output-S(Meta-Llama-3.1-70B-Instruct-zero_shot) 0.633 - - - 20241002T114345
output-S(Meta-Llama-3.1-70B-Instruct-few_shot) 0.704 - - - 20241002T114336
TARGER_components 0.611 0.486 0.644 - 20240823T140402
TRABAM_components 0.832 0.506 0.662 - 20240823T140148
DIAM_S_components 0.532 0.096 0.117 - 20240823T135816
DIAM_L_boundaries 0.579 0.133 - - 20240823T135810
MARGOT_components 0.454 0.097 0.133 - 20240823T135751
AURC_components 0.805 0.445 - - 20240823T135708
ARGUMINSCI_components 0.6 0.115 0.09 - 20240823T135637
TRABAM_relations - - - 0.248 20240823T135627
DIAM_S_relations - - - 0.076 20240823T135619