BMKC :   A dataset for BioMedical Knowledge Comprehension

  To be updated


Download


Statistics

Dataset # Queries Max # options Avg # options Avg # tokens Vocab size
BMKC T train 463,981 93 25.6 291 876,621
valid 5,278 66 25.4 291
test 3,868 74 25.7 289
BMKC LS train 362,439 90 25.3 270 714,751
valid 4,136 57 25.1 269
test 3,205 74 25.4 271

    (Last updated on February 14, 2017)


Contact - for bugs, comments and questions

Seongsoon Kim: seongkim_at_korea_dot_ac_dot_kr
Donghyeon Park: parkdh_at_korea_dot_ac_dot_kr
Yonghwa Choi: yonghwachoi_at_korea_dot_ac_dot_kr
Jaewoo Kang: kangj_at_korea_dot_ac_dot_kr



Last updated on February 14, 2017