3C Shared task: A Kaggle Competition for Citation Context Classification

Researchers from CORE are organizing a new shared task: the ‘3C’ Citation Context Classification Task, as part of the International Workshop on Mining Scientific Publications, WOSP 2020 (https://wosp.core.ac.uk/jcdl2020/index.html). The new task will be hosted on Kaggle (https://www.kaggle.com/c/about/inclass), which is a popular Machine Learning/Data Science competition hosting platform. The competition uses a portion of the  Academic Citation Typing (ACT) dataset  (http://oro.open.ac.uk/60670/), which is the largest dataset of its type in existence, which is also the only dataset of citations annotated by authors and the only truly multidisciplinary dataset. Using this dataset, the shared task aims at classifying the citation context in research publications based on their influence and purpose. There will be two subtasks associated with this shared task. The subtask A is a multi-class classification problem, where citations are categorized into six different classes based on the purpose. The second subtask B is a binary classification task, based on the citation influence. read more...