Multi-hop inference is the task of combining more than one piece of information to solve an inference task, such as question answering.
- 2020-03-06: Training data release
- 2020-04-06: Test data release; Evaluation start
- 2020-09-21: Evaluation end
- 2020-10-02: System description paper deadline
- 2020-10-18: Deadline for reviews of system description papers
- 2020-10-25: Author notifications
- 2020-11-01: Camera-ready description paper deadline
- 2020-12-13: TextGraphs-14 workshop
Dates are specified in the ISO 8601 format.
The shared task data distribution includes a baseline that uses a term frequency model (tf.idf) to rank how likely table row sentences are to be a part of a given explanation. The performance of this baseline on the development partition is 0.255 MAP.
$ make dataset
$ ./baseline_tfidf.py tables questions.dev.tsv > predict.txt
The format of the predict.txt
file is questionID<TAB>explanationID
without header; the order is important. When tqdm is installed, baseline_tfidf.py
will show a nicely-looking progress bar.
$ ./evaluate.py questions.dev.tsv predict.txt
In order to prepare a submission file for CodaLab, create a ZIP file containing your predict.txt
for the test dataset, cf. make predict-tfidf-test.zip
.
Please submit your solutions via CodaLab: https://competitions.codalab.org/competitions/23615.
This shared task is organized within the 14th workshop on graph-based natural language processing, TextGraphs-14: https://sites.google.com/view/textgraphs2020.
We welcome questions and answers on the shared task on CodaLab Forums: https://competitions.codalab.org/forums/20311/.
To contact the task organizers directly, please send an email to textgraphsoc@gmail.com.
By submitting results to this competition, you consent to the public release of your scores at the TextGraph-14 workshop and in the associated proceedings, at the task organizers' discretion. Scores may include, but are not limited to, automatic and manual quantitative judgements, qualitative judgements, and such other metrics as the task organizers see fit. You accept that the ultimate decision of metric choice and score value is that of the task organizers.
You further agree that the task organizers are under no obligation to release scores and that scores may be withheld if it is the task organizers' judgement that the submission was incomplete, erroneous, deceptive, or violated the letter or spirit of the competition's rules. Inclusion of a submission's scores is not an endorsement of a team or individual's submission, system, or science.
You further agree that your system may be named according to the team name provided at the time of submission, or to a suitable shorthand as determined by the task organizers.
You agree not to use or redistribute the shared task data except in the manner prescribed by its licence.
To encourage transparency and replicability, all teams must publish their code, tuning procedures, and instructions for running their models with their submission of shared task papers.
- Jansen, P., Ustalov, D.: TextGraphs 2020 Shared Task on Multi-Hop Inference for Explanation Regeneration. In: Proceedings of the Graph-based Methods for Natural Language Processing (TextGraphs). pp. 85–97. Association for Computational Linguistics, Barcelona, Spain (Online) (2020).
@inproceedings{Jansen:20,
author = {Jansen, Peter and Ustalov, Dmitry},
title = {{TextGraphs~2020 Shared Task on Multi-Hop Inference for Explanation Regeneration}},
year = {2020},
booktitle = {Proceedings of the Graph-based Methods for Natural Language Processing (TextGraphs)},
pages = {85--97},
address = {Barcelona, Spain (Online)},
publisher = {Association for Computational Linguistics},
isbn = {978-1-952148-42-2},
url = {https://www.aclweb.org/anthology/2020.textgraphs-1.10},
language = {english},
}