Modeling Context in Answer Sentence Selection Systems on a Latency Budget

Rujun Han; Luca Soldaini; Alessandro Moschitti

Modeling Context in Answer Sentence Selection Systems on a Latency Budget

Rujun Han, Luca Soldaini, Alessandro Moschitti

Abstract Paper Connected Papers Add to Favorites

Information Retrieval, Search and Question Answering Short paper Paper

Gather-2A: Apr 22, Gather-2A: Apr 22 (13:00-15:00 UTC) [Join Gather Meeting]

You can open the pre-recorded video in separate windows.

Abstract: Answer Sentence Selection (AS2) is an efficient approach for the design of open-domain Question Answering (QA) systems. In order to achieve low latency, traditional AS2 models score question-answer pairs individually, ignoring any information from the document each potential answer was extracted from. In contrast, more computationally expensive models designed for machine reading comprehension tasks typically receive one or more passages as input, which often results in better accuracy. In this work, we present an approach to efficiently incorporate contextual information in AS2 models. For each answer candidate, we first use unsupervised similarity techniques to extract relevant sentences from its source document, which we then feed into an efficient transformer architecture fine-tuned for AS2. Our best approach, which leverages a multi-way attention architecture to efficiently encode context, improves 6% to 11% over non-contextual state of the art in AS2 with minimal impact on system latency. All experiments in this work were conducted in English.

NOTE: Video may display a random order of authors. Correct author list is at the top of this page.

Connected Papers in EACL2021