THE FIRST SET CONTAINS FOR EACH QUESTION ALLWORK IS THAT WE DO NOT...

1. The first set contains for each question all

work is that we do not assume this similarity. In

sentences in the top 100 paragraphs returned

our approach valid answer sentences are allowed

by Lucene when using simple queries made

to have grammatical structures that are very dif-

up from the question’s key words. It cannot

ferent from the question and also very different

be guaranteed that answers to every question

from each other. Thus it is natural to compare our

are present in this test set.

approach against a baseline that compares can-