THAT CONTAIN THE ANSWER TO THE GIVENIN THIS PAPER WE DESCRIBE AN...

2002) that contain the answer to the given

In this paper we describe an algorithm that

TREC question.

learns possible syntactic answer sentence formu-

lations for syntactic question classes from a set of

In total, the data available to us for our experi-

example question/answer sentence pairs. Unlike

ments consists of 8,830 question/answer sentence

the related work described above, it acknowledges

pairs. This data is publicly available, see (Kaisser

that a) a valid answer sentence’s syntax might

and Lowe, 2008). The algorithm described in this

be very different for the question’s syntax and b)

paper has three main steps:

several valid answer sentence structures, which

Phrase alignment Key phrases from the ques-

might be completely independent from each other,

tion are paired with phrases from the answer

can exist for one and the same question.

sentences.

To illustrate this consider the question “When

was Alaska purchased?” The following four sen-

Pattern creation The dependency structures of

queries and answer sentences are analyzed

tences all answer the given question, but only the

first sentence is a straightforward reformulation of

and patterns are extracted.

the question:

Pattern evaluation The patterns discovered in

the last step are evaluated and a confidence