MANY OF THE MEASURES RETURN RESULTS, EVEN IFWHEN[1]+WAS[2]+NP[3]+VE...
2. Many of the measures return results, even if
When[1]+was[2]+NP[3]+VERB[4], which
only a weak semantic relationship exists. For
together list 382 answer sentences, and thus 382
our purposes however, it is beneficial to only
potentially different answer sentence structures
take strong semantic relations into account.
from which patterns can be gained. As a result,
the amount of training examples we have avail-
5 Pattern Creation
able, is sufficient to achieve the performance de-
scribed in Section 7. The algorithm described in
Figure 1 details our algorithm in its five key steps.
this paper can of course also be used for more
In step 1 and 2 key phrases from the question are
complicated NLQs, although in such a scenario a
aligned to the corresponding phrases in the an-
significantly larger amount of training data would
swer sentence, see Section 4 of this paper. Step
have to be used.
3 is concerned with retrieving the parse tree for
the answer sentence. In our implementation all
6 Pattern Evaluation
answer sentences in the training set have for per-
For each created pattern, at least one match-
formance reasons been parsed beforehand with
the Stanford Parser (Klein and Manning, 2003b;
ing example must exists: the sentence that was
used to create it in the first place. However, we
n
do not know how precise each pattern is. To
X
score(ac) =
score(p
i
) (2)
this end, an additional processing step between
i=1
pattern creation and application is needed: pat-
where
tern evaluation. Similar approaches to ours have
(
correct
been described in the relevant literature, many
i
+1
correct
i
+incorrect
i
+2
if match
score(p
i
) =
(3)
of them concerned with bootstrapping, starting
0
no match