2 TRAINING AND EXECUTIONDOCUMENT FEATURE SET (DF) IS A FEATURE SET E...
3.2 Training and Execution
Document Feature Set (DF) is a feature set ex-
The training phase estimates a probabilistic model
tracted only from a document. Using only DF corre-
from training data (x
(1)
,y
(1)
),...,(x
(n)
,y
(n)
) gener-
sponds to unbiased Term Extraction (TE).
ated from the CRL QA Data. The execution phase
For each word w
i
, the following features are ex-
evaluates the probability of y
0(i)
given inputx
0(i)
us-
tracted:
ing the the probabilistic model.
dw–k,. . .,dw+0,. . .,dw+k: k preceding and follow-
ing words of the word w
i
, e.g., { dw–1: w
i
−
1
,
Training Phase
dw+0:w
i
, dw+1:w
i+1