1 SCREENING FILTERA NOUN PHRASE. ALL THE PROBABILITIES OF RULES ARES...
3.1 Screening filter
a noun phrase. All the probabilities of rules are
Screening was performed by removing recognition
stochastically estimated based on data. Probabilities
errors using a confidence measure as a threshold and
for frequently used rules become greater, and those
then summarizing it within an 80% to 100% com-
for rarely used rules become smaller. Even though
paction ratio. In this summarization technique, the
transcription results given by a speech recognizer are
word significance and linguistic score for summa-
ill-formed, the dependency structure can be robustly
rization were calculated using text from Mainichi
estimated by our SDCFG.
newspapers published from 1994 to 2001, compris-
ing 13.6M sentences with 232M words. The SD-
The generality score is defined as
CFG for the word concatenation score was calcu-
AG
(Pn
) =w∈P
n
:w=
contlogP(w),