2 KNOWLEDGE-BASED MEASURESAMPLING CAN BE USED TO ESTIMATE THE CORRES...

Nội dung
Đáp án tham khảo

2 KNOWLEDGE-BASED MEASURESAMPLING CAN BE USED TO ESTIMATE THE CORRES...

3.2 Knowledge-based Measure

Sampling can be used to estimate the corresponding

Mihalcea et al. (2006) proposed several knowledge-

expected posterior probabilities P (z|D) = ˆ θ

and

based methods for measuring the semantic level sim-

P (w|z) = ˆ φ

(Griffiths and Steyvers, 2004).

ilarity of texts to solve the lexical chasm problem be-

In this paper we use two LDA based similarity

tween short texts. These knowledge-based similarity

measures in (Celikyilmaz et al., 2010) to measure

measures were derived from word semantic similar-

the similarity between short information need texts.

ity by making use of WordNet. The evaluation on a

The first LDA similarity method uses KL divergence

paraphrase recognition task showed that knowledge-

to measure the similarity between two documents

based measures outperform the simpler lexical level

under each given topic:

approach.

We follow the definition in (Mihalcea et al., 2006)

to derive a text-to-text similarity metric mcs for two

X

10

sim

(D

, D

) = 1

K

given texts D

and D

:

P

maxSim(w, D

) ∗ idf (w)

mcs(D

, D

) =

W (D

, D

) =

idf (w)

maxSim(w, D

) ∗ idf (w)

− KL(D

k D

+ D

+

Bạn đang xem 3. - BÁO CÁO KHOA HỌC: "IMPROVING QUESTION RECOMMENDATION BY EXPLOITING INFORMATION NEED" PPTX