4 SIMILARITY MEASUREQ2

Question

5.4 Similarity MeasureQ2: Where should my family go for springThe first experiment conducted question recommen-break?dation based on their information need parts. Dif-InfoN ... family wants to go somewhere forferent text similarity methods described in sectiona couple days during spring break ...3 were used to measure the similarity between theprefers a warmer climate and we live ininformation need texts. In TFIDF similarity mea-IL, so it shouldn’t be SUPER far away.sure (TFIDF), the idf values for each word were... a family road trip. ...computed from frequency counts over the entireRQ1 Whats a cheap travel destination forspring break?Aquaint corpus8. For calculating the word-to-wordInfoN I live in houston texas and i’m trying toknowledge-based similarity, a WordNet::Similarityfind i inexpensive place to go for springJava implementation9of the similarity measures linbreak with my family.My parents don’t(Knowledge2) and jcn (Knowledge1) is used in thiswant to spend a lot of money due to thepaper. For calculating topic model based similarity,economy crisis, ... a fun road trip...we estimated two LDA models from ’Train t’ andRQ2 Alright you creative deal-seekers, I need’Train c’ using GibbsLDA++10. We treated eachsome help in planning a spring breaktrip for my familyquestion including the question title and the infor-InfoN Spring break starts March 13th and goesmation need part as a single document of a sequenceuntil the 21st ... Someplace WARM!!!of words. These documents were preprocessed be-Family-oriented hotel/resort ... Northfore being fed into LDA model. 1800 iterations forAmerican Continent (Mexico, America,Gibbs sampling 200 topics parameters were set forJamaica, Bahamas, etc.) Cost= Aroundeach LDA model estimation.$5,000 ...The results in table 2 show that TFIDF and LDA1methods perform better for recommending questionsTable 4: Question recommendation results by LDA mea-than the others. After further analysis of the ques-suring the similarity between information needstions recommended by both methods, we discov-8https://traloihay.net9https://traloihay.net10https://traloihay.netered that the ordering of the recommended questionsThe question and information need pairs in both‘Train t’ and ‘Train c’ training sets were used tofrom TFIDF and LDA1 are quite different. TFIDFtrain two IBM-4 translation models by GIZA++similarity method prefers texts with more commonwords, while the LDA1 method can find the rela-toolkit. These pairs were also preprocessed beforetraining. And the pairs whose information need parttion between the non-common words between shorttexts based on a series of third-party topics. The L-become empty after preprocessing were disregard-ed.DA1 method outperforms the TFIDF method in twoways: (1) the top recommended questions’ informa-During the experiment, we found that some of thegenerated words in the information need parts aretion needs share less common words with the querythemselves. This is caused by the self translationquestion’s; (2) the top recommended questions spanproblem in translation model: the highest transla-wider topics. The questions highly recommended bytion score for a word is usually given to itself ifLDA1 can suggest more useful topics to the user.the target and source languages are the same (XueKnowledge-based methods are also shown to per-form worse than TFIDF and LDA1. We found thatet al., 2008). This has always been a tough ques-tion: not using self-translated words can reduce re-some words were mis-tagged so that they were nottrieval performance as the information need partsincluded in the word-to-word similarity calculation.Another reason for the worse performance is that theneed the terms to represent the semantic meanings;using self-translated words does not take advantagewords out of the WordNet dictionary were also notincluded in the similarity calculation.of the translation approach. To tackle this problem,The Mean Reciprocal Rank score for TFIDF andwe control the number of the words predicted by thetranslation model to be exactly twice the number ofLDA1 are more than 80%. That is to say, we are ablewords in the corresponding preprocessed question.to recommend questions to the users by measuringThe predicted information need words for the re-their information needs. The first two recommendedtrieved questions are shown in Table 5. In Q1, the in-questions for Q1 and Q2 using LDA1 method areformation need behind question “recommend web-shown in table 4. InfoN is the information need partsite for custom built computer parts” may implyassociated with each question.that the users need to know some information aboutIn the preprocessing step, some words were suc-building computer parts such as “ram ” and “moth-cessfully corrected such as “What should I do thiserboard ” for a different purpose such as “gaming ”.saturday? ... and staying in a hotell ...” and “myfaimly is traveling to florda ...”. However, there areWhile in Q2, the user may want to compare comput-ers in different brands such as “dell ” and “mac” orstill a small number of texts such as “How come myGforce visualization doesn’t work? ” and “Do i needconsider the “price” factor for “purchasing a laptopfor a college student”.an Id to travel from new york to maimi? ” failed toWe also did a small scale comparison between thebe corrected. So in the future, a better method isgenerated information needs against the real ques-expected to correct these failure cases.tions whose information need parts are not empty.