5). CROSS-VALIDATION WAS CONDUCTED. FORROUGE-2 SCORE, WHICH...

Question

Section 2.5). Cross-validation was conducted. ForROUGE-2 score, which seems to be particularlythe S Π system, which required no training, all ofsensitive to Novelty: no matter what combinationthe 300 summaries were used as the test set.of measures is used (R alone, RQ, RQC), changesS Σ outperformed the baseline in Recall (R) butin ROUGE-2 score remain under one point per-not in Precision (P); nevertheless, the combined F-centile. Once Novelty is added, performances rise1 score (F) was sensibly higher (around 5 pointsabruptly to the system’s highest. A summary ex-percentile). On the other hand, our S Π systemample, along with the question and the best an-showed very consistent improvements of an orderswer, is presented in Table 2.of 10 to 15 points percentile over the baseline onall measures; we would like to draw attention on4 Discussion and Future Directionsthe fact that even if Precision scores are higher,it is on Recall scores that greater improvementsWe conclude by discussing a few alternatives towere achieved. This, together with the results ob-the approaches we presented. The length M con-tained by S Σ , suggest performances could benefitstraint for the final summary (Section 2.6), couldhave been determined by making use of external10Available at https://traloihay.net.knowledge such as T K q : since T K q representsaspxspace for questions is presented by Agichtein etHOW TO PROTECT YOURSELF FROM A BEAR?https://traloihay.netal. (2008) and could be used to rank the quality of20060818062414AA7VldBquestions in a way similar to how we ranked the***BEST ANSWER***Great question. I have done alot of trekking through California, Montanaquality of answers.and Wyoming and have met Black bears (which are quite dinky and placidThe Quality assessing component itself couldbut can go nuts if they have babies), and have been half an hour away from(allegedly) the mother of all grizzley s whilst on a trail through Glacierbe built as a module that can be adjusted to theNational park - so some other trekkerers told me... What the park wardenssay is SING, SHOUT, MAKE NOISE...do it loudly, let them know youkind of Social Media in use; the creation of cus-are there..they will get out of the way, it is a surprised bear wot will gotomized Quality feature spaces would make itmental and rip your little legs off..No fun permission: anything that willconfuse them and stop them in their tracks...I have been told be an nativepossible to handle different sources of UGC (fo-american buddy that to keep a bottle of perfume in your pocket...throw it atthe ground near your feet and make the place stink: they have good noses,rums, collaborative authoring websites such asthem bears, and a mega concentrated dose of Britney Spears ObsessiveCompulsive is gonna give em something to think about...Have you got aWikipedia, blogs etc.). A great obstacle is the lackrape alarm? Def take that...you only need to distract them for a secondof systematically available high quality trainingthen they will lose interest..Stick to the trails is the most important thing,and talk to everyone you see when trekking: make sure others know whereexamples: a tentative solution could be to makeyou are.use of clustering algorithms in the feature space;***SUMMARIZED ANSWER***[...]In addition if the bear actually approaches you or charges you.. stillhigh and low quality clusters could then be labeledstand your ground. Many times they will not actually come in contactwith you, they will charge, almost touch you than run away. [...]Theby comparison with examples of virtuous behav-actions you should take are different based on the type of bear. for ex-ior (such as Wikipedia’s Featured Articles). Theample adult Grizzlies can t climb trees, but Black bears can even whenadults. They can not climb in general as thier claws are longer and notquality of a document could then be estimated as asemi-retractable like a Black bears claws.[...]I truly disagree with thewhole play dead approach because both Grizzlies and Black bears arefunction of distance from the centroid of the clus-oppurtunistic animals and will feed on carrion as well as kill and eat an-ter it belongs to. More careful estimates could takeimals. Although Black bears are much more scavenger like and tend notto kill to eat as much as they just look around for scraps. Grizzlies on thethe position of other clusters and the concentrationother hand are very accomplished hunters and will take down large preyanimals when they want.[...]I have lived in the wilderness of Northernof nearby documents in consideration.Canada for many years and I can honestly say that Black bears are not atFinally, in addition to the chosen best answer, aall likely to attack you in most cases they run away as soon as they see orsmell a human, the only places where Black bears are agressive is in parksDUC-styled query-focused multi-document sum-with visitors that feed them, everywhere else the bears know that usuallyhumans shoot them and so fear us.[...]mary could be used as a baseline against whichthe performances of the system can be checked.Table 2: A summarized answer composed of five differentportions of text generated with the SΠscoring function; the5 Related Workchosen best answer is presented for comparison. The rich-A work with a similar objective to our own isness of the content and the good level of readability makethat of Liu et al. (2008), where standard multi-it a successful instance of metadata-aware summarization ofdocument summarization techniques are em-information in cQA systems. Less satisfying examples in-ployed along with taxonomic information aboutclude summaries to questions that require a specific order ofquestions. Our approach differs in two fundamen-sentences or a compromise between strongly discordant opin-tal aspects: it took in consideration the peculiari-ions; in those cases, the summarized answer might lack logi-ties of the data in input by exploiting the nature ofcal consistency.UGC and available metadata; additionally, alongwith relevance, we addressed challenges that arethe total knowledge available about q, a coveragespecific to Question Answering, such as Cover-estimate of the final answers against it would haveage and Novelty. For an investigation of Coveragebeen ideal. Unfortunately the lack of metadatain the context of Search Engines, refer to Swami-about those answers prevented us from proceedingnathan et al. (2009).in that direction. This consideration suggests theAt the core of our work laid information trust-idea of building T K q using similar answers in thefulness, summarization techniques and alternativedataset itself, for which metadata is indeed avail-concept representation. A general approach toable. Furthermore, similar questions in the datasetcould have been used to augment the set of an-the broad problem of evaluating information cred-swers used to generate the final summary with an-ibility on the Internet is presented by Akamineswers coming from similar questions. Wang et al.et al. (2009) with a system that makes use of(2009a) presents a method to retrieve similar ques-semantic-aware Natural Language Preprocessingtions that could be worth taking in considerationtechniques. With analogous goals, but a focusfor the task. We suggest that the retrieval methodon UGC, are the papers of Stvilia et al. (2005),could be made Quality-aware. A Quality featureMcguinness et al. (2006), Hu et al. (2007) andZeng et al. (2006), which present a thorough inves-state-of-the-art summarization systems is ongoing.tigation of Quality and trust in Wikipedia. In theAcknowledgmentscQA domain, Jeon et al. (2006) presents a frame-work to use Maximum Entropy for answer qualityThis work was partly supported by the Chi-estimation through non-textual features; with thenese Natural Science Foundation under grant No.same purpose, more recent methods based on the60803075, and was carried out with the aid ofexpertise of answerers are proposed by Suryantoa grant from the International Development Re-et al. (2009), while Wang et al. (2009b) introducesearch Center, Ottawa, Canada. We would like tothe idea of ranking answers taking their relation tothank Prof. Xiaoyan Zhu, Mr. Yang Tang and Mr.questions in consideration. The paper that we re-Guillermo Rodriguez for the valuable discussionsgard as most authoritative on the matter is the workand comments and for their support. We wouldby Agichtein et al. (2008) which inspired us in thealso like to thank Dr. Chin-yew Lin and Dr. Eu-design of the Quality feature space presented ingene Agichtein from Emory University for sharing