THE TOTAL POINTS THAT THE AUTHOR WHOTURES ARE APPLIED

Question

13. Total Points: The total points that the author whotures are applied.gives the answer sentence receives.Step 2. Context assignment: every context sen-tence is assigned to the most relevant question sen-The previous literature (Shah et al., 2010) hintedtence. We compute the semantic similarity(Simpsonthat some cQA features, such as Sentence Length,Has Link and Best Answer Star, may be more im-and Crowe, 2005) between sentences or sub ques-Figure 1: Four kinds of the contextual factors are considered for answer summarization in our general CRF basedmodels.tions as:swer sentences xi , xj and their correspondingreplied questions Qri, Qrj. If the similarity of Qrisim(w1, w2)sim(x, y) = 2 × ∑and Qrj is above some upper threshold τuq, this| x | + | y | (2)(w1,w2)∈M(x,y)means that xi and xj are very similar and likely toprovide similar viewpoint to answer similar ques-where M (x, y) denotes synset pairs matched in sen-tions. In this case, we want to select either xi ortences x and y; and the similarity between the twoxjas answer. This is done by setting the contextualsynsets w1 and w2 is computed to be inversely pro-factor cf2such that xiand xj have opposite labels,portional to the length of the path in Wordnet.{One answer sentence may related to more thanexp ν, yi∗ yj = − 1cf2=one sub questions to some extent. Thus, we de-exp − ν, otherwisefine the replied question Qri as the sub questionwith the maximal similarity to sentence xi: Qri =Assuming that sentence xi is selected as a sum-argmaxQjsim(xi, Qj). It is intuitive that differentmary sentence, and its next local neighborhood sen-summary sentences aim at answering different subtence xi+1by the same author is dissimilar to it butquestions. Therefore, we design the following twoit is relevant to the original multi-sentence question,contextual factors based on the similarity of repliedthen it is reasonable to also pick xi+1as a summaryquestions.sentence because it may offer new viewpoints byDissimilar Replied Question Factor: Given twothe author. Meanwhile, other local and non-localanswer sentences xi , xj and their correspondingsentences which are similar to it at above the up-replied questions Qri, Qrj. If the similarity2of Qriper threshold will probably not be selected as sum-and Qrj is below some threshold τlq, it means thatmary sentences as they offer similar viewpoint asxiand xjwill present different viewpoints to answerdiscussed above. Therefore, we propose the follow-different sub questions. In this case, it is likely thating two kinds of contextual factors for selecting thexi and xj are both summary sentences; we ensureanswer sentences in the CRF model.this by setting the contextual factor cf1with a largeLocal Novelty Factor: If the similarity of answervalue of exp ν , where ν is a positive real constantsentence xi and xi+1 given by the same author isoften assigned to value 1; otherwise we set cf1 tobelow a lower threshold τls, but their respective sim-exp − ν for penalization.ilarities to the sub questions both exceed an upperthreshold τus, then we will boost the probability ofexp ν, yi= yj = 1selecting both as summary sentences by setting:cf1=exp ν, yi= yi+1 = 1cf3=Similar Replied Question Factor: Given two an-2We use the semantic similarity of Equation 2 for all ourRedundance Factor: If the similarity of answersimilarity measurement in this paper.sentence xi and xj is greater than the upper thresh-where N denotes the total number of training sam-old τus, then they are likely to be redundant andples. we compute the log-likelihood gradient com-hence should be given opposite labels. This is doneponent of θ in the first term of Equation 4 as inby setting:usual CRFs. However, the second term of Equation{ exp ν, yi∗ yj = − 1θg∥2be-4 is non-differentiable when some special ∥ − →comes exactly zero. To tackle this problem, an ad-cf4 =ditional variable is added for each group (Schmidt ,θg∥2 with