3. DATA STATISTICS TABLES 4, 5 SHOW STATISTICAL INFORMATION ON...

4.3. Data statistics

Tables 4, 5 show statistical information on our dataset. Totally, we have 3,600

annotated questions with the average length in words is 11.14. On average, each

question contains about 1.02 entity with the average length of 3.04 words. The most

popular entities include UniName (952), MajorName (733), Datetime (509),

MajorMode (241), ScholarName (219), and AdmissionType (200).

Table 4. Statistical information on the dataset Number of questions 3,600 Average length (in words) of questions 11.14 Average number of entities per question 1.02 Average length (in words) of entities 3.04 Table 5. Statistical information on entity types No Entity type Quantity No Entity Type Quantity 1 UniName 952 8 ScholarName 219 2 CampusName 119 9 AdmissionType 200 3 DeptName 39 10 MajorMode 241 4 TeacherName 38 11 KYears 30 5 MajorName 733 12 Duration 80 6 SubjectName 120 13 Datetime 509 7 DocsName 171 14 Number 197