NTU-Exam 板


LINE

课程名称︰资讯检索 课程性质︰图资系大三必修 课程教师︰唐牧群 开课学院:文学院 开课系所︰图书资讯学系 考试日期(年月日)︰101/01/10 考试时限(分钟):1.20~4.20 是否需发放奖励金:要! (如未明确表示,则不予发放) 试题 : 1. With an imaginary database that contains only the following 5 document: (20 points) D1:"a dog barks at a cat and a dog in a tree" D2:"a dog watches ants eat the bark of a tree" D3:"a dog watches another dog by a tree" D4:"a dog barks at a cat on a tree" D5:"the bark fell from the tree as a cat watches" (Terms in the stop word list have been marked with lighter hue). Please 1. Create an inverted file for the database where each cell contains the TF (Term Frequency)weight of each term all the documents. 2.Calculate document frequecy(DF) and IDF weight for each index term(simply use N/n without logarithm). 3. Give the ranking after the user submits the query"dog barks cat" 4. After the first iteration, the user examines the results and marks D1, D4 as relevant, and D2 and D5 as non-relevant. Produce the new ranking using Rocchio's method where α=1.0 β=1.0 γ=1.0 Answer 4 out of the following 5 questions; each will acount for 20 points. 2. Unlike data retrieval, where perfect precision and recall are guaranted, information retrieval is more of a probabilistic process where information conveyed in the retrieved documents might or might not answer users' information needs. What are the possible causes behind the uncertainty of IR? 3. Define the following concepts and explain hoe they are related to one another:"specificity", "precision" and "IDF(Inverse document Frecuency); "eshaustivity", "recall" and "TF(Term Frequency)".There is often a trade-off between presicion and recall, is there also a trade-off between specificity and exhaustivity? 4. Explain three basic models in information retrieve:Boolean, Vector space Probabilistic. 5. Explain the rationales behind eliciting users' relevance feedback and how it can improve search results. What are two mechanisms with which relevant terms can be identified and extracted(hint: IQE and AQE)? 6. How does retrievel on the Web differ from retrieval with traditional bibliographic databases(e.g the nature of Web document and Web environment, the"structuredness" of indexing, and the use of link data etc.)? Give the formula of Google's PageRank and explain its rationale. --



※ 发信站: 批踢踢实业坊(ptt.cc)
◆ From: 140.112.4.195
1F:推 yoyo8089 :>< 01/10 16:08
2F:→ yoyo8089 :图资系已收 01/10 16:09
3F:推 abacada :囧 (帮1F小板主拍拍?) 01/11 08:07







like.gif 您可能会有兴趣的文章
icon.png[问题/行为] 猫晚上进房间会不会有憋尿问题
icon.pngRe: [闲聊] 选了错误的女孩成为魔法少女 XDDDDDDDDDD
icon.png[正妹] 瑞典 一张
icon.png[心得] EMS高领长版毛衣.墨小楼MC1002
icon.png[分享] 丹龙隔热纸GE55+33+22
icon.png[问题] 清洗洗衣机
icon.png[寻物] 窗台下的空间
icon.png[闲聊] 双极の女神1 木魔爵
icon.png[售车] 新竹 1997 march 1297cc 白色 四门
icon.png[讨论] 能从照片感受到摄影者心情吗
icon.png[狂贺] 贺贺贺贺 贺!岛村卯月!总选举NO.1
icon.png[难过] 羡慕白皮肤的女生
icon.png阅读文章
icon.png[黑特]
icon.png[问题] SBK S1安装於安全帽位置
icon.png[分享] 旧woo100绝版开箱!!
icon.pngRe: [无言] 关於小包卫生纸
icon.png[开箱] E5-2683V3 RX480Strix 快睿C1 简单测试
icon.png[心得] 苍の海贼龙 地狱 执行者16PT
icon.png[售车] 1999年Virage iO 1.8EXi
icon.png[心得] 挑战33 LV10 狮子座pt solo
icon.png[闲聊] 手把手教你不被桶之新手主购教学
icon.png[分享] Civic Type R 量产版官方照无预警流出
icon.png[售车] Golf 4 2.0 银色 自排
icon.png[出售] Graco提篮汽座(有底座)2000元诚可议
icon.png[问题] 请问补牙材质掉了还能再补吗?(台中半年内
icon.png[问题] 44th 单曲 生写竟然都给重复的啊啊!
icon.png[心得] 华南红卡/icash 核卡
icon.png[问题] 拔牙矫正这样正常吗
icon.png[赠送] 老莫高业 初业 102年版
icon.png[情报] 三大行动支付 本季掀战火
icon.png[宝宝] 博客来Amos水蜡笔5/1特价五折
icon.pngRe: [心得] 新鲜人一些面试分享
icon.png[心得] 苍の海贼龙 地狱 麒麟25PT
icon.pngRe: [闲聊] (君の名は。雷慎入) 君名二创漫画翻译
icon.pngRe: [闲聊] OGN中场影片:失踪人口局 (英文字幕)
icon.png[问题] 台湾大哥大4G讯号差
icon.png[出售] [全国]全新千寻侘草LED灯, 水草

请输入看板名称,例如:Tech_Job站内搜寻

TOP