NTU-Exam 板


LINE

課程名稱︰資訊檢索與擷取 課程性質︰資工系選修 課程教師︰陳信希 開課學院:電機資訊學院 開課系所︰資訊工程學系 考試日期(年月日)︰2021/11/11 考試時限(分鐘):180 試題 : 1. Term frequency and inverse document frequency are commonly used to measure the importance of a term in a document and a query. We aim to select terms with discriminative power within a document and between documents to repre- sent a document. How term frequency and inverse document frequency achieve the goal? (10 points) 2. A long document is usually composed of passages describing several topics. On the one hand, it is relatively easier to retrieve long documents than short documents with keyword-based approach. On the other hand, the repre- sentation of long documents tends to be vague when average word (term) em- bedding approach is used for aggregation. Do you have any ideas to deal with these issues in keyword-based approach and term embedding-based approach? (10 points) 3. In language modeling, each individual document can be considered as a docu- ment model for retrieval. Besides, a document collection can be also used to learn a collection model for smoothing in retrieval. Please describe the idea of integrating document model and collection model for IR. (10 points) 4. To model term-term relationship is important in information retrieval. Va- rious methods from conventional counting-based approach to current predic- tion-based approach have been proposed. Please show one method from each ap- proach to compute inter-term relationship. (10 points) 5. (a) What are the typical similarities and topical similarities? (5 points) (b) Term representations learned from models based on different size of con- texts (e.g., document, short window size, or short context) may capture different similarities (typical similarities or topical similarities). Please explain this statement. (5 points) (c) Exact matching and embedding space based matching have different effects on retrieval. Please discuss this point. (5 points) 6. An IR model is a quadruple $[D, Q, F, R(q_i, d_j)]$ where $D$ is a set of logical views for the documents in the collection, $Q$ is a set of logical views for the user queries, $F$ is a framework for modeling documents and queries, and $R(q_i, d_j)$ is a ranking function. Please specify the framework $F$ and the ranking function $R$ for each of the following models. (15 points) (a) BM25 Model (b) Translation Model (c) Term Embedding Model 7. Query expansion aims to introduce new query terms to the original query. Please specify how query expansion is introduced to each of the following models. (15 points) (a) Vector Space Model (b) Language Model (c) Term Embedding Model 8. In SIGIR 2016, two tutorial speakers classify "Question Answering from Docu- ments" into an "easy" problem in IR. In contrast, they regard "Question Ans- wering from Knowledge Base" as a "hard" problem in IR. Do you agree such a classification? Please show your thoughts. (10 points) 9. Neural information retrieval systems typically use chaining pipeline. Are there any practical considerations? Please suggest a cascade pipeline to ex- plain your idea. (10 points) 10. We often encounter mis-conception, mis-translation, and mis-formulation pro- blems to transform an information need to a query in ad hoc retrieval. You have learned fundamentals of information retrieval during the first half of semester. Please show the lessons to deal with these problems. (10 points) -- 第01話 似乎在課堂上聽過的樣子 第02話 那真是太令人絕望了 第03話 已經沒什麼好期望了 第04話 被當、21都是存在的 第05話 怎麼可能會all pass 第06話 這考卷絕對有問題啊 第07話 你能面對真正的分數嗎 第08話 我,真是個笨蛋 第09話 這樣成績,教授絕不會讓我過的 第10話 再也不依靠考古題 第11話 最後留下的補考 第12話 我最愛的學分 --



※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 111.249.65.236 (臺灣)
※ 文章網址: https://webptt.com/m.aspx?n=bbs/NTU-Exam/M.1767058471.A.939.html
1F:→ rod24574575 : 收錄資訊系! 12/30 22:55







like.gif 您可能會有興趣的文章
icon.png[問題/行為] 貓晚上進房間會不會有憋尿問題
icon.pngRe: [閒聊] 選了錯誤的女孩成為魔法少女 XDDDDDDDDDD
icon.png[正妹] 瑞典 一張
icon.png[心得] EMS高領長版毛衣.墨小樓MC1002
icon.png[分享] 丹龍隔熱紙GE55+33+22
icon.png[問題] 清洗洗衣機
icon.png[尋物] 窗台下的空間
icon.png[閒聊] 双極の女神1 木魔爵
icon.png[售車] 新竹 1997 march 1297cc 白色 四門
icon.png[討論] 能從照片感受到攝影者心情嗎
icon.png[狂賀] 賀賀賀賀 賀!島村卯月!總選舉NO.1
icon.png[難過] 羨慕白皮膚的女生
icon.png閱讀文章
icon.png[黑特]
icon.png[問題] SBK S1安裝於安全帽位置
icon.png[分享] 舊woo100絕版開箱!!
icon.pngRe: [無言] 關於小包衛生紙
icon.png[開箱] E5-2683V3 RX480Strix 快睿C1 簡單測試
icon.png[心得] 蒼の海賊龍 地獄 執行者16PT
icon.png[售車] 1999年Virage iO 1.8EXi
icon.png[心得] 挑戰33 LV10 獅子座pt solo
icon.png[閒聊] 手把手教你不被桶之新手主購教學
icon.png[分享] Civic Type R 量產版官方照無預警流出
icon.png[售車] Golf 4 2.0 銀色 自排
icon.png[出售] Graco提籃汽座(有底座)2000元誠可議
icon.png[問題] 請問補牙材質掉了還能再補嗎?(台中半年內
icon.png[問題] 44th 單曲 生寫竟然都給重複的啊啊!
icon.png[心得] 華南紅卡/icash 核卡
icon.png[問題] 拔牙矯正這樣正常嗎
icon.png[贈送] 老莫高業 初業 102年版
icon.png[情報] 三大行動支付 本季掀戰火
icon.png[寶寶] 博客來Amos水蠟筆5/1特價五折
icon.pngRe: [心得] 新鮮人一些面試分享
icon.png[心得] 蒼の海賊龍 地獄 麒麟25PT
icon.pngRe: [閒聊] (君の名は。雷慎入) 君名二創漫畫翻譯
icon.pngRe: [閒聊] OGN中場影片:失蹤人口局 (英文字幕)
icon.png[問題] 台灣大哥大4G訊號差
icon.png[出售] [全國]全新千尋侘草LED燈, 水草

請輸入看板名稱,例如:Soft_Job站內搜尋

TOP