NTU-Exam 板


LINE

课程名称︰数位语音处理概论 课程性质︰电机系/资讯系选修 课程教师︰李琳山 开课学院:电资学院 开课系所︰电机系 考试日期(年月日)︰108.11.09 考试时限(分钟):120 试题 : 注:以下部分数学式以LaTeX语法表示。 1. (8 pts) What is GMM? How do we use it with HMM for continuous speech recong- nition? 2. (12 pts) Given an HMM with parameters \lambda = (A, B, \pi), an observation sequence \bar{O} = o_1,...,o_t,...,o_T and a state sequence \bar{q} = q_1,.. .,q_t,...,q_T, define \alpha_t(i) = Prob[o_1,...,o_t, q_t = i | \lambda] \beta_t(i) = Prob[o_{t+1},...,o_T | q_t = i, \lambda] We usually assume Prob[\bar{O}, q_t = i | \lambda] = \alpha_t(i)\beta_t(i). (3 pts) Show that Prob(\bar{O} | \lambda) = \sum_{i=1}^N[\alpha_t(i)\beta_t(i)]. (3 pts) Show that Prob(q_t = i | \bar{O}, \lambda) = \frac{\alpha_t(i)\beta_y(i)}{\sum_{i=1}^N[\alpha_t(i)\beta_t(i)]}. (6 pts) Formulate and describe the procedures for Viterbi algorithm to find the best state sequence \bar{q}^* = q_1^*,...,q_t^*,...,q_T^*. 3. (10 pts) Please explain how LBG algorithm and K-means algorithm work respec- tively. Does K-means algorithm always yeild the same result regardless of d- ifferent initialization? 4. (10 pts) While training triphone acoustic models, data and parameter sharing is a common approach to ensure that there is enough data to train each acou- stic model. Such sharing technique usually occurs on the state level. Please explain what this means. 5. (15 pts) You are taking an adventure in the Mabao forest. There are only fo- ur kinds of animals in the forest: otters, foxes, squirrels and duckbills. You know that the population percentage of each kind of animals is 30%, 20%, 40% and 10%, respectively. One morning, you see a brown-colored creature with white strips on its back and a black tail run away swiftly, while it is too sudden that you cannot c- learly recognize which species it is. Luckily, you have got the probability of the three characteristics observed on each of the four species from a pr- evious research listed in Table 1, where o_1, o_2, o_3 refer to "brown-colo- red", "white-striped" and "black-tailed". Moreover, you know that for each of the four species, the three characteris- tics happen independently, that is \forall i \neq j, o_i \neq o_j | c_k. In order to make you guess more efficient so that you can spend most of your time enjoying the wilderness, you decide to make a decision tree for animal classification based on the three question: "Whether it is brown-colored", "Whether it has white strips" and "Whether it has black tail". The decision tree is like the one in Figure 1. Please build up this decision tree by put- ting the three questions into the three nodes. What is the entropy reduction resulted from the uppermost node of the tree? YOu are allowed to leave the logarithmic term in your answer instead of giving a numerical solution. | p(o_1 | c_i) | p(o_2 | c_i) | p(o_3 | c_i) | otter(c_1) | 0.8 | 0.3 | 0.8 | fox(c_2) | 0.1 | 0.3 | 0.4 | squirrel(c_3) | 0.2 | 0.7 | 0.4 | duckbill(c_4) | 0.8 | 0.3 | 0.2 | Table 1: The Posterior Probability of the Three Charecteristics question A /(T) \(F) question B question C /(T) \(F) /(T) \(F) class a class b class c class d Figure 1: Sample Decision Tree (Hint: You do not need to actually compute the entropy of the whole tree. I- stead, you should be able to come up with a "best" tree structure by simply looking at the posterior probability of the three characteristics. Trust yo- ur intuition!) 6. (10 pts) Explain: What is entropy? What is perplexity of a language model w- ith respect to a test corpus? 7. (10 pts) Explain the OOV problem and how this problem for high frequency OOV words can be solved for Chinese language. 8. (10 pts) Explain the following two things: (5 pts) What are excitation and formant structure? Which one is more import- ant in speech recognition? Why? (5 pts) What is voiced speech? What is pitch? How is it related to the tone in Mandarin Chinese? 9. (6 pts) Describe the precise way of measuring the recognition errors between the following two strings in digital string recognition: (3 pts) a as the reference and b the machine output (3 pts) b as the reference and a the machine output (a) 52030325 (b) 5940345 10. (9 pts) Explain what bean search is. Whata are the advantage of using it in a large-vocabulary continuous speech recognition system? What are the trade -off in choosing the bean which for it? --



※ 发信站: 批踢踢实业坊(ptt.cc), 来自: 114.24.173.199 (台湾)
※ 文章网址: https://webptt.com/cn.aspx?n=bbs/NTU-Exam/M.1624746149.A.C30.html
1F:推 rod24574575 : 收录资讯系! 06/27 11:02







like.gif 您可能会有兴趣的文章
icon.png[问题/行为] 猫晚上进房间会不会有憋尿问题
icon.pngRe: [闲聊] 选了错误的女孩成为魔法少女 XDDDDDDDDDD
icon.png[正妹] 瑞典 一张
icon.png[心得] EMS高领长版毛衣.墨小楼MC1002
icon.png[分享] 丹龙隔热纸GE55+33+22
icon.png[问题] 清洗洗衣机
icon.png[寻物] 窗台下的空间
icon.png[闲聊] 双极の女神1 木魔爵
icon.png[售车] 新竹 1997 march 1297cc 白色 四门
icon.png[讨论] 能从照片感受到摄影者心情吗
icon.png[狂贺] 贺贺贺贺 贺!岛村卯月!总选举NO.1
icon.png[难过] 羡慕白皮肤的女生
icon.png阅读文章
icon.png[黑特]
icon.png[问题] SBK S1安装於安全帽位置
icon.png[分享] 旧woo100绝版开箱!!
icon.pngRe: [无言] 关於小包卫生纸
icon.png[开箱] E5-2683V3 RX480Strix 快睿C1 简单测试
icon.png[心得] 苍の海贼龙 地狱 执行者16PT
icon.png[售车] 1999年Virage iO 1.8EXi
icon.png[心得] 挑战33 LV10 狮子座pt solo
icon.png[闲聊] 手把手教你不被桶之新手主购教学
icon.png[分享] Civic Type R 量产版官方照无预警流出
icon.png[售车] Golf 4 2.0 银色 自排
icon.png[出售] Graco提篮汽座(有底座)2000元诚可议
icon.png[问题] 请问补牙材质掉了还能再补吗?(台中半年内
icon.png[问题] 44th 单曲 生写竟然都给重复的啊啊!
icon.png[心得] 华南红卡/icash 核卡
icon.png[问题] 拔牙矫正这样正常吗
icon.png[赠送] 老莫高业 初业 102年版
icon.png[情报] 三大行动支付 本季掀战火
icon.png[宝宝] 博客来Amos水蜡笔5/1特价五折
icon.pngRe: [心得] 新鲜人一些面试分享
icon.png[心得] 苍の海贼龙 地狱 麒麟25PT
icon.pngRe: [闲聊] (君の名は。雷慎入) 君名二创漫画翻译
icon.pngRe: [闲聊] OGN中场影片:失踪人口局 (英文字幕)
icon.png[问题] 台湾大哥大4G讯号差
icon.png[出售] [全国]全新千寻侘草LED灯, 水草

请输入看板名称,例如:WOW站内搜寻

TOP