久久久久久精品无码人妻_青春草无码精品视频在线观_无码精品国产VA在线观看_国产色无码专区在线观看

COM6511代寫、Python語言編程代做

時間:2024-05-09  來源:  作者: 我要糾錯



COM4511/COM6511 Speech Technology - Practical Exercise -
Keyword Search
Anton Ragni
Note that for any module assignment full marks will only be obtained for outstanding performance that
goes well beyond the questions asked. The marks allocated for each assignment are 20%. The marks will be
assigned according to the following general criteria. For every assignment handed in:
1. Fulfilling the basic requirements (5%)
Full marks will be given to fulfilling the work as described, in source code and results given.
2. Submitting high quality documentation (5%)
Full marks will be given to a write-up that is at the highest standard of technical writing and illustration.
3. Showing good reasoning (5%) Full marks will be given if the experiments and the outcomes are explained to the best standard.
4. Going beyond what was asked (5%)
Full marks will be given for interesting ideas on how to extend work that are well motivated and
described.
1 Background
The aim of this task is to build and investigate the simplest form of a keyword search (KWS) system allowing to find information
in large volumes of spoken data. Figure below shows an example of a typical KWS system which consists of an index and
a search module. The index provides a compact representation of spoken data. Given a set of keywords, the search module
Search Results
Index
Key− words
queries the index to retrieve all possible occurrences ranked according to likelihood. The quality of a KWS is assessed based
on how accurately it can retrieve all true occurrences of keywords.
A number of index representations have been proposed and examined for KWS. Most popular representations are derived
from the output of an automatic speech recognition (ASR) system. Various forms of output have been examined. These differ
in terms of the amount of information retained regarding the content of spoken data. The simplest form is the most likely word
sequence or 1-best. Additional information such as start and end times, and recognition confidence may also be provided for
each word. Given a collection of 1-best sequences, the following index can be constructed
w1 (f1,1, s1,1, e1,1) . . . (f1,n1 , s1,n1 , e1,n1 )
w2 (f1,1, s1,1, e1,1) . . . (f1,n1 , s1,n1 , e1,n1 )
.
.
.
wN (fN,1, sN,1, eN,1) . . . (fN,nN , sN,nN , eN,nN )
(1)
1
where wi is a word, ni is the number of times word wi occurs, fi,j is a file where word wi occurs for the j-th time, si,j and ei,j
is the start and end time. Searching such index for single word keywords can be as simple as finding the correct row (e.g. k)
and returning all possible tuples (fk,1, sk,1, ek,1), . . ., (fk,nk , sk,nk , ek,nk ).
The search module is expected to retrieve all possible keyword occurrences. If ASR makes no mistakes such module
can be created rather trivially. To account for possible retrieval errors, the search module provides each potential occurrence
with a relevance score. Relevance scores reflect confidence in a given occurrence being relevant. Occurrences with extremely
low relevance scores may be eliminated. If these scores are accurate each eliminated occurrence will decrease the number of
false alarms. If not then the number of misses will increase. What exactly an extremely low score is may not be very easy
to determine. Multiple factors may affect a relevance score: confidence score, duration, word confusability, word context,
keyword length. Therefore, simple relevance scores, such as those based on confidence scores, may have a wide dynamic range
and may be incomparable across different keywords. In order to ensure that relevance scores are comparable among different
keywords they need to be calibrated. A simple calibration scheme is called sum-to-one (STO) normalisation
rˆi,j = r
γ
 
i,j
ni
k=1 r
γ
i,k
(2)
where ri,j is an original relevance score for the j-th occurrence of the i-th keyword, γ is a scale enabling to either sharpen or
flatten the distribution of relevance scores. More complex schemes have also been examined. Given a set of occurrences with
associated relevance scores, there are several options available for eliminating spurious occurrences. One popular approach
is thresholding. Given a global or keyword specific threshold any occurrence falling under is eliminated. Simple calibration
schemes such as STO require thresholds to be estimated on a development set and adjusted to different collection sizes. More
complex approaches such as Keyword Specific Thresholding (KST) yield a fixed threshold across different keywords and
collection sizes.
Accuracy of KWS systems can be assessed in multiple ways. Standard approaches include precision (proportion of relevant retrieved occurrences among all retrieved occurrences) and recall (proportion of relevant retrieved occurrences among all
relevant occurrences), mean average precision and term weighted value. A collection of precision and recall values computed
for different thresholds yields a precision-recall (PR) curve. The area under PR curve (AUC) provides a threshold independent summative statistics for comparing different retrieval approaches. The mean average precision (mAP) is another popular,
threshold-independent, precision based metric. Consider a KWS system returning 3 correct and 4 incorrect occurrences arranged according to relevance score as follows: ✓ , ✗ , ✗ , ✓ , ✓ , ✗ , ✗ , where ✓ stands for correct occurrence and ✗ stands
for incorrect occurrence. The average precision at each rank (from 1 to 7) is 1
1 , 0
2 , 0
3 , 2
4 , 3
5 , 0
6 , 0
7 . If the number of true correct
occurrences is 3, the mean average precision for this keyword 0.7. A collection-level mAP can be computed by averaging
keyword specific mAPs. Once a KWS system operates at a reasonable AUC or mAP level it is possible to use term weighted
value (TWV) to assess accuracy of thresholding. The TWV is defined by
TWV(K, θ) = 1 −
 
1
|K|
 
k∈K
Pmiss(k, θ) + βPfa(k, θ)
 
(3)
where k ∈ K is a keyword, Pmiss and Pfa are probabilities of miss and false alarm, β is a penalty assigned to false alarms.
These probabilities can be computed by
Pmiss(k, θ) = Nmiss(k, θ)
Ncorrect(k) (4)
Pfa(k, θ) = Nfa(k, θ)
Ntrial(k) (5)
where N<event> is a number of events. The number of trials is given by
Ntrial(k) = T − Ncorrect(k) (6)
where T is the duration of speech in seconds.
2 Objective
Given a collection of 1-bests, write a code that retrieves all possible occurrences of keyword list provided. Describe the search
process including index format, handling of multi-word keywords, criterion for matching, relevance score calibration and
threshold setting methodology. Write a code to assess retrieval performance using reference transcriptions according to AUC,
mAP and TWV criteria using β = 20. Comment on the difference between these criteria including the impact of parameter β.
Start and end times of hypothesised occurrences must be within 0.5 seconds of true occurrences to be considered for matching.
2
3 Marking scheme
Two critical elements are assessed: retrieval (65%) and assessment (35%). Note: Even if you cannot complete this task as a
whole you can certainly provide a description of what you were planning to accomplish.
1. Retrieval
1.1 Index Write a code that can take provided CTM files (and any other file you deem relevant) and create indices in
your own format. For example, if Python language is used then the execution of your code may look like
python index.py dev.ctm dev.index
where dev.ctm is an CTM file and dev.index is an index.
Marks are distributed based on handling of multi-word keywords
• Efficient handling of single-word keywords
• No ability to handle multi-word keywords
• Inefficient ability to handle multi-word keywords
• Or efficient ability to handle multi-word keywords
1.2 Search Write a code that can take the provided keyword file and index file (and any other file you deem relevant)
and produce a list of occurrences for each provided keyword. For example, if Python language is used then the
execution of your code may look like
python search.py dev.index keywords dev.occ
where dev.index is an index, keywords is a list of keywords, dev.occ is a list of occurrences for each
keyword.
Marks are distributed based on handling of multi-word keywords
• Efficient handling of single-word keywords
• No ability to handle multi-word keywords
• Inefficient ability to handle multi-word keywords
• Or efficient ability to handle multi-word keywords
1.3 Description Provide a technical description of the following elements
• Index file format
• Handling multi-word keywords
• Criterion for matching keywords to possible occurrences
• Search process
• Score calibration
• Threshold setting
2. Assessment Write a code that can take the provided keyword file, the list of found keyword occurrences and the corresponding reference transcript file in STM format and compute the metrics described in the Background section. For
instance, if Python language is used then the execution of your code may look like
python <metric>.py keywords dev.occ dev.stm
where <metric> is one of precision-recall, mAP and TWV, keywords is the provided keyword file, dev.occ is the
list of found keyword occurrences and dev.stm is the reference transcript file.
Hint: In order to simplify assessment consider converting reference transcript from STM file format to CTM file format.
Using indexing and search code above obtain a list of true occurrences. The list of found keyword occurrences then can
be assessed more easily by comparing it with the list of true occurrences rather than the reference transcript file in STM
file format.
2.1 Implementation
• AUC Integrate an existing implementation of AUC computation into your code. For example, for Python
language such implementation is available in sklearn package.
• mAP Write your own implementation or integrate any freely available.
3
• TWV Write your own implementation or integrate any freely available.
2.2 Description
• AUC Plot precision-recall curve. Report AUC value . Discuss performance in the high precision and low
recall area. Discuss performance in the high recall and low precision area. Suggest which keyword search
applications might be interested in a good performance specifically in those two areas (either high precision
and low recall, or high recall and low precision).
• mAP Report mAP value. Report mAP value for each keyword length (1-word, 2-words, etc.). Compare and
discuss differences in mAP values.
• TWV Report TWV value. Report TWV value for each keyword length (1-word, 2-word, etc.). Compare and
discuss differences in TWV values. Plot TWV values for a range of threshold values. Report maximum TWV
value or MTWV. Report actual TWV value or ATWV obtained with a method used for threshold selection.
• Comparison Describe the use of AUC, mAP and TWV in the development of your KWS approach. Compare
these metrics and discuss their advantages and disadvantages.
4 Hand-in procedure
All outcomes, however complete, are to be submitted jointly in a form of a package file (zip/tar/gzip) that includes
directories for each task which contain the associated required files. Submission will be performed via MOLE.
5 Resources
Three resources are provided for this task:
• 1-best transcripts in NIST CTM file format (dev.ctm,eval.ctm). The CTM file format consists of multiple records
of the following form
<F> <H> <T> <D> <W> <C>
where <F> is an audio file name, <H> is a channel, <T> is a start time in seconds, <D> is a duration in seconds, <W> is a
word, <C> is a confidence score. Each record corresponds to one recognised word. Any blank lines or lines starting with
;; are ignored. An excerpt from a CTM file is shown below
7654 A 11.34 0.2 YES 0.5
7654 A 12.00 0.34 YOU 0.7
7654 A 13.30 0.5 CAN 0.1
• Reference transcript in NIST STM file format (dev.stm, eval.stm). The STM file format consists of multiple records
of the following form
<F> <H> <S> <T> <E> <L> <W>...<W>
where <S> is a speaker, <E> is an end time, <L> topic, <W>...<W> is a word sequence. Each record corresponds to
one manually transcribed segment of audio file. An excerpt from a STM file is shown below
2345 A 2345-a 0.10 2.03 <soap> uh huh yes i thought
2345 A 2345-b 2.10 3.04 <soap> dog walking is a very
2345 A 2345-a 3.50 4.59 <soap> yes but it’s worth it
Note that exact start and end times for each word are not available. Use uniform segmentation as an approximation. The
duration of speech in dev.stm and eval.stm is estimated to be 57474.2 and 25694.3 seconds.
• Keyword list keywords. Each keyword contains one or more words as shown below
請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp




















 

標簽:

掃一掃在手機打開當前頁
  • 上一篇:EBU6304代寫、Java編程設(shè)計代做
  • 下一篇:COM4511代做、代寫Python設(shè)計編程
  • 無相關(guān)信息
    昆明生活資訊

    昆明圖文信息
    蝴蝶泉(4A)-大理旅游
    蝴蝶泉(4A)-大理旅游
    油炸竹蟲
    油炸竹蟲
    酸筍煮魚(雞)
    酸筍煮魚(雞)
    竹筒飯
    竹筒飯
    香茅草烤魚
    香茅草烤魚
    檸檬烤魚
    檸檬烤魚
    昆明西山國家級風景名勝區(qū)
    昆明西山國家級風景名勝區(qū)
    昆明旅游索道攻略
    昆明旅游索道攻略
  • 短信驗證碼平臺 理財 WPS下載

    關(guān)于我們 | 打賞支持 | 廣告服務(wù) | 聯(lián)系我們 | 網(wǎng)站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 kmw.cc Inc. All Rights Reserved. 昆明網(wǎng) 版權(quán)所有
    ICP備06013414號-3 公安備 42010502001045

    久久久久久精品无码人妻_青春草无码精品视频在线观_无码精品国产VA在线观看_国产色无码专区在线观看

    午夜免费视频网站| 国产视频在线观看网站| 国产美女视频免费看| 艹b视频在线观看| 涩多多在线观看| 99爱视频在线| a级网站在线观看| 欧美三级理论片| 国内自拍在线观看| 成人毛片100部免费看| 成年网站免费在线观看| 免费黄色福利视频| 亚洲激情免费视频| 伊人五月天婷婷| 日韩精品你懂的| 久草青青在线观看| 亚洲精品在线网址| 深夜黄色小视频| 午夜欧美福利视频| 日本在线观看a| 国产欧美日韩小视频| 182午夜视频| 亚洲a级黄色片| 簧片在线免费看| 黄色影院一级片| 五十路熟女丰满大屁股| 9色porny| 免费拍拍拍网站| 国产爆乳无码一区二区麻豆| 色姑娘综合天天| 91aaa精品| 久久久久久蜜桃一区二区| 日韩欧美xxxx| 欧美 日韩 激情| 久久综合九色综合88i| 在线观看17c| 欧美xxxxxbbbbb| av在线网站免费观看| 亚洲精品第三页| 亚洲一级片免费观看| 五月六月丁香婷婷| 黄色免费高清视频| 91国在线高清视频| 18禁裸男晨勃露j毛免费观看| 97国产精东麻豆人妻电影| 国产精品专区在线| 国产婷婷一区二区三区| 欧美精品自拍视频| 91国视频在线| xxxx一级片| 久久精品午夜福利| 黄色成人免费看| 三级性生活视频| 97超碰人人爱| 久久精品视频16| 免费男同深夜夜行网站| 欧美一级裸体视频| 国产5g成人5g天天爽| 日韩精品一区二区三区电影| 香港三级日本三级a视频| 日韩精品免费一区| 国产成人在线小视频| 黄色国产一级视频| 丝袜制服一区二区三区| 激情久久综合网| 欧美这里只有精品| 欧美 激情 在线| 国产美女视频免费看| 国产av熟女一区二区三区| 又粗又黑又大的吊av| 国产成年人视频网站| 99视频精品全部免费看| 国内外成人激情视频| 天堂视频免费看| 国产女主播自拍| 欧美午夜aaaaaa免费视频| 热久久最新地址| 国产成人无码一二三区视频| 色啦啦av综合| 女人帮男人橹视频播放| 成人免费毛片播放| 免费看污污视频| 国产激情在线观看视频| 免费在线观看污污视频| 成人在线免费观看av| 中文字幕色网站| 国产欧美在线一区| 91看片淫黄大片91| 中文字幕视频在线免费观看| 日本福利视频在线观看| 午夜视频在线瓜伦| www婷婷av久久久影片| 国产wwwxx| 激情五月宗合网| 波多野结衣免费观看| 人人妻人人澡人人爽欧美一区双| 亚洲自偷自拍熟女另类| 亚洲第一精品区| 国产视频一区二区三区在线播放| 欧美性受xxxx黑人猛交88| 乱子伦视频在线看| 国产aaa免费视频| 日韩欧美中文视频| 激情视频综合网| 国产中文字幕二区| www亚洲国产| 欧美日韩黄色一级片| 97精品国产97久久久久久粉红| 精品中文字幕av| 亚洲区成人777777精品| 手机在线成人免费视频| avav在线播放| www.51色.com| 男女男精品视频站| 国产乱人伦精品一区二区三区| 人人妻人人做人人爽| 三级av免费看| jizz欧美性11| 亚洲综合日韩欧美| 欧美婷婷精品激情| 午夜dv内射一区二区| 日韩伦理在线免费观看| 中文字幕永久视频| 成人免费视频久久| 美女网站视频黄色| 日本中文字幕精品—区二区| 亚洲欧美一二三| 青青草综合视频| 欧美精品久久久久久久久久久| 今天免费高清在线观看国语| www.亚洲一区二区| 警花观音坐莲激情销魂小说| 深夜做爰性大片蜜桃| 五月天色婷婷综合| 国产精品无码一区二区在线| 日本少妇高潮喷水视频| 色欲av无码一区二区人妻| 激情五月开心婷婷| 中文字幕天天干| 熟妇熟女乱妇乱女网站| av动漫在线免费观看| 日韩 欧美 视频| 免费av观看网址| 成人亚洲视频在线观看| 欧美 亚洲 视频| 免费一级特黄毛片| 东京热加勒比无码少妇| 杨幂毛片午夜性生毛片| 日本中文字幕影院| 日韩亚洲欧美一区二区| 日本免费不卡一区二区| 亚洲36d大奶网| 欧美 国产 精品| 美女福利视频在线| 天美一区二区三区| 野外做受又硬又粗又大视频√| 欧美日本视频在线观看| 免费看涩涩视频| 久久久999视频| 亚欧美在线观看| 中文字幕日韩精品无码内射| 国产免费黄视频| 中文字幕第22页| 亚洲熟妇无码一区二区三区| 亚洲一区二区三区四区五区xx| а 天堂 在线| 欧美 日韩 亚洲 一区| 99热一区二区| 久久国产精品网| 日韩av卡一卡二| 97干在线视频| 五月激情婷婷在线| 欧美一级片免费播放| 中国女人做爰视频| av动漫在线观看| 日韩视频在线免费播放| 91精品91久久久中77777老牛| 日本a√在线观看| 91国在线高清视频| 别急慢慢来1978如如2| 国产精品久久成人免费观看| 狠狠97人人婷婷五月| 在线观看av免费观看| 国产极品粉嫩福利姬萌白酱| 九九九九九国产| 日本黄大片一区二区三区| youjizz.com在线观看| 午夜激情av在线| 六月丁香激情网| 神马午夜伦理影院| 在线观看国产一级片| 黄色网页免费在线观看| www.-级毛片线天内射视视| av无码精品一区二区三区| 大陆av在线播放| 在线观看中文av| www.色就是色| 国产成人a亚洲精v品无码| 国产精品wwwww| 人妻激情另类乱人伦人妻|