Based on my project, I am getting started to review the ASR course from CMU.  This will help me understand what happens under the hood. To be honest, the slides from CMU is among the best that I have ever seen for ASR courses. This post will do a short review of the Lec 1… Continue reading

MSc Final Dissertation Note (Kaldi Installation)

I am doing my final project that doing experiments about the function of Dropout in the LSTM for TIMIT corpus. Kaldi and CNTK are the tools chosen for this one. Kaldi is a really powerful tool for decoding waveform files into features. The prerequisites for this are to install Kaldi and its relative dependencies the… Continue reading MSc Final Dissertation Note (Kaldi Installation)

TIMIT Introduction (the Official Doc)

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus (TIMIT) Training and Test Data NIST Speech Disc CD1-1.1 The TIMIT corpus of read speech has been designed to provide speech data for the acquisition of acoustic-phonetic knowledge and for the development and evaluation of automatic speech recognition systems. TIMIT has resulted from the joint efforts of several… Continue reading TIMIT Introduction (the Official Doc)

Leetcode 357. Count Numbers with Unique Digits

This post I will make a short review about Python data structure by solving the problem of Leetcode 357. The problem is as below: Given a non-negative integer n, count all numbers with unique digits, x, where 0 ≤ x < 10n. Example: Given n = 2, return 91. (The answer should be the total… Continue reading Leetcode 357. Count Numbers with Unique Digits

Extreme Computing (Hadoop Map Reduce) 4

This one will continue the discussion for the second assignment of Extreme Computing in the University of Edinburgh Task 4 For this task you will use a dataset from StackOverflow and extract specific pieces of information. Initially, you should understand the format of the dataset, next you will need to do parse each post, and… Continue reading Extreme Computing (Hadoop Map Reduce) 4

Extreme Computing (Hadoop Map Reduce) 3

This blog will be the second assignment of Extreme Computing from the University of Edinburgh. Extreme Computing Second assignment Based on an Assignment by Michail Basios and Stratis Viglas In this assignment, you will address real-world cases where MapReduce can be used. Initially, you will deal with a problem related to information retrieval: you will… Continue reading Extreme Computing (Hadoop Map Reduce) 3

Extreme Computing (Hadoop Map Reduce) 2

In this part, we will still discuss the remaining problems solved in the coursework of Extreme Computing in the University of Edinburgh.  If you would like to see the whole introduction, please go to Part 1. Task 5 Create a version of the two-word counting program that uses a combiner. Is it faster? Mapper (Python)… Continue reading Extreme Computing (Hadoop Map Reduce) 2

Extreme Computing (Hadoop Map Reduce) 1

Now our network generates more and more data as time goes by. Computer scientists are trying to understand the pattern under all of the raw data. Hadoop could be used to clean data and send then results to the pipeline, or do some SQL style manipulations or many more advanced tasks on different types of data. Today… Continue reading Extreme Computing (Hadoop Map Reduce) 1