This course is set for graduate and senior undergraduate students, who have a basic background in statistics.
The course teaches the fundamental theory and algorithms behind machine learning (ML), a key issue in the artificial intelligence. It aims to equip students with a deeper understanding and adept skills in the field of ML, to solve various pragmatic questions in broader areas. ML has a wide range of applications, including web search, social network, speech recognition, image processing, robotics and finance. We choose the application of statistical machine translation to demonstrate how the ML study is motivated and applied. Machine translation is one of the hottest area nowadays, with governments and large companies such as Microsoft, Baidu and Google investing huge resources to the task.
The syllabus contains both theory and practice: First, we describe the training criteria (Bayesian decision rule, maximum likelihood), statistical models (CRF, SVM, HMM and neural network) and training algorithms (EM and Gibbs Sampling); Then, advanced machine translation methods are introduced based on the statistical approaches (word alignment, phrase training, decoding and optimization).
The course is mainly conducted through lectures and discussions. The students are encouraged to join research projects to realize and to develop novel ideas.