5 credits
30.0 h + 15.0 h
Q1
Teacher(s)
Dupont Pierre; Fairon Cédrick;
Language
English
Main themes
- Basics in phonology, morphology, syntax and semantics
- Linguistic resources
- Part-of-speech tagging
- Statistical language modeling (N-grams and Hidden Markov Models)
- Robust parsing techniques, probabilistic context-free grammars
- Linguistics engineering applications such as spell or syntax checking software, POS tagging, document indexing and retrieval, text categorization
Aims
At the end of this learning unit, the student is able to : | |
1 | Given the learning outcomes of the "Master in Computer Science and Engineering" program, this course contributes to the development, acquisition and evaluation of the following learning outcomes:
Given the learning outcomes of the "Master [120] in Computer Science" program, this course contributes to the development, acquisition and evaluation of the following learning outcomes:
Students completing successfully this course should be able to
Students will have developed skills and operational methodology. In particular, they have developed their ability to
|
The contribution of this Teaching Unit to the development and command of the skills and learning outcomes of the programme(s) can be accessed at the end of this sheet, in the section entitled “Programmes/courses offering this Teaching Unit”.
Content
- Linguistic essentials: morphology, part-of-speech, phrase structure, semantics and pragmatics
- Mathematical foundations: formal languages, and elements of information theory
- Corpus analysis: formating, tokenization, morphology, data tagging
- N-grams: maximum likelihood estimation and smoothing
- Hidden Markov Models: definitions, Baum-Welch and Viterbi algorithms
- Part-of-Speech Tagging
- Probabilistic Context-Free Grammars: parameter estimation and parsing algorithms, tree banks
- Machine Translation: classical and statistical methods (IBM models, Phrase-based models), evaluation
-
Applications: SMS predictors, POS taggers, information extraction
http://moodleucl.uclouvain.be/course/view.php?id=7865
Teaching methods
- 12 lectures
- 3 miniprojects
- feedback sessions about the miniprojects
Evaluation methods
25% for practical works + 75% final exam (closed book)
No possibility to present again practical works in the second session
No possibility to present again practical works in the second session
Online resources
Bibliography
Slides obligatoires disponibles sur le site :
http://moodleucl.uclouvain.be/course/view.php?id=7865 1 textbook conseillé :
http://moodleucl.uclouvain.be/course/view.php?id=7865 1 textbook conseillé :
Faculty or entity
INFO
Programmes / formations proposant cette unité d'enseignement (UE)
Title of the programme
Sigle
Credits
Prerequisites
Aims
Master [120] in Data Science Engineering
Master [120] in Computer Science and Engineering
Master [120] in Linguistics
Master [120] in Computer Science
Master [120] in data Science: Statistic
Master [120] in data Science: Information technology