5.00 credits
30.0 h + 30.0 h
Q1
Teacher(s)
. SOMEBODY; Hainaut Donatien; Jacques Laurent;
Language
French
Prerequisites
To follow this course the student must have a basic knowledge of probabilities such as taught in courses LEPL1108 or LBIR1212.
The prerequisite(s) for this Teaching Unit (Unité d’enseignement – UE) for the programmes/courses that offer this Teaching Unit are specified at the end of this sheet.
The prerequisite(s) for this Teaching Unit (Unité d’enseignement – UE) for the programmes/courses that offer this Teaching Unit are specified at the end of this sheet.
Main themes
This course presents the fundamental statistical concepts in an engineering context (exploratory analysis, inference, simulation) as well as basis method for analysing multivariate databases (like the linear regression, the principal component analysis and the classification).
Learning outcomes
At the end of this learning unit, the student is able to : | |
1 |
|
Content
- Exploratory analysis and sampling
- Introduction to multivariate data analysis
- Parametric estimate (methods of moments and log-likelihood maximization) and properties of estimators (bias, variance, mean-squared error).
- Statistical inference (confidence intervals and significance tests): comparison of means of two or several normal populations, proportions, variance testing.
- Linear regression, including the analysis of coefficients and significance tests.
- Panorama of learning techniques, supervised and unsupervised learning methods
- Links between objectives of data analysis methods and their mathematical representation.
- Regression and classification methods (such as linear models and least square, k-nearest neighbors, logistic regression)
- Training, test error and generalization error, the Bias-Variance tradeoff, and elements of statistical decision theory
- Resampling techniques for model selection/evaluation (e.g., validation set, K-fold cross validation, bootstrap)
- Unsupervised learning: reduction of dimension (principal component analysis) and methods of clustering (K-means).
- Introduction to multivariate data analysis
- Parametric estimate (methods of moments and log-likelihood maximization) and properties of estimators (bias, variance, mean-squared error).
- Statistical inference (confidence intervals and significance tests): comparison of means of two or several normal populations, proportions, variance testing.
- Linear regression, including the analysis of coefficients and significance tests.
- Panorama of learning techniques, supervised and unsupervised learning methods
- Links between objectives of data analysis methods and their mathematical representation.
- Regression and classification methods (such as linear models and least square, k-nearest neighbors, logistic regression)
- Training, test error and generalization error, the Bias-Variance tradeoff, and elements of statistical decision theory
- Resampling techniques for model selection/evaluation (e.g., validation set, K-fold cross validation, bootstrap)
- Unsupervised learning: reduction of dimension (principal component analysis) and methods of clustering (K-means).
Teaching methods
(Remark: In 2021-2022, this course will be taught in French)
The course is composed of:
- 9 lectures on the topics listed in the course content;
- 7 practical sessions, both classical and numerical;
- 4 hackathons, representing 2 x 2 hours each, associated with small Python projects realized in group on subjects discovered both in the lectures and in the practical sessions.
The course is composed of:
- 9 lectures on the topics listed in the course content;
- 7 practical sessions, both classical and numerical;
- 4 hackathons, representing 2 x 2 hours each, associated with small Python projects realized in group on subjects discovered both in the lectures and in the practical sessions.
Evaluation methods
Written individual exam to evaluate the understanding of concepts and techniques The hackathons represents 25% of the final mark. Lecturers keep the right to orally question students about their exam and hackathons.
Other information
To follow this course the student must have a basic knowledge of probabilities such as taught in courses LEPL1108 or LBIR1212. The schedule of course is subject to modifications due to sanitary conditions. Please check the Moodle website for more details.
Online resources
The totality of teaching material is available on the companion moodle website of the course. The schedule of course is subject to modification due to sanitary conditions, please consult the Moodle website of the course for additional information.
Faculty or entity
EPL