Due to the COVID-19 crisis, the information below is subject to change,
in particular that concerning the teaching mode (presential, distance or in a comodal or hybrid format).
5 credits
30.0 h
Q2
Teacher(s)
Kolp Manuel; Saerens Marco;
Language
English
Main themes
Nowadays, data are everywhere. For most organizations, potentially every area of its business, as well as every relationship related to its business, can now be quantified and recorded. Such amount of data led to the emergence of powerful methods for storing, processing, querying, and extracting useful information/knowledge from these data. This course will be focused on methods for data understanding, design, management, preparation, modeling, querying, and visualization, as a global means for the organization of making better decisions. As a central element in data analytics, modeling and methodology will play an important role in this course, including, e.g., data design for business intelligence analytics, predictive modeling, or fitting statistical models to data.
Aims
At the end of this learning unit, the student is able to : | |
1 |
Having regard to the LO of the programme, this activity contributes to the development and acquisition of the following LO :
|
Content
The scope of the course is broad and the instructor will certainly not be able to cover all of the material concerning data analytics in business. Depending of his background, interests and experience, he will focus on some specific techniques or skim through a broad range of methods.
Potential covered topics are (but not limited to): database design for data analytics, business intelligence techniques, dimensionality reduction for data visualization, extracting recurrent patterns from data, cluster analysis, predictive modeling (supervised classification and regression methods), modeling relationships by latent variable techniques, data analysis algorithms scaling to big data, etc. All these techniques will be illustrated through business applications.
Typically, these last years, the course was split into two parts: "Data management techniques" and "Machine learning techniques for supervised classification".
Potential covered topics are (but not limited to): database design for data analytics, business intelligence techniques, dimensionality reduction for data visualization, extracting recurrent patterns from data, cluster analysis, predictive modeling (supervised classification and regression methods), modeling relationships by latent variable techniques, data analysis algorithms scaling to big data, etc. All these techniques will be illustrated through business applications.
Typically, these last years, the course was split into two parts: "Data management techniques" and "Machine learning techniques for supervised classification".
Teaching methods
Due to the COVID-19 crisis, the information in this section is particularly likely to change.
Classical courses (either on-site or remotely, depending on the situation) and case studies
Evaluation methods
Due to the COVID-19 crisis, the information in this section is particularly likely to change.
Continuous evaluation- Date: Will be specified later
- Type of evaluation: Project with report
- Comments: 60% of the final result
- Oral: No
Written: No
Unavailability or comments: No
- Oral: No
- Written: Yes
- Comments: 40% of the final result
Other information
Prerequisites : Bachelor in Business Engineering or at least :
- A first course in programming
- A first course in information systems analysis and design
- A first course in multivariate calculus
- A first course in linear algebra and matrix theory
- A first course in probability and statistics (including maximum likelihood estimation)
- A first course in multivariate statistical analysis
Bibliography
Potential sources:
- Provost & Fawcett (2013), Data science for business. O Reilly.
- Sherman (2014), Business intelligence guidebook: from data integration to analytics. Morgan Kaufmann.
- Efraim, Sharda & Delen (2010), Decision support and business intelligence Systems. Pearson.
- Leskovec, Rajaraman & Ullman (2014), Mining of massive datasets, 2nd ed. Cambridge University Press.
- Kelleher, Mac Namee & D Arcy (2015), Fundamentals of machine learning for predictive data analytics. MIT Press.
- Hastie, Tibshirani & Friedman (2009), The elements of statistical learning, 2nd ed. Springer-Verlag.
- Izenman (2008), Modern multivariate statistical techniques: regression, classification, and manifold learning. Springer.
- Bellanger & Tomassone (2014), Exploration de données et méthodes statistiques : data analysis & data mining avec le Logiciel R. Ellipses.
Faculty or entity
CLSM
Force majeure
Teaching methods
Depending on the covid 19 situation, the course, assignments and exam could be delivered on-site, on-line or via a mix of both means (comodal).
Evaluation methods
Depending on the covid 19 situation, the course, assignments and exam could be delivered on-site, on-line or via a mix of both means (comodal).
MCQ online on Moodle Quiz with statistical standard setting (seuil de réussite statistique) as defined by LLL
MCQ online on Moodle Quiz with statistical standard setting (seuil de réussite statistique) as defined by LLL