4.00 credits
15.0 h + 15.0 h
Q2
Teacher(s)
Schnor Christine;
Language
English
Prerequisites
Preferably, the students should have acquired some basic knowledge on Stata (e.g. through the introductory course to STATA LDEMO2630) and have some knowledge about datasets.
However, no statistical expertise is required since statistical methods are kept to a minimum.
However, no statistical expertise is required since statistical methods are kept to a minimum.
Main themes
Database management and processing provides the foundations needed to gather, handle and analyze complex survey or census data with STATA.
The course focuses on 7 themes:
1. Introduction to Stata
2. Variable management (generating and modifying variables, dealing with string variables)
3. Data cleaning (dealing with missing data, duplicates, and date processing)
4. Organizing and documenting scripts
5. Data manipulation in subsets of data and across subgroups
6. Combining or reshaping datasets
7. Using loops and other tools to repeat commands over different files or segments of datasets
8. Visualizations and maps
The course focuses on 7 themes:
1. Introduction to Stata
2. Variable management (generating and modifying variables, dealing with string variables)
3. Data cleaning (dealing with missing data, duplicates, and date processing)
4. Organizing and documenting scripts
5. Data manipulation in subsets of data and across subgroups
6. Combining or reshaping datasets
7. Using loops and other tools to repeat commands over different files or segments of datasets
8. Visualizations and maps
Learning outcomes
At the end of this learning unit, the student is able to : | |
1. | be enabled to prepare efficiently survey or census datasets for analysis ; |
2. | handle survey and census data: clean the data, merge and reshape datasets, extract relevant information, apply functions over subset of the data, combine multiple datasets in one project ; |
3. | use data visualizations (plots or maps) as tools to check the data. |
Content
Database management and processing provides the foundations needed to gather, handle and analyze complex survey or census data with STATA.
The course focuses on 7 themes:
The course focuses on 7 themes:
- Introduction to Stata
- Variable management (generating and modifying variables, dealing with string variables)
- Data cleaning (dealing with missing data, duplicates, and date processing)
- Organizing and documenting scripts
- Data manipulation in subsets of data and across subgroups
- Combining or reshaping datasets
- Using loops and other tools to repeat commands over different files or segments of datasets
- Visualizations and maps
Teaching methods
All lessons are a mix of a standard lecture and computer-based practical sessions based on real-life examples. The lectures provide the main concepts and tools, as well as basic knowledge required to do the exercises. Assignments are scheduled after each session to apply the procedures on datasets and verify the assimilation of concepts and tools. Corrections are offered at the beginning of each course.
Evaluation methods
The formal mid-term and end-term assessments are based on specific survey datasets. Evaluations are weighted in the following way:
- 10% mini exam on basic knowledge in Stata during lecture time (60 mn)
- 20% homework exercises
- 30% two assignments (small research projects)
- 40% oral exam at end of term
Other information
Prerequisites
Preferably, the students should have acquired some basic knowledge on Stata (e.g. through the introductory course to STATA LDEMO2630) and have some knowledge about datasets.
However, no statistical expertise is required since statistical methods are kept to a minimum.
Preferably, the students should have acquired some basic knowledge on Stata (e.g. through the introductory course to STATA LDEMO2630) and have some knowledge about datasets.
However, no statistical expertise is required since statistical methods are kept to a minimum.
Faculty or entity
PSAD