This course introduces Python programming for statisticians working with survey and census data. You will learn to set up reproducible analysis environments, load and clean data from various formats, handle missing values and duplicates, and combine datasets through merges and SQL queries.
The course covers exploratory data analysis techniques, data visualization for statistical reporting, and foundations of statistical computation including confidence intervals, hypothesis tests, and survey-weighted estimation.
You will also be introduced to predictive modeling with classification methods, and learn strategies for working with datasets that exceed memory limits, including chunked processing and modern file formats like Parquet.
- Teacher: arseniy g
- Teacher: Admin User