UFCEQE 15 M
UFCEQE-15-M Statistical Learning
Overview
This module will provide an overview of best practices of using statistical methods to create and build models to analyse data. It will develop skills relating to scripting for data analysis, data visualisation and reproducible research.
The aim of this module is to provide a sound understanding of the role of statistical inference in the field of data science.
This will extend to including best practice in study design and in understanding the consequences of working with "found" rather than designed data.
Embedded within the module will be the principles of reproducible research and the data modelling cycle.
Objectives
- Apply the principles of statistical inference following a structured modelling cycle to solve problems.
- Communicate the findings of the results of statistical analysis to specific audiences.
- Design, develop and validate a range of statistical models.
Curriculum
Topics are likely to include but are not limited to:
- Introduction to the concepts of reproducible research using R / Rmarkdown.
- Use of data analysis plans.
- Exploratory Data Analysis - highlighting the importance of visualisation as an analysis tool as well as a communication tool.
- Data Management and metadata.
- Hypothesis testing - using both traditional methods and simulation approaches.
- Model building using a statistical framework.
- Model selection and validation.
Assessment
Exam (100%)