Recent years have brought a rapid growth in the amount and complexity of health data captured. Data collected in imaging, genomics, health registries, wearables, and among other applications call for new statistical techniques in both predictive and descriptive learning. Machine learning algorithms for classification and prediction complement classical statistical tools in the analysis of these data. This unit will cover modern machine learning methods particularly useful for large and complex health data. Topics include: linear regression and K-nearest neighbours; classification; bootstrapping and cross-validation resampling methods; model selection and regularization; non-linear approaches including splines and generalised additive models; and tree-based methods. The statistical software R will be used throughout the unit.
Unit details and rules
Academic unit | Public Health |
---|---|
Credit points | 6 |
Prerequisites
?
|
(PUBH5010 or BSTA5011 or CEPI5100) and (BSTA5007 or BSTA5210 or PUBH5217) |
Corequisites
?
|
None |
Prohibitions
?
|
None |
Assumed knowledge
?
|
None |
Available to study abroad and exchange students | No |
Teaching staff
Coordinator | Andrew Grant, andrew.grant1@sydney.edu.au |
---|---|
Lecturer(s) | Andrew Grant, andrew.grant1@sydney.edu.au |
Armando Teixeira-Pinto, armando.teixeira-pinto@sydney.edu.au |