The focus of this unit is on understanding and applying relevant concepts, techniques, algorithms, and tools for the analysis, management and visualisation of data- with the goal of enabling discovery of information and knowledge to guide effective decision making and to gain new insights from large data sets. To this end, this unit of study provides a broad introduction to data management, analysis, modelling and visualisation using the Python programming language. Development of custom software using the powerful, general-purpose Python scripting language; Data collection, cleaning, pre-processing, and storage using various databases; Exploratory data analysis to understand and profile complex data sets; Mining unlabelled data to identify relationships, patterns, and trends; Machine learning from labelled data to predict into the future; Communicate findings to varied audiences, including effective data visualisations. Core data science content will be taught in normal lecture + tutorial delivery mode. Python programming will be taught through an online learning platform in addition to the weekly face-to-face lecture/tutorials. The unit of study will include hands-on exercises covering the range of data science skills above.
Unit details and rules
Academic unit | Computer Science |
---|---|
Credit points | 6 |
Prerequisites
?
|
None |
Corequisites
?
|
None |
Prohibitions
?
|
COMP5310 or INFO3406 |
Assumed knowledge
?
|
Good understanding of relational data model and database technologies as covered in ISYS2120 or COMP9120 (or equivalent UoS from different institutions) |
Available to study abroad and exchange students | No |
Teaching staff
Coordinator | Nataliia Stratiienko, nataliia.stratiienko@sydney.edu.au |
---|