University of Sydney Handbooks - 2018 Archive

Download full 2018 archive Page archived at: Fri, 21 Sep 2018 05:39:44 +0000

Data Science

Study in the discipline of Data Science is jointly offered by the School of Mathematics and Statistics in the Faculty of Science and the School of Information Technologies in the Faculty of Engineering and Information Technologies. Units of study in this major are available at standard and advanced level.

About the major

Data is an essential asset in many organisations as it enables informed decision making into many areas including market intelligence and science. In the major in Data Science, you will learn computational and analytical skill sets that stem from statistics and computer science, to manage, interpret, understand, analyse and derive key knowledge from the data.

You will develop critical thinking about data and its use, a deep understanding of the core technical skills required and an appreciation for the context in which that data was collected. At the 3000-level of study and beyond, you will develop the ability to understand problems from many disciplines and place a data-driven problem into an analytical framework, solve the problem through computational means, interpret the results and communicate them to clients or collaborators.

Requirements for completion

A major in Data Science requires 48 credit points, consisting of:

(i) 6 credit points of 1000-level core units
(ii) 6 credit points of 1000-level units according to one of the following rules:
(a) 6 credit points of selective units, or
(b) 3 credit points of statistics units and 3 credit points of computations units, or
(c) 3 credit points of advanced statistics and 3 credit points of calculus and linear algebra units
(iii) 12 credit points of 2000-level core units
(iv) 6 credit points of 2000-level selective units
(v) 6 credit points of 3000-level interdisciplinary project units
(vi) 6 credit points of 3000-level methodology-focussed units
(vii) 6 credit points of 3000-level methodology or application and discipline-focussed units

A minor in Data Science is available and articulates to this major.

First year

DATA1001 Foundations of Data Science is a foundational unit in the Data Science major. The unit focuses on developing critical and statistical thinking skills for all students.

DATA1002 Informatics: Data and Computation is a foundational unit in the Data Science major. This unit covers computation and data handling, integrating sophisticated use of existing productivity software, e.g. spreadsheets, with the development of custom software using the general-purpose Python language.

Students are strongly encouraged to take DATA1001 and DATA1002 for this major. However, there are some equivalent selective units for DATA1001 and students can choose from: ENVX1002, MATH1005, MATH1015, MATH1115, MATH1905, MATH1021, MATH1921, MATH1931, MATH1023, MATH1923, MATH1933, MATH1002, MATH1902. Students should refer to Table A for specific 1000-level requirements.

Second year

DATA2001 – Data Science: Scale and Data Diversity focuses on methods and techniques to efficiently explore and analyse large data collections;

DATA2002 – Data Analytics: Learning from Data focuses on developing data analytic skills for a wide range of problems and data.

Students also complete one unit from a selection: COMP2123, COMP2823, STAT2X11, QBUS2830.

Third year

DATA3001 – Interdisciplinary Data Science Project is the capstone 3000-level unit for the major and will include both the disciplinary and interdisciplinary project. The main component for the unit will be a nine week project that applies the candidates’ skills and knowledge to analyse a real, messy dataset from a knowledge domain outside data science and statistics.

Students will also select 6 credit points from a selection of DATA and STAT units focusing on methodology, and 6 credit points from a selection of methodology or application and discipline-focussed units.

Note that the following units will also be available in 2019 at 3000-level: COMP3308, COMP3027, COMP3608, COMP3927.

Fourth year

The fourth year is only offered within the combined Bachelor of Science/Bachelor of Advanced Studies course.

Advanced coursework
The Bachelor of Advanced Studies advanced coursework option consists of 48 credit points, which must include a minimum of 24 credit points in a single subject area at 4000-level, including a project unit of study worth at least 12 credit points. Space is provided for 12 credit points towards the second major (if not already completed). 24 credit points of advanced study will be included in the table for 2020.

Honours
Requirements for Honours in the area of Data Science: completion of 24 credit points of project work and 24 credit points of coursework.

Honours units of study will be available in 2020.

Contact and further information

W www.maths.usyd.edu.au/
E


All enquiries phone: +61 2 9351 5804 or +61 2 9351 5787

Address:
School of Mathematics and Statistics
Level 5, Carslaw Building F07
University of Sydney NSW 2006

Professor Jean Yang
T +61 2 9351 3012
E

Learning Outcomes

Students who graduate from Data Science will be able to demonstrate:

Interdisciplinary Skills

  1. Ability to engage with problems from many diverse areas of application and to understand the relationships between a given problem and data collected to solve the problem.
  2. Ability to relate context specific knowledge to data, to understand how data can be used to generate context specific knowledge, and know how this knowledge can guide data analytics.

Foundational Understanding

  1. Understanding of the importance of experimental design, its relationship with data output, and how this data should be analysed and evaluated, including potential pitfalls.
  2. Ability to identify, at a general level, the type of data analytical approach required for a particular problem; whether that is data analysis, simulation based modelling or equation-based modelling.
  3. Understanding of how the data context, organizational constraints and quality issues have implications for flow-on impacts in further stages of the analysis.

Data Science Methods and Tools

  1. Skills in data management with an understanding of how data, metadata, and derived knowledge (including analytical models) are stored, accessed, and administered.
  2. A range of computational skills including programming, choosing scientific data formats, creating and using databases (for storing and accessing metadata) and use of graphical information systems (for mapping and sharing high dimensional data). These skills also include understanding the principles of programming and the ability to translate this knowledge to new computational code and to create tools.
  3. Data analytical competencies that include, but are not limited to, the use appropriate of quantitative models or visualisation methods on multiple data types to:
  • enable prediction of outcome
  • recognise significant patterns and trends
  • critically assess the strengths and weaknesses of different analytical approaches.

Communication Skills

  1. Ability and experience to confidently use one’s data analytical competency to communicate discipline-specific outcomes in written and verbal form, and for decision making.

Problem Awareness

  1. An awareness of data integrity issues including appreciation of data privacy and ethical issues.
  2. General understanding of how data analytical tools can be automated and implemented efficiently and up-scaled if necessary using the available technologies.