Skip to main content
Unit of study_

ODAT5013: Data Wrangling and Databases

2025 unit information

This unit provides conceptual and practical introduction covering data wrangling and database management. Students will gain a broad understanding of the capabilities for data wrangling and database management, on which effective data analysis depends. It will provide understanding of the implications for their analysis work, of the data management capabilities, and the language and ideas to communicate with the data engineers. The unit covers topics such as 1) data storage architectures 2) relational and other data models 3) data integrity 4) data privacy and security 5) data cleaning and pre-processing, and 6) data consolidation/integration.

Unit details and rules

Managing faculty or University school:

Engineering

Study level Postgraduate
Academic unit Computer Science
Credit points 6
Prerequisites:
? 
None
Corequisites:
? 
None
Prohibitions:
? 
None
Assumed knowledge:
? 
Basic computer literacy

At the completion of this unit, you should be able to:

  • LO1. Explain the primary concerns and capabilities of data management and compare various architectures for data storage and processing.
  • LO2. Apply conceptual database modelling techniques to design domain-specific relational databases and understand the connection between relational and data frame models.
  • LO3. Use SQL to manipulate relational data, including performing data aggregation, filtering, joining tables, and grouping.
  • LO4. Implement efficient query processing and optimisation techniques using a formal query language, i.e., relational algebra.
  • LO5. Analyse non-tabular data types such as document, graph, spatial, and timeseries and understand their unique characteristics and use cases.
  • LO6. Explain the importance of data integrity, understand the challenges involved, and apply mechanisms to define and enforce constraints to maintain data integrity.
  • LO7. Evaluate and apply access control, anonymisation, and encryption mechanisms to secure data and ensure privacy.
  • LO8. Consolidate data from different sources and apply data cleaning and pre-processing techniques to create a unified and coherent dataset in preparation for analysis.

Unit availability

This section lists the session, attendance modes and locations the unit is available in. There is a unit outline for each of the unit availabilities, which gives you information about the unit including assessment details and a schedule of weekly activities.

The outline is published 2 weeks before the first day of teaching. You can look at previous outlines for a guide to the details of a unit.

Session MoA ?  Location Outline ? 
Semester 2a 2024
Online Online Program
Session MoA ?  Location Outline ? 
Semester 2a 2025
Online Online Program
Outline unavailable
There are no availabilities for previous years.

Find your current year census dates

Modes of attendance (MoA)

This refers to the Mode of attendance (MoA) for the unit as it appears when you’re selecting your units in Sydney Student. Find more information about modes of attendance on our website.