Introduction to 'big data' databases and programming for transport analytics

17 to 20 July 2018

Institute of Transport and Logistics Studies
University of Sydney Business School

Course Overview

Transport researchers are increasingly collecting or using 'big data' containing millions of observations from sources such as GPS, smartphones, smartcards and bluetooth sensors. Government and businesses are also increasingly offering accessing to their own transport datasets through application programming interfaces (APIs). These can be used to retrieve data on the real-time location of buses, passengers and people in specific locations or on certain roads. To make effective use of these datasets it is necessary to process and analyse the data at a disaggregate level. Accomplishing this requires a knowledge of databases (to store and manage data) and programming skills (to implement the appropriate logic for processing). In addition, while some researchers have access to programmers and database analysts to perform these tasks it is nonetheless crucial to have an understanding of the capabilities and issues relating to the management and processing since how this is done has a direct effect on any results.

This short course will provide transport researchers with the knowledge and tools to manage and process these big data datasets. Attendees will first be introduced to relational databases - enabling them to store, manage and retrieve data. Subsequently, an introduction to programming using R will give students the tools to create algorithms to process raw data, retrieve data from APIs and merge datasets to make them useable for a variety of transport analyses including statistical modelling and spatial analysis. The course will be taught as a mixture of lectures and practical tutorials throughout all four days.

By the end of the course you will be able to:

  • Work with raw data from a variety of sources
  • Store, edit, retrieve and manage related large datasets within a database
  • Combine data from several sources at the same time
  • Retrieve data to perform statistical analyses
  • Create powerful visualisations from raw and processed data
  • Work with geographic and non-geographic data

Participant Feedback

This short course has been attended by participants from government (state and federal), industry and academia. Below is a selection of feedback received in 2015 and 2016.

  • Teaching from easy to complex make it easy to understand
  • It is an intensive course but quite inspiring
  • The guys really know their stuff and present well, very polished and comprehensive
  • Adrian and Richard are very knowledgeable and were able to answer all my questions!
  • Excellent compilation of notes and resources
  • Very relevant knowledge for my work and generally a great intro to big data programming and visualisation
  • Detailed guidance, hands-on use and practice on computer. Seeing real data and analysis
  • A great course; a credit to Richard and Adrian
  • Interesting and resourceful training course
  • Thanks for organising the big databases training. By far the best training I have attended in years.


Dr Adrian Ellison and Dr Richard Ellison


The course is fully booked for 2018.

To be placed on the waitlist should a space become available, please email:

Further information

Please email: or phone +61 2 9114 1813.


Veriu Camperdown

Adina Central


Lab 4, Codrington Building (H69)
Room 5040, Level 5, Abercrombie Building (H70)


For any further information please email