Spherical multi-modal scene analysis
Summary
The next generation of webcam is likely to be a multi-modal spherical scene analysis device that improves human-computer interaction. This research project explores how to successfully combine spherical acoustics, spherical cameras, and range information for multi-modal human computer interaction.
Supervisor(s)
Associate Professor Craig Jin, Associate Professor Philip Leong, Dr Alistair McEwan
Research Location
Electrical and Information Engineering
Program Type
Masters/PHD
Synopsis
This leading research project explores computational methods (software, graphical processing units, field-programmable gate arrays) for spherical scene analysis using spherical microphone arrays and spherical cameras.. Topics and tools required include multi-modal software development, scene analysis techniques, and custom computing techniques.
Additional Information
Successful candidates likely have a background in electrical engineering, software programming, graphical processing units, custom computing, mathematics, or physics with an interest in spherical scene analysis and human-computer interaction.
http://www.ee.usyd.edu.au/carlab
Want to find out more?
Contact us to find out what’s involved in applying for a PhD.
Contact Research Expert to find out more about participating in this opportunity.
Browse for other opportunities within the Electrical and Information Engineering .
Keywords
Spherical Acoustics, spherical cameras, multi-modal signal processing, human-computer interfaces, scene analysis, spherical microphone arrays
Opportunity ID
The opportunity ID for this research opportunity is: 1358
Other opportunities with Associate Professor Craig Jin
- Pattern analysis techniques for sound synthesis
- Interpolation of binaural impulse responses for virtual auditory displays
- Sound field recording and recreation
- Beamforming with acoustic vector sensors - Multiple acoustic source localisation using acoustic vector sensor arrays
- Speech separation and localisation using particle filtering
- Mapping 2D Images to 3D Shape
- New technique for studying human brain activity
- Next Generation Audio Coding
- Statistical models of ear shape and ear acoustics
- Binaural signal processing algorithms for hearing aids
- Electrical Impedance Tomography for stroke, biophysical monitoring and medical device design
- Impedance tomography for cardiac imaging: high speed tomography
- Medical diagnostics for neonates in the developing world
- FPGA-based Low Latency Trading
- Floating Point FPGA Architectures
- Placement-aware Hardware Description Languages
- Scalable vision machines
- Modelling Parkinson's disease using control models
- Novel Electrodes for rapid electrophysiological recording
Other opportunities with Associate Professor Philip Leong
- other research opportunities available at Faculty of Engineering and Information Technologies
- FPGA-based Low Latency Trading
- Floating Point FPGA Architectures
- Placement-aware Hardware Description Languages
- Scalable vision machines
- Modelling Parkinson's disease using control models
- Mapping 2D Images to 3D Shape
- New technique for studying human brain activity
- Next Generation Audio Coding
- Statistical models of ear shape and ear acoustics
- Medical diagnostics for neonates in the developing world
- Electrical Impedance Tomography for stroke, biophysical monitoring and medical device design
- Impedance tomography for cardiac imaging: high speed tomography
- Novel Electrodes for rapid electrophysiological recording
- Binaural signal processing algorithms for hearing aids
Other opportunities with Dr Alistair McEwan
- Medical diagnostics for neonates in the developing world
- Electrical Impedance Tomography for stroke, biophysical monitoring and medical device design
- Impedance tomography for cardiac imaging: high speed tomography
- Novel Electrodes for rapid electrophysiological recording
- Mapping 2D Images to 3D Shape
- New technique for studying human brain activity
- Next Generation Audio Coding
- Statistical models of ear shape and ear acoustics
- FPGA-based Low Latency Trading
- Floating Point FPGA Architectures
- Placement-aware Hardware Description Languages
- Scalable vision machines
- Modelling Parkinson's disease using control models
- Binaural signal processing algorithms for hearing aids