Statistical Natural Language Processing

COMP5046

This unit deals with techniques for the automatic processing of natural languages (such as English, French, etc) and the engineering of such software systems. Engineering processes will be described in the context of methods for creating effective tools for information retrieval and extraction, question answering, classifying and clustering of the documents in a large corpora. Processing sub-systems for such tasks as tokenisation, lexical verification, part-of-speech tagging, parsing and word sense disambiguation will be described. Particular emphasis is given to methods that analyse the meaning in texts and the general application of machine learning methods to these topics. Various applications of these methods to research in health texts and other contexts being pursued in the University of Sydney will be explored.

Unit of study details

Unit of study level: Postgraduate

Credit points: 6

Commencing semesters: 1

Further unit of study information

Unit of study handbook: COMP5046

Costs and scholarships information: Costs and Scholarships

Final dates to withdraw from units of study: Census Dates

Available for study abroad and exchange: No

Our courses that offer this unit of study