Statistical Natural Language Processing
COMP5046
This unit deals with techniques for the automatic processing of natural languages (such as English, French, etc) and the engineering of such software systems. Engineering processes will be described in the context of methods for creating effective tools for information retrieval and extraction, question answering, classifying and clustering of the documents in a large corpora. Processing sub-systems for such tasks as tokenisation, lexical verification, part-of-speech tagging, parsing and word sense disambiguation will be described. Particular emphasis is given to methods that analyse the meaning in texts and the general application of machine learning methods to these topics. Various applications of these methods to research in health texts and other contexts being pursued in the University of Sydney will be explored.
Unit of study details
Unit of study level: Postgraduate
Credit points: 6
Commencing semesters: 1
Further unit of study information
Unit of study handbook: COMP5046
Costs and scholarships information: Costs and Scholarships
Final dates to withdraw from units of study: Census Dates
Available for study abroad and exchange: No
Our courses that offer this unit of study
- Master of Information Technology
- Graduate Diploma in Information Technology
- Graduate Certificate in Information Technology
- Bachelor of Information Technology
- Bachelor of Computer Science and Technology (Honours)
- Bachelor of Computer Science and Technology (Honours) (Advanced)
- Bachelor of Information Technology and Bachelor of Arts
- Bachelor of Information Technology and Bachelor of Medical Science
- Bachelor of Information Technology and Bachelor of Science
- Bachelor of Information Technology and Bachelor of Laws