This course is an introduction to Natural Language Processing (NLP) for students already proficient in programming and machine learning. The aim is to provide a solid background in theory and techniques used to accomplish different NLP tasks such as understanding and generating natural language. As NLP technologies are used by many people every day, and inform many other "AI" systems, special focus will be given to questions of ethics, equity, and the social impact of these technologies.
The course covers a mix of techniques, including rule-based, statistical, and machine learning methods for NLP. Since language data is at the core of many modern NLP techniques, the course will additionally cover assessment of data quality, as well as developing an understanding of complex issues of representation and data ownership.
Basic concepts and methodology from linguistics are introduced, including aspects of how language is constructed and used, and the importance of context. These are used to ground an understanding both of how effective solutions to NLP tasks are constructed, and the challenges of doing so for various languages.
Beyond this theoretical grounding, there will be practical exercises and assignments focusing on applying various techniques to address tasks within NLP. The coursework also includes actively participating in seminars and writing reports.
Univ: To be admitted you must have (or equivalent) 90 ECTS-credits including 60 ECTS-credits in Computing Science or three years of completed studies within a study programme (180 ECTS-credits). In both cases, includning * a course (7.5 ECTS-credits) in Machine learning (e.g. 5DV194) that includes Naive Bayes, Hidden Markov Models, Decision Trees and Neural Networks including how backpropagation works * a course (7.5 ECTS-credits) in Formal languages (e.g. 5DV208 CS3: Computations and languages or 5DV037 Fundamentals of Computer Science) that includes Automata, Turing Machines, Regular languages, Context-free languages, pumping lemma (regular, context free), CYK parser (also passing familiarity with shift-reduce)
It is recommended to have some familiarity with Python (we will use Python in exercises/assignments, so students should either know how to code in Python or be in a situation where they feel confident they can quickly pick it up)
Proficiency in English equivalent to Swedish upper Secondary course English A/5. Where the language of instruction is Swedish, applicants must prove proficiency in Swedish to the level required for basic eligibility for higher studies.
Entry requirements
Selection
Academic credits
Applicants in some programs at Umeå University have guaranteed admission to this course. The number of places for a single course may therefore be limited.
Application code
UMU-57230
Application
Application deadline was
19 April 2022.
Please note: This second application round is intended only for EU/EEA/Swiss citizens.
Submit a
late application
at Universityadmissions.se.
As a citizen of a country outside the European Union (EU), the European Economic Area (EEA) or Switzerland, you are required to pay application and tuition fees for studies at Umeå University.