Skip to content
printicon

Big Data and high-dimensional data analysis

  • Number of credits 7.5 Credits
  • Level Master’s level
  • Starting Autumn Term 2018

About the course

Element 1 (2 hp): Theory.
In this Element we discuss what characterizes big data and high-dimensional data, including a historical background and examples of applications. Regression analysis including the maximum likelihood- and least squares methods are repeated. The general classification problem is introduced. The goals of classification and hoe performance is measured, are discussed. Furthermore validation methods including cross validation, and evaluation with independent test data, are included. The theory and applications of logistic regression analysis and linear and quadratic discriminant analysis (LDA and QDA) are covered. Variable selection for classification problems, ridge regressio, lasso and principal component analysis (PCA) are treated, as well as how these methods can be used together with logistic regression, LDA and QDA. The statistical software R and interestin program libraries in it are introduced, including a discussion on a worked through exampl containing variable selection, classification and evaluation. Furthermore, the methods K-nearest neighbour (KNN), system vector machines (SVM) and random forest are covered. The general problem of cluster analysis is introduced. The goals of cluster analysis and how performance (robustness) is measured, are discussed. In conection to this, hierarchical cluster analysis, k-means, ans self-organizing maps (SOM) are treated.

Element 2 (5.5 hp) Computer labs.
The Element covers analysis of several data sets, using the statistical methods that are included in the course. The analyses are conducted in the programming language R. In the element, the students are supposed to write thorough reports of the analyses and the results from them.

In a degree, this course may not be included together with another course with a similar content. If unsure, students should ask the Director of Studies in Mathematics and Mathematical Statistics. The course can also be included in the subject area of computational science and engineering. 

Application and eligibility

Big Data and high-dimensional data analysis, 7.5 hp

Visa tillfällen för föregående termin Autumn Term 2018 Det finns inga senare terminer för kursen

Starts

Lectures begin on week starting 5 November 2018

Ends

Lectures end during the week of 14 January 2019

Study location

Umeå

Language

English (upon request)

Type of studies

Daytime, 50%

Required Knowledge

The course requires 90 ECTS including 12 ECTS Mathematical Statistics and 7,5 ECTS Computer Programming or equivalent. Proficiency in English equivalent to Swedish upper secondary course English 5/A. Where the language of instruction is Swedish, applicants must prove proficiency in Swedish to the level required for basic eligibility for higher studies.
 

Selection

Guaranteed place Applicants in some programs at Umeå University have guaranteed admission to this course. The number of places for a single course may therefore be limited.

Application code

UMU-58216

Application

The online application opens 15 March 2018 at 13:00 CET. Application deadline is 16 April 2018.

Application and Tuition fees

As a citizen of a country outside the European Union (EU), the European Economic Area (EEA) or Switzerland, you are required to pay application and tuition fees for studies at Umeå University.

Application fee

SEK900

Tuition fee, first instalment

SEK16,875

Total fee

SEK16,875

Contact us

Contactperson for the course is: Study counselor Lars-Daniel Öhman

Please be aware that the university is a state body and that what you write here can be included in a public document. Therefore, be careful if you are writing about sensitive or personal matters in this contact form. If you have a such an enquiry, please call us instead. All data will be treated in accordance with The Personal Data Act.

Course is given by
Dept of Mathematics and Mathematical Statistics