851325 Exploratory data analysis with R


Type
Lecture and seminar
Semester hours
2
Lecturer (assistant)
Organisation
Offered in
Wintersemester 2022/23
Languages of instruction
Englisch

Content

Data science methods are becoming increasingly important in scientific research as well as business applications. R is a flexible and powerful statistics program that is freely available and provides a large library of methods for the exploration of data. The course will teach exploratory data analysis in R, with a focus on workflows, different types of real-world data, visualization and reproducible analysis.

Topics:
- Working with data: concept of tidy data, data exploration and wrangling with the tidyverse packages
- Data Visualization: the grammar of graphics, visualization of multivariate data, animated and interactive graphics
- Identification and handling of missing values and outlying points
- Exploring spatial and temporal data
- Application of exploratory methods such as principal components analysis and clustering
- Reproducible reporting with R Markdown
- Interactive dashboards with R Shiny

Previous knowledge expected

Basic knowledge of statistics and R is recommended, for example Statistics with R (851309, 851402) or First Steps with R (851016).

Objective (expected results of study and acquired competences)

By completing this course, students should be able to perform exploratory data analyses for a range of data types and formats. The students have acquired a repertoire of explorative methods, are able to name and describe them, understand their methodological assumptions, and have the competence to apply them to concrete problems with the use of statistical software. The skills comprise generating reproducible reports and interactive dashboards, and to interpret the results.
You can find more details like the schedule or information about exams on the course-page in BOKUonline.