Introduction to the statistical software R
medRSD Workshop
Target group | Doctoral students in medicine (Members of medRSD) Supervisors and Postdocs Scientific researchers |
Language | English, German |
Length of the workshop | Three days |
Places | 13 |
Costs | 650 € for external participants Free for doctoral students and researchers at the Medical Faculty of HHU |
Registration | medRSD: See below how to register |
In the training program of medRSD:
3 event days in the area of core competencies
In the basic curriculum of the PhD program:
3 event days after consultation with Dr. Gätjens
Dr. Katherine Ogurtsova
Dr. Ogurtsova is working in the Institute for Health Services Research and Health Economics in German Diabetes Center. Her primary qualification includes statistical methods in medicine, epidemiology, and public health with the main focus on R-programming and methodological issues.
Dr. Ralf Schäfer
Dr. Schäfer is a psychologist and a head of Psychological laboratory at Clinical Institute for Psychosomatic Medicine and Psychotherapy at University Clinic in the Heinrich Heine University. Dr. Schäfer’s research interest are psycho-physiological questions related to processing emotions and an investigation into the effectiveness of special psychotherapeutic therapies such as the effects of stress management training.
Lotte Wagner-Douglas
Lotte Wagner-Douglas is in the master's program in psychology at Heinrich Heine University and has been part of Dr. Schäfer's team for several years. Due to many years of programming experience with a focus on R as well as a broad knowledge of statistical methods, her main focus is on the processing and statistical analysis of data using R as well as active participation in current research topics.
Prerequisites
- Good general PC knowledge
- Programming skills is an advantage
- Basic statistical knowledge is an advantage
Concept of the course
„R“ (https://www.r-project.org/) is a programming language and application environment for statistical analyses and graphics. R is a free GNU project software and can be installed and used free of charge. Unlike many commercial systems (e.g. SPSS, SAS), the R-project is constantly developed by leading scientists and the wide statistical community. All procedures and functions are visible, i.e. the source code can be called up and checked at any time. There are a lot of packages that cover literally all statistical questions and methods.
The seminar gives a first impression of the R functionality and how to deal with a scripting language. The workshop is application-oriented. Standard procedures in R are shown and trained by means of examples and own calculations on a learning dataset. During the first two days the basic knowledge of R is given. The third day could be interesting for advanced users as well.
Learning aims
- Operate with objects, vectors and matrices. Import and export from/to SPSS and Excel sheets. Basic data management and data types.
- Making simple and complex graphics, running simple and sophisticated descriptive and inferential statistical analysis.
- Understanding how the complex statistical analysis can be performed in R (regressions, multilevel analysis, survival analysis)
Course content
- Brief introduction to R, the concept of R and the basic functions and conventions
- Practical exercises with simple calculations and application procedures
- Getting to know simple basic vocabulary of the script language
- Reading complex data records
- Data sorting and data retrieval
- Explorative data analysis
- Creating graphics
- Creating descriptive and inferential statistical procedures.
- Basic regression analysis in R, diagnostic techniques for the quality of regressions, statistical testing, diagrams.
- Multi-level analysis in R and graphs (ggplots)
- Survival analysis in R and graphs.
- Working with RStudio
Other
Please, install the following software before the course on your own laptop.
R: https://cran.uni-muenster.de/
RStudio: https://rstudio.com/products/rstudio/download/
YouTube how-to-do for Windows 10:
https://www.youtube.com/watch?v=_2sewGCA0y4&ab_channel=BecomingaDataScientist
YouTube how-to-do for Mac OS:
https://www.youtube.com/watch?v=LanBozXJjOk&ab_channel=DataSciencewithTom
MS Teams: https://www.microsoft.com/en/microsoft-teams/download-app
Please install R first and then RStudio. The programs must be granted administrator rights in Windows 7 or higher operating system versions. Installation on Mac OS is also possible.
Data:
- Differentiation sample versus population
https://www.youtube.com/watch?v=eIZD1BFfw8E
https://www.youtube.com/watch?v=Mb9BuEkbaHQ
- Sampling and sampling bias
https://www.youtube.com/watch?v=z0Ry_3_qhDw https://www.youtube.com/watch?v=PdXDLNNXPik
- Normal Distribution
https://www.youtube.com/watch?v=mtbJbDwqWLE&ab_channel=SimpleLearningPro
- Scale levels nominal/categorial, ordinal, interval/metric, ratio
https://www.youtube.com/watch?v=LuBD49SFpWs
- Causality
https://www.youtube.com/watch?v=ROpbdO-gRUo
- Here you can find some colorful and quite simple explanations all kinds
https://www.youtube.com/c/Simplelearningpro/videos
Descriptive statistics:
- Measures of the central tendency (mean, modulus, median)
https://www.youtube.com/watch?v=kn83BA7cRNM&list=PL8dPuuaLjXtNM_Y-bUAhblSAdWRnmBUcr&index=4
- Variability (variance, standard deviation standard error)
https://www.youtube.com/watch?v=wDAd_QHKoOg
https://www.youtube.com/watch?v=Cx2tGUze60s
https://www.youtube.com/watch?v=3UPYpOLeRJg
- Covariance and correlation
https://www.youtube.com/watch?v=xGbpuFNR1ME
https://www.youtube.com/watch?v=4EXNedimDMs
- Types of correlation
https://www.youtube.com/watch?v=Ypgo4qUBt5o&ab_channel=DrMaggard
- Linear regression
https://www.youtube.com/watch?v=ZkjP5RJLQF4
https://www.youtube.com/watch?v=iAgYLRy7e20
https://www.youtube.com/watch?v=kHZBy1uVNnM
https://www.youtube.com/watch?v=kHZBy1uVNnM&list=RDCMUCFrjdcImgcQVyFbK04MBEhA&start_radio=1&rv=kHZBy1uVNnM&t=367
- Logistic regression
https://www.youtube.com/watch?v=yIYKR4sgzI8
Inference statistics:
- Hypothesis, alpha and beta error probability
https://www.youtube.com/watch?v=hYWQT5nt6DU www.youtube.com/watch
- Alpha error accumulation
- Test-Power and effect size
https://www.youtube.com/watch?v=7mE-K_w1v90
https://www.youtube.com/watch?v=9LVD9oLg1A0
- The general linear model
https://www.youtube.com/watch?v=wfhD_ox4Srw
- Difference Linear Models vs. Generalized Linear Models
https://www.youtube.com/watch?v=ddCO2714W-o&ab_channel=MeerkatStatistics
- T-tests and ANOVA (regression revisited)
https://www.youtube.com/watch?v=pTmLQvMM-1M&ab_channel=BozemanScience
https://www.youtube.com/watch?v=nk2CQITm_eo
https://www.youtube.com/watch?v=NF5_btOaCig
- Differences between parametric and non-parametric methods
https://www.youtube.com/watch?v=ftnOBcXtBEQ
- Choosing a Statistical Test (Parametric vs Non parametric)
https://www.youtube.com/watch?v=ulk_JWckJ78&ab_channel=DanielM
Survival Analysis:
- https://www.youtube.com/watch?v=v1QqpG0rR1k&list=PLTNMv857s9WUclZLm6OFUW3QcXgRa97jx
- https://www.youtube.com/watch?v=K-_sblQZ5rE&list=PLTNMv857s9WUclZLm6OFUW3QcXgRa97jx&index=2
- https://www.youtube.com/watch?v=Dfe59glNXAQ&list=PLTNMv857s9WUclZLm6OFUW3QcXgRa97jx&index=3
- https://www.youtube.com/watch?v=lxoWsVco_iM&list=PLTNMv857s9WUclZLm6OFUW3QcXgRa97jx&index=4
Epidemiology: