# Humboldt-Universität zu Berlin - Statistics

Datenanalyse II (VL+UE)

## Datenanalyse II (VL+UE)

The lecture covers classical statistical topics and we will work half of the time with SPSS and/or R in a PC-Pool. Note: The lecture will be held in german, but slides and exercise will be in english.

Bitte registrieren Sie sich im zugehörigen Moodle-Kurs.

Data analysis I:

• Survey design
• Operationalization
• Validity
• Short repetition of Statistics I+II
• Sampling
• Scaling
• Test theory
• Estimation theory
• Univariate statistics
• Graphical representation (stem-and-leaf, strip plot, kernel density, violin plot)
• Coeffcients (quantile, entropy, higher moments, hinges and spreads)
• Tests (t-Tests, Mann-Withney-U, Median, Wilcoxon, Kruskal-Wallis, ANOVA, Friedman)
• Transformations (Power, Box-Cox)
• Outliers
• Identification of ouitliers
• Robust coefficients for location and dispersion (L- and M-estimators for location)
• Treatment of missing values
• Types (MAR, MCAR, MNAR)
• Imputation methods (single, multiple)
• Bivariate statistics
• Graphical representation (sunflower plot, mosaic plot, trellis display)
• Subgroup analysis

Data analysis II:

• Bivariate statistics
• Coeffcients and tests (association and PRE- measure, Cohen's kappa, relative risk, odds ratio)
• Multivariate statistics
• Principal component analysis
• Exploratory factor analysis (reliability for sum scores)
• Cluster analysis
• Regression methods
• Simple linear regression
• Multiple linear regression
• Generalized linear regression
• Non- and Semiparametric Regression
• Classification and regression trees
• Neural networks

# Literature

• Barnett, V., Lewis, T. (1994) Outliers in statistical data, 3rd. Edition, Wiley, New York
• Berry, D.A., Lindgren, B.W. (1990), Statistics: Theory and Methods, Brooks/Cole Publishing Company, Pacific Grove
• Bortz, J. (1993), Statistik, Springer, Berlin et al.
• Bosch, K. (1992), Statistik-Taschenbuch, Oldenbourg, München, Wien
• Bühl, A., Zöfel, P. (1994), SPSS unter Windows Version 6, Addison-Wesley, Bonn et al.
• Böning, H., Trenkler, G. (1978), Nichtparametrische statistische Methoden, Walter de Gruyter, Berlin, New York
• Böning, H. (1991), Robuste und adaptive Tests, Walter de Gruyter, Berlin, New York
• Hartung, J., Elpelt, B., Klösener, K.-H. (1993), Statistik, Oldenbourg Verlag, München
• Heiler, S., Michels, P. (1994), Deskriptive und explorative Datenanalyse, Oldenbourg, München, Wien
• Jobson, J.D. (1991), Applied Multivariate Data Analysis, Vol. I: Regression and Experimental Design, Springer, Berlin et al.
• Köhler, W.-M. (1994), SPSS für Windows, Vieweg, Wiesbaden
• Mann, P. S. (1992), Introductory Statistics, John Wiley, New York at al.
• Rasmussen, S. (1992), An Introduction to Statistics with Data Analysis, Brooks/Cole Publishing Company, Pacific Grove
• Rönz: Skript zur Vorlesung "Computergestützte Statistik I", 2001
• Rönz, B., Strohe, H. G. (Hrsg., 1994), Lexikon Statistik, Gabler-Verlag, Wiesbaden
• Schlittgen, R. (1990), Einführung in die Statistik, Oldenbourg, München, Wien

### Anrechnungen

 Kurs HU Country University Course Datenanalyse I NL Vrije Universiteit Amsterdam Econometrics and Operations Research: Econometrics and Data Science Data Analysis 1 Datenanalyse II DE Technische Universität Berlin Institute of Software Engineering and Theoretical Computer Science Machine Learning 1