DS803: Statistics for Data Science
Study Board of Science
Teaching language: Danish or English depending on the teacher
EKA: N340044102
Assessment: Second examiner: Internal
Grading: 7-point grading scale
Offered in: Odense
Offered in: Autumn
Level: Master
STADS ID (UVA): N340044101
ECTS value: 5
Date of Approval: 27-02-2019
Duration: 1 semester
Version: Approved - active
Comment
Entry requirements
Academic preconditions
Students taking the course are recommended to:
- Have knowledge of mathematics at high school level
- Be able to use PC
The course cannot be chosen by students who: Have passed courses in basic or advanced statistics courses at the level of ST520 Applied statistics or higher.
Course introduction
The aim of the course is to enable the student to
- Understand concepts in probability and distribution theory.
- Utilize graphics and summary methods for descriptive data analysis.
- Describe data using key statistics such as mean, variance, and correlation.
- Construct confidence intervals for key statistics.
- Test simple statistical hypotheses.
- Analyze data using simple regression models.
- Design data collection.
- Understand central elements in published results from statistical analyses of actual data.
- Critically evaluate the appropriateness of employed methods and inferences based on these.
- Present statistical results in non-technical terms.
- Use the statistical software R for analysing actual data, which is important in regard to being able to work academically with data science problems.
The course builds on the knowledge acquired in the courses in the individual student’s bachelor programme; and gives an academic basis for studying the all later topics in the curriculum, as well as the master project.
In relation to the competence profile of the degree it is the explicit focus of the course to:
- Give the competence to working critically with own projects and data.
- Give skills to critically evaluate scientific publications.
- Give knowledge and understanding of choice and use of appropriate statistical methods.
Expected learning outcome
The learning objective of the course is that the student demonstrates the ability of:
- Utilizing graphics and summary methods for descriptive data analysis.
- Describing data using key statistics such as mean, variance, and correlation.
- Constructing confidence intervals for key statistics.
- Testing simple statistical hypotheses.
- Analyzing data using simple regression models.
- Designing data collection.
- Understanding central elements in published results from statistical analyses of actual data.
- Critically evaluating the appropriateness of employed methods and inferences based on these.
- Presenting statistical results in non-technical terms.
- Use R for simple statistical analyses.
Content
The following main topics are contained in the course:
- The foundation for statistical considerations.
- From population to sample and back again.
- Basic parameters and their estimation.
- Descriptive statistics (tables and graphics).
- Basal calculus
- Probabilities and distributions.
- Hypotheses and principles for tests.
- Examples of test methods: t-test, chi-square-test.
- Basic concepts underlying linear models starting from simple linear regression.
- Basic concepts with regard to study design.
- Common problems in applied statistics (types of inferential error, mass significance).
- In the course the statistical software R is used.
Literature
Examination regulations
Exam element a)
Timing
Efterår
Tests
Portfolio
EKA
N340044102
Assessment
Second examiner: Internal
Grading
7-point grading scale
Identification
Full name and SDU username
Language
Normally, the same as teaching language
Examination aids
Allowed, a closer description of the exam rules will be posted in itslearning
ECTS value
5
Additional information
Portfolio consists of quizzes, e-tests and home assignments that are assessed together.
Indicative number of lessons
Teaching Method
At the faculty of science, teaching is organized after the three-phase model ie. intro, training and study phase.
- Intro phase (lectures) - 26 hours
- Training phase: 22 hours, including 11 hours tutorials and 11hours laboratory
In the intro phase a modified version of the classical lecture is employed, where the terms and concepts of the topic are presented, from theory as well as from examples based on actual data. In these hours there is room for questions and discussions. In the training phase the students work with data-based problems and discussion topics, related to the content of the previous lectures in the intro phase. In these hours there is a possibility of working specifically with selected difficult concepts. In the study phase the students work independently with problems and the understanding of the terms and concepts of the topic. Questions from the study phase can afterwards be presented in either the intro phase hours or the study phase hours.
Study phase activities:
- Work on specific problems not covered in the training phase hours.
- Discussion of the terms and concepts and problems in regard to data collection and data quality.