BMB830: Biostatistics in R I (5 ECTS)

STADS: 01014301

Level
Master's level course

Teaching period
The course is offered in the autumn semester.

Teacher responsible
Email: veits@bmb.sdu.dk

Timetable
Group Type Day Time Classroom Weeks Comment
Common I Monday 14-16 U142 36
Common I Monday 14-16 U30 39
Common I Tuesday 14-16 T8 37
Common I Tuesday 16-18 U154 40
Common I Wednesday 10-12 U142 36
Common I Wednesday 10-12 U154 39
Common I Wednesday 10-12 U21 40
Common I Wednesday 10-12 U26A 41
Common I Thursday 08-10 U154 37
Common I Friday 08-10 U142 35
H1 TE Monday 16-18 U24A 41
H1 TE Tuesday 16-18 U24A 36
H1 TE Tuesday 14-16 U24A 39
H1 TE Thursday 10-12 U154 37
H1 TE Thursday 10-12 U21 40
H1 TE Friday 14-16 U142 36,41
H1 TE Friday 14-16 U154 39
Show entire timetable
Show personal time table for this course.

Prerequisites:
None

Academic preconditions:
Students taking the course are expected to:
  • Have knowledge in statistics 
  • Understand the basic principles of molecular biology


Course introduction
Modern experimental platforms generate large sets of often noisy data that requires its processing by appropriate analytic and statistical methods. High-confidence data interpretation is built upon correct application of methods such as statistical models and pattern recognition. Furthermore, proper visualization of the results helps presenting and understanding the results. This course introduces the students to the main concepts of biostatistics, data analysis and visualization, so they understand the principles to design and apply work flows that handle a certain data type. The course will have a theoretical and a practical part, with the objective to provide general understanding of data analysis and application of bioinformatics tools.

Among currently available software suits, the R scripting language became very popular to deal with biostatistics and analysis of large data sets, as it (i) provides a vast number of statistical tools, (ii) allows adaptation of the analysis to any experimental design, (iii) offers simple commands to operate on entire data sets, (iv) provides a wide range of methods for data visualization and (v) has a large and active community of researchers developing new tools. However, it requires the user to acquire scripting skills to take advantage of the many features.
The course will introduce the students to basic programming of R scripts, data visualization and basic statistical models necessary to deal with data from modern high-throughput experiments.



Expected learning outcome
The learning objectives of the course are that the student demonstrates the ability to:
  • independently analyze biological data sets. 
  • work with large data amounts and carry out standard statistical analysis to identify relevant features. 
  • use standard algorithms for multi-variate analysis
  • design scripts for detailed visualization of their results. 
  • apply tools for data interpretation.
  • know how to objectively discuss applied data analysis methods presented e.g. in publications.
Subject overview
The following main topics are contained in the course:
  • basic probability
  • different types of data modeling
  • basic statistical models
  • data visualization
  • data interpretation
  • basic multi-variate analysis
Literature
    Der er i øjeblikket ikke angivet nogle materialer for kurset.


Website
This course uses e-learn (blackboard).

Prerequisites for participating in the exam
Tutorial and exercises. Pass/fail, internal marking by teacher (01014312)

Assessment and marking:
Oral examination. External marking, Danish 7-mark scale (01014302)

Allowed aids: Blackboard / Whiteboard.



Expected working hours
The teaching method is based on three phase model.
Intro phase: 20 hours
Skills training phase: 16 hours, hereof:
 - Tutorials: 16 hours

Educational activities

Educational form

Language
This course is taught in English.

Course enrollment
See deadline of enrolment.

Tuition fees for single courses
See fees for single courses.