English

DM847: Introduction to Bioinformatics (10 ECTS)

STADS: 15017301

Level
Master's level course

Teaching period
The course is offered when needed.

Teacher responsible

Email: jbaumbac@imada.sdu.dk

Timetable

Group	Type	Day	Time	Classroom	Weeks	Comment
Common	I	Monday	14-16	U107	38	Bioinfo seminar series
Common	I	Tuesday	12-14	IMADA semi	49
Common	I	Wednesday	12-14	IMADA semi	36-40,43,45-49
Common	I	Wednesday	12-14	U48	41	DM847
Common	I	Wednesday	12-14	U105	44
Common	I	Thursday	10-12	IMADA semi	36-41,43-48

Show entire timetable
Show personal time table for this course.

Comment:
Ubegrænset deltagerantal

Prerequisites:
None

Academic preconditions:
The content of DM507 Algorithms and Data Structures should be known.

Course introduction
The purpose of this course is to give an introduction to bioinformatics research. In each class, we will start with a concrete biological and/or medical question, transform it into a computational problem formulation, design a mathematical model, solve it, and finally derive and evaluate real-world answers from within the model. The course aims at providing the basic insights in modern bioinformatics research. It will be designed as prerequisite for a planned Bioinformatics II special course.

Expected learning outcome

Explain and understand the central dogma of molecular biology, central aspects of gene regulation, the basic principle of epigenetic DNA modifications, and specialties w.r.t. bacteria & phage genetics
Model ontologies for biomedical data dependencies
Design of systems biology databases
Explain and implement DNA & amino acid sequence analysis methods (HMMs, scoring matrices, and efficient statistics with them on data structures like suffix arrays)
Explain and implement statistical learning methods on biological networks (network enrichment, GraphLets)
Explain the specialties of bacterial genetics (the operon prediction trick).
Explain and implement methods for suffix trees, suffix arrays, and the Burrows-Wheeler transformation
Explain de novo sequence pattern screening with EM algorithm and entropy models.

Subject overview
Central dogma of molecular genetics, epigenetics, and bacterial and phage genetics, design of online databases for molecular biology content (ontologies, and example databases: NCBI, CoryneRegNet, ONDEX), DNA and amino acid sequence pattern models (HMMS, scoring matrices, mixed models, efficient statistics with them on big data sets), specialities in bacterial genetics (sequence models and functional models for operons prediction), de novo identification of transcription factor binding motifs (recursive expectation maximization, entropy-based models), analysis of next-generation DNA sequencing data sets (memory-aware short sequence read mapping data with Burrows Wheeler transformation and suffix arrays, bi-modal peak calling), visualization of biological networks (graph layouting: small but highly variable graphs vs. huge but rather static graphs), systems biology and statistics on networks (network enrichment with CUSP, jActiveModules and KeyPathwayMiner, Graphlet degree signatures)

Literature

Meddeles ved kursets start

Website
This course uses e-learn (blackboard).

Prerequisites for participating in the exam
None

Assessment and marking:
Oral exam, Danish 7-mark scale, external examiner.

Expected working hours
The teaching method is based on three phase model.
Intro phase: 41 hours
Skills training phase: 41 hours, hereof:
- Tutorials: 41 hours

Educational activities

Language
This course is taught in English.

Course enrollment
See deadline of enrolment.

Tuition fees for single courses
See fees for single courses.