DM837: Big Data (10 ECTS)

STADS: 15017001

Level
Master's level course

Teaching period
The course is offered in the spring semester.

Teacher responsible
Email: zhou@imada.sdu.dk

Timetable
Group Type Day Time Classroom Weeks Comment
Common I Tuesday 10-12 IMADA Seminarrum 06-08,10-13,15-22
H1 TE Monday 12-14 IMADA Seminarrum 20
H1 TE Thursday 10-12 IMADA Seminarrum 06-08,10-13,15-19,21-22
Show entire timetable
Show personal time table for this course.

Comment:
Ubegrænset deltagerantal

Prerequisites:
None

Academic preconditions:
The content of DM505 Database Design and Programming and DM532 Principles of Database Systems should be known. This course cannot be taken if DM822 Cloud Computing has already been or is currently taken.

Course introduction
The main purpose of this course is to give the participants an understanding of the technologies of Big Data analysis and management. It covers both traditional methods used in data warehouse and parallel database systems, real-time data-stream processing systems, as well as modern technologies of cloud computing and massively parallel data analysis platforms.

Expected learning outcome
  • Explain the techniques of data warehouse and parallel database systems
  • Explain the techniques of data stream processing
  • Account for theories behind massively parallel data analysis systems
  • Explain the design and trade-off in the modern systems introduced in the course
  • Develop programs and apply tools for big data management and analysis and deploy them on a cloud computing platform;
  • Report work done in the assignments in a clear and precise language, and in a structured fashion.
Subject overview
Data warehouse, parallel database systems, massively parallel data analysis, approximate query processing, parallel data stream processing, fault-tolerance, load balancing, load shedding, dynamic schaling, data partitioning, distributed transactions, Hadoop, MapReduce, GraphLab, Spark, Storm.

Literature
    Meddeles ved kursets start


Website
This course uses e-learn (blackboard).

Prerequisites for participating in the exam
Assignments. Pass/fail, internal evaluation. (15017022)

Assessment and marking:
  1. Oral exam. External examiner, graded after Danish 7 mark scale (5 ECTS). (15017002)
  2. Project. External examiner, graded after Danish 7 mark scale (5 ECTS). (15017012)
Expected working hours
The teaching method is based on three phase model.
Intro phase: 32 hours
Skills training phase: 24 hours

Educational activities Study phase: 24 hours

Language
This course is taught in English.

Course enrollment
See deadline of enrolment.

Tuition fees for single courses
See fees for single courses.