Email: rolf@imada.sdu.dk
During the course, the participants will implement a search engine. This will take place in work groups of somewhat larger sizes than usual for course projects, in order to gain experience with cooperation and project management in sizable software projects.
Expected learning outcome
After the course, the student is expected to be able to:
- explain in details the algorithmic and mathematical methods behind the main components of a search engine for WWW, including the components for data collection, indexing, query answering, and ranking of results.
- list the statistical aspects of the web graph taught in the course.
- explain the web graph models taught in the course, and describe their properties.
- explain other algorithmic and mathematical methods taught in the course of relevance to analysis of the Internet, including measures for web page similarity.
- during execution of the above demonstrate precision in wording and in use of mathematics and logic, as well as ability to select central and important parts of the subject under discussion.
- construct a well-functioning prototype of a web search engine by implementing the algorithmic methods behind the main components of such a search engine, including the components for data collection, indexing, query answering, and ranking of results.
- in a clear and well-structured language document the work done and the design choices made, including a description of the main structure of the program and of the principles of the algorithmically and programming-wise central parts of the program.
Subject overview
The anatomy of a search engine: web crawling, indexing, ranking, query handling. Specific subjects include: Internet protocols, algorithms and data structures for textual data, handling of massive data sets, compression, and link-based ranking. Additional research subjects to be discussed include: classic information retrieval, clustering, graph models for the Internet, web caching, and applications of game theory on the Internet.
Literature