International PhD School in Formal Languages and Applications

Text Retrieval

Tarragona, September 2004


Maxime Crochemore,   mac@univ-mlv


The course is devoted to algorithms processing strings and texts efficiently. These types of algorithms are used for software design in the domains of operating systems utilities, search engines on the Internet, data retrieval systems, analysis of genetic sequences, and natural language processing, for example. The underlying methodology borrows elements from automata theory and combinatorics on words.


Lectures include the following topics:

Related textbooks

Visualization of algorithms

Text Searching

Searching a list of strings --- Suffix Arrays

Structures for indexing texts

