Conference Home Page

Toronto Life Magazine

eye Magazine

toronto.com

Important Dates

Submitting Research Papers

Submitting Posters and Demos

Submitting Workshop Proposals

Submitting Tutorial Proposals

Printed Call for Papers (PDF)

Events

Keynote

Banquet

Tutorials

Workshops

Registration

Accommodation

Schedule at a Glance (PDF)

Area Coordinators

Conference Committee

Local Arrangements Committee

Contact

The Association for Computing Machinery
 
Microsoft Research
 
IBM Research
 
Altavista
 
Hummingbird
 
University of Waterloo
 
University of Toronto
 

SIGIR 2003 Afternoon Tutorial
XML Information Retrieval
July 28, 14:00-17:30 at the Jackson Room, Hilton Hotel
Ricardo Baeza-Yates & Norbert Fuhr

This tutorial covers the main concepts related to retrieving information from data structured in XML. The content is divided in three parts. The first covers the main concepts of XML, including defining, displaying and querying XML data. The second part addresses the challenges of retrieving information from XML data, in particular ranking and benchmarking. The last part covers indices for structure and their algorithms, focusing in the trade-off of expressivity vs. efficiency. Along the tutorial the state of the art of the above issues is emphasized, including current available software.

Ricardo Baeza-Yates is Ph.D. in Computer Science (University of Waterloo, Canada). Presently he is Professor and Chair of the Computer Science Department of the Universidad de Chile. He is also president of the Latin-American Center for Studies in Informatics (CLEI), an association of more than 50 computer science departments in Latin America; member of the IEEE-CS Board of Governors. His fields of research are information retrieval, algorithms, and information visualization. He is co-author of the book Modern Information Retrieval, published in 1999 by Addison-Wesley, as well as co-author of the 2nd edition of the Handbook of Algorithms and Data Structures, Addison-Wesley, 1991; and co-editor of Information Retrieval: Algorithms and Data Structures, Prentice-Hall, 1992, among other publications in journals published by ACM, IEEE or SIAM.

Norbert Fuhr holds a Ph.D. (Dr.) in computer science from the University of Darmstadt, Germany, From 1991-2002, he was professor at the computer science department of the University of Dortmund. Since 2002, he is professor in the computer science department of the University of Duisburg-Essen. He is well known for his work on probabilistic and logic-based IR models. His current research interests are IR methods for XML, user-oriented access to digital libraries, retrieval in networked information systems, and evaluation methods for IR. He is the founder and co-leader of the INEX initiative for the evaluation of XML