![]() |
||||||||||||||||||||||||||||||||||||||||||
![]() |
||||||||||||||||||||||||||||||||||||||||||
|
This tutorial covers the main concepts related to retrieving information from data structured in XML. The content is divided in three parts. The first covers the main concepts of XML, including defining, displaying and querying XML data. The second part addresses the challenges of retrieving information from XML data, in particular ranking and benchmarking. The last part covers indices for structure and their algorithms, focusing in the trade-off of expressivity vs. efficiency. Along the tutorial the state of the art of the above issues is emphasized, including current available software. Ricardo Baeza-Yates is Ph.D. in Computer Science (University of Waterloo, Canada). Presently he is Professor and Chair of the Computer Science Department of the Universidad de Chile. He is also president of the Latin-American Center for Studies in Informatics (CLEI), an association of more than 50 computer science departments in Latin America; member of the IEEE-CS Board of Governors. His fields of research are information retrieval, algorithms, and information visualization. He is co-author of the book Modern Information Retrieval, published in 1999 by Addison-Wesley, as well as co-author of the 2nd edition of the Handbook of Algorithms and Data Structures, Addison-Wesley, 1991; and co-editor of Information Retrieval: Algorithms and Data Structures, Prentice-Hall, 1992, among other publications in journals published by ACM, IEEE or SIAM. Norbert Fuhr holds a Ph.D. (Dr.) in computer science from the University of Darmstadt, Germany, From 1991-2002, he was professor at the computer science department of the University of Dortmund. Since 2002, he is professor in the computer science department of the University of Duisburg-Essen. He is well known for his work on probabilistic and logic-based IR models. His current research interests are IR methods for XML, user-oriented access to digital libraries, retrieval in networked information systems, and evaluation methods for IR. He is the founder and co-leader of the INEX initiative for the evaluation of XML |