Jamie Callan is a Professor of Computer Science at Carnegie Mellon's Language Technologies Institute and School of Information Systems & Management. Prior to joining CMU he was a Research Assistant Professor at the University of Massachusetts, where he also received his Ph.D. His research and teaching focus on text-based information retrieval, primarily search engine architectures, federated search of groups of search engines, adaptive information filtering, text mining, and information retrieval for educational applications. He has published more than 150 scientific papers. He is the Editor-in-Chief of ACM Transactions on Information Systems (TOIS) and was a founding Editor-in-Chief of Foundations and Trends in Information Retrieval (FnTIR). He has served as Chair of ACM SIGIR, and Program Chair of the SIGIR and CIKM conferences.
Speech Title: Search Engine Support for Software Applications (Click here for pdf file)
Abstract:Question-answering, computer-assisted language learning, text mining, and other software applications that use a full-search engine to find information in a large text corpus are becoming common. A software application may use metadata and text annotations to reduce the mismatch between the concept-based representations convenient for inference and the word-based representations typically used for text retrieval. Software applications may also be able to specify detailed requirements that retrieved passages must satisfy. This use of text search is very different than the ad-hoc, interactive search that information retrieval research typically studies.
Divesh Srivastava is the head of the Database Research Department at AT&T Labs-Research. He received his Ph.D. from the University of Wisconsin, Madison, and his B.Tech from the Indian Institute of Technology, Bombay. He is on the board of trustees of the VLDB Endowment, the associate Editor-in-Chief of the IEEE Transactions on Knowledge and Data Engineering, and an associate editor of the ACM Transactions on Database Systems. He has served as the program committee co-chair of many conferences, including VLDB 2007. His research interests and publications span a variety of topics in data management.
Speech Title: Schema Extraction (Click here for pdf file)
Abstract: Understanding the schema of a complex database is a crucial step in exploratory data analysis.
However, gaining such an understanding is challenging for new users for many reasons.
Susan Dumais is a Principal Researcher and manager of the Context, Learning and User Experience for Search (CLUES) Group at Microsoft Research. Prior to joining Microsoft Research, she was at Bellcore and Bell Labs for many years, where she worked on Latent Semantic Indexing (a statistical method for concept-based retrieval), interfaces for combining search and navigation, and organizational impacts of new technology. Her current research focuses on user modeling and personalization, context and information retrieval, temporal dynamics of information, interactive retrieval, and novel evaluation methods. She has worked closely with several Microsoft groups (Bing, Windows Desktop Search, SharePoint Portal Server, and Office Online Help) on search-related innovations. Susan has published more than 200 articles in the fields of information science, human-computer interaction, and cognitive science, and holds several patents on novel retrieval algorithms and interfaces. Susan is also an adjunct professor in the Information School at the University of Washington. She is Past-Chair of ACM's Special Interest Group in Information Retrieval (SIGIR), and serves on several editorial boards, technical program committees, and government panels. She was elected to the CHI Academy in 2005, an ACM Fellow in 2006, and received the Gerard Salton Award from SIGIR for Lifetime Achievement in 2009.
Speech Title: Temporal Dynamics and Information Retrieval (Click here for pdf file)
Abstract: Many digital resources, like the Web, are dynamic and ever-changing collections of information. However, most of the tools information retrieval and management that have been developed for interacting with Web content, such as browsers and search engines, focus on a single static snapshot of the information. In this talk, I will present analyses of how Web content changes over time, how people re-visit Web pages over time, and how re-visitation patterns are influenced by changes in user intent and content. These results have implications for many aspects of information retrieval and management including crawling, ranking and information extraction algorithms, result presentation, and evaluation. I will describe a prototype system that supports people in understanding how the information they interact with changes over time, and a new retrieval model that incorporates features about the temporal evolution of content to improve core ranking. Finally, I will conclude with an overview of some general challenges that need to be addressed to fully incorporate temporal dynamics in information retrieval and information management systems.
Biographical sketch: Gregory Grefenstette (Stanford'78, University of Pittsburgh'93) is chief science officer of Exalead, part of DassaultSystemes. An expert in natural language processing, Grefenstette established the field of Cross Language Information Retrieval by organizing its first Workshop at SIGIR'96. He is also one of the pioneers of distributional semantics, following his PhD work "Exploring Automatic Thesaurus Generation" (Kluwer, 1994). Involved in information retrieval since the early TREC days, he has always been keen on large scale solutions to natural language processing problems, co-editing with Adam Kilgarriff a special issue of "Computational Linguistics" in 2003. Former chief scientist at the Xerox Research Centre Europe (93-01), at Clairvoyance Corporation (01-04), and with the French CEA (04-08), he has been active in transferring research into products as inventor in 16 U.S patents. In recent years, he has been working with Adrian Popescu on Geographical Indexing. With Dassault Systemes, he is interested in exploring multidimensional access to information.
Speech Title: Use of Semantics in Real Life Applications (Click here for pdf file)
Abstract: Much research in computer exploitable semantics performed in the 1990s and early 2000s is finally finding its way into practical applications. This talk will explore these paths from research to industry, illustrated by current products on the market.