Advanced Search »
National Science Foundation Award #0237381

CAREER: Improving Information Access by Learning from User Interactions

 
Investigator(s): Thorsten Joachims (PI)
Sponsor: Cornell University - Endowed, NY 14853 6072555014
Start Date/Expiration Date 2003-09-01 to 2006-08-31 (amended 2005-07-08)
Awarded Amount to Date: $240,000
Abstract: The project takes a machine learning approach to improving the effectiveness of information access tools, in particular the retrieval quality of search engines. The ability to learn enables a search engine to automatically adapt its retrieval strategy to individual users, to specific user groups, and to particular WWW sites. A search engine should learn, for example, that a query for ``Michael Jordan'' issued from a user at cs.cornell.edu is much more likely to refer to the professor at UC Berkeley than for an average user. Similarly, a search engine should be able to adapt to collection properties, for example, that in a particular intranet not the TITLE field, but the H1 headlines contain the most important information. Since explicit user feedback is rarely available, implicit feedback derived from observable user behavior is used as the input to the learning algorithms. Such implicit feedback requires new machine learning methods, since it comes in forms that are different from the standard machine learning settings. For examples, in search engines it is more reasonable to exploit clickthrough data as feedback in the form of pair-wise preferences (e.g. ``for query Q, document A should be ranked higher than document B'') than as an absolute relevance feedback. The project analyzes the reliability of implicit clickthrough data, designs and analyzes learning methods, and evaluates their applicability on an educational database, providing a service to the scientific community. Beyond this direct contribution, this technology can be used to improve the performance of general purpose search engines such as Google and hence has broader impacts beyond the scientific community. Information on this project is available on the web http://www.cs.cornell.edu/People/tj/career. The resulting software will be made available for download.
NSF Org: IIS - Division of Information & Intelligent Systems
Award Number: 0237381
Award Instrument: Continuing grant
Program Manager: Maria Zemankova
IIS Division of Information & Intelligent Systems
CSE Directorate for Computer & Information Science & Engineering
NSF Program(s): INFORMATION & KNOWLEDGE MANAGE
Field Application(s): Human Subjects, Information Systems
Program Reference Code(s): ADVANCED SOFTWARE TECH & ALGOR, 9216
FACULTY EARLY CAREER DEVELOPMENT PROGRAM, 1045
Program Element Code(s): 6855