Advanced Search »
Newsletter
Unsubscribe »
National Science Foundation Award #0312200

ITR: Querying Web Resources Using Metadata in a Database

 
Investigator(s): Gultekin Ozsoyoglu (PI) ; Z. Meral Ozsoyoglu (Co-PI)
Sponsor: Case Western Reserve University, OH 44106 2163684510
Start Date/Expiration Date 2003-08-01 to 2006-07-31 (amended 2005-06-06)
Awarded Amount to Date: $281,000
Abstract: This project investigates web querying techniques for accessing web information resources. The term information resource refers to large web-accessible resources such as the ACM Digital Library. We propose a semantics-based way of accessing a web information resource: extract metadata about topics and relationships from the web resource, extend the metadata with "importance scores", and query it from a database. The query language is extended with constructs (a) to propagate importance scores to the query output to rank query output, and (b) to define "stopping conditions" to reduce query evaluation times. For some query requests, the metadata in the database may not be sufficient to answer queries. Our research direction is to locate more informative query answers by mixing database querying with "focused crawling" in the web information resource, at the algebraic operator level of SQL queries. These queries allow time constraints, and relax the closed world assumption, making it necessary to redefine the notion of well-defined queries. Data extraction techniques will be employed to extract metadata. Variations of our basic approach that do not require direct database query engine changes will be evaluated. Standalone web applications will be developed, and made available.
NSF Org: IIS - Division of Information & Intelligent Systems
Award Number: 0312200
Award Instrument: Continuing grant
Program Manager: Maria Zemankova
IIS Division of Information & Intelligent Systems
CSE Directorate for Computer & Information Science & Engineering
NSF Program(s): ITR SMALL GRANTS
Field Application(s): Information Systems
Program Reference Code(s): ADVANCED SOFTWARE TECH & ALGOR, 9216
BASIC RESEARCH & HUMAN RESORCS, 9218
INFORMATION MANAGEMENT, 1655
RES EXPER FOR UNDERGRAD-SUPPLT, 9251
UNDERGRADUATE EDUCATION, 9178
Program Element Code(s): 1686