Efficient Prediction of Difficult Keyword Queries over
Databases
ABSTRACT:
Keyword queries
on databases provide easy access to data, but often suffer from low ranking
quality, i.e., low precision and/or recall, as shown in recent benchmarks. It
would be useful to identify queries that are likely to have low ranking quality
to improve the user satisfaction. For instance, the system may suggest to the
user alternative queries for such hard queries. In this paper, we analyze the
characteristics of hard queries and propose a novel framework to measure the
degree of difficulty for a keyword query over a database, considering both the
structure and the content of the database and the query results. We evaluate
our query difficulty prediction model against two effectiveness benchmarks for
popular keyword search ranking methods. Our empirical results show that our
model predicts the hard queries with high accuracy. Further, we present a suite
of optimizations to minimize the incurred time overhead.
EXISTING SYSTEM:
Ø There have been collaborative efforts to provide
standard benchmarks and evaluation platforms for keyword search methods over
databases. One effort is the data-centric track of INEX WorkshopQueries were
provided by participants of the workshop. Another effort is the series of
Semantic Search Challenges (SemSearch).The results indicate that even with structured data, finding the
desired answers tokeyword queries is still a hard task. More interestingly,
looking closer to the ranking quality of the best performing methods on both
workshops.
DISADVANTAGES
OF EXISTING SYSTEM:
Ø Suffer from low ranking quality.
Ø Performing very poorly on a subset of queries.
PROPOSED SYSTEM:
Ø We set forth a principled framework and proposed novel
algorithms to measure the degree of the difficulty of a query over a DB, using
the ranking robustness principle.
Ø Based on our framework, we propose novel algorithms
that efficiently predict the effectiveness of a keyword query.
ADVANTAGES
OF PROPOSED SYSTEM:
Ø Easily mapped to both XML and relational data.
Ø Higher prediction accuracy and minimize the incurred
time overhead.
SYSTEM
REQUIREMENTS:
HARDWARE REQUIREMENTS:
Ø
System : Pentium IV 2.4 GHz.
Ø
Hard Disk :
40 GB.
Ø
Floppy Drive : 1.44
Mb.
Ø
Monitor : 15
VGA Colour.
Ø
Mouse :
Logitech.
Ø Ram : 512 Mb.
SOFTWARE
REQUIREMENTS:
Ø Operating system : Windows
XP/7.
Ø Coding Language : JAVA/J2EE
Ø IDE : Netbeans 7.4
Ø Database : MYSQL
REFERENCE:
Shiwen Cheng,
Arash Termehchy, and Vagelis Hristidis,“Efficient Prediction of Difficult
Keyword Queries over Databases”,VOL. 26, NO. 6, JUNE 2014.
No comments:
Post a Comment