Exploiting multiple paths to express scientific queries
Title | Exploiting multiple paths to express scientific queries |
Publication Type | Conference Papers |
Year of Publication | 2004 |
Authors | Lacroix Z, Moths T, Parekh K, Raschid L, Vidal M-E |
Conference Name | 16th International Conference on Scientific and Statistical Database Management, 2004. Proceedings |
Date Published | 2004/06/21/23 |
Publisher | IEEE |
ISBN Number | 0-7695-2146-0 |
Keywords | access protocols, biology computing, BioNavigation system, complex queries, Costs, Data analysis, data handling, Data visualization, data warehouse, Data warehouses, Databases, diseases, distributed databases, hard-coded scripts, information resources, Information retrieval, mediation-based data integration system, multiple paths, query evaluation, Query processing, scientific data collection, scientific discovery, scientific information, scientific information systems, scientific object of interest, scientific queries, sequences, Web resources |
Abstract | The purpose of this demonstration is to present the main features of the BioNavigation system. Scientific data collection needed in various stages of scientific discovery is typically performed manually. For each scientific object of interest (e.g., a gene, a sequence), scientists query a succession of Web resources following links between retrieved entries. Each of the steps provides part of the intended characterization of the scientific object. This process is sometimes partially supported by hard-coded scripts or complex queries that will be evaluated by a mediation-based data integration system or against a data warehouse. These approaches fail in guiding the scientists during the collection process. In contrast, the BioNavigation approach presented in the paper provides the scientists with information on the available alternative resources, their provenance, and the costs of data collection. The BioNavigation system enhances a mediation-based integration system and provides scientists with support for the following: to ask queries at a high conceptual level; to visualize the multiple alternative resources that may be exploited to execute their data collection queries; to choose the final execution path to evaluate their queries. |
DOI | 10.1109/SSDM.2004.1311231 |