Page 1 |
Save page Remove page | Previous | 1 of 192 | Next |
|
small (250x250 max)
medium (500x500 max)
large ( > 500x500)
Full Resolution
All (PDF)
|
This page
All
Subset |
A DATA INTEGRATION APPROACH TO DYNAMICALLY FUSING
GEOSPATIAL SOURCES
by
Snehal Thakkar
A Thesis Presented to the
FACULTY OF THE GRADUATE SCHOOL
UNIVERSITY OF SOUTHERN CALIFORNIA
In Partial Fulfillment of the
Requirements for the Degree
MASTER OF SCIENCE
(COMPUTER SCIENCE)
December 2007
Copyright 2007 Snehal Thakkar
Object Description
| Title | A data integration approach to dynamically fusing geospatial sources |
| Author | Thakkar, Snehal M. |
| Author email | snehalth@usc.edu |
| Degree | Doctor of Philosophy |
| Document type | Thesis |
| Degree program | Computer Science |
| School | Viterbi School of Engineering |
| Date defended/completed | 2007-08-30 |
| Date submitted | 2007 |
| Restricted until | Unrestricted |
| Date published | 2007-09-19 |
| Advisor (committee chair) | Knoblock, Craig A. |
| Advisor (committee member) |
Ambite, Jose Luis Shahabi, Cyrus Wilson, John P. |
| Abstract | Accurate and efficient integration of geospatial data is an important problem with implications in critical areas such as emergency response and urban planning. Some of the key challenges in supporting large-scale geospatial data integration are: (1) automatically representing a large number of geospatial source available on the web by utilizing various geospatial data access standards, (2) handling different geospatial data formats and access patterns, (3) assessing the quality of the data provided by a large number of geospatial sources, and (4) automatically providing high quality answers to the user queries based on a quality criteria supplied by the user. In this thesis I describe my research on efficient and accurate integration of geospatial data from a large number of sources. In particular, I describe a representation methodology for declaratively describing the content and the quality of data provided by sources in a data integration system. I discuss methods to automatically generate the descriptions of both the content and the quality of data provided by geospatial sources. I describe a quality-driven query answering algorithm that exploits the descriptions of the content provided by the geospatial sources to generate an initial data integration plan that answers a given user query and optimizes the generated plan by utilizing the description of the quality of data provided by the sources and the quality criteria specified by the user. I also present a mapping of the generated integration plan into a program that can be efficiently executed by a streaming, dataflow-style execution engine. I implement my techniques in a framework called Quality-driven Geospatial Mediator (QGM). My experimental evaluation in automatically representing over 1200 real-world geospatial sources shows that QGM accurately generates the descriptions of the content and the quality of geospatial sources.; The empirical evaluation of QGM's query answering techniques using over 1200 real-world sources shows that QGM provides better quality data in response to the user queries compared to the traditional data integration systems and does so with lower response time. |
| Keyword | geospatial data quality; geospatial data integration |
| Language | English |
| Part of collection | University of Southern California dissertations and theses |
| Publisher (of the original version) | University of Southern California |
| Place of publication (of the original version) | Los Angeles, California |
| Publisher (of the digital version) | University of Southern California. Libraries |
| Type | texts |
| Legacy record ID | usctheses-m825 |
| Rights | Thakkar, Snehal M. |
| Repository name | Libraries, University of Southern California |
| Repository address | Los Angeles, California |
| Repository email | http://www.usc.edu/isd/libraries/services/ask_a_librarian/email/ |
| Filename | etd-Thakkar-20070919 |
| Archival file | uscthesesreloadpub_Volume14/etd-Thakkar-20070919.pdf |
Description
| Title | Page 1 |
| Full text | A DATA INTEGRATION APPROACH TO DYNAMICALLY FUSING GEOSPATIAL SOURCES by Snehal Thakkar A Thesis Presented to the FACULTY OF THE GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for the Degree MASTER OF SCIENCE (COMPUTER SCIENCE) December 2007 Copyright 2007 Snehal Thakkar |
Comments
Post a Comment for Page 1

