How Does the Spatial Data Redundancy Affect Query Performance in Geographic Data Warehouses?

Rodrigo Costa Mateus; Thiago Luís Lopes Siqueira; Valéria Cesário Times; Ricardo Rodrigues Ciferri; Cristina Dutra de Aguiar Ciferri

Authors

Rodrigo Costa Mateus Universidade Federal de Pernambuco
Thiago Luís Lopes Siqueira Instituto Federal de Educação, Ciência e Tecnologia de São Paulo
Valéria Cesário Times Universidade Federal de Pernambuco
Ricardo Rodrigues Ciferri Universidade Federal de São Carlos
Cristina Dutra de Aguiar Ciferri Universidade de São Paulo

Keywords:

benchmark, geographic data warehouse, performance evaluation

Abstract

Geographic Data Warehouses (GDWs) are traditional data warehouses with spatial attributes that are used for defining spatial dimension tables, spatial measures and spatial hierarchies. Non-redundant spatial data warehouse schemas have been recognized as an essential issue in the GDW design.

Although the lack of spatial redundancy represents a gain in data storage, it implies in a need for performing expensive join operations to answer a given query that may refer to one or more query windows. In this paper, we investigate to what extent the separate storage of spatial and conventional data is recommended in GDW, according to increasing numbers of query windows.

We also investigate if the complexity of the spatial data (i.e. points versus polygons) influences the choice of storing spatial and conventional data in the same or in different dimension tables. Our experimental results indicated that if non-redundant spatial data are represented as point objects, an approach to avoid additional join costs by storing both point data and their descriptive data in a single table should be chosen. The results also showed that redundant GDW schemas introduce a severe drawback, as some spatial analytical queries cannot reuse previously fetched spatial data, impairing query performance.

Finally, based on the experimental results, we propose in this paper a set of guidelines for the design of logical GDW schemas, called ``Logical GDW Design Guidelines''.

Author Biographies

Rodrigo Costa Mateus, Universidade Federal de Pernambuco

Informatics Center, Federal University of Pernambuco,
50733-970, Recife, PE, Brazil
Thiago Luís Lopes Siqueira, Instituto Federal de Educação, Ciência e Tecnologia de São Paulo

São Paulo Federal Institute of Education, Science and Technology, 13565-905, São Carlos, SP, Brazil
Valéria Cesário Times, Universidade Federal de Pernambuco

Informatics Center, Federal University of Pernambuco,
50733-970, Recife, PE, Brazil
Ricardo Rodrigues Ciferri, Universidade Federal de São Carlos

Department of Computer Science, Federal University of São Carlos,
13565-905, São Carlos, SP, Brazil
Cristina Dutra de Aguiar Ciferri, Universidade de São Paulo

Department of Computer Science, University of São Paulo,
13560-970, São Carlos, SP, Brazil

How Does the Spatial Data Redundancy Affect Query Performance in Geographic Data Warehouses?

Authors

Keywords:

Abstract

Author Biographies

Downloads

Published

Issue

Section

Developed By

Language

Information