How Does the Spatial Data Redundancy Affect Query Performance in Geographic Data Warehouses?

Authors

  • Rodrigo Costa Mateus Universidade Federal de Pernambuco
  • Thiago Luís Lopes Siqueira Instituto Federal de Educação, Ciência e Tecnologia de São Paulo
  • Valéria Cesário Times Universidade Federal de Pernambuco
  • Ricardo Rodrigues Ciferri Universidade Federal de São Carlos
  • Cristina Dutra de Aguiar Ciferri Universidade de São Paulo

Keywords:

benchmark, geographic data warehouse, performance evaluation

Abstract

Geographic Data Warehouses (GDWs) are traditional data warehouses with spatial attributes that are used for defining spatial dimension tables, spatial measures and spatial hierarchies. Non-redundant spatial data warehouse schemas have been recognized as an essential issue in the GDW design.

Although the lack of spatial redundancy represents a gain in data storage, it implies in a need for performing expensive join operations to answer a given query that may refer to one or more query windows. In this paper, we investigate to what extent the separate storage of spatial and conventional data is recommended in GDW, according to increasing numbers of query windows.

We also investigate if the complexity of the spatial data (i.e. points versus polygons) influences the choice of storing spatial and conventional data in the same or in different dimension tables. Our experimental results indicated that if non-redundant spatial data are represented as point objects, an approach to avoid additional join costs by storing both point data and their descriptive data in a single table should be chosen. The results also showed that redundant GDW schemas introduce a severe drawback, as some spatial analytical queries cannot reuse previously fetched spatial data, impairing query performance.

Finally, based on the experimental results, we propose in this paper a set of guidelines for the design of logical GDW schemas, called ``Logical GDW Design Guidelines''.

Author Biographies

  • Rodrigo Costa Mateus, Universidade Federal de Pernambuco

    Informatics Center, Federal University of Pernambuco,

    50733-970, Recife, PE, Brazil

  • Thiago Luís Lopes Siqueira, Instituto Federal de Educação, Ciência e Tecnologia de São Paulo

    São Paulo Federal Institute of Education, Science and Technology, 13565-905, São Carlos, SP, Brazil

  • Valéria Cesário Times, Universidade Federal de Pernambuco

    Informatics Center, Federal University of Pernambuco,

    50733-970, Recife, PE, Brazil

  • Ricardo Rodrigues Ciferri, Universidade Federal de São Carlos

    Department of Computer Science, Federal University of São Carlos,

    13565-905, São Carlos, SP, Brazil

  • Cristina Dutra de Aguiar Ciferri, Universidade de São Paulo

    Department of Computer Science, University of São Paulo,

    13560-970, São Carlos, SP, Brazil

Downloads

Published

2010-09-09

Issue

Section

Regular Articles