Skip to main content

2014 | Buch

Data Mining Techniques in Sensor Networks

Summarization, Interpolation and Surveillance

verfasst von: Annalisa Appice, Anna Ciampi, Fabio Fumarola, Donato Malerba

Verlag: Springer London

Buchreihe : SpringerBriefs in Computer Science

insite
SUCHEN

Über dieses Buch

Sensor networks comprise of a number of sensors installed across a spatially distributed network, which gather information and periodically feed a central server with the measured data. The server monitors the data, issues possible alarms and computes fast aggregates. As data analysis requests may concern both present and past data, the server is forced to store the entire stream. But the limited storage capacity of a server may reduce the amount of data stored on the disk. One solution is to compute summaries of the data as it arrives, and to use these summaries to interpolate the real data. This work introduces a recently defined spatio-temporal pattern, called trend cluster, to summarize, interpolate and identify anomalies in a sensor network. As an example, the application of trend cluster discovery to monitor the efficiency of photovoltaic power plants is discussed. The work closes with remarks on new possibilities for surveillance enabled by recent developments in sensing technology.

Inhaltsverzeichnis

Frontmatter
Chapter 1. Sensor Networks and Data Streams: Basics
Abstract
Recent advances in pervasive computing and sensor technologies have significantly influenced the field of geosciences, by changing the type of dynamic environmental phenomena that can be detected, monitored, and reacted to. Another important aspect is the real-time data delivery of novel platforms. In this chapter, we describe the specific characteristics of sensor data and sensor networks. Furthermore, we identify the most promising streaming models, which can be embedded in intelligent sensor platforms and used to mine real-time data for a variety of analytical insights.
Annalisa Appice, Anna Ciampi, Fabio Fumarola, Donato Malerba
Chapter 2. Geodata Stream Summarization
Abstract
The management of massive amounts of geodata collected by sensor networks creates several challenges, including the real-time application of summarization techniques, which should allow the storage of this unbounded volume of georeferenced and timestamped data in a server with a limited memory for any future query. SUMATRA is a summarization technique, which accounts for spatial and temporal information of sensor data to produce the appropriate trade-off between size and accuracy of geodata summarization. It uses the count-based model to process the stream. In particular, it segments the stream into windows, computes summaries window-by-window, and stores these summaries in a database. The trend clusters are discovered as a summary of each window. They are clusters of georeferenced data, which vary according to a similar trend along the time horizon of the window. Signal compression techniques are also considered to derive a compact representation of these trends for storage in the database. The empirical analysis of trend clusters contributes to assess the summarization capability, the accuracy, and the efficiency of the trend cluster-based summarization schema in real applications. Finally, a stream cube, called geo-trend stream cube, is defined. It uses trends to aggregate a numeric measure, which is streamed by a sensor network and is organized around space and time dimensions. Space-time roll-up and drill-down operators allow the exploration of trends from a coarse-grained and inner-grained hierarchical view.
Annalisa Appice, Anna Ciampi, Fabio Fumarola, Donato Malerba
Chapter 3. Missing Sensor Data Interpolation
Abstract
Ubiquitous sensor stations continuously measure several geophysical variables over large zones and long (potentially unbounded) periods of time. However, observations can cover neither every space location nor every time. Interpolation, i.e., the estimation of unknown data in each location or time of interest, can be used to supplement station records. Although in GIScience there has been a tendency to treat space and time separately, there is now great interest in analyzing data in both the domains. This suggests that integrating space and time would yield better results than treating them separately, when interpolating several geophysical fields. This chapter contributes to the investigation of spatiotemporal interpolators in a remote-sensing scenario. We describe two interpolation techniques, which use trend clusters to interpolate missing data. The former performs the estimation phase by using the Inverse Distance Weighting approach, while the latter uses Kriging. Both have been adapted to a sensor network scenario. The proposed techniques have been evaluated in a large air-climate sensor network. The empirical study compares the accuracy and efficiency of both techniques.
Annalisa Appice, Anna Ciampi, Fabio Fumarola, Donato Malerba
Chapter 4. Sensor Data Surveillance
Abstract
A growing volume of geodata requires for appropriate data management systems, which ensure data acquisition and memory-preserving storage as well as continuous surveillance of this unbounded amount of georeferenced data. Trend cluster discovery, as a spatiotemporal aggregate operator, may play a crucial role in the surveillance process of the sensor data. We describe a computation-preserving algorithm, which employs an incremental learning strategy to continuously maintain sliding window trend clusters across a sensor network. The analysis of trend clusters, which are discovered at the consecutive sliding windows, is useful to look for possible changes in the data, as well as to produce forecasts of the future.
Annalisa Appice, Anna Ciampi, Fabio Fumarola, Donato Malerba
Chapter 5. Sensor Data Analysis Applications
Abstract
A PhotoVoltaic (PV) plant is a power station which converts sunlight energy into electric energy. In the last decade, PV plants have become ubiquitous in several countries of the European Union, due to a valuable policy of economic incentives (e.g., feed-in tariffs). Today, this ubiquity of PV plants has paved the way to the marketing of new smart systems, designed to monitor the energy production of a PV plant grid and supply intelligent services for customer and production applications. In this chapter, we start moving in this direction by fulfilling the urgent request of PV customers and PV companies to enjoy knowledge-based managing and monitoring services, integrated within a PV plant network. In particular, we illustrate a business intelligence solution developed to monitor the efficiency of the energy production of PV plants and a data mining solution for the fault diagnosis in PV plants.
Annalisa Appice, Anna Ciampi, Fabio Fumarola, Donato Malerba
Backmatter
Metadaten
Titel
Data Mining Techniques in Sensor Networks
verfasst von
Annalisa Appice
Anna Ciampi
Fabio Fumarola
Donato Malerba
Copyright-Jahr
2014
Verlag
Springer London
Electronic ISBN
978-1-4471-5454-9
Print ISBN
978-1-4471-5453-2
DOI
https://doi.org/10.1007/978-1-4471-5454-9