In-Network Query Processing Sam Madden CS294-1 9/30/03 Outline • TinyDB • • • • • – And demo! Aggregate Queries ACQP Break Adaptive Operator Placement …

Download Report

Transcript In-Network Query Processing Sam Madden CS294-1 9/30/03 Outline • TinyDB • • • • • – And demo! Aggregate Queries ACQP Break Adaptive Operator Placement …

In-Network Query Processing

Sam Madden CS294-1 9/30/03

Outline

• • • • • • TinyDB – And demo!

Aggregate Queries ACQP Break Adaptive Operator Placement …

Outline

• • • • • • TinyDB – And demo!

Aggregate Queries ACQP Break Adaptive Operator Placement …

Programming Sensor Nets Is Hard

– Months of lifetime required from small batteries – 3-5 days naively; can’t recharge often – Interleave sleep with processing – Lossy, low-bandwidth, short range communication » Nodes coming and going » Multi-hop High-Level Abstraction – Remote, zero administration deployments – Highly distributed environment – Limited Development Tools » Embedded, LEDs for Debugging!

A Solution: Declarative

•

Queries

Users specify the data they want – Simple, SQL-like queries – Using predicates, not specific addresses – Our system: TinyDB • Challenge is to provide: – Expressive & easy-to-use interface – High-level operators • “ Transparent Optimizations ” that many programmers would miss – Sensor-net specific techniques – Power efficient execution framework

TinyDB Demo

TinyDB Architecture

SELECT AVG(temp) WHERE light > 400 Queries Results

Multihop Network Query Processor

Agg avg(temp) T :1, AVG : 225 T :2, AVG : 250

Schema: •“Catalog” of commands & attributes ~10,000 Lines Embedded C Code

400

Tables

light >

Samples got(‘temp’) getTempFunc(…) (3x larger than 2 nd Name : temp Time to sample : 50 uS Cost to sample : 90 uJ Calibration Table :

Units : Deg. F Error : ± 5 Deg F largest TinyOS Program) … TinyDB

Declarative Queries for Sensor Networks

“Find the sensors in bright nests.” • 1 Examples: SELECT nodeid, nestNo, light FROM sensors WHERE light > 400 EPOCH DURATION 1s

0 0 1 1

Sensors

Epoch Nodeid nestNo Light 1 2 1 2 17 25 17 25 455 389 422 405

Aggregation Queries

2 SELECT AVG(sound) FROM sensors EPOCH DURATION 10s “Count the number occupied nests in each loud region of the island.” 3 SELECT region, CNT (occupied) AVG (sound) FROM sensors GROUP BY region HAVING AVG (sound) > 200 EPOCH DURATION 10s Epoch region CNT(…) 0 North 3 0 1 1 South 3 North 3 South 3 AVG(…) 360 520 370 520 Regions w/ AVG(sound) > 200

• • • • •

Benefits of Declarative Queries

Specification of “whole-network” behavior Simple, safe Complex behavior via multiple queries, app logic Optimizable – Exploit (non-obvious) interactions – E.g.: • ACQP operator ordering, Adaptive join operator placement, Lifetime selection, Topology selection Versus other approaches, e.g., Diffusion • Black box ‘filter’ operators • Intanagonwiwat , “Directed Diffusion”, Mobicomm 2000

Outline

• • • • • • TinyDB – And demo!

Aggregate Queries ACQP Break Adaptive Operator Placement …

Tiny Aggregation (TAG)

• • Not in today’s reading In-network processing of aggregates – Common data analysis operation • Aka gather operation or reduction in || programming – Communication reducing • Operator dependent benefit • Exploit query semantics to improve efficiency!

Madden, Franklin, Hellerstein, Hong. Tiny AGgregation (TAG), OSDI 2002 .

Query Propagation Via Tree Based Routing

• Tree-based routing – Used in: • Query delivery • Data collection – Topology selection is important; e.g.

• Krishnamachari, DEBS 2002 , Intanagonwiwat, ICDCS 2002 , Heidemann, SOSP 2001 • LEACH/SPIN, Heinzelman et al . MOBICOM 99 • SIGMOD 2003 – Continuous process • Mitigates failures Q:SELECT … R:{…} E Q R:{…} Q B D Q Q Q A Q Q R:{…} C Q Q R:{…} Q F Q

Basic Aggregation

• In each epoch: – Each node samples local sensors once – Generates partial state record ( PSR ) • local readings • readings from children – Outputs PSR during assigned comm. interval • Communication scheduling for power reduction • • At end of epoch, PSR for whole network output at root New result on each successive epoch • Extras: – Predicate-based partitioning via GROUP BY 2 4 1 5 3