19  Sensor Fusion Architectures

In 60 Seconds

Sensor fusion architectures define how and where data from multiple sensors is combined. The three main patterns – centralized, distributed, and hierarchical – offer different trade-offs in accuracy, scalability, and fault tolerance. The Dasarathy taxonomy classifies fusion systems by their input/output abstraction levels, from raw data combination to decision-level voting.

Learning Objectives

After completing this chapter, you will be able to:

  • Design centralized, distributed, and hierarchical fusion architectures
  • Apply the Dasarathy taxonomy for fusion classification
  • Choose appropriate fusion architectures for IoT applications
  • Evaluate trade-offs between architecture options

Key Concepts

  • Centralised fusion: All raw sensor data is transmitted to a central node for processing — simple to implement but creates a bandwidth bottleneck and single point of failure.
  • Decentralised fusion: Each sensor or node performs local processing and shares only estimates or features with neighbours, reducing communication overhead and improving fault tolerance.
  • Hierarchical fusion: A tree-structured architecture where low-level nodes fuse nearby sensors, intermediate nodes fuse local estimates, and a central node produces the final global estimate — balancing communication efficiency with accuracy.
  • Distributed fusion: A peer-to-peer architecture where nodes share estimates with neighbours and iteratively converge to a global consensus without any central coordinator.
  • Track-to-track fusion: Combining state estimates (tracks) from multiple independent trackers rather than raw observations, reducing communication bandwidth at the cost of some information loss.
  • JDL (Joint Directors of Laboratories) model: A standard five-level data fusion reference model: Level 0 (signal processing), Level 1 (object refinement), Level 2 (situation refinement), Level 3 (threat refinement), Level 4 (process refinement).

Imagine you have several sensors in a room – a thermometer, a motion detector, and a light sensor. Each one gives you a piece of the puzzle about what is happening. Sensor fusion is the process of combining these readings to get a more complete and accurate picture. The key question is: where do you combine them? You could send all raw data to one powerful computer (centralized), let each sensor make its own decision and then vote (distributed), or organize sensors into teams that summarize their findings before passing them up to a coordinator (hierarchical). Each approach has trade-offs in speed, reliability, and accuracy.

19.1 Fusion Architecture Types

Diagram comparing centralized, distributed, and hierarchical sensor fusion architectures showing data flow patterns and processing nodes

19.1.1 Architecture Comparison

Centralized

Pros: Optimal fusion and the simplest processing pipeline.

Cons: Single point of failure with the highest bandwidth demand.

Best fit: Small-scale systems with dependable backhaul.

Distributed

Pros: Scalable, fault-tolerant, and efficient on the network.

Cons: More coordination logic and less globally optimal fusion.

Best fit: Large IoT networks spread across many nodes.

Hierarchical

Pros: Balanced accuracy, modular deployment, and manageable bandwidth.

Cons: Moderate system complexity across tiers.

Best fit: Smart buildings, campuses, and smart city rollouts.

Worked Example: Choosing a Fusion Architecture for a Smart Campus

A university deploys 800 environmental sensors across 12 buildings to monitor HVAC efficiency. Each sensor (BME680) reports temperature, humidity, pressure, and VOC every 30 seconds. The campus facilities team needs fused “comfort index” values per room, per floor, and campus-wide.

Data volume calculation:

  • Per sensor per reading: 4 values x 4 bytes + 12 bytes metadata = 28 bytes
  • Per sensor per day: 28 bytes x 2,880 readings = 80.6 KB/day
  • Total raw data: 800 sensors x 80.6 KB = 64.5 MB/day

Architecture A: Centralized – All 800 sensors send raw data to one campus server.

Metric Value
Network bandwidth 64.5 MB/day (0.75 KB/s sustained)
Server compute 2,304,000 readings/day, single Kalman filter instance
Latency (sensor to decision) 2-5 seconds (network + queue + processing)
Failure impact Server down = zero monitoring for all 12 buildings
Infrastructure cost 1 server ($3,000) + campus-wide network backhaul

Architecture B: Distributed – Each sensor independently computes its own comfort index and broadcasts its decision.

Metric Value
Network bandwidth 9.2 MB/day (each sensor sends a 4-byte decision packet instead of 28 raw bytes: 86% reduction)
Edge compute Each sensor runs local threshold logic (fits in 64 KB RAM)
Latency (sensor to decision) <100 ms (local processing)
Failure impact 1 sensor fails = 1 room unmonitored, rest unaffected
Infrastructure cost $0 additional (runs on sensor MCU)

Problem: No cross-room correlation. Cannot detect that Building A’s HVAC is pulling cold air into Building B through connected ductwork because each sensor only sees its own room.

Architecture C: Hierarchical – Sensors fuse per-room (4-8 sensors each), room nodes fuse per-floor, floor nodes fuse per-building.

Metric Value
Network bandwidth 10.8 MB/day (room-level aggregates sent up, 83% reduction from centralized)
Compute distribution Room: simple averaging (MCU). Floor: anomaly detection (Raspberry Pi). Building: cross-floor optimization (edge server)
Latency (sensor to room decision) <200 ms
Latency (sensor to campus decision) 3-8 seconds
Failure impact Room node fails = 1 room dark, floor/building still functional
Infrastructure cost 100 room gateways ($50 each = $5,000) + 12 building servers ($500 each = $6,000) = $11,000

Decision: The campus chose Architecture C (Hierarchical) despite higher infrastructure cost because:

  1. Cross-building HVAC correlation required floor-level fusion (ruled out pure distributed)
  2. A single-server failure blacking out all 12 buildings was unacceptable (ruled out centralized)
  3. Hierarchical allowed incremental deployment – they started with 3 buildings and added 9 more over 18 months

6-month result: Hierarchical fusion detected that Buildings 3 and 7 shared a return air plenum, causing temperature oscillations (Building 3’s heater triggered Building 7’s cooler in a feedback loop). Fixing the plenum damper saved $14,200/year in energy waste – an insight impossible with per-sensor distributed fusion and discovered within 2 weeks of enabling cross-building correlation at the campus level.

The bandwidth savings from hierarchical aggregation are substantial. Raw data transmission (Architecture A):

\[\text{Bandwidth} = 800 \text{ sensors} \times 28 \text{ bytes/reading} \times 2 \text{ readings/min} = 44,800 \text{ bytes/min}\]

Daily bandwidth: \(44,800 \times 1440 = 64.5\) MB/day or 23.5 GB/year.

Hierarchical aggregation (Architecture C) reduces this through three stages:

  • Room level: 4-8 sensors → 1 avg value per room = 6× reduction
  • Floor level: 10 rooms → 5 zone averages = 2× reduction
  • Building level: 12 buildings → 1 campus value = No reduction (need all buildings)

Total bandwidth after aggregation: \(64.5 \text{ MB} \div 6 \approx 10.8\) MB/day. Over 3 years, the bandwidth saving is:

\[\text{Saved} = (23.5 - 3.9) \times 3 = 58.8 \text{ GB}\]

At $0.09/GB for cellular IoT data plans, hierarchical fusion saves $1.76/year in bandwidth alone – modest, but the real payoff is in analytics. The $11,000 infrastructure investment pays for itself in under 10 months solely from the $14,200 annual HVAC energy savings, not counting bandwidth.

Latency budget: Room decision in <200ms. Campus decision in 3-8 seconds. The 5-second difference allows for sequential processing: Room → Floor (1s) → Building (2s) → Campus (5s total). Each level has time to apply Kalman filtering with 10 samples (200ms × 10 = 2s window) before aggregating upward.

19.2 Data Fusion Taxonomy (Dasarathy Classification)

The Dasarathy taxonomy provides a formal framework for classifying sensor fusion systems based on their input and output abstraction levels.

Dasarathy taxonomy diagram showing five fusion categories from Data-In Data-Out to Decision-In Decision-Out with input and output abstraction levels

19.2.1 The Five Dasarathy Categories

DAI-DAO

Flow: Data -> Data

Raw sensor values are combined into a refined raw estimate.

Example: Two temperature sensors averaged together.

DAI-FEO

Flow: Data -> Feature

Raw readings are transformed into a higher-level feature.

Example: Accelerometer and gyroscope data become motion features.

FEI-FEO

Flow: Feature -> Feature

Multiple feature sets are merged into a richer description.

Example: Wi-Fi RSSI and BLE features combined.

FEI-DEO

Flow: Feature -> Decision

Features are classified into a discrete state or label.

Example: Heart rate and motion -> “User sleeping”.

DEI-DEO

Flow: Decision -> Decision

Independent decisions are fused into one final outcome.

Example: Camera and radar agree to BRAKE.

19.2.2 When to Use Each Category

  • DAI-DAO: Need single “best” raw measurement from redundant sensors
  • DAI-FEO: Raw data needs transformation into meaningful metrics
  • FEI-FEO: Combining complementary features from different sensor types
  • FEI-DEO: Features need classification into categorical outputs
  • DEI-DEO: Multiple independent classifiers vote on final decision

19.3 Smart Home Multi-Sensor Example

A smart home illustrates how the Dasarathy categories apply in practice. Temperature and humidity sensors fuse at the data level (DAI-DAO) to produce a comfort index. Motion, light, and door sensors extract occupancy features (DAI-FEO), which are then classified into states like “home,” “away,” or “sleeping” (FEI-DEO). Finally, the comfort system and security system each make independent decisions that are combined at the decision level (DEI-DEO) to determine actions like adjusting the thermostat or arming the alarm.

Smart home multi-sensor fusion diagram showing temperature and humidity becoming a comfort index, motion, light, and door readings becoming occupancy features, then combined decisions driving thermostat, lighting, and alarm actions

19.4 Architecture Trade-offs

Tradeoff: Early Fusion vs Late Fusion

Information preservation

Early fusion: Maximum signal detail is retained.

Late fusion: Minimal detail remains once only decisions are shared.

Computational cost

Early fusion: Very high because raw streams must be aligned and processed together.

Late fusion: Low because each node publishes a compact result.

Sensor heterogeneity and timing

Early fusion: Difficult with mixed sensors and strict synchronization requirements.

Late fusion: Easier to mix sensors and tolerate asynchronous updates.

Failure handling

Early fusion: Harder because one bad stream can disturb the shared pipeline.

Late fusion: Easier because nodes can fail independently and degrade gracefully.

Choose Early Fusion when:

  • Sensors measure complementary aspects (RGB + depth)
  • Cross-sensor correlations are important
  • Maximum accuracy is required

Choose Late Fusion when:

  • Sensors are heterogeneous with different rates
  • Graceful degradation is required
  • Computational resources are limited
Tradeoff: Sensor-Level vs Central Fusion

Latency

Sensor-level fusion: Lower because decisions stay local.

Central fusion: Higher because data must cross the network first.

Bandwidth

Sensor-level fusion: Lower by shipping compressed features or decisions.

Central fusion: Higher when raw sensor streams are transmitted.

Accuracy

Sensor-level fusion: Can be limited by local context and compute.

Central fusion: Benefits from the full dataset and richer cross-sensor correlation.

Reliability

Sensor-level fusion: Graceful degradation when one node drops out.

Central fusion: A single central failure can halt the whole service.

Choose Sensor-Level Fusion when:

  • Network bandwidth is constrained
  • Local latency requirements are strict
  • Privacy requires data to stay on-device

Choose Central Fusion when:

  • Sensors are heterogeneous and require complex correlation
  • Historical context is needed
  • Computational resources at edge are limited

19.5 Worked Example: Autonomous Forklift Safety System

Scenario: Design fusion architecture for autonomous forklift pedestrian detection.

Requirements:

  • <200ms latency from detection to brake
  • Recall above 99.9%
  • <1 false alarm per hour

Sensors:

  • 2D LIDAR (10 Hz, 360 deg, 30m range)
  • Stereo camera (30 Hz, 90 deg FOV)
  • Ultrasonic array (20 Hz, 180 deg, 5m range)

Architecture Decision: Cascaded fusion

  1. Stage 1 (LIDAR, 40ms): Fast point cloud clustering, flag all potential pedestrians
  2. Stage 2 (Camera, 80ms): Run CNN only on LIDAR-flagged regions
  3. Stage 3 (Ultrasonic, 20ms): Confirm close-range detections

Fusion Logic:

Alert if:
  (LIDAR AND Camera) OR
  (Ultrasonic AND Camera) OR
  (LIDAR confidence > 0.95)

Result: 99.92% recall, 0.7 false alarms/hr, 155ms latency – meeting all three requirements.

Key Insight: Cascaded fusion leverages each sensor’s strengths – LIDAR for fast spatial detection, camera for visual classification, ultrasonic for close-range confirmation.

19.6 Code Example: Kalman Filter for Sensor Fusion

The Kalman filter is the most widely used algorithm for combining noisy sensor data. Here is a simplified implementation fusing a GPS and an accelerometer for position tracking:

predicted_pos = pos + velocity * dt + 0.5 * accel * dt**2
predicted_var = pos_var + process_noise

innovation = gps_pos - predicted_pos
kalman_gain = predicted_var / (predicted_var + gps_noise)

fused_pos = predicted_pos + kalman_gain * innovation
updated_var = (1 - kalman_gain) * predicted_var

Why Kalman filtering matters for IoT:

Delivery drone position

Without fusion: GPS jumps 5-10 m between readings.

With Kalman fusion: Smooth trajectory with less than 1 m drift between GPS updates.

Fitness tracker steps

Without fusion: Accelerometer-only counts miss 15-25% of steps.

With Kalman fusion: Gyroscope-aided fusion keeps the error below 5%.

Indoor-outdoor transition

Without fusion: GPS loss means the position becomes unknown.

With Kalman fusion: The IMU carries the estimate until GPS recovers.

19.6.1 Explore: Fusion Architecture Bandwidth Calculator

Use this calculator to compare bandwidth requirements across the three fusion architectures for your own IoT deployment scenario.

19.7 Database Selection for Fusion Systems

Sensor fusion systems generate data at different rates and require different storage strategies:

InfluxDB

Best for: Time-series sensor data.

Write speed: 1M+ points/sec.

Query pattern: Time-range queries and downsampling.

Example: Raw sensor readings and aggregated metrics.

TimescaleDB

Best for: Time-series data with SQL joins.

Write speed: 500K+ points/sec.

Query pattern: Relational queries across sensor types.

Example: Fusion pipelines with metadata context.

MongoDB

Best for: Heterogeneous sensor documents.

Write speed: 100K+ docs/sec.

Query pattern: Flexible schema queries.

Example: Multi-format fusion results.

SQLite

Best for: Edge-local storage.

Write speed: 10K+ inserts/sec.

Query pattern: Simple on-device queries.

Example: Local buffering before cloud sync.

Redis

Best for: Real-time state caches.

Write speed: 1M+ ops/sec.

Query pattern: Fast key-value lookups.

Example: Current sensor states and fusion intermediates.

Storage capacity planning – adjust the parameters below to estimate storage needs for your fusion system:

Concept Relationships

Builds On:

Enables:

Architecture Patterns:

Related Topics:

Decision Support:

Try It Yourself

Experiment: Compare Fusion Architectures with Simulated Sensors

Implement all three fusion architectures and measure their characteristics:

Setup:

  • Simulate 10 temperature sensors in a building
  • Inject realistic noise (±0.5°C) and occasional sensor failures
  • True temperature drifts ±2°C over 1 hour

Architecture 1: Centralized

# All sensors → single fusion node
fused_temp = kalman_filter([sensor1, sensor2, ..., sensor10])

Architecture 2: Distributed

# Each sensor fuses locally, broadcasts decision
decisions = [sensor.classify_temperature() for sensor in sensors]
vote = majority_vote(decisions)

Architecture 3: Hierarchical

# 3 zones (3-4 sensors each) → building level
zone1 = avg([sensor1, sensor2, sensor3])
zone2 = avg([sensor4, sensor5, sensor6, sensor7])
zone3 = avg([sensor8, sensor9, sensor10])
building = weighted_avg([zone1, zone2, zone3])

Measurements:

  • Accuracy: RMS error vs ground truth
  • Latency: Time from sensor reading to fused output
  • Bandwidth: Data transmitted over network
  • Fault tolerance: Accuracy when 3 sensors fail

Expected Results:

Centralized

Best accuracy, highest latency and bandwidth, weakest fault tolerance.

Distributed

Moderate accuracy, lowest latency and bandwidth, strongest fault tolerance.

Hierarchical

Good accuracy with medium latency, medium bandwidth, and solid resilience.

Common Pitfalls

Centralised fusion requires transmitting all raw sensor data to a single node — in large-scale IoT deployments this communication overhead can dwarf the processing cost. Calculate the communication budget before choosing centralised vs distributed fusion.

Each tier in a hierarchical fusion system reduces information as it aggregates. Verify that the accuracy achieved at the top tier is adequate for the application after all intermediate aggregation steps, not just in the best-case scenario.

In a distributed fusion network, the failure of a node that aggregates data from 50 sensors silently removes all those sensors from the system. Design explicit failure detection and isolation so that node failures are visible and handled gracefully.

Distributed fusion architectures require time-synchronised measurements from all participating nodes. The synchronisation protocol overhead (NTP, PTP) can consume a significant fraction of the communication budget. Include it in resource budgeting.

19.8 Summary

Sensor fusion architectures determine system performance:

  • Centralized: Optimal but single point of failure
  • Distributed: Scalable and fault-tolerant
  • Hierarchical: Balanced approach for complex systems
  • Dasarathy taxonomy: Classifies fusion by input/output abstraction levels
  • Trade-offs: Early vs late fusion, sensor-level vs central

Sensor fusion architectures are like different ways to organize a team of detectives solving a mystery together!

19.8.1 The Sensor Squad Adventure: Three Ways to Solve a Mystery

The Sensor Squad had a big mystery to solve: “Who left the window open in the Smart School?” Principal Processor gave them three plans to choose from.

Plan A – The Big Meeting (Centralized): Sammy the Sensor, Lila the LED, Max the Microcontroller, and Bella the Battery all bring their clues to ONE big meeting room. Sammy says, “I felt cold air at 3 PM!” Lila adds, “I saw extra light from outside!” Max reports, “I heard the wind sensor go crazy!” Principal Processor looks at ALL the clues together and figures it out. This works great for a small team, but what if Principal Processor gets sick? Nobody can solve the mystery!

Plan B – Neighborhood Watch (Distributed): Each squad member investigates their OWN hallway. Sammy checks the science wing, Lila checks the art room, Max checks the gym, and Bella checks the library. They each make their own guess about which window is open, then they VOTE on the answer. If Sammy gets confused, the others can still solve it!

Plan C – Team Captains (Hierarchical): The squad splits into two sub-teams. Team Temperature (Sammy and a thermometer friend) figures out WHERE it is cold. Team Light (Lila and a brightness buddy) figures out WHERE extra light is coming in. Then Team Captain Max combines the sub-team reports. This is balanced – not too slow, not too risky!

Principal Processor smiled: “Choose your architecture based on the mission! Small mission? Use Plan A. Big school? Use Plan B. Medium? Plan C!”

19.8.2 Key Words for Kids

Word What It Means
Centralized Everyone sends data to ONE boss who makes the decision
Distributed Everyone makes their own decision, then they vote
Hierarchical Small teams decide first, then team captains combine results
Architecture The plan for how a system is organized
Key Takeaway

Fusion architecture choice determines system reliability, scalability, and accuracy. Centralized fusion provides optimal accuracy but creates a single point of failure. Distributed fusion scales well and tolerates failures but may sacrifice accuracy. Hierarchical fusion balances both concerns and is the most common choice for production IoT systems. Use the Dasarathy taxonomy to classify what level of abstraction your fusion operates at.

19.9 What’s Next

Study concrete fusion applications built on these architectures

Data Fusion Applications

Learn the Kalman filter algorithm used in many architectures

Kalman Filter for IoT

Explore particle filters for non-linear fusion

Particle Filters

Understand best practices for architecture selection

Data Fusion Best Practices

Return to the module overview

Data Fusion Introduction