Define edge data acquisition and explain why processing at the network periphery is essential for IoT systems
Classify IoT device categories and match acquisition strategies to device capabilities
Apply sampling and compression techniques to reduce data volume while preserving analytical value
Calculate power budgets for edge devices using duty cycling formulas
Design gateway architectures that bridge constrained devices to cloud infrastructure
Key Concepts
Analog-to-Digital Conversion (ADC): The process of converting continuous analog sensor signals (voltage, current) to discrete digital values with a resolution determined by the ADC bit depth (e.g., 12-bit gives 4096 levels).
Sampling rate: The frequency at which sensor readings are taken; must be at least twice the highest frequency of interest (Nyquist criterion) to avoid aliasing artefacts in the sampled signal.
Anti-aliasing filter: An analog low-pass filter applied before ADC conversion to remove signal components above the Nyquist frequency, preventing them from appearing as false low-frequency artefacts in the digital signal.
Signal conditioning: Hardware circuits (amplifiers, level shifters, filters) that adapt the raw sensor output signal to the acceptable input range of the ADC or microcontroller.
Timestamp accuracy: The precision and synchronisation of timestamps attached to sensor readings; for multi-sensor fusion and anomaly detection, timestamps must be accurate to within the relevant time resolution of the phenomena being measured.
Data pipeline latency: The total time from a physical event occurring to the event being available for analytics — composed of sensor response time, ADC conversion time, buffering delay, transmission latency, and processing time.
In 60 Seconds
Edge data acquisition is the foundation of every IoT system — it defines how raw physical phenomena are converted to digital readings, buffered, validated, and prepared for transmission or local processing. Getting acquisition right (correct sampling rate, noise filtering, timestamp accuracy) determines the quality ceiling for all downstream analytics.
Minimum Viable Understanding
Edge data acquisition collects sensor data at the network periphery and transmits only what is needed – 90% of raw IoT data is never analyzed, so local filtering and aggregation are critical.
Three device categories (Big Things, Small IP Things, Non-IP Things) require fundamentally different acquisition strategies based on their compute power, connectivity, and energy constraints.
Transmission dominates power budgets – batching data and maximizing sleep time through duty cycling can extend battery life from days to years.
Sensor Squad: Edge Data Acquisition
Sammy the Sensor is stationed in a greenhouse monitoring temperature every second. But sending every single reading to the cloud would be like Sammy calling headquarters 86,400 times a day!
Instead, Lila the Logic Chip helps Sammy be smarter: “Hey Sammy, if the temperature hasn’t changed much, just keep a running average. Only call headquarters when something interesting happens – like the temperature suddenly spiking!”
Max the Messenger (the radio module) is relieved. “I use a LOT of energy every time I transmit,” Max explains. “If Lila filters the data first, I only wake up a few times an hour instead of every second. My battery lasts months instead of days!”
Bella the Base Station (the gateway) collects messages from dozens of sensors like Sammy. She translates their different languages (protocols) into one the cloud understands, and keeps a backup in case the internet goes down.
The lesson: Smart sensors don’t just collect data – they decide what’s worth sending!
For Beginners: What is Edge Data Acquisition?
Think of edge data acquisition like a local newspaper reporter versus a national news network.
A local reporter (edge device) collects news from the neighborhood and decides what’s important enough to send to the national headquarters (cloud). They don’t send everything – just the highlights. This saves time, money, and keeps headquarters from being overwhelmed.
The “Edge” is simply where your sensors live:
Location
Example
Why “Edge”?
Your thermostat
Living room wall
At the edge of your network
Factory sensor
On a machine
Far from the central servers
Traffic camera
Roadside pole
Collecting data at the source
Why not just send everything to the cloud?
Challenge
Without Edge Processing
With Edge Processing
Speed
Wait for cloud response (100-500 ms)
Instant local decisions (<10 ms)
Battery
Constant transmission drains battery
Send only summaries, save power
Bandwidth
Network gets clogged
Only important data travels
Privacy
All data goes to remote servers
Sensitive data stays local
Real-world example: A security camera generates 1 GB of video per hour. Instead of sending all of that to the cloud, edge processing detects motion and only uploads the 5-second clips that matter – reducing data by 95%.
45.2 Prerequisites
Before starting this section, you should be familiar with:
Basic electrical concepts: Current, voltage, and power calculations for battery life analysis
45.3 Overview
Edge data acquisition is the process of collecting, processing, and transmitting sensor data at the network periphery – where physical devices meet the digital infrastructure. This section covers the fundamental concepts and techniques for efficient data collection at the IoT edge.
Key Principle
Collect raw data at the edge, but only transmit what’s needed – 90% of IoT data is never analyzed. Edge processing reduces latency, saves bandwidth, extends battery life, and protects privacy.
Interactive: Sensor-to-Network Data Pipeline
45.4 Edge Data Acquisition Architecture
The following diagram illustrates the end-to-end flow of data from sensors at the edge through gateways to the cloud, highlighting where filtering, aggregation, and compression occur.
45.5 IoT Device Categories
Different device types require fundamentally different acquisition strategies. The following diagram classifies the three main categories and their characteristics.
Putting Numbers to It
Consider three devices collecting air quality data:
Big Thing (IP camera with AI): 1920×1080 RGB frames at 30 fps generate: \[\text{Data rate} = 1920 \times 1080 \times 3\text{ bytes} \times 30\text{ fps} = 186{,}624{,}000\text{ bytes/s} \approx 178\text{ MB/s}\]
Even with H.264 compression (50:1), this is 3.56 MB/s – requires Wi-Fi or Ethernet.
Small IP Thing (ESP32 temperature sensor): 16-bit reading every 60s: \[\text{Daily data} = 2\text{ bytes} \times 1{,}440\text{ samples} = 2{,}880\text{ bytes} = 2.8\text{ KB/day}\]
Works on cellular (NB-IoT) or LoRaWAN with multi-year battery life.
Runs 5+ years on coin cell, but needs gateway for IP translation. Device category determines connectivity path, power budget, and data handling strategy.
45.6 Chapter Contents
This topic is organized into three focused chapters:
Understanding the foundation of edge data acquisition:
IoT Device Categories: Big Things, Small IP Things, and Non-IP Things
Connectivity Paths: Direct connection vs. gateway-mediated access
Data Generation Patterns: How different device types produce vastly different data volumes
Power Budget Framework: Decision tree for selecting acquisition strategies
Key takeaway: Match your acquisition strategy to device capabilities – cameras need compression, temperature sensors need aggregation, non-IP devices need protocol translation.
Algorithm Selection: Decision tree for choosing compression based on analytics requirements
Key takeaway: Apply the 90% rule – aggregate “normal” readings locally, transmit only summaries and anomalies. This extends battery life from days to years.
Missing Data Handling: Imputation strategies and health monitoring
Key takeaway: Transmission dominates power budget – batch data and maximize sleep time to extend battery life from months to years.
45.7 Data Reduction Decision Framework
Use the following decision tree to select the right data reduction strategy for your edge device.
45.8 Worked Example: Smart Agriculture Sensor Node
Real-World Scenario: Vineyard Soil Monitoring
Context: A vineyard deploys 200 soil moisture sensors across 50 hectares to optimize irrigation. Each sensor runs on a CR2032 coin cell battery (225 mAh) and uses LoRaWAN to communicate with a gateway.
Requirements:
Measure soil moisture every 15 minutes
Detect rapid moisture drops (e.g., irrigation failure) within 5 minutes
Battery life target: at least 2 years
Data must reach the cloud for trend analysis
Step 1: Estimate Raw Data Volume
Each reading is 4 bytes (moisture %) + 4 bytes (timestamp) + 2 bytes (battery level) = 10 bytes per sample.
At 15-minute intervals: 10 bytes x 4 per hour x 24 hours = 960 bytes/day per sensor.
Across 200 sensors: 960 x 200 = 192 KB/day raw data (manageable for LoRaWAN).
Step 2: Apply Edge Reduction
Instead of transmitting every reading, each sensor applies:
Delta encoding: Only send if moisture changes by more than 2% from last transmitted value
Hourly aggregation: Send min/max/avg summary every hour (30 bytes instead of 40 bytes)
Event-based alerts: Immediate transmission if moisture drops more than 10% in 30 minutes
Result: Average transmissions drop from 96/day to 28/day (71% reduction).
Step 3: Calculate Battery Life
Mode
Current Draw
Duration per Cycle
Sleep
2 uA
~14 min 59.7 sec (remainder of 15-min cycle)
Sense
5 mA
100 ms
Transmit
44 mA
200 ms (LoRa SF7, including radio wake-up)
Average current with 28 transmissions/day:
Sleep: 2 uA x (86,400 - 28 x 0.3) / 86,400 = ~2.0 uA
Sense-only (96 - 28 = 68 readings without TX): 5 mA x 0.1s x 68 / 86,400 = ~0.4 uA
Sense + Transmit (28 readings with TX): (5 mA x 0.1s + 44 mA x 0.2s) x 28 / 86,400 = ~3.0 uA
Total average: ~5.4 uA
Battery life: 225 mAh / 0.0054 mA = 41,667 hours = ~4.8 years (theoretical maximum)
Accounting for self-discharge (~2%/year) and temperature variation: realistic battery life of 2.5-3.5 years, exceeding the 2-year target.
Step 4: Gateway Design
One LoRaWAN gateway covers the entire 50-hectare vineyard (range: ~2 km line-of-sight). The gateway:
Receives packets from all 200 sensors
Buffers up to 24 hours of data in case of internet outage
Forwards to The Things Network (TTN) via 4G cellular backhaul
Runs on solar power with battery backup
Outcome: The vineyard achieves real-time irrigation monitoring with multi-year battery life, at a total data cost of under 6 MB/month for the entire deployment.
45.8.1 Interactive: Edge Sensor Battery Life Calculator
Use this calculator to explore how duty cycling parameters affect battery life. Adjust the sleep current, sensing current, transmit current, and number of daily transmissions to see the impact on estimated battery life.
The following diagram shows how power consumption varies across the duty cycle phases of a typical edge sensor node.
Common Pitfalls in Edge Data Acquisition
1. Over-sampling without purpose Collecting data at higher rates than needed wastes power and storage. A soil moisture sensor sampling every second when conditions change over hours is burning 99.9% of its energy for no analytical benefit. Always ask: What is the fastest physically meaningful change rate?
2. Ignoring transmission costs Engineers often optimize the sensing stage but forget that a single LoRa transmission at SF12 costs as much energy as 10 hours of sleep. Design around minimizing transmissions, not just minimizing sensing.
3. No local buffering or retry logic If the gateway is temporarily unreachable, unbuffered sensors simply lose data. Always implement store-and-forward with at least 24 hours of local buffer capacity.
4. Treating all data equally Sending routine “everything is normal” readings at the same priority as alarm conditions wastes bandwidth and delays critical alerts. Implement tiered priority: anomalies get immediate transmission, routine data gets batched.
5. Forgetting clock drift Edge devices with cheap oscillators can drift several seconds per day. Without periodic time synchronization (e.g., via gateway beacons), timestamps become unreliable and data fusion across sensors breaks down.
6. Underestimating battery self-discharge Theoretical battery life calculations assume full capacity. In practice, coin cells lose 1-3% per year from self-discharge, and extreme temperatures can halve capacity. Always apply a 50% safety margin to calculated battery life.
45.9.1 The Cost of Skipping Edge Processing: Exposed Wireless
A cautionary example comes from a large-scale smart building project in Singapore (2018). A property management company deployed 12,000 environmental sensors (temperature, humidity, CO2, occupancy) across 8 commercial buildings, each sensor transmitting raw readings every 10 seconds over Wi-Fi to a central cloud platform.
Metric
Design Assumption
Actual Result
Daily data volume
“Manageable” (not calculated)
82 GB/day across all buildings
Cloud ingestion cost
Budgeted S$800/month
S$6,200/month (AWS IoT Core + S3)
Wi-Fi access points
240 existing APs sufficient
Needed 380 additional APs (S$152K)
Sensor battery life
2 years (AA lithium)
4.2 months (constant Wi-Fi = 85 mA average)
Battery replacement labor
Budgeted S$24K/year
S$172K/year (technician visits every 4 months to 12K sensors)
Useful data analyzed
100%
3.1% (the rest was “22.3C, 22.3C, 22.3C…” noise)
The fix was straightforward: a firmware update added 5-minute window aggregation and delta encoding (transmit only on >0.5C change). Data volume dropped 94%, cloud costs fell to S$480/month, and battery life extended to 2.8 years. The lesson: the project’s total overspend before the fix (6 months of deployment) was S$247K – more than the entire sensor hardware budget of S$192K. Edge processing is not an optimization; it is a requirement.
45.10 Knowledge Checks
Test your understanding of edge data acquisition concepts:
Key Insight: Edge acquisition is not just data collection – it is intelligent filtering and preprocessing. As the Singapore smart building case study above illustrates, skipping edge processing can cost more than the hardware itself.