16 Error Detection: Checksums and CRC

In 60 Seconds

Error detection adds a calculated value (checksum or CRC) to packets so receivers can verify data integrity. Simple checksums (add all bytes) are fast but weak – they miss transposed bytes. CRC uses polynomial division to catch 99.9999% of errors, making it the standard choice for Ethernet, USB, LoRaWAN, and safety-critical IoT systems.

16.1 Learning Objectives

By the end of this chapter, you will be able to:

Calculate checksums and CRC values: Compute 8-bit checksums by hand and trace the XOR-based polynomial division used by CRC-16
Differentiate checksum from CRC: Classify error types (single-bit, burst, transposition) that each method detects or misses
Evaluate detection strength quantitatively: Compare undetected-error probabilities for 8-bit checksum, CRC-16, and CRC-32 in a given deployment scenario
Select an error-detection scheme: Justify the choice of checksum, CRC-16, or CRC-32 based on channel noise, safety requirements, and device constraints
Diagnose corrupted packets: Parse a hex dump, verify the CRC or checksum, and pinpoint the likely corruption source

For Beginners: Error Detection

When data travels wirelessly, bits can get flipped by interference – like static on a phone call garbling words. Error detection adds a small mathematical “fingerprint” to each message so the receiver can check if anything got corrupted along the way. It is similar to how a cashier adds up your items and checks the total – if the numbers do not match, something went wrong. This chapter covers the two main techniques: simple checksums and the more powerful CRC used in nearly all modern networks.

Related Chapters

Prerequisites (Read These First):

Packet Anatomy - Headers, payloads, and trailers fundamentals
Data Representation - Binary and hexadecimal for calculations

Companion Chapters (Packet Structure Series):

Packet Structure Overview - Index of all packet structure topics
Frame Delimiters and Boundaries - How receivers detect packet boundaries
Protocol Overhead - Header comparison and encapsulation

Security:

Encryption Architecture - How encryption affects packet structure
Threats and Attacks - Packet sniffing, replay attacks

16.2 Error Detection: Checksums and CRC

Time: ~9 min | Difficulty: Intermediate | Unit: P02.C02.U03

Key Concepts

Integrity check: The sender calculates a value from the payload and trailer fields so the receiver can recompute it and reject corrupted frames.
Key metric: Detection strength is measured by what kinds of corruption go unnoticed, especially burst errors, bit flips, and byte transpositions.
Main trade-off: Simple checksums are cheap to compute but weak against structured errors, while CRCs cost more bits and logic but catch far more real channel faults.
Protocol pattern: Lightweight protocols may use addition-based checksums, but most modern link layers and industrial buses rely on CRC-16 or CRC-32.
Deployment consideration: The value of stronger error detection rises quickly when retransmissions are expensive, safety matters, or the channel is noisy.
Design checkpoint: Choose the weakest method only after checking both the error model and the cost of a missed corruption event.

Problem: Noise, interference, or hardware faults can corrupt data during transmission.

Solution: Add a calculated value in the trailer that the receiver can verify.

16.2.1 Simple Checksum

Add all bytes, take lowest 8 bits:

Payload bytes: [0x45, 0x3F, 0x12]
Byte sum: 0x45 + 0x3F + 0x12 = 0x96
Checksum (low 8 bits): 0x96
Trailer: [0x96]

Pros: Simple, fast Cons: Weak error detection (can miss burst errors)

Interactive: Checksum Calculator

Try calculating checksums for different byte sequences and see how transposition affects the result.

Show code

viewof checksumInput = Inputs.textarea({
  label: "Enter hex bytes (space-separated, e.g., 45 3F 12):",
  value: "45 3F 12",
  width: 400,
  rows: 2
})

checksumBytes = {
  const hexValues = checksumInput.trim().split(/\s+/);
  return hexValues.map(h => parseInt(h, 16)).filter(n => !isNaN(n) && n >= 0 && n <= 255);
}

checksumResult = {
  if (checksumBytes.length === 0) return 0;
  const sum = checksumBytes.reduce((acc, val) => acc + val, 0);
  return sum & 0xFF;
}

html`<div style="background: #f8f9fa; padding: 16px; border-radius: 8px; border-left: 4px solid #16A085; margin-top: 12px;">
  <div style="font-family: monospace; font-size: 14px; color: #2C3E50;">
    <div><strong>Input bytes:</strong> [${checksumBytes.map(b => '0x' + b.toString(16).toUpperCase().padStart(2, '0')).join(', ')}]</div>
    <div style="margin-top: 8px;"><strong>Sum:</strong> ${checksumBytes.reduce((a,b) => a+b, 0)} (0x${checksumBytes.reduce((a,b) => a+b, 0).toString(16).toUpperCase()})</div>
    <div style="margin-top: 8px;"><strong>Checksum (8-bit):</strong> <span style="color: #E67E22; font-weight: bold;">0x${checksumResult.toString(16).toUpperCase().padStart(2, '0')}</span></div>
  </div>
</div>`

Show code

viewof swapDemo = Inputs.button("Swap First Two Bytes", {
  value: null,
  reduce: () => {
    const bytes = checksumInput.trim().split(/\s+/);
    if (bytes.length >= 2) {
      [bytes[0], bytes[1]] = [bytes[1], bytes[0]];
      checksumInput.value = bytes.join(' ');
      checksumInput.dispatchEvent(new Event('input', {bubbles: true}));
    }
  }
})

md`**Observation**: Swapping bytes produces the **same checksum** — this is why checksums are weak for detecting transposition errors!`

Quick Check: Checksum Limitations Quick Check

Concept: Why simple checksums are insufficient for reliable communication.

16.2.2 Cyclic Redundancy Check (CRC)

Uses polynomial division for robust error detection: - CRC-16: 16-bit value, detects all single-bit and double-bit errors - CRC-32: 32-bit value, detects 99.9999% of errors - Used by: Ethernet, USB, LoRaWAN, Modbus

Example: Ethernet Frame Check Sequence (FCS) uses CRC-32

16.3 How CRC Works

CRC treats the data as a polynomial and divides it by a generator polynomial. The remainder becomes the CRC value:

Data as polynomial: Each bit position represents a coefficient (e.g., 0x45 = 0b01000101 = x^6 + x^2 + x^0)
Generator polynomial: Standardized for each CRC variant (CRC-32 uses 0x04C11DB7)
Division: XOR-based polynomial division (no carry, just XOR)
Remainder: The final remainder is appended as the CRC

Why CRC is better than checksum:

Single bit flip: 8-bit checksum: 100% CRC-16: 100% CRC-32: 100%
Two bit flips: 8-bit checksum: High but not guaranteed CRC-16: 100% CRC-32: 100%
Transposed bytes: 8-bit checksum: 0% (undetected!) CRC-16: 100% CRC-32: 100%
Burst < 16 bits: 8-bit checksum: Poor (~50%) CRC-16: 100% CRC-32: 100%
Burst < 32 bits: 8-bit checksum: Poor (~50%) CRC-16: 99.998% CRC-32: 100%
Random multi-bit: 8-bit checksum: ~99.6% (1 - 1/256) CRC-16: 99.998% CRC-32: 99.9999%

Interactive: CRC Error Detection Demonstrator

Explore how CRC detects different types of errors that checksums miss.

Show code

viewof testData = Inputs.text({
  label: "Original data (hex bytes):",
  value: "45 3F 12",
  width: 400
})

viewof errorType = Inputs.select(
  ["None", "Flip 1 bit", "Flip 2 bits", "Swap bytes 0↔1", "Burst error (4 bits)"],
  {
    label: "Inject error:",
    value: "None"
  }
)

// Simple CRC-16 CCITT implementation
function crc16(bytes) {
  let crc = 0xFFFF;
  const poly = 0x1021;

  for (let byte of bytes) {
    crc ^= (byte << 8);
    for (let i = 0; i < 8; i++) {
      if (crc & 0x8000) {
        crc = ((crc << 1) ^ poly) & 0xFFFF;
      } else {
        crc = (crc << 1) & 0xFFFF;
      }
    }
  }
  return crc;
}

function simpleChecksum(bytes) {
  return bytes.reduce((sum, b) => sum + b, 0) & 0xFF;
}

function applyError(bytes, errorType) {
  const result = [...bytes];
  switch(errorType) {
    case "Flip 1 bit":
      if (result.length > 0) result[0] ^= 0x01;
      break;
    case "Flip 2 bits":
      if (result.length > 0) result[0] ^= 0x01;
      if (result.length > 1) result[1] ^= 0x80;
      break;
    case "Swap bytes 0↔1":
      if (result.length >= 2) [result[0], result[1]] = [result[1], result[0]];
      break;
    case "Burst error (4 bits)":
      if (result.length > 0) result[0] ^= 0x0F;
      break;
  }
  return result;
}

originalBytes = testData.trim().split(/\s+/).map(h => parseInt(h, 16)).filter(n => !isNaN(n) && n <= 255)

corruptedBytes = applyError(originalBytes, errorType)

checksumOriginal = simpleChecksum(originalBytes)
checksumCorrupted = simpleChecksum(corruptedBytes)
crcOriginal = crc16(originalBytes)
crcCorrupted = crc16(corruptedBytes)

checksumDetected = checksumOriginal !== checksumCorrupted
crcDetected = crcOriginal !== crcCorrupted

html`<div style="background: #f8f9fa; padding: 20px; border-radius: 8px; margin-top: 12px;">
  <div style="display: grid; grid-template-columns: 1fr 1fr; gap: 20px;">

    <!-- Original Data -->
    <div>
      <h5 style="margin: 0 0 12px 0; color: #2C3E50; border-bottom: 2px solid #16A085; padding-bottom: 6px;">Original Data</h5>
      <div style="font-family: monospace; font-size: 13px; background: white; padding: 12px; border-radius: 6px;">
        <div><strong>Bytes:</strong> [${originalBytes.map(b => '0x' + b.toString(16).toUpperCase().padStart(2,'0')).join(', ')}]</div>
        <div style="margin-top: 8px; color: #7F8C8D;"><strong>Checksum:</strong> 0x${checksumOriginal.toString(16).toUpperCase().padStart(2,'0')}</div>
        <div style="margin-top: 4px; color: #7F8C8D;"><strong>CRC-16:</strong> 0x${crcOriginal.toString(16).toUpperCase().padStart(4,'0')}</div>
      </div>
    </div>

    <!-- Corrupted Data -->
    <div>
      <h5 style="margin: 0 0 12px 0; color: #2C3E50; border-bottom: 2px solid #E67E22; padding-bottom: 6px;">After Error Injection</h5>
      <div style="font-family: monospace; font-size: 13px; background: white; padding: 12px; border-radius: 6px;">
        <div><strong>Bytes:</strong> [${corruptedBytes.map(b => '0x' + b.toString(16).toUpperCase().padStart(2,'0')).join(', ')}]</div>
        <div style="margin-top: 8px; color: ${checksumDetected ? '#E74C3C' : '#2ECC71'};"><strong>Checksum:</strong> 0x${checksumCorrupted.toString(16).toUpperCase().padStart(2,'0')}</div>
        <div style="margin-top: 4px; color: ${crcDetected ? '#E74C3C' : '#2ECC71'};"><strong>CRC-16:</strong> 0x${crcCorrupted.toString(16).toUpperCase().padStart(4,'0')}</div>
      </div>
    </div>
  </div>

  <!-- Detection Results -->
  <div style="margin-top: 20px; padding: 16px; background: white; border-radius: 8px; border-left: 4px solid ${errorType === 'None' ? '#16A085' : (checksumDetected || crcDetected ? '#E67E22' : '#E74C3C')};">
    <h5 style="margin: 0 0 12px 0; color: #2C3E50;">Detection Results</h5>
    <div style="display: grid; grid-template-columns: 1fr 1fr; gap: 16px;">
      <div style="padding: 12px; background: ${checksumDetected ? 'rgba(46, 204, 113, 0.1)' : 'rgba(231, 76, 60, 0.1)'}; border-radius: 6px;">
        <div style="font-size: 12px; color: #7F8C8D; margin-bottom: 4px;">Checksum</div>
        <div style="font-size: 18px; font-weight: bold; color: ${checksumDetected ? '#2ECC71' : '#E74C3C'};">
          ${checksumDetected ? '✓ DETECTED' : '✗ MISSED'}
        </div>
      </div>
      <div style="padding: 12px; background: ${crcDetected ? 'rgba(46, 204, 113, 0.1)' : 'rgba(231, 76, 60, 0.1)'}; border-radius: 6px;">
        <div style="font-size: 12px; color: #7F8C8D; margin-bottom: 4px;">CRC-16</div>
        <div style="font-size: 18px; font-weight: bold; color: ${crcDetected ? '#2ECC71' : '#E74C3C'};">
          ${crcDetected ? '✓ DETECTED' : '✗ MISSED'}
        </div>
      </div>
    </div>
  </div>
</div>`

Show code

md`**Key Observation**: ${errorType === "Swap bytes 0↔1" ? "Notice how checksum **FAILS** to detect byte transposition — both original and swapped have the same checksum!" : errorType === "None" ? "Select an error type to see detection in action." : crcDetected && !checksumDetected ? "CRC detected the error while checksum missed it!" : crcDetected && checksumDetected ? "Both methods detected this error (CRC is still more reliable overall)." : "No error injected."}`

Putting Numbers to It

Let’s quantify how CRC-32’s superior error detection translates to real-world IoT reliability.

Scenario: A smart city deploys 10,000 parking sensors, each transmitting occupancy status every 5 minutes.

Annual packet volume: \[N_{\text{packets/year}} = 10{,}000 \text{ sensors} \times \frac{60 \text{ min}}{5 \text{ min}} \times 24 \times 365 = 1{,}051{,}200{,}000 \text{ packets/year}\]

Bit Error Rate (BER) in urban RF environment: Typical \(\text{BER} = 10^{-5}\) (1 bit error per 100,000 bits transmitted)

Payload size: 32 bytes = 256 bits per packet

Expected corrupted packets per year: \[N_{\text{corrupted}} = 1{,}051{,}200{,}000 \times (256 \times 10^{-5}) = 2{,}691{,}072 \text{ corrupted packets/year}\]

Undetected errors with 8-bit checksum (\(2^8 = 256\) possible values): \[N_{\text{undetected, checksum}} = \frac{2{,}691{,}072}{256} \approx 10{,}512 \text{ bad packets accepted/year}\]

Undetected errors with CRC-32 (\(2^{32} = 4{,}294{,}967{,}296\) possible values): \[N_{\text{undetected, CRC-32}} = \frac{2{,}691{,}072}{4{,}294{,}967{,}296} \approx 0.000627 \text{ bad packets/year}\]

Key insight: CRC-32 reduces undetected errors by 16,777,216x (factor of \(2^{24}\)) compared to 8-bit checksums. Over the system’s 10-year lifetime, the checksum approach would accept roughly 105,120 corrupted parking occupancy readings, potentially causing billing disputes or incorrect navigation guidance.

Interactive: Error Rate Impact Calculator

Calculate the impact of bit error rates on your IoT deployment.

Show code

viewof numSensors = Inputs.range([100, 100000], {
  label: "Number of sensors:",
  value: 10000,
  step: 100,
  width: 400
})

viewof txInterval = Inputs.range([1, 60], {
  label: "Transmission interval (minutes):",
  value: 5,
  step: 1,
  width: 400
})

viewof payloadSize = Inputs.range([8, 256], {
  label: "Payload size (bytes):",
  value: 32,
  step: 8,
  width: 400
})

viewof berExponent = Inputs.range([-7, -3], {
  label: "Bit Error Rate (BER) exponent:",
  value: -5,
  step: 1,
  width: 400,
  format: x => `10^${x} (${x === -3 ? 'very noisy' : x === -4 ? 'noisy' : x === -5 ? 'typical urban' : x === -6 ? 'good' : 'excellent'})`
})

errorCalc = {
  const ber = Math.pow(10, berExponent);
  const txPerYear = numSensors * (60 / txInterval) * 24 * 365;
  const bitsPerPacket = payloadSize * 8;
  const corruptedPackets = txPerYear * bitsPerPacket * ber;
  const undetectedChecksum = corruptedPackets / 256;
  const undetectedCRC32 = corruptedPackets / 4294967296;

  return {
    txPerYear: Math.round(txPerYear),
    corruptedPackets: Math.round(corruptedPackets),
    undetectedChecksum: Math.round(undetectedChecksum),
    undetectedCRC32: undetectedCRC32
  };
}

html`<div style="background: linear-gradient(135deg, #2C3E50 0%, #34495e 100%); padding: 20px; border-radius: 8px; color: white; margin-top: 12px;">
  <h4 style="margin-top: 0; color: #16A085;">Annual Error Analysis</h4>

  <div style="display: grid; grid-template-columns: 1fr 1fr; gap: 16px; margin-top: 16px;">
    <div style="background: rgba(255,255,255,0.1); padding: 12px; border-radius: 6px;">
      <div style="font-size: 12px; opacity: 0.8;">Packets/Year</div>
      <div style="font-size: 24px; font-weight: bold; color: #16A085;">${errorCalc.txPerYear.toLocaleString()}</div>
    </div>

    <div style="background: rgba(255,255,255,0.1); padding: 12px; border-radius: 6px;">
      <div style="font-size: 12px; opacity: 0.8;">Corrupted Packets</div>
      <div style="font-size: 24px; font-weight: bold; color: #E67E22;">${errorCalc.corruptedPackets.toLocaleString()}</div>
    </div>
  </div>

  <div style="margin-top: 20px; padding-top: 16px; border-top: 1px solid rgba(255,255,255,0.2);">
    <h5 style="margin: 0 0 12px 0; color: #16A085;">Undetected Errors (Bad Data Accepted)</h5>

    <div style="background: rgba(230, 126, 34, 0.2); padding: 12px; border-radius: 6px; border-left: 4px solid #E67E22; margin-bottom: 12px;">
      <div style="font-size: 12px; opacity: 0.9;">8-bit Checksum</div>
      <div style="font-size: 28px; font-weight: bold; color: #E67E22;">${errorCalc.undetectedChecksum.toLocaleString()}</div>
      <div style="font-size: 11px; margin-top: 4px; opacity: 0.8;">bad packets/year</div>
    </div>

    <div style="background: rgba(22, 160, 133, 0.2); padding: 12px; border-radius: 6px; border-left: 4px solid #16A085;">
      <div style="font-size: 12px; opacity: 0.9;">CRC-32</div>
      <div style="font-size: 28px; font-weight: bold; color: #16A085;">${errorCalc.undetectedCRC32 < 0.001 ? '<0.001' : errorCalc.undetectedCRC32.toFixed(3)}</div>
      <div style="font-size: 11px; margin-top: 4px; opacity: 0.8;">bad packets/year</div>
    </div>
  </div>

  <div style="margin-top: 16px; padding: 12px; background: rgba(22, 160, 133, 0.15); border-radius: 6px; font-size: 13px;">
    <strong style="color: #16A085;">Improvement Factor:</strong> CRC-32 is <strong>${(errorCalc.undetectedChecksum / Math.max(errorCalc.undetectedCRC32, 0.0001)).toFixed(0)}×</strong> more reliable than simple checksums
  </div>
</div>`

Figure 16.1: Comparison view showing how checksum, CRC-16, and CRC-32 process the same bytes and what each method can reliably catch. The diagram pairs a simple additive checksum workflow with a polynomial remainder workflow and a capability summary for transpositions, burst errors, and common protocol uses.

Mobile Figure Summary: Checksum vs CRC

Checksum workflow

Input bytes: 45 3F 12
Add the bytes: 0x45 + 0x3F + 0x12 = 0x96
Append trailer byte 0x96
Fast, but weak against structured corruption

CRC workflow

Divide the data bitstream by generator polynomial 0x1021
Append the remainder as the trailer, for example 0x7F82
Costs more computation, but catches far more real transmission faults

Detection coverage

Single-bit flips: checksum Yes, CRC-16 Yes, CRC-32 Yes
Byte swaps: checksum No, CRC-16 Yes, CRC-32 Yes
Burst errors under 16 bits: checksum No, CRC-16 Yes, CRC-32 Yes
Burst errors under 32 bits: checksum No, CRC-16 No, CRC-32 Yes

Figure 16.2: Timeline view of CRC in action. A transmitter computes the integrity trailer, the channel corrupts one byte, the receiver recalculates and detects the mismatch, and a retransmission succeeds when the recomputed CRC finally matches the received trailer.

Mobile Figure Summary: CRC Retransmission Timeline

Sender builds a frame with payload 45 3F 12 and CRC-16 7F82.
Noise corrupts one byte so the receiver sees 44 3F 12 while the trailer still says 7F82.
Receiver recalculates the CRC and gets A1C4, which does not match the received trailer.
Receiver rejects the frame and sends a NACK requesting retransmission.
Sender retransmits the original frame, the receiver recalculates 7F82, and the clean packet is accepted.

16.4 Common CRC Polynomials

CRC-8: polynomial 0x07, size 1 byte, used by I2C and ATM
CRC-16-CCITT: polynomial 0x1021, size 2 bytes, used by Bluetooth and X.25
CRC-16-Modbus: polynomial 0x8005, size 2 bytes, used by Modbus RTU
CRC-32: polynomial 0x04C11DB7, size 4 bytes, used by Ethernet, USB, and Zip
CRC-32C: polynomial 0x1EDC6F41, size 4 bytes, used by iSCSI and SCTP

16.5 Concept Relationships

Checksum Builds on: Binary addition and modulo arithmetic Leads to: Simple integrity checks Contrasts with: CRC (polynomial-based)
CRC (Cyclic Redundancy Check) Builds on: Polynomial division and Galois field math Leads to: Robust error detection Contrasts with: Cryptographic hashes (SHA, MD5)
Error Detection Builds on: Digital transmission theory Leads to: Forward Error Correction (FEC) and ARQ protocols Contrasts with: Error Correction (rebuilds data)
Burst Error Detection Builds on: Signal processing and noise patterns Leads to: Reed-Solomon codes Contrasts with: Single-bit error detection
FCS (Frame Check Sequence) Builds on: CRC-32 Leads to: Ethernet MAC layer Contrasts with: Application-layer checksums

16.6 Error Detection vs. Error Correction

Error Detection (this chapter): Identifies that an error occurred, triggers retransmission

Error Correction (Forward Error Correction): Fixes errors without retransmission

Detection + Retransmit: overhead Low (2-4 bytes), latency Variable, best for Wi-Fi, TCP, and most IoT deployments
FEC (Reed-Solomon): overhead High (10-30%), latency Fixed, best for Satellite links and the LoRa physical layer
Hybrid ARQ: overhead Medium, latency Medium, best for LTE and 5G

For most IoT applications, error detection with retransmission is preferred because: 1. Errors are rare (< 1% on good links) 2. FEC overhead is expensive for constrained devices 3. Retransmission latency is acceptable for sensor data

16.7 Knowledge Check: Error Detection

Knowledge Check: Error Detection Methods Quick Check

Concept: Comparing checksum and CRC error detection.

16.8 Scenario-Based Practice

Scenario: Designing a Custom Protocol for Industrial Sensors

Situation: You’re designing a communication protocol for 1,000 pressure sensors in an oil refinery. Requirements: - Each sensor sends: Sensor ID (16-bit), pressure (32-bit float), temperature (16-bit), timestamp (32-bit) - Transmission medium: RS-485 serial bus (noisy industrial environment) - Messages must be detectable even if receiver joins mid-transmission - Critical safety system: undetected errors could cause explosions

Question: Design the packet structure including header, payload, and trailer. Justify your choice of framing method and error detection mechanism.

Answer

Recommended Packet Structure:

Start delimiter: 2 bytes set to 0x55 0xAA so receivers can resynchronize mid-stream.
Length: 1 byte storing the total payload length (12 bytes for this example).
Sensor ID: 2 bytes for the 16-bit device identifier.
Pressure: 4 bytes as an IEEE 754 float.
Temperature: 2 bytes as a signed integer in C x 100.
Timestamp: 4 bytes as Unix epoch seconds.
CRC-32: 4 bytes using polynomial 0x04C11DB7.
End delimiter: 1 byte set to 0x7E.
Total frame size: 20 bytes.

Error Detection: CRC-32 (not just checksum)

Why CRC-32 for safety-critical systems: - Checksum weakness: Can miss errors where bytes are transposed (0x45 0x32 vs 0x32 0x45 have same sum) - CRC-32 detects: All single-bit errors, all double-bit errors, all odd-bit errors, all burst errors < 32 bits - Safety margin: For random errors, undetected corruption is about 1 in 4.3 billion (about 1/2^32)

Real-world consideration: Many industrial protocols (Modbus RTU, CAN) use CRC-16, which is often sufficient. CRC-32 adds 2 bytes of overhead but provides extra safety margin for explosion-risk environments.

Scenario: Debugging a Corrupted Packet

Situation: Your smart home gateway received this hex dump from a Zigbee temperature sensor, but the reading seems wrong (showing 500C instead of expected 25C):

61 04 00 08 02 01 F4 01 48 2A

The expected packet format is: - Frame Control: 2 bytes - Sequence: 1 byte - Cluster ID: 2 bytes - Attribute ID: 2 bytes - Data Type: 1 byte - Value: 2 bytes (temperature x 100, little-endian)

Question: Parse the packet byte-by-byte and identify where the error might be. What temperature does the packet actually encode?

Answer

Byte-by-Byte Parsing:

Bytes 0-1 (61 04): Frame Control = 0x0461 (ZCL Global, Server to Client).
Byte 2 (00): Sequence number = message #0.
Bytes 3-4 (08 02): Cluster ID = 0x0208, but it should be 0x0402 for Temperature Measurement.
Bytes 5-6 (01 F4): Attribute ID = 0xF401, but it should be 0x0000 for MeasuredValue.
Byte 7 (01): Data Type = 0x01, likely wrong because an int16 reading should use 0x29.
Bytes 8-9 (48 2A): Value = 0x2A48 = 10,824 / 100 = 108.24C.

The actual temperature value:

Looking at bytes 8-9: 48 2A - Little-endian: 0x2A48 = 10,824 - As signed int16: 10,824 / 100 = 108.24C (still wrong!)

Root cause found:

The bytes should be: 09 C4 for 25.0C (2500 in hex = 0x09C4)

But we have: 48 2A (0x2A48 = 10824 = 108.24C)

Likely causes:

Sensor malfunction - reading garbage
Byte corruption - single bit flip in transmission
Wrong sensor type - maybe it’s humidity (0-100%) encoded differently

Debug steps:

Check CRC/FCS (not shown in dump) - was it valid?
Request retransmission
Check sensor wiring and calibration

16.9 Code: Implementing Checksums and CRC in Python

Python: Checksum vs CRC-16 Implementation

def simple_checksum(data: bytes) -> int:
    """Simple 8-bit checksum: sum all bytes, keep lowest 8 bits."""
    return sum(data) & 0xFF

def crc16_ccitt(data: bytes, poly=0x1021, init=0xFFFF) -> int:
    """CRC-16/CCITT used by Bluetooth, X.25, and many IoT protocols."""
    crc = init
    for byte in data:
        crc ^= byte << 8
        for _ in range(8):
            if crc & 0x8000:
                crc = (crc << 1) ^ poly
            else:
                crc = crc << 1
            crc &= 0xFFFF  # Keep 16-bit
    return crc

# --- Demo: checksum weakness ---
packet_a = bytes([0x45, 0x3F, 0x12])  # Original
packet_b = bytes([0x3F, 0x45, 0x12])  # Bytes 0 and 1 swapped

print("=== Checksum (weak) ===")
print(f"Original:  {packet_a.hex()} -> checksum = 0x{simple_checksum(packet_a):02X}")
print(f"Swapped:   {packet_b.hex()} -> checksum = 0x{simple_checksum(packet_b):02X}")
print(f"Same checksum? {simple_checksum(packet_a) == simple_checksum(packet_b)}")
# Output: Both = 0x96. Checksum MISSES the transposition error!

print("\n=== CRC-16 (robust) ===")
print(f"Original:  {packet_a.hex()} -> CRC-16 = 0x{crc16_ccitt(packet_a):04X}")
print(f"Swapped:   {packet_b.hex()} -> CRC-16 = 0x{crc16_ccitt(packet_b):04X}")
print(f"Same CRC?  {crc16_ccitt(packet_a) == crc16_ccitt(packet_b)}")
# Output: Different CRCs. CRC DETECTS the transposition error.

# --- Demo: single bit flip detection ---
print("\n=== Single bit flip ===")
corrupted = bytes([0x45, 0x3F, 0x13])  # Last byte: 0x12 -> 0x13 (1 bit flip)
print(f"Original:   CRC = 0x{crc16_ccitt(packet_a):04X}")
print(f"Corrupted:  CRC = 0x{crc16_ccitt(corrupted):04X}")
print(f"Detected?   {crc16_ccitt(packet_a) != crc16_ccitt(corrupted)}")

What to observe: Run this code to see that the simple checksum produces identical values for [0x45, 0x3F, 0x12] and [0x3F, 0x45, 0x12] (transposed bytes), while CRC-16 catches the error. This is exactly why CRC is required for reliable IoT communication.

16.10 Worked Example: Debugging a Corrupted LoRaWAN Packet

Scenario: Temperature Sensor Reporting Impossible Values

Situation: A LoRaWAN temperature sensor on a building roof reports 847C. The sensor (SHT31) has a range of -40 to 125C. What happened?

Received payload (hex): 03 4F 01 A2

Expected format: [msg_type(1B)] [temp_x100(2B, big-endian, signed)] [humidity(1B)]

Parsing the corrupt payload:

msg_type = 0x03: sensor reading type, so the first byte looks valid.
temp_raw = 0x4F01: big-endian decode gives 20,225 / 100 = 202.25C, which is still impossible.
temp_raw = 0x014F: little-endian decode gives 335 / 100 = 3.35C, which is plausible for a roof sensor.

Root cause: The sensor firmware was updated from big-endian to little-endian encoding, but the server decoder was not updated. The bytes 4F 01 were decoded as big-endian (0x4F01 = 20,225) instead of little-endian (0x014F = 335 = 3.35C).

But what about the 847C report? That was a different packet where the CRC check passed but a framing error shifted the payload bytes by one position. The humidity byte (0x64 = 100% RH) was interpreted as the high byte of temperature.

Lesson: CRC detects bit-level corruption, but it cannot detect application-layer framing errors where bytes are valid but misinterpreted. Always include a message type or version byte so decoders can validate the packet structure.

16.11 Review Exercises

Common Pitfalls

1. Treating a Checksum as if It Were a CRC

Checksums and CRCs are not interchangeable just because both live in the trailer. Addition-based checksums miss structured errors such as byte transpositions that CRCs catch reliably, so calling them equivalent leads to silent corruption in real deployments.

2. Verifying the Wrong Bytes

Integrity checks only work if sender and receiver run the algorithm over the exact same byte sequence in the exact same order. A wrong initial value, reflected bit order, omitted header byte, or endian mistake makes good packets fail and bad debugging assumptions spread quickly.

3. Stopping at Detection Without Planning Recovery

Detecting corruption is only half the system behavior. The real protocol must still decide whether to drop, retransmit, request a new sample, or raise an alarm. If recovery behavior is undefined, a strong CRC only tells you that the packet is bad, not what the system should do next.

Label the Diagram

16.12 Summary

Error detection ensures data integrity across noisy networks:

Checksums: Simple addition-based method, fast but weak detection
CRC: Polynomial-based method, detects 99.9999% of errors
CRC-16/CRC-32: Standard choices for IoT protocols
Trade-offs: More robust detection requires more computation and bytes

Key Takeaways:

CRC is much more reliable than simple checksums
Checksums can miss transposed bytes that CRC catches
Safety-critical systems should use CRC-32 or better
Error detection enables retransmission; error correction avoids it

16.13 What’s Next

Protocol Overhead: Compare CRC and header overhead across Ethernet, LoRaWAN, Zigbee, and more.
Frame Delimiters and Boundaries: See how CRC interacts with start/end delimiters and byte stuffing.
TCP Reliability: Learn how TCP combines checksums with retransmission for end-to-end reliability.
LoRaWAN Security and Joining: Explore the AES-CMAC Message Integrity Code that goes beyond CRC.
Encryption Security Properties: Understand HMAC and CMAC, which authenticate the sender as well as checking integrity.

For Kids: Meet the Sensor Squad!

Sammy the Sensor sends a message: “Temperature is 25 degrees!” But oh no – a noisy radio wave garbles it to “Temperature is 95 degrees!”

Max the Microcontroller explains: “This is why we need ERROR DETECTION. It’s like adding a secret check to every message!”

Lila the LED shows two methods:

Method 1 – Checksum (Simple): “Add up all the numbers in your message. 2+5 = 7. Send ‘25’ plus the check ‘7’. The receiver adds 2+5 and checks: does it equal 7? YES! Message is good!”

“But,” Lila warns, “if the message changes from ‘25’ to ‘52’ (numbers swapped), the checksum is still 7! Oops – we missed the error!”

Method 2 – CRC (Super Smart): “CRC is like a magic math puzzle. It does fancy polynomial division (don’t worry, the computer does it automatically!) and catches almost EVERY error – even swapped numbers!”

Bella the Battery asks: “But doesn’t CRC use more energy?”

Max nods: “A little more math, but it catches 99.9999% of errors. For a sensor in a hospital or factory, that’s worth it! We don’t want wrong readings causing problems!”

The Squad’s Rule: Always add a check to your messages! Checksum is quick and easy. CRC is stronger and catches almost everything. For important data, always use CRC!