Scenario: E.ON deploys 3 million smart electricity meters across Germany. Under GDPR and the German Metering Point Operation Act (MsbG), individual consumption data is personal data requiring strict privacy controls. The utility needs 15-minute consumption data for billing and grid balancing but must minimize personal data exposure.
Given:
- 3 million meters, 1 reading per 15 minutes = 288 million readings/day
- Each reading: meter ID (personal identifier), timestamp, kWh, voltage, power factor
- GDPR Article 5(1)(c): Data minimization – only collect what is necessary
- GDPR Article 17: Right to erasure – customers can request deletion of historical data
- MsbG requirement: Smart meter gateways must encrypt data end-to-end (BSI TR-03109 standard)
- Retention: Billing data for 10 years (tax law), raw readings for 13 months (regulatory)
- Use cases: Monthly billing (needs meter-level data), grid balancing (needs aggregate only), consumption analytics (opt-in only)
Step 1 – Apply data minimization at each pipeline stage:
| Smart meter gateway |
Raw 1-second readings |
Aggregate to 15-minute totals locally |
15-min kWh + max voltage |
| Network transport |
15-min readings with meter ID |
End-to-end TLS 1.3 encryption (BSI certified) |
Encrypted payload |
| Ingestion (Kafka) |
Decrypted 15-min readings |
Pseudonymize meter ID -> hash(meter_id + salt) |
Pseudonymized readings |
| Billing database |
Pseudonymized readings |
Retain only billing-relevant fields |
kWh per 15-min period |
| Grid analytics |
Pseudonymized readings |
Aggregate to postcode level (anonymize) |
Postcode-level demand |
| Customer portal |
Full personal data (opt-in) |
Customer controls data sharing preferences |
Per customer consent |
Step 2 – Implement right to erasure (Article 17):
When a customer requests data deletion:
| Customer portal |
Delete account and all personal data |
24 hours |
Low |
| Billing database |
Pseudonymize remaining invoices (replace name with “DELETED”) |
72 hours |
Medium |
| Kafka topics |
Data expires naturally (13-month retention) |
Up to 13 months |
None (automatic) |
| Grid analytics (aggregated) |
No action needed – already anonymized |
N/A |
None |
| Backups |
Flag for exclusion in next backup cycle |
30 days |
High |
Total erasure compliance time: 72 hours for active systems, 30 days for backup propagation (compliant with GDPR’s “without undue delay” requirement).
Step 3 – Calculate privacy vs. utility tradeoff:
| Monthly billing |
Meter-level, 15-min resolution |
Yes (pseudonymized) |
No (contractual basis) |
| Grid load forecasting |
Postcode-level, 1-hour aggregates |
No (anonymized) |
No |
| Consumption tips for customer |
Meter-level, daily resolution |
Yes (direct identifier) |
Yes (opt-in) |
| Demand response programs |
Meter-level, real-time |
Yes (direct identifier) |
Yes (explicit consent) |
| Third-party energy broker |
Meter-level, 15-min, 12 months |
Yes (direct identifier) |
Yes (explicit, revocable) |
Step 4 – Security controls per CSA top threats:
| Data breach |
AES-256 encryption at rest, TLS 1.3 in transit |
EUR 180,000 (key management infrastructure) |
| Weak identity |
mTLS certificates per meter, OAuth2 for staff |
EUR 95,000 (PKI infrastructure) |
| Insecure API |
API gateway with rate limiting, WAF |
EUR 45,000 |
| Insider threat |
Role-based access, audit logging, 4-eyes principle for bulk data access |
EUR 60,000 |
| Total security spend |
|
EUR 380,000/year |
Per-meter security cost: EUR 380,000 / 3,000,000 = EUR 0.13/meter/year
Result: The privacy-preserving pipeline achieves full GDPR compliance while maintaining all utility use cases. Key architectural decisions: pseudonymization at ingestion (not at the edge, to avoid key management on 3 million devices), aggregation-based anonymization for grid analytics (no re-identification risk), and consent-gated access for customer-facing features. Security costs of EUR 0.13/meter/year are negligible compared to potential GDPR fines (up to 4% of annual turnover = EUR 1.6 billion for E.ON).
Key Insight: Privacy-by-design in IoT data pipelines is not just a legal requirement – it simplifies architecture. By anonymizing grid analytics data at the aggregation stage, the entire downstream analytics platform is exempt from GDPR scope, reducing compliance overhead for data scientists. The principle: minimize personal data as early in the pipeline as possible, so downstream systems never need to handle it.