# Data engineering tips for cyclists

Bad data creates fake confidence and fake concern.

## Minimal standards
- Deduplicate by source activity ID
- Normalize timezone and day boundaries
- Separate planned and completed sessions

## Reliability checks
Track missingness and outliers routinely. Better input quality means better coaching decisions and cleaner trend interpretation.
