Production ML Pipeline
Objective: Get the plant disease identification ML pipeline to full production readiness with real model inference, proper class mapping, and production-grade error handling.
Status legend: [ ] todo, [~] in-progress, [x] done
Tasks
- 01 — PlantVillage class inventory and knowledge base mapping →
01-plantvillage-class-inventory.md - 02 — Label mapping layer implementation →
02-label-mapping-implementation.md - 03 — TensorFlow.js model loading verification and fixes →
03-model-loading-verification.md - 04 — Confidence calibration for PlantVillage model →
04-confidence-calibration.md - 05 — Real model integration into identification pipeline →
05-pipeline-integration.md - 06 — Plant-context-aware identification →
06-plant-context-identification.md - 07 — End-to-end integration testing →
07-end-to-end-testing.md - 08 — Production hardening and observability →
08-production-hardening.md
Dependencies
- 01 → 02 (mapping data feeds label layer)
- 02 → 05 (labels feed pipeline)
- 03 → 05 (verified model loading feeds pipeline)
- 04 → 05 (calibration feeds pipeline)
- 05 → 06 (real model enables plant context)
- 05 → 07 (integrated pipeline enables e2e testing)
- 07 → 08 (tested pipeline enables production hardening)
Exit Criteria
- The feature is complete when:
- Model loads successfully and produces real (non-mock) predictions
- All 38 PlantVillage classes map to valid knowledge base disease IDs
- End-to-end pipeline works: upload image → get real disease diagnoses with calibrated confidence
- Confidence scores are meaningful (high confidence for clear cases, low for ambiguous)
- Plant context optionally boosts relevant predictions
- Full integration test suite passes
- Error handling, logging, and monitoring in place
- No demo mode fallback in production
- Rate limiting and input sanitization active
- Health endpoint reports model status and inference metrics