Test Offline Pipeline (Engineering View)¶
Role¶
The test offline pipelines run trained models on held-out Parquet/CSV data and produce Excel reports, confusion matrices, and diagnostic plots (e.g. size, location, leak series). They answer: “How would this model perform on new cases?”
Why It Matters for Engineering¶
- By-case evaluation: Typically we evaluate per case or per file, so metrics align with the by-case split philosophy (see Train/validation split by case).
- Threshold and metrics: Detection threshold and metric definitions (precision, recall, F1, etc.) are consistent with training and model validation.
- World-class analysis: Reports and plots support the “world-class” analysis goal (see World-class detection analysis).
Configuration and Scripts¶
- Scripts:
run_test_offline_pipeline.py,run_pfm_test_offline_pipeline.py. - Config:
parquet_root,output_report_folder, model paths, feature/window config, sampling options. - Detailed reference: See Test offline in the main Pipelines section.
This Engineering page gives the rationale; the full configuration and output description are in the Pipelines docs.