2024 devtools ● stable

MLEvaluation

Evaluation harness for ML experiments with standardized metrics and reports.

Highlights

  • Unified classification and regression metrics under one runner.
  • Integrated MLflow tracking for run comparison and reproducibility.
  • Generated HTML reports with metric trends and confusion matrices.