mljar/mljar-supervised
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
π Changes
- publish_app() now reuses the last published URL by default (#830)
- Fixed missing mljar_app.json upload in publish_app() (#829)
- Improved numeric widget steps in generated apps (#828)
- Added AI Act transparency messaging in generated apps, reports, and training output (#810)
π Changes
- (#803) automatically generate web app for your automl model
- (#813) add fairness compliance certificate generation
π Changes
- Simplified `report_structured()` default output to compact summary (leaderboard + global feature importance).
- Added on-demand detailed report for a selected model via `report_structured(model_name=...)`.
- Improved selected-model markdown layout: clear section order, cleaner headings, filtered noisy hyperparameters, richer fairness details and fairness explanation text.
- Included fairness columns in summary leaderboard when fairness is enabled.
- Fixed global feature-importance aggregation metadata (`models_present`) and updated examples/tests for the new report workflow.
π¦ mljar-supervised 1.2.0
- Added `AutoML.report_structured(format="markdown"|"dict"|"json", model_details=...)` for LLM-friendly reporting.
- New on-demand artifact: `<results_path>/report_structured.json` (full structured payload).
- Improved markdown report readability (clean run summary, clearer best/model sections, better feature-importance naming).
- Added examples for classification, regression, and fairness structured reports.
- Updated structured-report tests and fixed macOS smoke workflow by installing `libomp`.
π Fixes
- removed unused dependencies, cleaned code -> the installation should be a little faster :) Changes in the #800
Fixes matplotlib backend initialization in notebooks after AutoML training #785
π Changes
- fixed issues with new sklearn API #789, #788, #787
- setup matplotlib backend for AutoML and switch it back to original #785
π Changes
- a lot of warning and bug fixes done in #777, #771, #762, #761, #760, #759, #758, #757, #756, #755, #754, #753, #752, #751, #750, #749, #743, #742, #733
Fix warnings due to packages update.
π Changes
- (#380) disable boost on errors step for custom strategy
- (#728) fix accuracy metric for Lightgbm
π Changes
- (#725) fix styling of AutoML report, apply styles in the `mljar-automl-report` class
π Changes
- fixed problems with `report()` (#714)
π Changes
- fix xgboost warning (#667)
π Fixes
- fix sklearn/scipy warnings (#709)
- fix report display in JupyterLab (#710)
Thanks to @lijm1358 for PR #689, it fixes problems with LightGBM tuning #645, #683.
I've added custom JSON Encoder that can handle numpy types. It fixes #496, #613, #622, #651.
π Changes
- πΌ pandas > 2.0.0
- π xgboost > 2.0.0 (#649)
- π³ dtreeviz > 2.2.2 (#631)
- π shap > 0.42.1
π Fixes π οΈ
- Alrighty, with great power (read: updates) comes great responsibility (read: fixes)! We've rolled up our sleeves to zap those pesky warnings caused by our major package glow-up:
- π Added classes_ for those classy classifiers (#654)
- π Patched up a boo-boo in the calibration plot (#655)
- π§ Tweaked a model type warning that was acting all sassy (#638)
- Keep rocking and happy coding! πΈπ€π
π Changes
- #637 fix problem with font loading for report
π Changes
- #634 fix problem with categorical values in target and nan values for fairness metric
- #635 add tests for fairness feature
- #636 switch off shap exceptions printouts
π Changes
- For implementation details please check issue #612
- For example usage please check article https://mljar.com/blog/fairness-machine-learning/
π Changes
- #595 replace boston example dataset with California housing dataset, replace `mse` metric with `squared_error` for tree based algorithms from sklearn
- #596 change the import method for `dtreeviz` package
π Changes
- #590 dynamically set font in a report, thanks @yairVanti!
Unpin `shap` version #551
π¦ Enhancements
- #523 Add type hints to AutoML class, thank you @DanielR59
- #519 save train&validation index to file in train/test split, thanks @filipsPL @MaciekEO
π Bug fixes
- #496 fix exception in baseline mode, thanks @DanielR59 @moshe-rl
- #522 fixed requirements issue, thanks @DanielR59 @MaciekEO
- #514 remove warning, thanks @MaciekEO
- #511 disable EDA, thanks @MaciekEO
π Changes
- #463 change multiprocessing to Parallel with loky
- #462 handle large data for tree visualization in regression
- #419 remove/hide warnings
- #411 loose dependencies for numpy and scipy
π Changes
- #81 add scatter plot predicted vs target in regression
- #158 add ROC curve for binary classification
- #336 add visualization for Optuna results
- #352 add support for Colab
- #374 update seaborn
- #378 set golden features number
- #379 switch off boost_on_errors step in Optuna mode
- #380 add custom cross validation strategy
- + 8 more
π Changes
- #343 set seed in Optuna
- #344 set eval_metric directly in all algorithms
- #350 add estimated train time in Optuna mode
- #342 add `optuna_verbose` param in `AutoML()`
- #354 add KNN in Optuna
- #356 and Neural Network in Optuna
- #357, #348 use mljar wrapper for Random Forest and Extra Trees
- #358 add `extra_tree` param in LightGBM
- + 6 more
Add support to Python 3.9 (#339) Thanks to @rterbush!
π Changes
- #332 We added Optuna framework for hyperparameters tuning. It can be used by setting `mode="Optuna"` in AutoML. You can read more details at blog post: https://mljar.com/blog/automl-optuna/
π Changes
- #179 add `need_retrain()` method to detect performance decrease
- #226 extract rules from decision tree
- #310 add support for MAPE
- #312 optimize prediction time
- #313 set stacking time threshold depending on best model train time
- #320 search for model with prediction time constraint
- #322 `n_jobs` as a parameter
- #328 disable stacking for small (nrows < 500) datasets
- + 8 more
