AutoML model incorporating tune commands. #1410

seraphimstreets · 2022-08-05T03:20:02Z

As part of the second stage of the GSOC AutoML project as defined in #968, this is a preliminary iteration of the AutoML model. The idea is to allow users to provide a dataset and list of models they wish to train, and DFFML's integrated AutoML model will perform training/tuning/scoring to select the best model for the user, abstracting away much of the ML process into an easy-to-use API. The current iteration performs the training and scoring using default hyperparameters, so has not implemented tuning yet. Some discussion by the community will be needed to evaluate the best way for tuning to occur in the AutoML process. (should we have default hyperparameter search spaces for each model, or must it be user-defined?)

…to tunecli

mhash1m

Great work so far, Lets continue the review on the weekend.

mhash1m · 2022-09-01T15:12:13Z

dffml/model/automl.py

+        if self.parent.config.objective == "min":
+            highest_acc = float("inf")
+        elif self.parent.config.objective == "max":
+            highest_acc = -1


just in case we have a scorer that outputs values below -1 or if someone adds one in the future, lets have the highest_acc as float("-inf") (might want to confirm this syntax)

mhash1m · 2022-09-01T15:56:26Z

dffml/model/automl.py

+            else:
+                tuner.config.parameters = {}
+
+            val = await tune(model, tuner, scorer, self.parent.config.predict, sources, sources)


Lets not use the same sources for train and validation. It was discussed that we will use a list of sources instead.

mhash1m · 2022-09-01T15:58:04Z

dffml/model/automl.py

+            else:
+                tuner.config.parameters = {}
+
+            val = await tune(model, tuner, scorer, self.parent.config.predict, sources, sources)


lets rename val so it doesnt get confused with validation(val short)

johnandersen777 · 2022-09-17T04:13:12Z

tuner/bayes_opt_gp/dffml_tuner_bayes_opt_gp/tests/test_regressor_model.py

+
+
+from dffml_model_xgboost.xgbregressor import (
+    XGBRegressorModel,


Was this file left in intentionally? Let's double check for other additions like this which might be from directory copy pastin getc.

Yes, you're right. I'll remove it.

seraphimstreets added 12 commits June 22, 2022 16:54

"tune function and CLI command"

68c923e

"tune function and CLI command"

4a7de3a

Merge branch 'tunecli' of https://github.com/seraphimstreets/dffml in…

5623a7d

…to tunecli

"unit tests for xgboost, pytorch, spacy"

cef4d3e

"unit test cleaning"

41e4284

"random_search and bayes_opt_gp"

742be25

Minor fixes and documentation

d4ca3b2

Added requested changes

54d54d5

"minor doctest edits"

5a05c86

"First iteration of AutoML model"

a314411

AutoML model iteration 1.5

e397155

"default and user-defined hyperparameters"

8803e1e

mhash1m suggested changes Sep 1, 2022

View reviewed changes

seraphimstreets added 2 commits September 2, 2022 06:40

"validation set splitting for automl tuning"

abd894e

"removed scikit dependency"

a26424b

johnandersen777 reviewed Sep 17, 2022

View reviewed changes

"removed extraneous test file"

eaa14bf

johnandersen777 added the awaiting maintainer The PR is waiting for a maintainer to review it label Feb 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoML model incorporating tune commands. #1410

AutoML model incorporating tune commands. #1410

seraphimstreets commented Aug 5, 2022

mhash1m left a comment

mhash1m Sep 1, 2022

mhash1m Sep 1, 2022

mhash1m Sep 1, 2022

johnandersen777 Sep 17, 2022

seraphimstreets Sep 19, 2022



		from dffml_model_xgboost.xgbregressor import (
		XGBRegressorModel,

AutoML model incorporating tune commands. #1410

Are you sure you want to change the base?

AutoML model incorporating tune commands. #1410

Conversation

seraphimstreets commented Aug 5, 2022

mhash1m left a comment

Choose a reason for hiding this comment

mhash1m Sep 1, 2022

Choose a reason for hiding this comment

mhash1m Sep 1, 2022

Choose a reason for hiding this comment

mhash1m Sep 1, 2022

Choose a reason for hiding this comment

johnandersen777 Sep 17, 2022

Choose a reason for hiding this comment

seraphimstreets Sep 19, 2022

Choose a reason for hiding this comment