[API] design for generic optimizer #93

fkiraly · 2024-09-20T19:42:00Z

From our earlier discussion.

I would design a generic interface as follows:

there are two (interface) classes, the BaseOptimizer and the BaseExperiment (or BaseEvaluator etc). Both inherit from skbase BaseObject, so provide a dataclass-like, sklearn-like composable interface.
- in particular, __init__ args always must be explicit, and never use positional or kwargs.
- the skbase tag system can be used to collect all the tags, e.g., from GFO things like the type of optimizer (particle etc), or whether it is computationally expensive, or soft dependencies required for it.
the BaseExperiment has a score method, it has the same signature as your "model" currently; its __call__ also redirects to score, so it can be used with the current signature. That's the "basic" interface, but we could also add an interface for gradients, to also cover gradient-based optimizers!
- an subclass of BaseExperiment could, for instance, be: evaluating an sklearn classifier by cv on a dataset, so it could be SklearnExperiment(my_randomforest, X, y, KFold(5).
the BaseOptimizer has __init__, which passes parameters only, and add_search, which has almost the current signature - it takes a BaseExperiment descendant instance, and one more object which configures the search space. Search behaviour like n_iter would not be passed in add_search, but should be an __init__ arg.
- to execute the search, I would suggest a fit method, as that would be compliant with multiple API naming choices, though I would not mind run or optimize etc. This method sets attributes to self, ending in _, wo they are visible via get_fitted_params

Thoughts?

The text was updated successfully, but these errors were encountered:

fkiraly · 2024-09-20T19:50:38Z

PS: I'm happy to try write this if you would like me to try? Not right now due to being busy, but maybe early Oct.

SimonBlanke · 2024-09-27T08:01:09Z

Hello @fkiraly,

I took some time to understand these changes you are proposing. I will show you how interpreted them, so please correct me if I misunderstood something.

It appears, that you want to change the API of Hyperactive, so that it is possible to use different optimization backends. This also necessitates to implement an interface (Experiment), that is adapted to certain optimizers.
For example: a optimizer that uses gradients also requires an experimental setup, that supports gradients.

I would be open to the possibility to optionally select other optimization backends for the experiment.

so it could be SklearnExperiment(my_randomforest, X, y, KFold(5)

I do not understand this example, because it would already be covered by the sklearn integration. A separate experiment-class for each package (sklearn, xgboost, pytorch) would heavily decrease the flexibility of the interface.

I would suggest a fit method, as that would be compliant with multiple API naming choices

Hyperactive does not fit an estimator at that point in the api. It runs the optimization setup. The fit-method makes sense in the sklearn integration.

fkiraly · 2024-09-27T19:32:57Z

A separate experiment-class for each package (sklearn, xgboost, pytorch) would heavily decrease the flexibility of the interface.

This would be used only for adaptation inside the sklearn adapter. The optimizer optimises the experiment.

You would need at least one experiment per package or unified API, no? But not one per unified API and optimizer.

Hyperactive does not fit an estimator at that point in the api.

I just mean, why not call optimize, instead, fit. It is just a naming question, since fit is used so often for data ingestion of any kind.

fkiraly · 2024-09-28T05:08:03Z

I think there is a small degree of miscommunication - would you like me to write a design document, or a draft PR (for demo purpose only)?

SimonBlanke · 2024-09-28T05:13:47Z

I think there is a small degree of miscommunication - would you like me to write a design document, or a draft PR (for demo purpose only)?

That would be great! :-)

fkiraly · 2024-10-30T13:56:35Z

Partially implemented here - feedback appreciated!

#95

SimonBlanke · 2024-11-04T08:47:55Z

Relevant comment: #85 (comment)

fkiraly added the enhancement New feature or request label Sep 20, 2024

fkiraly mentioned this issue Sep 20, 2024

[MNT] managing the dependencies #94

Open

fkiraly mentioned this issue Oct 22, 2024

[ENH] sklearn compatible tuning wrapper estimator #85

Open

fkiraly mentioned this issue Oct 30, 2024

[ENH] partial design sketch with experiment and optimizer base classes #95

Open

fkiraly mentioned this issue Dec 25, 2024

[ENH] Benchmarking Development Discussion sktime/sktime#5196

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[API] design for generic optimizer #93

[API] design for generic optimizer #93

fkiraly commented Sep 20, 2024 •

edited

Loading

fkiraly commented Sep 20, 2024

SimonBlanke commented Sep 27, 2024

fkiraly commented Sep 27, 2024

fkiraly commented Sep 28, 2024

SimonBlanke commented Sep 28, 2024

fkiraly commented Oct 30, 2024

SimonBlanke commented Nov 4, 2024

[API] design for generic optimizer #93

[API] design for generic optimizer #93

Comments

fkiraly commented Sep 20, 2024 • edited Loading

fkiraly commented Sep 20, 2024

SimonBlanke commented Sep 27, 2024

fkiraly commented Sep 27, 2024

fkiraly commented Sep 28, 2024

SimonBlanke commented Sep 28, 2024

fkiraly commented Oct 30, 2024

SimonBlanke commented Nov 4, 2024

fkiraly commented Sep 20, 2024 •

edited

Loading