Handling of parameters with integer values #308

MariusCautun · 2022-02-27T15:43:21Z

Treating integer-valued parameters as floats can be sub-optimal for the maximization procedure. This is discussed in https://arxiv.org/abs/1706.03673, where the authors have shown that accounting for the integer nature of some parameters can lead to faster convergence (see for example their figure 3). Given that optimizing the machine learning hyperparameters involves many times integer parameters, I think it is a worthwhile problem.

The same paper proposes a simple way of dealing with integer parameters with minimal changes to the code. For backward compatibility, the simplest way to implement the changes would be to add an extra dictionary where you specify the integer variables (if no such dictionary is given, then all are assumed to be floats).

Is this a change worth considering? If so, I can give it a try and implement it as a pull request.

bwheelz36 · 2022-02-28T00:03:22Z

Have you read this example, item 2:
https://github.com/fmfn/BayesianOptimization/blob/master/examples/advanced-tour.ipynb
?

If so, how would what you propose differ? (sorry, I haven't read the paper!)

MariusCautun · 2022-03-01T13:13:04Z

I saw the example in the advanced tour. The idea is that if you know a parameter is an integer then the Bayesian Optimizer can find the maximum faster (i.e achieve convergence in a smaller number of samples) than if you were to treat that parameter as a float.

The quickest would be to have an extra list or dictionary where you specify all the integer parameters. Then, when you calculate the Gaussian Process value for a point in parameter space, you round to integer values all the parameters that are defined as integers. The paper I pointed to claims that this extra step, which has minimal computation overhead, should speed up finding the maximum.

The idea is that if you have an integer parameter and you know the value of your black-box function at n and n+1, then you know the value of that function in the interval [n,n+1]. If you treat that parameter as a float, then even if you know the value of the function at n and n+1, your function still has the freedom to take a different value (within the dispersion of the fitted Gaussian Process) in the (n,n+1) interval. This means that you know less information than in the case of an integer parameter. This is why implementing integer parameters can speed up finding the maximum.

bwheelz36 · 2022-03-02T00:31:26Z

Certainly sounds interesting. I was wondering how this works with the acquisition function and I think their figure 1 explains that very nicely. I guess you saw my reply on your other proposal that pull requests will likely take quite some time. But I think many people would be interested in the ability to better handle integers, especially if you can replicate some of these results showing aster convergence.

sanoj2021 · 2022-05-24T16:09:05Z

I would be interested in an integer feature and fastened convergence. Greets

till-m · 2022-09-05T06:54:37Z

I've had a closer look at categorical/integer parameters and it shouldn't be too hard to implement. The only thing that is unclear to me is how the acquisition function will be maximized -- finding extrema of mixed value functions is a nontrivial problem. In Garrido-Merchan/Hernandez-Lobato there's no mention of a specific method used to find the maximum, as far as I can tell. If anyone has suggestions as to how to solve this sub-problem, I would be interested.

bwheelz36 · 2024-06-25T01:49:50Z

@till-m - should we close this for now given the issue in 430?

till-m · 2024-12-27T08:28:10Z

Implemented in #531 and published as a beta release, version 3.0.0b1. If someone is working on problems using typed optimization, I would love some getting some feedback from beta testers. :) Documentation for the feature is available here (example usage) and here (API reference).

bwheelz36 added the enhancement label May 16, 2022

simonmb mentioned this issue Jun 11, 2022

Integrate code from GPyOpt: addresses several issues #326

Closed

till-m mentioned this issue May 25, 2023

Parameter Typing #430

Closed

7 tasks

till-m mentioned this issue Dec 10, 2024

Typed Optimization #531

Merged

till-m closed this as completed Dec 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handling of parameters with integer values #308

Handling of parameters with integer values #308

MariusCautun commented Feb 27, 2022

bwheelz36 commented Feb 28, 2022

MariusCautun commented Mar 1, 2022

bwheelz36 commented Mar 2, 2022

sanoj2021 commented May 24, 2022

till-m commented Sep 5, 2022

bwheelz36 commented Jun 25, 2024

till-m commented Dec 27, 2024 •

edited

Loading

Handling of parameters with integer values #308

Handling of parameters with integer values #308

Comments

MariusCautun commented Feb 27, 2022

bwheelz36 commented Feb 28, 2022

MariusCautun commented Mar 1, 2022

bwheelz36 commented Mar 2, 2022

sanoj2021 commented May 24, 2022

till-m commented Sep 5, 2022

bwheelz36 commented Jun 25, 2024

till-m commented Dec 27, 2024 • edited Loading

till-m commented Dec 27, 2024 •

edited

Loading