Support categoricals in alternating optimization #2866

TobyBoyne · 2025-06-04T22:53:36Z

Motivation

See issue #2864

Have you read the Contributing Guidelines on pull requests?

Yes

Test Plan

There are already tests in place for mixed integer feature spaces. I have created a new test case for mixed categorical feature spaces. Once the overall approach has been reviewed, I will also add tests for the new functions that have been introduced

Related PRs

None yet - happy to update the docs.

TobyBoyne · 2025-06-04T23:01:51Z

Some questions I have about my implementation:

What should be the expected behaviour if the search space is not mixed (eg. everything is continuous, or everything is categorical)? I have added in a few checks that were necessary for spaces that are categorical+continuous, but no discrete, but I'm not sure which cases need to be well defined.
Should I introduce any additional keys to options to allow for more fine-grained control?
What should be the fallback when there are too many categories? For discrete ordinal features, continuous relaxation is in place.
I found some issues when using this with the OneHotToNumeric transform (see the FIXME comment in optimize_mixed. I would expect get_X_baseline to return the untransformed (ie. one-hot) inputs, but it seems to return the transformed

Balandat

What should be the expected behaviour if the search space is not mixed (eg. everything is continuous, or everything is categorical)? I have added in a few checks that were necessary for spaces that are categorical+continuous, but no discrete, but I'm not sure which cases need to be well defined.

I think if either it's fully continuous or fully discrete we should just raise an error. We have other acquisition optimizers for that.

botorch/optim/optimize_mixed.py

Balandat

Should I introduce any additional keys to options to allow for more fine-grained control?

Which ones would those be? In general I think unless we have a need for them we can keep things simple and add them later if needed.

What should be the fallback when there are too many categories? For discrete ordinal features, continuous relaxation is in place.

Good question. If we can't enumerate them all, then a natural thing to do would be to sample a subset in each dimension in each step rather than computing all.

Balandat · 2025-06-05T04:37:44Z

botorch/optim/optimize_mixed.py

+        # FIXME: get_X_baseline should return untransformed inputs, but when
+        # OneHotToNumeric is used, it returns the transformed inputs.
+        X_baseline = opt_inputs.acq_function.model.input_transform.untransform(
+            X_baseline
+        )


Hmm interesting. I am not super familiar with this part of the code but it seems that this handling here is not working the way it should: https://github.com/pytorch/botorch/blame/d247a33486c22bc5c6f16107db3e71f1e3bf3acd/botorch/optim/utils/acquisition_utils.py#L126-L129

Looking through the code, it seems that _has_transformed_inputs and _has_transformed_inputs is used very sparingly. This requires some more thorough audit, I think.

The solution you have here will not work if the model doesn't actually have an input_transform attribute.

My solution was very much a temporary band-aid for something I was trying to run. I'm happy to remove this line, as I agree that it won't work if there are no input_transforms (and I think it might apply some input transforms twice). I will leave a TODO in the code noting that this sometimes breaks with bad input transforms.

You should check if opt_inputs.acq_function.model has an input_transform attribute and do nothing if it doesn't.

Have you tried this with some other input transforms? Is this only an issue with OneHotToNumeric?

I've investigated further and it seems to not be a problem on BoTorch's end - when I make models only in BoTorch, I can't recreate the error with any kind of transform (OneHotToNumeric included). It's only when I'm using external code that I seem to get these issues.

I'll keep looking into this separately, but the issue is not with this PR (or even with BoTorch).

Balandat · 2025-06-05T04:42:12Z

cc @saitcakmak who may be interested in this feature

Co-authored-by: Max Balandat <[email protected]>

TobyBoyne · 2025-06-05T12:36:34Z

I think if either it's fully continuous or fully discrete we should just raise an error. We have other acquisition optimizers for that.

I agree, although making this change does break this existing test, and therefore might break expected behaviour for some BoTorch users?

botorch/test/optim/test_optimize_mixed.py

Lines 535 to 547 in d247a33

    
           # Only continuous parameters should fallback to optimize_acqf. 
        
           with mock.patch( 
        
               f"{OPT_MODULE}._optimize_acqf", wraps=_optimize_acqf 
        
           ) as wrapped_optimize: 
        
               optimize_acqf_mixed_alternating( 
        
                   acq_function=acqf, 
        
                   bounds=bounds, 
        
                   discrete_dims=[], 
        
                   options=options, 
        
                   q=1, 
        
                   raw_samples=20, 
        
                   num_restarts=2, 
        
               )

Balandat · 2025-06-05T13:14:14Z

I agree, although making this change does break this existing test, and therefore might break expected behaviour for some BoTorch users?

Good point - let's just fall back to to the standard _optimize_acqf in the same way in this case:

botorch/botorch/optim/optimize_mixed.py

Lines 679 to 680 in d247a33

    
           if len(discrete_dims) == 0: 
        
               return _optimize_acqf(opt_inputs=opt_inputs)

codecov · 2025-06-06T04:40:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (d247a33) to head (0d8e128).
Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##              main     #2866   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files          211       211           
  Lines        19353     19396   +43     
=========================================
+ Hits         19353     19396   +43

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Balandat

This looks great, thanks a lot for the high quality contribution!

I left a few nits inline; the main gap is to add a test for the case of too many categories (where we sample instead).

botorch/optim/optimize_mixed.py

test/optim/test_optimize_mixed.py

botorch/optim/optimize_mixed.py

Co-authored-by: Max Balandat <[email protected]>

…icals

TobyBoyne · 2025-06-06T11:13:54Z

I think that should be everything, @Balandat! Thanks for the quick and thorough review!

test/optim/test_optimize_mixed.py

facebook-github-bot · 2025-06-06T22:12:58Z

@Balandat has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

saitcakmak

Looks great! Thanks for implementing, this has been on my wish list for a while :)

facebook-github-bot · 2025-06-09T14:56:05Z

@Balandat merged this pull request in a268631.

TobyBoyne added 6 commits June 4, 2025 13:09

Initial setup for introducing categorical dims

8a08572

Implement categorical neighbors

07ddd83

Add initial sampling for categorical-mixed feature spaces

a58b7e7

Add perturbation of cat features in discrete step

01a4c10

Test for existence of integer/cat dimensions; begin writing test

87bf8c4

Untransform OneHotToNumeric

e7858f6

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Jun 4, 2025

Balandat reviewed Jun 5, 2025

View reviewed changes

botorch/optim/optimize_mixed.py Outdated Show resolved Hide resolved

botorch/optim/optimize_mixed.py Outdated Show resolved Hide resolved

botorch/optim/optimize_mixed.py Show resolved Hide resolved

botorch/optim/optimize_mixed.py Outdated Show resolved Hide resolved

Balandat reviewed Jun 5, 2025

View reviewed changes

TobyBoyne and others added 4 commits June 5, 2025 12:27

Update botorch/optim/optimize_mixed.py

83d9069

Co-authored-by: Max Balandat <[email protected]>

Update botorch/optim/optimize_mixed.py

f277b0c

Co-authored-by: Max Balandat <[email protected]>

Discrete dims now Noneable; remove repeated input transform

45dad1c

Sample values for large categorical features

be461a4

Fix tests by passing empty cat_dims

50fa5ed

TobyBoyne added 2 commits June 5, 2025 14:38

Revert failure on purely continuous problem

e344040

Add test for get_categorical_neighbors

590791f

TobyBoyne requested a review from Balandat June 5, 2025 16:05

Balandat requested changes Jun 6, 2025

View reviewed changes

TobyBoyne and others added 6 commits June 6, 2025 11:25

Update botorch/optim/optimize_mixed.py

88d0d40

Co-authored-by: Max Balandat <[email protected]>

Update botorch/optim/optimize_mixed.py

f165357

Co-authored-by: Max Balandat <[email protected]>

Update test/optim/test_optimize_mixed.py

2d92a6e

Co-authored-by: Max Balandat <[email protected]>

Use Python primitives for constructing categorical neighbors

e5586ce

Update optimize_acqf_mixed_alternating docstring to reflect categor…

f89910d

…icals

Set manual seed in all test_optimize_acqf_mixed_* tests

d60b7b4

Test random subsampling of categorical values

05a0c13

Correct private function docstring

1d7f2a2

TobyBoyne requested a review from Balandat June 6, 2025 11:20

Balandat reviewed Jun 6, 2025

View reviewed changes

test/optim/test_optimize_mixed.py Show resolved Hide resolved

TobyBoyne added 2 commits June 6, 2025 21:08

Fix randomly sampling current_x as neighbor

d0c7fc5

Fix random seed in test

0d8e128

Balandat approved these changes Jun 6, 2025

View reviewed changes

saitcakmak approved these changes Jun 9, 2025

View reviewed changes

Merge branch 'main' into optimize-cat-alternating

4a18054

facebook-github-bot closed this in a268631 Jun 9, 2025

facebook-github-bot added the Merged label Jun 9, 2025

Support categoricals in alternating optimization #2866

Support categoricals in alternating optimization #2866

Uh oh!

Conversation

TobyBoyne commented Jun 4, 2025

Motivation

Have you read the Contributing Guidelines on pull requests?

Test Plan

Related PRs

Uh oh!

TobyBoyne commented Jun 4, 2025

Uh oh!

Balandat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Balandat left a comment

Choose a reason for hiding this comment

Uh oh!

Balandat Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

TobyBoyne Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

Balandat Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

TobyBoyne Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Balandat commented Jun 5, 2025

Uh oh!

TobyBoyne commented Jun 5, 2025

Uh oh!

Balandat commented Jun 5, 2025

Uh oh!

codecov bot commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Balandat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

TobyBoyne commented Jun 6, 2025

Uh oh!

Uh oh!

facebook-github-bot commented Jun 6, 2025

Uh oh!

saitcakmak left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 9, 2025

Uh oh!

Uh oh!

TobyBoyne Jun 5, 2025 •

edited

Loading

codecov bot commented Jun 6, 2025 •

edited

Loading