Initial attempt to incorporate MultiTask GPs #353

jpfolch · 2024-02-21T17:15:41Z

Initial attempt to incorporate MultiTask GPs, with long term plan of incorporating the multi-fidelity algorithm as described in here.

Current issues:

When predicting the posterior, BoFire by default returns the posterior mean and a co-variance which included the observation noise (see this line where observation_noise = True). However, BoTorch's MultiTaskGP does not support adding observation noise yet and returns NotImplementedError: Specifying observation noise is not yet supported by MultiTaskGP. (a) A work-around would be to redefine the MultiTaskGPSurrogate to have it set to False, however this would lead to inconsistencies in model usage within BoFire. (b) Maybe a new flag can be incorporated into BoFire to include the option of predicting observation noise. (c) For MultiTaskGPs I could implement the new posterior myself, by adding the observation noise to the posterior covariances manually; and then simplify the code once BoTorch starts supporting observation noise.
I am unable to use the dump functionality, when saving the BoTorch model using torch.save(self.model, buffer) I get the error AttributeError: Can't pickle local object 'IndexKernel.__init__.<locals>.<lambda>'. Still not sure if this is an issue with my local code, with GPyTorch's Index Kernel, or something else entirely.
Need some way of specifying which column has the task_id for each observation. Currently done by calling the variable 'fid' (for fidelity, but easily changed). Need to decide whether we specify which column has the task_ids by specific name or with some other method.
LKJ Prior (a matrix-valued prior on the inter-task correlations) depends on the number of tasks, which is unknown until the model is initialized. Currently I am setting it as a dummy n_tasks = 1 in the prior and then changing the attribute value when initializing the surrogate, it works but perhaps there is a better way of going about it.

Future work:

Need to incorporate a likelihood with heteroskedastic noise that allows for different noise levels in each task.
Once MultiTaskGPs are working, implement a multi-fidelity strategy for querying.

jduerholt

Hi @jpfolch,

thank you very much for your PR. MultiTask and MultiFidelityGPs are something which I wanted to have in BoFire for already a long time, but never find time for!

For the start I am focusing on your point three (how to encode the task info), as I have already an idea for this ;)

My proposal is to introduce a new input feature called TaskInput. This TaskInput should inherit from the CategoricalInput feature and should have an additional attribute called fidelities of type List[int] in which one can assign different fidelity levels to the individual tasks/categories. I would add a validator that checks that one always has a step size of 1 between different fidelities and that it starts with zero.

For example, having a TaskInput with three different tasks/categories named process_1, process_2 and process_3 the following fidelities would be valid:

[0,0,0]: all have the same fidelity level
[0,0,1]: process_3 has the highest fidelity

Via the allowed attribute of the TaskInput one can then define which tasks can be proposed by the optimizer. As the TaskInput would be a subclass of the CategoricalInput, one has to setup the encoding for the GP properly. Currently, CategoricalInputs are alway One Hot encoded, either one gives the option for ordinal encoding for TaskInputss or one does still the OneHot encoding and uses OneHotToNumeric input transform for the TaskInput as here: https://github.com/experimental-design/bofire/blob/main/bofire/surrogates/mixed_single_task_gp.py which would be from my perspective the most elegant way, but let me know what you think ;)

In addtion, we should add a validator for the Inputs object to forbid more than one TaskInput.

What do you think? I will answer your other points over the weekend (hopefully).

Best,

Johannes

jduerholt · 2024-02-22T17:28:11Z

bofire/data_models/priors/api.py

@@ -25,3 +29,6 @@
 MBO_LENGTHCALE_PRIOR = partial(GammaPrior, concentration=2.0, rate=0.2)
 MBO_NOISE_PRIOR = partial(GammaPrior, concentration=2.0, rate=4.0)
 MBO_OUTPUTSCALE_PRIOR = partial(GammaPrior, concentration=2.0, rate=4.0)
+MBO_LKJ_PRIOR = partial(


No need for two types here, if they are the same, the two different types are due to historic reasons. You can just create a LKJ_PIOR methods.

And perhaps we should make there n_task as non-default argument ...

jduerholt · 2024-02-22T17:34:54Z

tutorials/multi_task_gp_testing.py

@@ -0,0 +1,51 @@
+import json
+
+from pydantic import parse_obj_as


This should all go into the test suite under botorch.tests

Yes, the plan is to remove it completely. It just there for now since it reproduces the errors I wrote about.

jpfolch · 2024-02-23T10:46:00Z

And I really like the TaskInput solution. I'll get started on it and push the changes.

jduerholt · 2024-02-23T12:11:46Z

And I really like the TaskInput solution. I'll get started on it and push the changes.

Great, I will think in the meantime about the other open questions.

jduerholt · 2024-02-27T17:12:53Z

bofire/data_models/features/tasks.py

+from bofire.data_models.features.api import DiscreteInput
+
+
+class TaskInput(DiscreteInput):


Short question: why are you basing it on the DiscreteInput and not on the CategoricalInput? I would prefer the CategoricalInput, as it is more expressive and the Task is somehow more a categorical quantity than a discrete one, as the DiscreteInput always asumes an ordinal ordering of its values.

I thought about it and here is my logic:

(a) The task themselves do not have any ordering

(b) But the GP expects an ordering of the tasks corresponding to the location of each specific task in the IndexKernel function, i.e. inputs to kernel are k((x', i), (x, j)) so the task input values should always correspond to values i, j in [0, ..., n_tasks - 1].

This means that all we need to do to initialize the object InputTasks is to select the number of tasks. I am happy to change this though, and we can just encode the Categorical variables before feeding it to the GP.

It is now using CategoricalInput as a base

jduerholt · 2024-02-27T17:13:46Z

bofire/data_models/features/tasks.py

+    n_tasks: int
+    fidelities: List[int]
+
+    @validator("fidelities")


This is pydantic 1 style, we are using pydantic 2, so you should use field_validator

jduerholt · 2024-02-27T17:15:36Z

bofire/data_models/features/tasks.py

+
+class TaskInput(DiscreteInput):
+    type: Literal["TaskInput"] = "TaskInput"
+    n_tasks: int


n_tasks is not necessary if you base it on the CategoricalInput as it would be just the number of categories.

jpfolch · 2024-02-27T22:40:57Z

I've now got an implementation of TaskInputs inheriting from CategoricalVariable and it seems to be working fine when it comes to training the model and fitting. A few issues remain:

Using OneHotToNumeric has to be done outside BoTorch's model since the first line of calling the posterior on MultiTaskGPytorchModel is includes_task_feature = X.shape[-1] == self.num_non_task_features + 1 to check if the task features are included in the X (i.e. fully expecting the task_id to be a singular values in a column). It does not apply the input transform until after this, so under one-hot-encoding the check fails and the model returns predictions for all tasks for all inputs. Two ways of fixing this (a) try to fix within BoTorch and see if they accept the changes, or (b) change the transform within BoFire, or (c) using transform before calling posterior and before setting the training_data (current code is doing this).
I am running into issues with pydantic and serialization. When running surrogate_data.model_dump_json() I get the following warning: Expected Union[definition-ref, definition-ref, definition-ref, definition-ref, definition-ref, definition-ref, definition-ref, definition-ref] but got TaskInput - serialized value may not be as expected. I think this is the causing the following problem where parse_obj_as(MultiTaskGPSurrogate, json.loads(jspec)) fails to run as it fails verification by giving the wrong arguments to the wrong classes. Any help on how to fix this would be appreciated.

merging changes to my develop branch

jduerholt · 2024-02-28T16:22:42Z

Hi @jpfolch,

as already discussed, you can find a proper implementation for the TaskInput feature here #360 which should be merged soon. Just use this one.

Regarding your point with the one-hot input transform: I did not know this and this is a pity. Including it into botorch will need some time, but your solution will not work with an optimizer as the optimizer will just call posterior. For this reason, we have to use an OrdinalEncoding for the categorical TaskInput.

This is implemented here:

bofire/bofire/data_models/features/categorical.py

Line 245 in 4790d63

def to_ordinal_encoding(self, values: pd.Series) -> pd.Series:

The actual encoding taken into account by the model is based on the attribute input_preprocessing_specs which is validated/generated here:

bofire/bofire/data_models/surrogates/surrogate.py

Line 21 in ccb0bc2

@field_validator("input_preprocessing_specs")

For the MultiTaskGP we then have to overwrite it in a way that we assign CategoricalEncodingEnum.ORDINAL for the task feature. If you do this, you should already be able to use the model. For using it in optimization, we then also have to so some adustments. But this will not be a problem.

So, I would recomment to merge main in and change the model to ORDINAL Encoding for the task feature. Note that in standard models, it should still use ONEHOT. You will also need to adjust the input scaler to ignore the task feature, so that it stays as integer.

Best,

Johannes

jpfolch · 2024-02-28T18:17:20Z

Perfect, thanks! I will get started on fixing the encoding for TaskInputs.

Regarding not being able to pickle Index kernels, @TobyBoyne found the bug within GPyTorch and submitted a PR which has been approved, so it should be fixed eventually, but we will need a temporary fix for the next few months at least.

TobyBoyne · 2024-02-28T18:40:25Z

Hey, while you're waiting for the patch to hit GPyTorch, you can fix this in BoFire by manually registering the prior. I've demonstrated this in TobyBoyne@f9df6b0 , just copy this change over to your code and you will be able to pickle the kernel :)

jduerholt · 2024-02-29T18:19:56Z

Hey @jpfolch,

I was curious to know what the botorch guys think about changing the order of operations regarding the input transforms in the posterior call so I created this issue: pytorch/botorch#2232

Let's see ;)

Best,

Johannes

jpfolch · 2024-03-05T18:41:53Z

Latest commit added the missing validations for the model, and added testing. Overall the multi-task model seems to work now. Only problem I am running into is that the test test_MultiTaskGPModel will fail randomly (this is independent of the test parameters). In particular, the problem happens when calling fit_gpytoch_mll and the error thrown is RunTimeError: Must provide inverse transform to be able to sample from prior, I will try to investigate further in the next few days.

jduerholt · 2024-03-05T20:49:30Z

Only problem I am running into is that the test test_MultiTaskGPModel will fail randomly (this is independent of the test parameters). In particular, the problem happens when calling fit_gpytoch_mll and the error thrown is RunTimeError: Must provide inverse transform to be able to sample from prior, I will try to investigate further in the next few days.

I have not yet looked at your additions, but do you think this is a bofire or a botorch problem? I will have a look at your changes tmr.

jduerholt · 2024-03-06T09:53:11Z

Hi @jpfolch,

this looks good overall. I will do a more thorough review as soon as you request it and you know more about the issue with the fitting. To test serialization and desirilization for the added datamodels (the new gp and the new prior), please add example configs in tests/bofire/data_models/specs/surrogates.py and tests/bofire/data_models/specs/priors.py. Everything what is registered there will be automatically tested for serialization. With add_invalid, you can add invalid specs to test your custom validators.

Best,

Johannes

jpfolch · 2024-03-11T17:22:19Z

Sorry, I haven't been able to work on this due to other commitments, but I should have more time now. Regarding the fit_gpytroch_mll problem. I am still unsure what is causing it, but I think it could be a gpytorch problem based on the following two issues: pytorch/botorch#1323 and pytorch/botorch#1860.

The best solution I have come up with, is to drop the LKJ prior all together until the issue is fixed, and use MultiTaskGPs without a prior on the task covariances.

jpfolch · 2024-03-12T18:05:10Z

Latest commit adds the example configs to tests/bofire/data_models/specs/surrogates.py and tests/bofire/data_models/specs/priors.py. I have left the LKJ prior in the code, but currently it cannot be used: when used it defaults to None and throws a warning, let me know if this functionality is okay or should be changed.

jduerholt · 2024-03-12T20:35:03Z

Thx, will have a look over the course of the week.

jduerholt

Hi,

thank you very much, looks overall really good. Only some minor things due to tests and validators. If you have problems with the validators, just tell me, then I try to resolve them. I know they can be tricky ...

As soon as this is landed, we have to implement it to the optimizer ;)

Best,

Johannes

bofire/data_models/features/tasks.py

bofire/data_models/surrogates/multi_task_gp.py

bofire/benchmarks/single.py

bofire/data_models/features/tasks.py

bofire/priors/mapper.py

tests/bofire/surrogates/test_gps.py

jduerholt · 2024-03-19T20:36:27Z

tests/bofire/surrogates/test_gps.py

+            input_preprocessing_specs={"task_id": CategoricalEncodingEnum.ONE_HOT},
+        )
+
+    # test that if there is no task input, there is an error


this can also go the add_invalid

tests/bofire/surrogates/test_gps.py

jpfolch · 2024-04-03T03:12:33Z

I've made all the requested changes. All tests pass locally :).

jduerholt · 2024-04-05T19:23:17Z

Hi @jpfolch,

thank you very much, can you also resolve the merge conflicts by merging the main branch into this one?

Best,

Johannes

merge main branch of bofire with my changes

jpfolch · 2024-04-08T20:10:07Z

All done, conflicts should be resolved now. I tested, and all tests passed except the EntingStrategy spec, but I don't have entmoot installed which explains it.

jduerholt · 2024-04-09T19:42:34Z

Tests are running through ;), can you fix the linting errors to?

jpfolch · 2024-04-10T03:10:52Z

I think the linting errors should be fixed now

jduerholt

Hi @jpfolch,

thank you very much! And sorry for the long process, as I am currently in parental leave, I am not looking every day into github ;)

Next step would be the multifidelity one, or the incorporation into the strategy?

Best,

Johannes

jpfolch · 2024-04-25T09:59:40Z

Hi @jduerholt,

Thank you very much! I've also been slow as I have been away and juggling other things meanwhile. Happy to see it has been merged.

The MultiTaskGP can be used directly to model the fidelities, so I think the next step would be incorporation into the strategy? This is, assuming the strategies are the part of the code where the BO chooses which experiment to do next?

Best,
Jose

jduerholt · 2024-05-17T13:06:42Z

Hi @jpfolch,

now I am back from parental leave and have again more time for BoFire.

Yes, incorporation into the strategies would be the next step. The adaptations has to be made here: https://github.com/experimental-design/bofire/blob/main/bofire/strategies/predictives/botorch.py

Especially, we have to adjust the dealing with the fixed features, as we currently assume that categoricals are always one hot encoded. Honestly, this whole part of the code needs to be cleaned up. Are you interested in doing it, or should I make a first draft?

Best,

Johannes

jduerholt · 2024-05-17T13:13:48Z

It would be especially important to know how the optimization over the TaskInput should work, do we optimize the ACQF for all allowed tasks and return the candidate with the highest acqf value? Or is somehow also the evaluation cost is considered, or is only the highest fidelity queried? Do we need specific ACQFS for this case?

jpfolch · 2024-06-07T09:51:13Z

Hi @jduerholt,

Sorry for the delay in reply, I have been busy finishing up work my PhD. I am currently writing up, but I should have some time to work on BoFire. So based on my previous work the way I recommend going about optimizing over TaskInput is:

We first choose the next point of interest by returning the candidate with the highest acqf value at the target fidelity.
We then choose which fidelity to evaluate based on one of two criteria:
(a) If the fidelity criterion is strictly ordered, we can simply choose the lowest fidelity until we reduce the uncertainty enough. We would then choose the second lowest fidelity until we reduce uncertainty, and so on... the idea being that we want to use lowest possible fidelity which we believe might still be informative of our target. This criterion is cheap to compute, generally stable, and I found it to work well. However, it does not take cost into account.
(b) A second criterion, which is more general and does take cost into account, is to consider the information gain of each fidelity on the target fidelity per unit cost. This is a more general criteria, however, it can be expensive to compute accurately and numerically unstable. There is a BoTorch implementation of the method though, which is probably well optimized (the single-fidelity method is described in this tutorial but there is a multi-fidelity version in the source code).

Approach (b) would be equivalent to first optimizing any acquisition function in the target fidelity, and then optimizing the task-input only with the MF-MES acquisition function. The benefit of this is that it gives you more flexibility in the choice acquisition functions and it is computationally cheaper than optimizing MF-MES directly (which is another possible route).

Let me know which of the two options you would prefer, and I can get started on it.

Best,
Jose

jpfolch · 2024-06-11T08:25:36Z

@bertiqwerty

jduerholt · 2024-06-13T15:34:14Z

Hi @jpfolch,

totally forgot this. I will have a detailed look tomorrow. Sorry. Too much to do :(

Best,

Johannes

jduerholt · 2024-07-01T08:38:49Z

Hi @jpfolch,

soory for the delay, the last weeks were crazy. I am not so much into multifidelity optimization, so I have a few questions:

I also found these two botorch tutorials: https://github.com/pytorch/botorch/blob/main/tutorials/Multi_objective_multi_fidelity_BO.ipynb and this https://github.com/pytorch/botorch/blob/main/tutorials/multi_fidelity_bo.ipynb. How do them fit in your proposed options?
Concerning option B: What do you mean with "reducing the uncertainty enough". Which uncertainty? And what is enough?

In general, I am always for starting with the easiest option ;)

Best,

Johannes

initial attempt to incorporate MultiTask GPs

b5522e1

jduerholt self-requested a review February 22, 2024 08:38

jduerholt requested changes Feb 22, 2024

View reviewed changes

jpfolch added 2 commits February 27, 2024 17:01

added TaskInputs functionality

12363af

lkj prior clean-up

47c4c96

jduerholt reviewed Feb 27, 2024

View reviewed changes

TaskInput inheriting from Categorical

92efb87

jpfolch added 2 commits February 28, 2024 16:11

Merge branch 'jpfolch_dev' into jpfolch_dev2

2d14c41

Merge pull request #2 from jpfolch/jpfolch_dev2

a06dd6a

merging changes to my develop branch

jpfolch added 2 commits March 5, 2024 18:32

validation changes to MultiTaskGP

989cd3c

tests for multitask gp

ffea47c

added input specs, limited lkj prior usage

cff6817

jduerholt mentioned this pull request Mar 14, 2024

Domains docu #369

Open

jduerholt requested changes Mar 19, 2024

View reviewed changes

added testing for multitask himbbelblau

3df737e

jpfolch added 2 commits April 2, 2024 21:02

removed old tasks file, added LKJ test

0378dd7

reformatted multitask testing and functionality

6823a08

jpfolch added 2 commits April 2, 2024 21:56

fixed kernel and prior serialization errors

85bf6fd

fixed serialization test for surrogates

6b3d613

jpfolch added 2 commits April 8, 2024 13:41

Merge branch 'jpfolch_dev' into jpfolch_dev_2

92c19ee

Merge pull request #3 from jpfolch/jpfolch_dev_2

e4e7e10

merge main branch of bofire with my changes

jduerholt marked this pull request as ready for review April 9, 2024 19:23

jpfolch added 2 commits April 9, 2024 13:57

black reformatted linting errors

bb4f36a

fixed linting issues

ce340dc

jduerholt approved these changes Apr 12, 2024

View reviewed changes

jduerholt merged commit 612db94 into experimental-design:main Apr 12, 2024
10 checks passed

		@@ -0,0 +1,51 @@
		import json

		from pydantic import parse_obj_as

		from bofire.data_models.features.api import DiscreteInput


		class TaskInput(DiscreteInput):

Initial attempt to incorporate MultiTask GPs #353

Initial attempt to incorporate MultiTask GPs #353

Conversation

jpfolch commented Feb 21, 2024

Current issues:

Future work:

jduerholt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jpfolch commented Feb 23, 2024

jduerholt commented Feb 23, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jpfolch commented Feb 27, 2024

jduerholt commented Feb 28, 2024 • edited Loading

jpfolch commented Feb 28, 2024

TobyBoyne commented Feb 28, 2024

jduerholt commented Feb 29, 2024

jpfolch commented Mar 5, 2024

jduerholt commented Mar 5, 2024 • edited Loading

jduerholt commented Mar 6, 2024

jpfolch commented Mar 11, 2024

jpfolch commented Mar 12, 2024

jduerholt commented Mar 12, 2024

jduerholt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jpfolch commented Apr 3, 2024 • edited Loading

jduerholt commented Apr 5, 2024

jpfolch commented Apr 8, 2024

jduerholt commented Apr 9, 2024 • edited Loading

jpfolch commented Apr 10, 2024

jduerholt left a comment

Choose a reason for hiding this comment

jpfolch commented Apr 25, 2024

jduerholt commented May 17, 2024

jduerholt commented May 17, 2024 • edited Loading

jpfolch commented Jun 7, 2024

jpfolch commented Jun 11, 2024

jduerholt commented Jun 13, 2024

jduerholt commented Jul 1, 2024

jduerholt commented Feb 28, 2024 •

edited

Loading

jduerholt commented Mar 5, 2024 •

edited

Loading

jpfolch commented Apr 3, 2024 •

edited

Loading

jduerholt commented Apr 9, 2024 •

edited

Loading

jduerholt commented May 17, 2024 •

edited

Loading