Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Surrogate Posteriors #309

Merged
merged 7 commits into from
Jul 15, 2024
Merged

Conversation

AdrianSosic
Copy link
Collaborator

@AdrianSosic AdrianSosic commented Jul 9, 2024

This PR is a next step toward a lean surrogate class layout:

  • Surrogate.(_)posterior now returns a Posterior object
  • Surrogate._fit no longer expects the SearchSpace as an argument, which brings us closer to the state that .fit and .posterior operate on the user/dataframe/context-level while _posterior and _fit operate on the purely mathematical level. This means that a user who writes their own surrogate class effectively only needs to implement the corresponding mathematical model in the latter two methods. Optional context information that may be required for this implementation (like the dimension index of the TaskParameter in the passed Tensor object) is now encapsulated into a surrogate-specific context object, that can be arbitrarily populated by the surrogate class, but whose logic is now cleanly separated from the actual fitting logic.
  • Adds a new GaussianSurrogate base class for (most our other) models that come with the implicit Gaussian noise assumption and effectively only implement mean and (co-)variance estimation.
  • Removes current scaling logic --> will be redesigned in upcoming PR
  • Improves and simplifies logic of the catch_constant_target, re-enabling slots for Surrogates
  • Removes register_custom_architecture --> will replaced with SurrogateProtocol in upcoming PR

@AdrianSosic AdrianSosic self-assigned this Jul 9, 2024
@AdrianSosic AdrianSosic changed the base branch from main to dev/surrogates July 9, 2024 11:26
@AdrianSosic AdrianSosic force-pushed the refactor/surrogates/posterior branch 2 times, most recently from 23f121c to e1279bf Compare July 10, 2024 06:40
@AdrianSosic AdrianSosic force-pushed the refactor/surrogates/posterior branch from e1279bf to 9d53245 Compare July 10, 2024 09:11
@AdrianSosic AdrianSosic force-pushed the refactor/surrogates/posterior branch from 9d53245 to 13e3353 Compare July 11, 2024 06:57
@AdrianSosic AdrianSosic force-pushed the refactor/surrogates/posterior branch from 13e3353 to 32e2ef5 Compare July 11, 2024 07:00
@AdrianSosic AdrianSosic merged commit 956d1d4 into dev/surrogates Jul 15, 2024
6 of 10 checks passed
@AdrianSosic AdrianSosic deleted the refactor/surrogates/posterior branch July 15, 2024 06:27
@AdrianSosic AdrianSosic added this to the Surrogate refactoring milestone Jul 24, 2024
@AdrianSosic AdrianSosic mentioned this pull request Aug 9, 2024
AdrianSosic added a commit that referenced this pull request Aug 29, 2024
Completes the surrogate factoring, which extended over #278, #309, #315,
#325, #337.

### Most important changes
* The transition point from experimental to computational representation
has been moved from the recommender to the surrogate. From an
architecture/responsibility perspective, this is reasonable since the
recommend should not have to bother about algorithmic/computational
details.
* The desired consequence is that public `Surrogate` methods like
`posterior` and `fit` can now operate on dataframes in experimental
representation, meaning they can also be exposed directly to the user.
* The new posterior methods now all return a general `Posterior` object
instead of implicitly assuming Gaussian distributions. This paves the
way for arbitrary surrogate extensions, such as Bernoulli/Categorical
surrogates, etc. At the moment, this introduces an explicit coupling to
botorch, which is fine because botorch remains a core dependency and the
only backend used for complex surrogate modeling. In the future, this
can be further abstracted by introducing our own `Posterior` class.
* The `Surrogate` layout has been refined such that the extracted
`SurrogateProtocol`, which now defines the formal interface for all
surrogates, imposes minimal requirements to the user.
* Scaling has been completely redesigned, offering the possibility to
configure input/output scaling down to the level of individual
parameters and targets. The configuration is currently class-specific,
but can be extended to allow surrogate instance specific rules in the
future.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants