Version 2.0.0a1 #875

renesass · 2022-07-19T12:12:14Z

Big pull-request to make SMAC more user-friendly.

Documentation

https://automl.github.io/SMAC3/development-2.0/

Todo

Required for release candidate:

HB/SH seems to be stuck when using more than 1 worker and also ignores walltime limit (just add workers to 2/1 example) -> Seems like it's a mac problem
Multi-Tasking für Intensifier?
Integrate BOinG + TurBO again
Integrate HydraFacade

Discussions / Postponed improvements

Continue runs.
- Many components are depending on states. Idea: Save/load states s.t. the optimization can pick-off where it stopped.
- Have the option to rerun crashed ones.
- HyperparameterFacade starts a local search when continuing a run. However, the limit already was exceeded.
- HB/SH does not work because an incumbent is given and it's stuck in the first stage.
Ask-and-Tell Interface
- Call asks multiple times before calling tell
- Find a way to incorporate the trials from the user when only using tell. Partially works for intensifier already.
- It does not make sense to tell SMAC trials in advance when using SH. Reason: It's heavily depending on a budget+instance combination and even if the user provides it, SMAC have to wait till the other trials have been finished too.
Constraints instead of imputation?
Facade: Build the facade automatically based on the scenario inputs (like if budgets are defined use successivehalving e.g.) -> AutoFacade? -> Log which components are used
Random_design.check should not be in acquition but in SMBO main loop, since it obleviates the necessity of computing epm and acquition to begin with?
Ask and intensification inversion. Currently, the ask method is passed to the intensifier. But ideally, the ask method defines what configuration to be evaluated exactly on what problem instance. This implies the inverse relation: the intensifier should be called from within ask?
Intensification is one way of defining a fidelity (as number of problem instances to evaluate on) but it shouldn’t be at the heart of SMAC, since nowadays the multiple dataset optimization is no longer as prominent.
10000 challengers: Never touch the surrogate model again? This usually happens due to the intensification percentage being at 0.5, the model fitting taking quite long, and the functions taking no time. If these three things don't apply, this is indeed an issue
_collect_data in smbo.py: Training only on the highest fidelity or mixed fidelity? -> Docs
Problem with _get_x_best and instances: Only the config with the lowest cost is used?!
Tools to visualize things.

Findings

I tried hard to find the reasons why SMAC is not reproducible: The reason is because of the method _get_timebound_for_intensification (influences the time_bound from the process_results method from the intensifier) in base_smbo.py as the time for the calculation is never 100% the same. Setting it to a fixed number results in reproducible results.

Changelog

Big Changes

We redesigned the scenario class completely. The scenario is implemented as a dataclass now and holds only environment variables (like limitations or save directory). Everything else was moved to the components directly.
We removed runtime optimization completely (no adaptive capping or imputing anymore).
We removed the command-line interface and restructured everything alongside. Since SMAC was building upon the command-line interface (especially in combination with the scenario), it was complicated to understand the behavior or find specific implementations. With the removal, we re-wrote everything in python and re-implemented the feature of using scripts as target functions.
Introducing trials: Each config/seed/budget/instance calculation is a trial.
The configuration chooser is integrated into the SMBO object now. Therefore, SMBO finally implements an ask-tell interface now.
Facades are redesigned so that they accept instantiated components directly. If a component is not passed, a default component is used, which is specified for each facade individually in the form of static methods. You can use those static methods directly to adapt a component to your choice.
A lot of API changes and renamings (e.g., RandomConfigurationChooser -> RandomDesign, Runhistory2EPM -> RunHistoryEncoder).
Ambiguous variables are renamed and unified across files.
Dependencies of modules are reduced drastically.
We incorporated Pynisher 1.0, which ensures limitations cross-platform.
We incorporated ConfigSpace 0.6, which simplified our examples.
Examples and documentation are completely reworked. Examples use the new ConfigSpace, and the documentation is adapted to version 2.0.
Transparent target function signatures: SMAC checks now explicitly if an argument is available (the required arguments are now specified in the intensifier). If there are more arguments that are not passed by SMAC, a warning is raised.
Components implement a meta property now, all of which describe the initial state of SMAC. The facade collects all metadata and saves the initial state of the scenario.
Improved multi-objective in general: RunHistory (in addition to RunHistoryEncoder) both incorporates the multi-objective algorithm. In other words, if the multi-objective algorithm changes the output, it directly affects the optimization process.
Configspace is saved in json only
StatusType is saved as integer and not as dict anymore
We changed the behavior of continuing a run:
- SMAC automatically checks if a scenario was saved earlier. If there exists a scenario and the initial state is the same, SMAC automatically loads the previous data. However, continuing from that run is not possible yet.
- If there was a scenario earlier, but the initial state is different, then the user is asked to overwrite the run or to still continue the run although the state is different (Note that this only can happen if the name specified in the scenario is the same). Alternatively, an old to the old run is added (e.g., the name was test, it becomes test-old).
- The initial state of the SMAC run also specifies the name (if no name in the scenario is specified). If the user changes something in the code base or in the scenario, the name and, therefore, the save location automatically changes.

New Features

Added a new termination feature: Use terminate_cost_threshold in the scenario to stop the optimization after a configuration was evaluated with a cost lower than the threshold.
Callbacks are completely redesigned. Added callbacks to the facade are called in different positions in the Bayesian optimization loop.
The multi-objective algorithm MeanAggregationStrategy supports objective weights now.
RunHistory got more methods like get_incumbent or get_pareto_front.

Fixes

You ever noticed that the third configuration has no origin? It's fixed now.
We fixed ParEGO (it updates every time training is performed now).

Optimization Changes

Changed initial design behavior
- You can add additional configurations now.
- max_ratio will limit both n_configs and n_configs_per_hyperparameter but not additional configurations
- Reduced default max_ratio to 0.1.

Code Related

Converted all unittests to pytests.
Instances, seeds, and budgets can be set to none now. However, mixing none and non-none will throw an exception.

…t-2.0

…o development-2.0

changelog.md

This was linked to issues Aug 4, 2022

Remodel TrajLogger #870

Closed

Incorporate Pynisher 1.0 #853

Closed

Pynisher isn't required unless limiting resources #822

Closed

Remove Scenario #860

Closed

This was linked to issues Aug 12, 2022

Alternative termination features other than max iteration? #877

Closed

[Doc] Document the callback signature better in docs #823

Closed

renesass added 9 commits August 15, 2022 16:12

A lot of work on intensifier

b3da515

Fix type?

cde6f70

Finally finished test_intensify

fafe612

Importing and isort

4508a08

Small user improvements

c89f8ba

Moved some files

0668714

Testing parallel scheduler successful

e5152b1

Progress on SH tests

435ec87

Implemented ASK-TELL interface

988cc45

renesass linked an issue Aug 16, 2022 that may be closed by this pull request

Question: is there an ask-tell interface? #814

Closed

renesass added 7 commits August 16, 2022 15:13

Expanded SH tests

01a99db

Slightly changed ask-tell structure

8be9453

Slow progress on SH testing

2f3178f

Added meta methods

728e9e0

Shifted old examples

d51ec3a

Added categories in examples

dd70faf

Docs renders again

e555202

renesass linked an issue Aug 17, 2022 that may be closed by this pull request

Improve pipelining #863

Closed

2 tasks

dengdifan and others added 6 commits August 18, 2022 11:02

move boing and turbo to smbo

e22526b

typos

ba3be97

Merge remote-tracking branch 'origin/development-2.0' into developmen…

a3618ec

…t-2.0

boing facade

a065bf1

Added option to pass evaluated configs

6a31077

MO algo does not return none anymore

e83a53d

renesass changed the title ~~Development 2.0~~ Version 2.0.0a1 Oct 11, 2022

renesass requested review from dengdifan and timruhkopf October 11, 2022 07:09

renesass and others added 11 commits October 11, 2022 09:26

Updated readme

b7f3bfa

Merged main

53dc526

Update README.md

0a1e7eb

Update README.md

dae994c

Updated text

f1c8ed6

Updated versions

3fbb179

Merge branch 'development-2.0' of https://github.com/automl/SMAC3 int…

cfbba16

…o development-2.0

Fix typo

546fa11

Merge branch 'development-2.0' of https://github.com/automl/SMAC3 int…

dad62af

…o development-2.0

Fix typo

d129c7c

Update README.md

7f31f5c

benjamc approved these changes Oct 11, 2022

View reviewed changes

Update facade name

823b5b9

dengdifan reviewed Oct 11, 2022

View reviewed changes

changelog.md Outdated Show resolved Hide resolved

dengdifan reviewed Oct 11, 2022

View reviewed changes

changelog.md Outdated Show resolved Hide resolved

dengdifan reviewed Oct 11, 2022

View reviewed changes

changelog.md Show resolved Hide resolved

dengdifan reviewed Oct 11, 2022

View reviewed changes

changelog.md Outdated Show resolved Hide resolved

Updated changelog

1b75cec

dengdifan approved these changes Oct 12, 2022

View reviewed changes

renesass merged commit ca4ffba into main Oct 12, 2022

renesass deleted the development-2.0 branch October 12, 2022 10:31

github-actions bot pushed a commit that referenced this pull request Oct 12, 2022

René Sass: Version 2.0.0a1 (#875)

351fb38

renesass restored the development-2.0 branch October 12, 2022 11:53

This was referenced Oct 17, 2022

Dead link on github.io-page #882

Closed

Rename classes #864

Closed

Increase algorithm transparency #862

Closed

Wrapper for CLI #861

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version 2.0.0a1 #875

Version 2.0.0a1 #875

renesass commented Jul 19, 2022 •

edited

Loading

Version 2.0.0a1 #875

Version 2.0.0a1 #875

Conversation

renesass commented Jul 19, 2022 • edited Loading

Documentation

Todo

Discussions / Postponed improvements

Findings

Changelog

Big Changes

New Features

Fixes

Optimization Changes

Code Related

renesass commented Jul 19, 2022 •

edited

Loading