CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Overview

This work addresses challenges in dynamic algorithm configuration (DAC) by simulating high-dimensional action spaces with interdependencies and varying importance between action dimensions. We propose sequential policies to effectively manage these properties, significantly outperforming independent learning of factorized policies and overcoming scalability limitations. Read the full paper at tbd.

Repository Structure

DACBench/: Contains our fork of DACBench extended by the Piecewise Linear benchmark, we will replace this by a git submodule after double-blind review.
analysis/: Contains scripts and notebooks for extracting data from wandb and for generating plots.
scripts/: Main script tp run all our experiments and conf subdirectories to setup experiments with hydra.
src/candid_dac/: Implementation of algorithms and policies to evaluat on Piecewise Linear benchmark.
setup.py: Script for installing the package and its dependencies.

Getting Started

Prerequisites

Ensure you have the following installed:

Python 3.8 or higher
Conda

Installation

Clone the repository:

git clone https://github.com/PhilippBordne/candidDAC.git
cd candidDAC

Get the DACBench submodule, which contains the benchmarks:
```
git submodule update --init --recursive
```

Create and activate the conda environment with python=3.10:

conda env create -n candid python=3.10
conda activate candid

Install the package (make sure DACBench/ is at root):
```
pip install -e .
```

Reproducing the Experiments

Note: We use wandb to track all the metrics in our experiments. Otherwise in our current implementation we don't log a metric but only print the training reward to the console.
To track metrics please specify a wandb project to plot to in the hydra config you plan to run (under scripts/conf)

To reproduce the experiments, follow these steps:

Navigate to the scripts directory:
```
cd scripts
```
To run a simple example of the SAQL algorithm on the piecewise_linear_2d benchmark, use the following command:
```
python dqn_factorized_policies.py --config-name=simple_example
      
```
To reproduce a specific experiment you can select which algorithm to run on which benchmark setup. You can also select the hyperparameters to use for the run but we always used best_.yaml. You will also have to specify the seed to run the experiment on. This example runs SAQL on the 10D Piecewise Linear benchmark using seed 0:
```
python dqn_factorized_policies.py --config-name=config +benchmark=piecewise_linear_10d +algorithm=saql +hyperparameters=best_saql +seed=0
```
Replace saql, piecewise_linear_10d, and best_saql with the algorithm, benchmark, and hyperparameters of your choice.
To do a sweep over different hyperparameter settings you can sample a random configuration from the spaces as specified in scripts/dqn_factorized_policies.py: To do so, run the following command:
```
python dqn_factorized_policies.py --config-name=config +benchmark=sigmoid_5d +algorithm=saql +hyperparameters=random_config +hyperparameters.seed=123 +seed=321
```
This will sample a random configuration using the seed 123 (and identify as config_id 123 in you wandb project). The seed 321 is used to run the experiment with the sampled configuration (and identified as seed in your wandb project). We note that we used the 5D Sigmoid benchmark to identify our hyperparameters.

Configuration Files

The configuration files for the experiments are located in the scripts/conf/ directory. These files contain settings for experiment design choices benchmark, algorithm, hyperparameter configuration.

Results

Results from the experiments, including logs and output data, will be stored in the results/ directory. You can analyze these results using the scripts and notebooks provided in the analysis/ directory. We provide the configurations and metrics for the experiments presented in the paper in analysis/run_data/.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
DACBench @ cf36adb		DACBench @ cf36adb
analysis		analysis
scripts		scripts
src/candid_dac		src/candid_dac
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Overview

Repository Structure

Getting Started

Prerequisites

Installation

Reproducing the Experiments

Configuration Files

Results

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

PhilippBordne/candidDAC

Folders and files

Latest commit

History

Repository files navigation

CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Overview

Repository Structure

Getting Started

Prerequisites

Installation

Reproducing the Experiments

Configuration Files

Results

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages