[examples] Add a top-level README file #468

ChrisCummins · 2021-10-14T17:30:40Z

This adds a top-level examples/README.md file which provides an overview of the various scripts and subdirectories.

This also updates a few implementation details in the example scripts.

Here is a preview of the new README:

CompilerGym Examples

This directory contains code samples for everything from implementing simple
RL agents to adding support for entirely new compilers. Is there an example that
you think is missing? If so, please contribute!

Table of contents:

Autotuning
Reinforcement learning
Extending CompilerGym
- Example CompilerGym service
- Example loop unrolling
Miscellaneous
- Exhaustive search of bounded action spaces
- Estimating the immediate and cumulative reward of actions and benchmarks

Autotuning

Performing a random walk of an environment

The random_walk.py script runs a single episode of a
CompilerGym environment, logging the action taken and reward received at each
step. Example usage:

$ python random_walk.py --env=llvm-v0 --step_min=100 --step_max=100 \
      --benchmark=cbench-v1/dijkstra --reward=IrInstructionCount

=== Step 1 ===
Action:       -lower-constant-intrinsics (changed=False)
Reward:       0.0
Step time:    805.6us

=== Step 2 ===
Action:       -forceattrs (changed=False)
Reward:       0.0
Step time:    229.8us

...

=== Step 100 ===
Action:       -globaldce (changed=False)
Reward:       0.0
Step time:    213.9us

Completed 100 steps in 91.6ms (1091.3 steps / sec).
Total reward: 161.0
Max reward:   111.0 (+68.94% at step 31)

For further details run: python random_walk.py --help.

GCC Autotuning (genetic algorithms, hill climbing, + more)

The gcc_search.py script contains implementations of several
autotuning techniques for the GCC environment. It was used to produce the
results for the GCC experiments in the CompilerGym
whitepaper. For further details run:
python gcc_search.py --help.

Makefile integration

The makefile_integration directory demonstrates a
simple integration of CopmilerGym into a C++ Makefile config. For details see
the Makefile.

Random search using the LLVM C++ API

While not intended for the majority of users, it is entirely straightforward to
skip CompilerGym's Python frontend and interact with the C++ APIs directly. The
RandomSearch.cc file demonstrates a simple parallelized
random search implemented for the LLVM compiler service. Run it using:

bazel run -c opt //examples:RandomSearch -- --benchmark=benchmark://cbench-v1/crc32

For further details run: bazel run -c opt //examples:RandomSearch -- --help

Reinforcement learning

PPO and integration with RLlib

The rllib.ipynb notebook demonstrates integrating CompilerGym
with the popular RLlib reinforcement
learning library. In notebook covers registering a custom environment using a
constrained subset of the LLVM environment's action space a finite time horizon,
and trains a PPO agent using separate train/val/test datasets.

Actor-critic

The actor_critic script contains a simple actor-critic
example using PyTorch. The objective is to minimize the size of a benchmark
(program) using LLVM compiler passes. At each step there is a choice of which
pass to pick next and an episode consists of a sequence of such choices,
yielding the number of saved instructions as the overall reward. For
simplification of the learning task, only a (configurable) subset of LLVM passes
are considered and every episode has the same (configurable) length.

For further details run: python actor_critic.py --help.

Tabular Q learning

The tabular_q script contains a simple tabular Q learning
example for the LLVM environment. Using selected features from Autophase
observation space, given a specific training program as gym environment, find
the best action sequence using online Q learning.

For further details run: python tabular_q.py --help.

Extending CompilerGym

Example CompilerGym service

The example_compiler_gym_service directory
demonstrates how to extend CompilerGym with support for new compiler problems.
The directory contains bare bones implementations of backends in Python or C++
that can be used as the basis for adding new compiler environments. See the
README.md file for further details.

Example loop unrolling

The example_unrolling_service directory
demonstrates how to implement support for a real compiler problem by integrating
with commandline loop unrolling flags for the LLVM compiler. See the
README.md file for further details.

Miscellaneous

Exhaustive search of bounded action spaces

The brute_force.py script runs a parallelized brute force of
an action space. It enumerates all possible combinations of actions up to a
finite episode length and evaluates them, logging the incremental rewards of
each. Example usage:

$ python brute_force.py --env=llvm-ic-v0 --benchmark=cbench-v1/dijkstra \
      --episode_length=8 --brute_force_action_list=-sroa,-mem2reg,-newgvn

Enumerating all episodes of 3 actions x 8 steps
Started 24 brute force workers for benchmark benchmark://cbench-v1/dijkstra using reward IrInstructionCountOz.
=== Running 6,561 trials ===
Runtime: 8 seconds. Progress: 100.00%. Best reward found: 0.8571428571428572.
Ending jobs ... I1014 12:04:51.671775 3245811 CreateAndRunCompilerGymServiceImpl.h:128] Service "/dev/shm/compiler_gym_cec/s/1014T120451-646797-5770" listening on 37505, PID = 3245811
completed 6,561 of 6,561 trials (100.000%), best sequence -mem2reg -mem2reg -sroa -sroa -mem2reg -sroa -sroa -newgvn

For further details run: python brute_force.py --help.

The explore.py script evaluates all possible combinations of
actions up to a finite limit, but partial sequences of actions that end up in
the same state are deduplicated, sometimes dramatically reducing the size of the
search space. This script can also be configured to do a beam search.

Example usage:

$ python explore.py --env=llvm-ic-v0 --benchmark=cbench-v1/dijkstra \
      --episode_length=8 --explore_actions=-simplifycfg,-instcombine,-mem2reg,-newgvn

...

*** Processing depth 6 of 8 with 11 states and 4 actions.

                                 unpruned     self_pruned    cross_pruned     back_pruned         dropped             sum
        added this depth                0              33               0              11               0              44
   full nodes this depth                0           2,833           1,064             199               0           4,096
     added across depths               69             151              23              34               0             277
full added across depths               69           3,727           1,411             254               0           5,461

Time taken for depth: 0.05 s
Top 3 sequence(s):
  0.9694  -mem2reg, -newgvn, -simplifycfg, -instcombine
  0.9694  -newgvn, -instcombine, -mem2reg, -simplifycfg
  0.9694  -newgvn, -instcombine, -mem2reg, -simplifycfg, -instcombine


*** Processing depth 7 of 8 with 0 states and 4 actions.

There are no more states to process, stopping early.

For further details run: python brute_force.py --help.

Estimating the immediate and cumulative reward of actions and benchmarks

The sensitivity_analysis directory contains a pair of
scripts for estimating the sensitivity of the reward signal to different
environment parameters:

action_sensitivity_analysis.py:
This script estimates the immediate reward that running a specific action has
by running trials. A trial is a random episode that ends with the determined
action.
benchmark_sensitivity_analysis.py:
This script estimates the cumulative reward for a random episode on a
benchmark by running trials. A trial is an episode in which a random number of
random actions are performed and the total cumulative reward is recorded.

mostafaelhoushi · 2021-10-14T17:50:43Z

LGTM

codecov-commenter · 2021-10-14T18:13:29Z

Codecov Report

Merging #468 (6a5a020) into development (00ae8c0) will decrease coverage by 18.49%.
The diff coverage is 50.00%.

@@               Coverage Diff                @@
##           development     #468       +/-   ##
================================================
- Coverage        87.55%   69.05%   -18.50%     
================================================
  Files              110      110               
  Lines             6121     6123        +2     
================================================
- Hits              5359     4228     -1131     
- Misses             762     1895     +1133

Impacted Files	Coverage Δ
compiler_gym/util/shell_format.py	`94.44% <50.00%> (-5.56%)`	⬇️
compiler_gym/util/statistics.py	`0.00% <0.00%> (-100.00%)`	⬇️
compiler_gym/leaderboard/__init__.py	`0.00% <0.00%> (-100.00%)`	⬇️
compiler_gym/util/temporary_working_directory.py	`0.00% <0.00%> (-100.00%)`	⬇️
compiler_gym/leaderboard/llvm_instcount.py	`0.00% <0.00%> (-92.79%)`	⬇️
compiler_gym/util/minimize_trajectory.py	`0.00% <0.00%> (-92.63%)`	⬇️
compiler_gym/bin/validate.py	`0.00% <0.00%> (-87.10%)`	⬇️
compiler_gym/bin/manual_env.py	`0.00% <0.00%> (-80.80%)`	⬇️
compiler_gym/bin/service.py	`0.00% <0.00%> (-76.28%)`	⬇️
compiler_gym/random_search.py	`17.83% <0.00%> (-73.52%)`	⬇️
... and 39 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 00ae8c0...6a5a020. Read the comment docs.

ChrisCummins added 3 commits October 14, 2021 18:27

[examples] Update docstrings and API usage.

f26d394

[examples] Add a top-level README file.

c12b772

[README] Rejig the link to the examples doc.

ab6d575

ChrisCummins added this to the v0.2.1 milestone Oct 14, 2021

ChrisCummins requested review from hughleat, mostafaelhoushi and JD-ETH October 14, 2021 17:30

mostafaelhoushi approved these changes Oct 14, 2021

View reviewed changes

ChrisCummins added 4 commits October 14, 2021 19:13

[tests] Add a benchmark attribute setter test.

9eedc0f

[util] Add a shell_format.indent() function.

3460c24

[examples] Update C++ API usage.

8a92a60

[examples] Improve the test for random walks.

6a5a020

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 14, 2021

ChrisCummins merged commit 165fa83 into facebookresearch:development Oct 20, 2021

ChrisCummins deleted the examples-docs branch October 20, 2021 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[examples] Add a top-level README file #468

[examples] Add a top-level README file #468

ChrisCummins commented Oct 14, 2021 •

edited

Loading

mostafaelhoushi commented Oct 14, 2021

codecov-commenter commented Oct 14, 2021 •

edited

Loading

[examples] Add a top-level README file #468

[examples] Add a top-level README file #468

Conversation

ChrisCummins commented Oct 14, 2021 • edited Loading

CompilerGym Examples

Autotuning

Performing a random walk of an environment

GCC Autotuning (genetic algorithms, hill climbing, + more)

Makefile integration

Random search using the LLVM C++ API

Reinforcement learning

PPO and integration with RLlib

Actor-critic

Tabular Q learning

Extending CompilerGym

Example CompilerGym service

Example loop unrolling

Miscellaneous

Exhaustive search of bounded action spaces

Estimating the immediate and cumulative reward of actions and benchmarks

mostafaelhoushi commented Oct 14, 2021

codecov-commenter commented Oct 14, 2021 • edited Loading

Codecov Report

ChrisCummins commented Oct 14, 2021 •

edited

Loading

codecov-commenter commented Oct 14, 2021 •

edited

Loading