Skip to content

Latest commit

 

History

History
48 lines (32 loc) · 1.71 KB

CHANGELOG.md

File metadata and controls

48 lines (32 loc) · 1.71 KB

CHANGELOG

0.2.1 (October 05, 2020)

PyTorch Elastic

NOTE: As of torch-1.7 and torchelastic-0.2.1 torchelastic will be bundled into the main pytorch docker image. torchelastic/examples will be available post torch-1.7 release since its base image will now be pytorch/pytorch

  • Torchelastic agent:

    • run_id available to workers as TORCHELASTIC_RUN_ID environment variable
    • Allow max_restarts=0
    • Worker exit barrier added to torchelastic agent to protect against variances in worker finish times
    • Improvements to error handling and propagation from torchelastic agent
    • Enable fault handlers on worker processes to get torch C++ stack traces
  • torchelastic.distributed.launch CLI:

    • New option --role to allow users to set worker role name
    • CLI options can now be set via environment variables (e.g. PET_NNODES="1:2")
  • Project:

    • Upgraded to Python 3.8
    • Tests moved to test directory within the respective modules
    • Use Pyre
  • Deprecated:

  • Experimental:

0.2.0 (April 29, 2020)

PyTorch Elastic

  • Separate infrastructure related work from the user script. DesignDoc
  • Events API

0.1.0rc1 (December 06, 2019)

PyTorch Elastic

  • First release torchelastic v0.1.0rc1 (experimental)