Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,541 workflow runs
5,541 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

DeepNVMe perf tuning
nv-accelerate-v100 #11309: Pull request #6560 synchronize by tjruwase
September 21, 2024 20:15 Queued olruwase/dnvme_docs
September 21, 2024 20:15 Queued
DeepNVMe perf tuning
nv-accelerate-v100 #11308: Pull request #6560 opened by tjruwase
September 21, 2024 18:29 1h 45m 48s olruwase/dnvme_docs
September 21, 2024 18:29 1h 45m 48s
Add APIs to offload states of model, optimizer, and engine
nv-accelerate-v100 #11307: Pull request #6011 synchronize by tohtana
September 21, 2024 01:14 Queued tohtana/offload_zero_buffers
September 21, 2024 01:14 Queued
Clean up prefetched parameters
nv-accelerate-v100 #11306: Pull request #6557 opened by tohtana
September 21, 2024 01:03 Queued tohtana/clean_up_prefetch_param
September 21, 2024 01:03 Queued
nv-accelerate-v100
nv-accelerate-v100 #11305: Scheduled
September 21, 2024 00:06 14h 37m 17s master
September 21, 2024 00:06 14h 37m 17s
add bfloat16 to inference support dtypes
nv-accelerate-v100 #11304: Pull request #6528 synchronize by tjruwase
September 20, 2024 21:18 9h 28m 32s nelyahu:add_bf16_to_inference_engine
September 20, 2024 21:18 9h 28m 32s
Fix training of pipeline based peft's lora model
nv-accelerate-v100 #11303: Pull request #5477 synchronize by tohtana
September 20, 2024 20:13 6h 3m 17s xuanhua:axu/fix-pipeline-with-lora
September 20, 2024 20:13 6h 3m 17s
nv-accelerate-v100
nv-accelerate-v100 #11301: Scheduled
September 20, 2024 00:06 12m 21s master
September 20, 2024 00:06 12m 21s
add option to disable logger while compiling to avoid graph breaks
nv-accelerate-v100 #11300: Pull request #6496 synchronize by ShellyNR
September 19, 2024 13:15 Action required ShellyNR:disable_logger_while_compiling
September 19, 2024 13:15 Action required
Rearrange inference OPS and stop using builder.load
nv-accelerate-v100 #11299: Pull request #5490 synchronize by oelayan7
September 19, 2024 06:46 15h 9m 55s oelayan7:rearrange_ops
September 19, 2024 06:46 15h 9m 55s
nv-accelerate-v100
nv-accelerate-v100 #11298: Scheduled
September 19, 2024 00:06 12h 35m 20s master
September 19, 2024 00:06 12h 35m 20s
Improve consistency of zero_grad
nv-accelerate-v100 #11297: Pull request #6554 synchronize by tohtana
September 18, 2024 21:56 10h 28m 35s tohtana/consistent_zero_grad
September 18, 2024 21:56 10h 28m 35s
Improve consistency of zero_grad
nv-accelerate-v100 #11296: Pull request #6554 synchronize by tohtana
September 18, 2024 21:04 51m 14s tohtana/consistent_zero_grad
September 18, 2024 21:04 51m 14s
Improve consistency of zero_grad
nv-accelerate-v100 #11295: Pull request #6554 synchronize by tohtana
September 18, 2024 20:59 5m 12s tohtana/consistent_zero_grad
September 18, 2024 20:59 5m 12s
Improve consistency of zero_grad
nv-accelerate-v100 #11294: Pull request #6554 synchronize by tohtana
September 18, 2024 20:55 4m 42s tohtana/consistent_zero_grad
September 18, 2024 20:55 4m 42s
Improve consistency of zero_grad
nv-accelerate-v100 #11293: Pull request #6554 opened by tohtana
September 18, 2024 20:27 27m 47s tohtana/consistent_zero_grad
September 18, 2024 20:27 27m 47s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
nv-accelerate-v100 #11292: Pull request #6553 opened by gyou2021
September 18, 2024 12:25 Action required gyou2021:configurable_autoTP
September 18, 2024 12:25 Action required
Enabled Qwen2-MoE Tensor Parallelism (TP) inference
nv-accelerate-v100 #11291: Pull request #6551 opened by gyou2021
September 18, 2024 10:16 Action required gyou2021:qwen2-moe
September 18, 2024 10:16 Action required
Fix gradient accumulation for Z2+offload
nv-accelerate-v100 #11290: Pull request #6550 synchronize by tjruwase
September 18, 2024 09:57 22h 15m 21s tohtana:tohtana/fix_grad_acc_z2_offload
September 18, 2024 09:57 22h 15m 21s
Fix gradient accumulation for Z2+offload
nv-accelerate-v100 #11289: Pull request #6550 opened by tohtana
September 18, 2024 08:12 1h 45m 41s tohtana:tohtana/fix_grad_acc_z2_offload
September 18, 2024 08:12 1h 45m 41s
Rearrange inference OPS and stop using builder.load
nv-accelerate-v100 #11288: Pull request #5490 synchronize by oelayan7
September 18, 2024 07:25 17h 10m 40s oelayan7:rearrange_ops
September 18, 2024 07:25 17h 10m 40s
Rearrange inference OPS and stop using builder.load
nv-accelerate-v100 #11287: Pull request #5490 synchronize by oelayan7
September 18, 2024 07:04 20m 15s oelayan7:rearrange_ops
September 18, 2024 07:04 20m 15s
Fix expert grad scaling problem with ZeRO optimizer
nv-accelerate-v100 #11286: Pull request #6546 synchronize by wyooyw
September 18, 2024 07:01 4h 34m 41s wyooyw:fix_expert_weight_grad_with_zero
September 18, 2024 07:01 4h 34m 41s
Fix expert grad scaling problem with ZeRO optimizer
nv-accelerate-v100 #11285: Pull request #6546 synchronize by wyooyw
September 18, 2024 06:59 Action required wyooyw:fix_expert_weight_grad_with_zero
September 18, 2024 06:59 Action required