Fix TD3 DDPG Implementation: Move Sampling Inside Gradient Step Loop #144

alessandroassirelli98 · 2024-04-03T09:31:37Z

This pull request addresses a discrepancy between the original TD3 and DDPG paper's algorithm and the current implementation in the repository. Specifically, the original implementation performs the sampling step outside of the gradient step loop, which diverges from the methodology outlined in the paper. We have corrected this by moving the sampling process inside the gradient step loop, aligning the implementation more closely with the intended algorithmic procedure described in the original paper and SpinningUp description.

Toni-SM · 2024-04-15T01:00:13Z

Hi @alessandroassirelli98

Could you please, update the PR target branch to the develop branch.
The main branch is updated from the develop branch only when a new release is made :)

alessandroassirelli98 · 2024-04-15T17:08:47Z

Ok, I have created another Pull Request for branch develop

Toni-SM · 2024-04-16T14:34:01Z

Nice... This PR will be closed then in favor of #147

Fix paper and implementation mismatch

40f5bad

Toni-SM closed this Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TD3 DDPG Implementation: Move Sampling Inside Gradient Step Loop #144

Fix TD3 DDPG Implementation: Move Sampling Inside Gradient Step Loop #144

alessandroassirelli98 commented Apr 3, 2024

Toni-SM commented Apr 15, 2024

alessandroassirelli98 commented Apr 15, 2024

Toni-SM commented Apr 16, 2024

Fix TD3 DDPG Implementation: Move Sampling Inside Gradient Step Loop #144

Fix TD3 DDPG Implementation: Move Sampling Inside Gradient Step Loop #144

Conversation

alessandroassirelli98 commented Apr 3, 2024

Toni-SM commented Apr 15, 2024

alessandroassirelli98 commented Apr 15, 2024

Toni-SM commented Apr 16, 2024