The total return (environment rewards, only used for evaluation):
The episode length and reward from discriminator:
The loss and accuracy of the discriminator:
Legend:
The total return (environment rewards, only used for evaluation):
The episode length and reward from discriminator:
The loss and accuracy of the discriminator:
Legend:
The total return (environment rewards, only used for evaluation):
The episode length and reward from discriminator:
The loss and accuracy of the discriminator:
Legend: