-
-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Proposal] let Minari Dataset's attribute ref_min/max score be attribute of environment #141
Comments
@rodrigodelazcano What are your thoughts on this? |
Hello @im-Kitsch, we don't change/rewrite the environments as it is done in D4RL, so we cannot add this attributes as they did. I agree, tho, that it should be an attribute of the environment. Unfortunately, they don't even represent a theoretical max/min, so their meaning is a bit vague for me. I remember we had a discussion on officially supporting them, but we did for backward compatibility of D4RL datasets. What do you mean with "dictionary of minari package"? |
Hi, @younik In this case we could maintain a dictionary of min/max score of different environments. For unkown enviorment raises warning. The main purpose is to have a unified min/max metirc for same environment so that we could compare different algorithm based on different datasets. Currently we use the function Actually mostly different datasets' min/max score are same for same base enviornment. I think the min/max is theoretical worst score and best score, i.e. random policy/expert policy, (it's discussed here Farama-Foundation/D4RL#48 (comment)). So I would say it's better to replay Similary to d4RL, I think we could made a dictionary like D4RL, i.e. make a file like Another choice I think we could do is that to given function call So in conclusion, we remove the metadata of ref_min_score and ref_max_score of MinariDataset and maintain it independently of dataset. |
This can work, but what would you do if the dataset has no associated environment? |
good question, but anyway, Recall that the main purpose is to have an common normalizing evaluation across different dataset quality. This could be helpful in research for comparison across different env/dataset/algorithm. And currently there are two APIs for creating dataset: Lines 305 to 309 in dd8406e
Lines 457 to 460 in dd8406e
Also, looks currently |
They will not need an environment soon, see #137 |
Hi, hmm, I think it makes sense, so let's keep current implementation. |
Hi,
would it be better to let "ref_max_score" and "ref_min_score" be attribute of environment but not dataset? So that different datasets have a uniform normalization metric?
in d4RL implementation it also actually do so, for different dataset's corresponding environemnt, like "env-xxx-xxx", their ref_max_score and ref_min_score comes from same macro definition. https://github.com/Farama-Foundation/D4RL/blob/71a9549f2091accff93eeff68f1f3ab2c0e0a288/d4rl/gym_mujoco/__init__.py#L23-L31
So why not we directly make it as attribute of environemnt or dictionary of minari package?
The text was updated successfully, but these errors were encountered: