Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scenario gaming #1637

Merged
merged 9 commits into from
Feb 15, 2022
Merged

Scenario gaming #1637

merged 9 commits into from
Feb 15, 2022

Conversation

miguelgfierro
Copy link
Collaborator

@miguelgfierro miguelgfierro commented Feb 10, 2022

Description

Add scenario for gaming industry

Related Issues

Checklist:

  • I have followed the contribution guidelines and code style for this project.
  • I have added tests covering my contributions.
  • I have updated the documentation accordingly.
  • This PR is being made to staging branch and not to main branch.


### Next best action prediction

An interesting scenario is next best action prediction. In this scenario, what is recommended is the most beneficial next action for the player. From the technical point of view, this can be implemented using collaborative filtering algorithms, such as [SAR](../../examples/00_quick_start/sar_movielens.ipynb), [BPR](../../examples/02_model_collaborative_filtering/cornac_bpr_deep_dive.ipynb), and [NCF](../../examples/00_quick_start/ncf_movielens.ipynb).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't this task closer to reinforcement learning rather than CF?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point, there is a way to do next best action in RL, a lot of people using MCTS. This is done in AlphaGo, AlphaZero, etc. But in that case, the selection of the optimal next action is automatically taken. If we pose the problem as a reco system, where actions are treated like items in the typical CF setup, then the user can choose among a set of actions.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I meant something about the state i.e. in CF there is no state. But here is the current state of the game relevant for which next actions to recommend? "Next" implies that you have at least some info about what has happened before. With CF you would always recommend the same action, wouldn't you?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The way I see it is like a session based recommender, but with actions instead of items. The info that has happened before are the previous actions, the next recommended action is an option that the user chooses among a list of actions available.

miguelgfierro and others added 3 commits February 11, 2022 10:23
Co-authored-by: angusrtaylor <anta@microsoft.com>
Co-authored-by: angusrtaylor <anta@microsoft.com>
Co-authored-by: angusrtaylor <anta@microsoft.com>
scenarios/README.md Outdated Show resolved Hide resolved
@miguelgfierro
Copy link
Collaborator Author

I'll merge this now, we can iterate over the content if needed

@miguelgfierro miguelgfierro merged commit 41d928e into staging Feb 15, 2022
@miguelgfierro miguelgfierro deleted the scenario_gaming branch February 15, 2022 12:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants