Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add LLM sample for DirectML #1082

Merged
merged 57 commits into from
Apr 18, 2024
Merged

Add LLM sample for DirectML #1082

merged 57 commits into from
Apr 18, 2024

Conversation

PatriceVignola
Copy link
Contributor

@PatriceVignola PatriceVignola commented Apr 15, 2024

This adds an LLM sample for DirectML that can convert and quantize a bunch of LLMs from HuggingFace.

The Dolly, Phi and LLaMA 2 folders were removed and replaced with a more generic LLM example that supports a large number of LLMs, including but not limited to Phi-2, Mistral, LLaMA 2

PatriceVignola and others added 30 commits December 7, 2023 01:27
## Describe your changes
Added the option to convert and run Falcon-7b-instruct using the llm.py
script and the flag --model_type=falcon

## Checklist before requesting a review
- [ ] Add unit tests for this change.
- [ ] Make sure all tests can pass.
- [ ] Update documents if necessary.
- [ ] Lint and apply fixes to your code by running `lintrunner -a`
- [ ] Is this a user-facing change? If yes, give a description of this
change to be included in the release notes.

## (Optional) Issue link
jambayk
jambayk previously approved these changes Apr 15, 2024
@guotuofeng
Copy link
Collaborator

This adds an LLM sample for DirectML that can convert and quantize a bunch of LLMs from HuggingFace.

could we elaborate what's kinds of changes in this PR? for examples, what model is removed? what kinds of models are added?

@guotuofeng
Copy link
Collaborator

You can refer to https://github.com/microsoft/Olive/blob/main/examples/mistral/user_script.py#L54, which I remove some unused code before.

@guotuofeng guotuofeng merged commit d068cf9 into main Apr 18, 2024
35 checks passed
@guotuofeng guotuofeng deleted the user/pavignol/llm-combined-clean branch April 18, 2024 02:46
DavitGrigoryan132 pushed a commit to DavitGrigoryan132/Olive that referenced this pull request Aug 14, 2024
This adds an LLM sample for DirectML that can convert and quantize a
bunch of LLMs from HuggingFace.

The Dolly, Phi and LLaMA 2 folders were removed and replaced with a more
generic LLM example that supports a large number of LLMs, including but
not limited to Phi-2, Mistral, LLaMA 2

---------

Co-authored-by: aamajumder <150728138+aamajumder@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants