Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ORT 1.19.0 Release: Cherry-Pick Round 0 #21609

Merged
merged 2 commits into from
Aug 4, 2024

Conversation

prathikr
Copy link
Contributor

@prathikr prathikr commented Aug 3, 2024

Description

Critical changes required for an external developer (GeekBench)

Motivation and Context

ORT 1.19.0 Release Preparation

### Description
Masks off top 4-bits of INT4 weights, improving accuracy.



### Motivation and Context
This is a workaround as the QNN docs state masking is not required.
### Description
- Supports quantized Conv + Activation on the HTP backend:
- Translates `DQs -> Conv -> Relu/Clip -> Q` into a single QNN Conv
operator if the Relu (or Clip) are redundant.



### Motivation and Context
Expands support for QDQ models created with tools that do not wrap Relu
or Clip with QDQ nodes.

This PR introduces the `IQnnNodeGroup` class. In the same way that a
`NodeUnit` represents a collection of `Nodes`, a `IQnnNodeGroup` can
represent one or more `NodeUnits` that are translated into a QNN
operator. QNN EP parses the ONNX graph to create a list of
`IQnnNodeGroup` objects, each representing a single `NodeUnit` or a
fusion of multiple `NodeUnits`.
@prathikr prathikr changed the title Prathikrao/cherry pick round0 ORT 1.19.0 Release: Cherry-Pick Round 0 Aug 3, 2024
@jywu-msft jywu-msft merged commit ee2fe87 into rel-1.19.0 Aug 4, 2024
113 of 114 checks passed
@jywu-msft jywu-msft deleted the prathikrao/cherry-pick-round0 branch August 4, 2024 05:05
@MaanavD MaanavD added the release:1.19.0 Cherry pick to ORT 1.19 label Aug 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release:1.19.0 Cherry pick to ORT 1.19
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants