[skip ci] ViT TTNN Tech Report #12800

mbahnasTT · 2024-09-18T01:23:04Z

Ticket

Link to Github Issue

Problem description

Provide context for the problem.

What's changed

Describe the approach used to solve the problem.
Summarize the changes made and its impact.

Checklist

Post commit CI passes
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
New/Existing tests provide coverage for changes

bbradelTT · 2024-09-18T17:38:09Z

tech_reports/ViT-TTNN/vit.md

+
+` seqL × head_count × head_size` 
+
+This step aggregates the outputs from the different heads into a single vector representation for each position in the sequence. The followin step is the Linear OP to calculate the self output, which is the output of the self multi-head attention module.


followin -> following

bbradelTT · 2024-09-18T17:39:16Z

tech_reports/ViT-TTNN/vit.md

+```
+
+#### 4.4.1 Q,K,V Generation using the Fused Linear OP
+The encoder input is matrix-mutiplied by the Q,K,V weights to generate the individual Query, Key, Value tensors. In the TT-NN implementation, the input is multipled by the pre-fused weights to generate the merged 3 tensors that will be split in a following step. The fused linear operation obective is to maximize the utilization by increasing the workload that is computed simultaneously on the Tensic core grid.


fix: multipled, obective

vshenoyTT and others added 13 commits September 13, 2024 21:38

#0: Add ViT Tech Report

27254c8

#0: Add ViT Tech Report

85cf695

#0: ViT Tech report

752173f

#0: ViT Tech report

758bdc5

#0: ViT Tech report

fb8f62a

#0: ViT Tech report

b45837a

#0: ViT Tech report

a1b4bd8

#0: ViT Tech report

9da9822

#0: ViT Tech report

bfc3683

#0: ViT Tech report

3f83763

#0: ViT Tech report

83422b6

#0: ViT tech report

c3f3cc6

#0: ViT tech report

77da226

mbahnasTT requested review from mywoodstock, davorchap, tt-aho, TT-BrianLiu and bbradelTT September 18, 2024 01:23

#0: ViT tech report

aa0e0a1

mbahnasTT force-pushed the mbahnas/vit_tech_report branch from 9552a04 to aa0e0a1 Compare September 18, 2024 03:25

#0: ViT tech report

666080c

mbahnasTT force-pushed the mbahnas/vit_tech_report branch from 3ba1089 to 666080c Compare September 18, 2024 17:05

#0: ViT tech report

1a1971a

bbradelTT reviewed Sep 18, 2024

View reviewed changes

mbahnasTT added 4 commits September 18, 2024 16:11

#0: ViT tech report

51d9b39

#0: ViT tech report

77c437d

#0: ViT tech report

b5bc95a

#0: edits

c37acc1

mbahnasTT force-pushed the mbahnas/vit_tech_report branch from 3a6811d to c37acc1 Compare September 22, 2024 20:18

mbahnasTT and others added 3 commits September 22, 2024 14:31

#0: edits

68f40c7

#0: edits

4d2e258

Merge branch 'main' into mbahnas/vit_tech_report

22e9337

mbahnasTT changed the title ~~ViT TTNN Tech Report~~ [skip ci] ViT TTNN Tech Report Sep 22, 2024

mbahnasTT merged commit 44eef17 into main Sep 22, 2024
6 checks passed

mbahnasTT deleted the mbahnas/vit_tech_report branch September 22, 2024 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[skip ci] ViT TTNN Tech Report #12800

[skip ci] ViT TTNN Tech Report #12800

mbahnasTT commented Sep 18, 2024 •

edited

Loading

bbradelTT Sep 18, 2024

bbradelTT Sep 18, 2024


		` seqL × head_count × head_size`

		This step aggregates the outputs from the different heads into a single vector representation for each position in the sequence. The followin step is the Linear OP to calculate the self output, which is the output of the self multi-head attention module.

[skip ci] ViT TTNN Tech Report #12800

[skip ci] ViT TTNN Tech Report #12800

Conversation

mbahnasTT commented Sep 18, 2024 • edited Loading

Ticket

Problem description

What's changed

Checklist

bbradelTT Sep 18, 2024

Choose a reason for hiding this comment

bbradelTT Sep 18, 2024

Choose a reason for hiding this comment

mbahnasTT commented Sep 18, 2024 •

edited

Loading