[WIP] Release code of MixFormer (CVPR2022, Oral) #1820

chensnathan · 2022-04-08T07:39:15Z

MixFormer: Mixing Features across Windows and Dimensions

Pre-trained models will be added in next few days.

paddle-bot-old · 2022-04-08T07:44:57Z

Thanks for your contribution!

Seperendity · 2022-05-20T06:27:40Z

您好，想请教一下，ppcls/arch/backbone/model_zoo/mixformer.py中line229这里的维度v = v * x_cnn2v是如何计算的呢？我看每个部分的最后两个维度分别是（1, C // self.num_heads）和(N, C // self.num_heads)这里做矩阵乘列和行的维度不是不对应吗？

chensnathan · 2022-05-20T08:59:47Z

@Seperendity 你好，这个是可以通过广播机制来实现的

Seperendity · 2022-05-22T04:43:05Z

@chensnathan 非常感谢您的解答！知道用的是广播机制了。但还是对为什么广播后的值乘的对应维度是numswindow和tokens数这两维，我看论文的意思以为是把权重乘到通道维度上。x_cnn2v = torch.sigmoid(channel_interaction).reshape([-1, 1, self.num_heads, 1, C // self.num_heads])和 v = v.reshape([x_cnn2v.shape[0], -1, self.num_heads, N, C // self.num_heads])代码中这么乘的原因是什么呢？直观上来看并没有将学到的权重赋到dims维度上，希望您能解答一下，不甚感激。

chensnathan · 2022-05-22T15:49:12Z

@Seperendity 你好，这样做是为了配合v的维度。举个例子理解一下，假设v的shape是[B, C, H, W]，x_cnn2v的shape是[B, C, 1, 1]，那么v = v * x_cnn2v是一个简单的channel attention。但是，在代码里的第223行，因为后续要准备做window-based self-attention，v的shape是[B*(H/win)*(W/win), win*win, num_heads, C/num_heads]，而x_cnn2v的shape是[B, C, 1, 1]，这个时候没法直接做channel attention。当然这里可以用不同的实现：

你可以把v再reshape回[B, C, H, W]，做完channel attention之后，再变成[B*(H/win)*(W/win), win*win, num_heads, C/num_heads]，再进入到下面的self-attention。
我这里选择的是，v的shape从[B*(H/win)*(W/win), win*win, num_heads, C/num_heads]变为[B, (H/win)*(W/win), win*win, num_heads, C/num_heads]，x_cnn2v变为[B, 1, 1, num_heads, C/num_heads]，然后再变为[B*(H/win)*(W/win), win*win, num_heads, C/num_heads]，再进入到下面的self-attention。
本质上是一样的。

Seperendity · 2022-05-22T15:58:39Z

@chensnathan 明白您的意思了，非常感谢您的耐心解答！很有意思的工作

cxz1276316542 · 2022-07-21T10:15:57Z

MixFormer: Mixing Features across Windows and Dimensions

Pre-trained models will be added in next few day
你好，预训练模型出来了吗？在哪里下载呢？

docs/en/models/MixFormer_en.md

docs/zh_CN/models/ImageNet1k/MixFormer.md

ppcls/arch/backbone/__init__.py

CLAassistant · 2024-09-30T03:18:51Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

code release for mixformer

7b6c148

chensnathan mentioned this pull request Apr 8, 2022

Mixformer #1814

Closed

paddle-bot-old bot added contributor status: proposed labels Apr 8, 2022

chensnathan added 4 commits April 11, 2022 23:48

update readme

68d5992

update readme

8df9bc6

update readme

a6ce0e0

update readme

c2921cf

This was referenced Sep 23, 2022

mixformer #2341

Closed

Where is the code for Mixformer #2340

Closed

chensnathan added 3 commits September 27, 2022 11:12

add part of pretrained models for mixformer

60239c9

fix conflict

507c49c

update readme for mixformer

633f25c

cuicheng01 reviewed Sep 27, 2022

View reviewed changes

docs/en/models/MixFormer_en.md Outdated Show resolved Hide resolved

docs/zh_CN/models/ImageNet1k/MixFormer.md Show resolved Hide resolved

ppcls/arch/backbone/__init__.py Outdated Show resolved Hide resolved

minor fix for mixformer

bf5f021

Ligoml removed the status: proposed label Sep 20, 2023

paddle-bot bot assigned cuicheng01 Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Release code of MixFormer (CVPR2022, Oral) #1820

[WIP] Release code of MixFormer (CVPR2022, Oral) #1820

chensnathan commented Apr 8, 2022

paddle-bot-old bot commented Apr 8, 2022

Seperendity commented May 20, 2022 •

edited

Loading

chensnathan commented May 20, 2022

Seperendity commented May 22, 2022

chensnathan commented May 22, 2022 •

edited

Loading

Seperendity commented May 22, 2022

cxz1276316542 commented Jul 21, 2022

CLAassistant commented Sep 30, 2024

[WIP] Release code of MixFormer (CVPR2022, Oral) #1820

Are you sure you want to change the base?

[WIP] Release code of MixFormer (CVPR2022, Oral) #1820

Conversation

chensnathan commented Apr 8, 2022

paddle-bot-old bot commented Apr 8, 2022

Seperendity commented May 20, 2022 • edited Loading

chensnathan commented May 20, 2022

Seperendity commented May 22, 2022

chensnathan commented May 22, 2022 • edited Loading

Seperendity commented May 22, 2022

cxz1276316542 commented Jul 21, 2022

CLAassistant commented Sep 30, 2024

Seperendity commented May 20, 2022 •

edited

Loading

chensnathan commented May 22, 2022 •

edited

Loading