Skip to content

Comments

support squeezeformer model causal convolution and new versition Multi-head Attention#87

Open
NLPvv wants to merge 7 commits intothu-spmi:masterfrom
NLPvv:master
Open

support squeezeformer model causal convolution and new versition Multi-head Attention#87
NLPvv wants to merge 7 commits intothu-spmi:masterfrom
NLPvv:master

Conversation

@NLPvv
Copy link
Contributor

@NLPvv NLPvv commented May 29, 2024

  • support squeezeformer model link succeed to CAT
  • squeezeformer: squeezeformer implementation, please refer paper

@NLPvv
Copy link
Contributor Author

NLPvv commented Jul 12, 2024

  • support Causal Convolution
  • support New versition Multi-head Attention

RTF Comparison:

exp RTF (FP32) RTF (ORT C++ FP32) RTF (ORT C++ INT8)
atten-ori 0.078 0.049 0.043
atten-new 0.062 0.033 0.019

@NLPvv NLPvv changed the title support squeezeforme model and related experiments for aishell support squeezeformer model causal convolution and new versition Multi-head Attention Jul 16, 2024
@OU-Zhijian
Copy link
Contributor

see above.

@OU-Zhijian
Copy link
Contributor

需要把 squeezeformer 引入后的实验结果,报告一下

MultiHeadedSelfAttentionModule 修改需要更多的说明,

RTF Comparison:
exp RTF (FP32) RTF (ORT C++ FP32) RTF (ORT C++ INT8)
atten-ori 0.078 0.049 0.043
atten-new 0.062 0.033 0.019
这张表,需要贴到readme.md,并补充实验条件。否则不知道,什么条件下的实验。

@thu-spmi
Copy link
Owner

Add squeezeformer.md to https://github.com/thu-spmi/CAT/tree/master/docs
to document the new method and experiment result

Add one line to whatsnew.md, which links to squeezeformer.md
https://github.com/thu-spmi/CAT/blob/master/docs/whatsnew.md

OZJ

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants