add flash_attention on model chatglm_v2 #9296

Mangodadada · 2024-10-22T03:39:19Z

PR types

others

PR changes

models

Description

add flash_attention on model chatglm_v2

paddle-bot · 2024-10-22T03:39:23Z

Thanks for your contribution!

CLAassistant · 2024-10-22T03:39:27Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

huxinye seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

codecov · 2024-10-22T04:12:46Z

Codecov Report

Attention: Patch coverage is 45.00000% with 11 lines in your changes missing coverage. Please review.

Project coverage is 52.92%. Comparing base (76a118b) to head (a12cadc).
Report is 272 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/chatglm_v2/modeling.py	45.00%	11 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9296      +/-   ##
===========================================
- Coverage    53.11%   52.92%   -0.19%     
===========================================
  Files          665      660       -5     
  Lines       109041   106857    -2184     
===========================================
- Hits         57918    56555    -1363     
+ Misses       51123    50302     -821

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lugimzzz · 2024-10-22T06:24:32Z

paddlenlp/transformers/chatglm_v2/modeling.py

+                self.hidden_size_per_attention_head,
+            ]
+        )
+


这一段reshape的逻辑不应该加在这里，破坏了原有的非fa2的逻辑。而且下面还有支持sequence_parallel的逻辑会重新reshape

按要求修改了

lugimzzz · 2024-10-22T06:30:20Z

paddlenlp/transformers/chatglm_v2/modeling.py

+            )
+            version_check = False
+        if self.config.use_flash_attention and version_check:
+            attention_mask = attention_mask


对qkv的reshape可以放在if config.use_flash_attention下面，并且需要考虑sequence parallel

已经按要求修改了

lugimzzz

LGTM

add flash_attention on model chatglm_v2

8ee5bc1

lugimzzz reviewed Oct 22, 2024

View reviewed changes

add flash attn and consider sequence parallel

a12cadc

lugimzzz approved these changes Oct 28, 2024

View reviewed changes

wawltor merged commit 2993974 into PaddlePaddle:develop Oct 28, 2024
6 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add flash_attention on model chatglm_v2 #9296

add flash_attention on model chatglm_v2 #9296

Uh oh!

Mangodadada commented Oct 22, 2024

Uh oh!

paddle-bot bot commented Oct 22, 2024

Uh oh!

CLAassistant commented Oct 22, 2024

Uh oh!

codecov bot commented Oct 22, 2024 •

edited

Loading

Uh oh!

lugimzzz Oct 22, 2024

Uh oh!

Mangodadada Oct 24, 2024

Uh oh!

lugimzzz Oct 22, 2024

Uh oh!

Mangodadada Oct 24, 2024

Uh oh!

lugimzzz left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

add flash_attention on model chatglm_v2 #9296

add flash_attention on model chatglm_v2 #9296

Uh oh!

Conversation

Mangodadada commented Oct 22, 2024

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Oct 22, 2024

Uh oh!

CLAassistant commented Oct 22, 2024

Uh oh!

codecov bot commented Oct 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lugimzzz Oct 22, 2024

Choose a reason for hiding this comment

Uh oh!

Mangodadada Oct 24, 2024

Choose a reason for hiding this comment

Uh oh!

lugimzzz Oct 22, 2024

Choose a reason for hiding this comment

Uh oh!

Mangodadada Oct 24, 2024

Choose a reason for hiding this comment

Uh oh!

lugimzzz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Oct 22, 2024 •

edited

Loading