-
Notifications
You must be signed in to change notification settings - Fork 59
Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 #542
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
d36c124
to
a514d36
Compare
e15d548
to
3643fee
Compare
69ec2a4
to
6ad267b
Compare
dd8b38e
to
940dfcf
Compare
940dfcf
to
4f44dd4
Compare
Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>
4f44dd4
to
d8cf0a1
Compare
# Apply the attention mask | ||
attn_weights = torch.where(attention_mask, mask_value, attn_weights) | ||
|
||
attn_weights = attn_weights / self.scale_attn |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why it has been moved from line 51?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It was made equivalent to new TF code; they have moved it down since its placement whether at line 50 or 58, won't affect the performance. Should I move it back to 50?
|
||
EXTERNAL_MODELS = { | ||
"hpcai-tech/grok-1", | ||
"hpcai-tech/grok-1": { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: Do we need this?
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>
f4ade46
to
8217cb5
Compare
Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>
8217cb5
to
7bf2298
Compare
Update Transformers to 4.55.0
Update PyTorch to 2.7.0+cpu
Torchvision to 0.22.0+cpu
and Python Requirement to >=3.9
Updated modeling files and Cache Utils for transformers 4.55.0
Updated models :