Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 #542

quic-mamta · 2025-08-19T19:40:10Z

Update Transformers to 4.55.0
Update PyTorch to 2.7.0+cpu
Torchvision to 0.22.0+cpu
and Python Requirement to >=3.9

Updated modeling files and Cache Utils for transformers 4.55.0

Updated models :

codegen
falcon
gemma
gemma2
gptj
gpt2
granite
granite_moe
grok1
llama
llama_swiftkv
mistral
mixtral_moe
mpt
phi
phi3
qwen2
starcoder2
gpt_bigcode
internvl
llava
llava_next
whisper
gemma3
llama4
mllama

QEfficient/transformers/models/gemma3/modeling_gemma3.py

Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>

quic-hemagnih · 2025-10-06T08:56:16Z

QEfficient/transformers/models/codegen/modeling_codegen.py

            # Apply the attention mask
            attn_weights = torch.where(attention_mask, mask_value, attn_weights)

+        attn_weights = attn_weights / self.scale_attn


Why it has been moved from line 51?

It was made equivalent to new TF code; they have moved it down since its placement whether at line 50 or 58, won't affect the performance. Should I move it back to 50?

vbaddi · 2025-10-06T09:05:50Z

QEfficient/utils/test_utils.py


    EXTERNAL_MODELS = {
-        "hpcai-tech/grok-1",
+        "hpcai-tech/grok-1": {


nit: Do we need this?

Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>

quic-mamta requested review from quic-rishinr, ochougul, quic-hemagnih and quic-amitraj as code owners August 19, 2025 19:40

quic-mamta changed the title ~~Tf version 4.55 upgrade~~ Transformers version 4.55 upgrade Aug 19, 2025

quic-mamta marked this pull request as draft August 19, 2025 19:42

asmigosw force-pushed the TF_version_4.55_upgrade branch from d36c124 to a514d36 Compare September 2, 2025 08:29

quic-mamta commented Sep 9, 2025

View reviewed changes

QEfficient/transformers/models/gemma3/modeling_gemma3.py Outdated Show resolved Hide resolved

quic-mamta force-pushed the TF_version_4.55_upgrade branch 2 times, most recently from e15d548 to 3643fee Compare September 23, 2025 08:45

quic-mamta marked this pull request as ready for review September 24, 2025 05:27

quic-mamta requested a review from vbaddi September 24, 2025 05:27

quic-mamta marked this pull request as draft September 24, 2025 19:31

quic-mamta force-pushed the TF_version_4.55_upgrade branch from 69ec2a4 to 6ad267b Compare September 25, 2025 07:46

abukhoy mentioned this pull request Sep 25, 2025

Update PyTorch to 2.7.1+cpu, Torchvision to 0.22.1+cpu, and Python Requirement to >=3.9 #524

Closed

2 tasks

quic-mamta changed the title ~~Transformers version 4.55 upgrade~~ Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 Sep 25, 2025

quic-mamta force-pushed the TF_version_4.55_upgrade branch 3 times, most recently from dd8b38e to 940dfcf Compare September 26, 2025 11:44

quic-mamta marked this pull request as ready for review September 26, 2025 11:44

quic-mamta force-pushed the TF_version_4.55_upgrade branch from 940dfcf to 4f44dd4 Compare September 26, 2025 19:03

Update modeling files

d8cf0a1

Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>

quic-mamta force-pushed the TF_version_4.55_upgrade branch from 4f44dd4 to d8cf0a1 Compare September 28, 2025 13:10

Merge branch 'main' into TF_version_4.55_upgrade

bbfc638

quic-hemagnih reviewed Oct 6, 2025

View reviewed changes

vbaddi reviewed Oct 6, 2025

View reviewed changes

QEfficient/utils/test_utils.py

EXTERNAL_MODELS = {

"hpcai-tech/grok-1",

"hpcai-tech/grok-1": {

Copy link

Contributor

vbaddi Oct 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: Do we need this?

asmigosw added 4 commits October 6, 2025 09:40

Updated Grok-1 pytorch_hf_tokens

e51cd13

Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

Updated test for InternVL

b04cc9c

Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

Merge branch 'main' into TF_version_4.55_upgrade

0f59cdf

Updated causal LM Test for Grok1

19e39f5

Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

asmigosw added 2 commits October 8, 2025 10:01

Ruff format

f54030e

Signed-off-by: Asmita Goswami <asmigosw@qti.qualcomm.com>

Merge branch 'main' into TF_version_4.55_upgrade

9a9f707

quic-mamta force-pushed the TF_version_4.55_upgrade branch 2 times, most recently from f4ade46 to 8217cb5 Compare October 8, 2025 10:48

fix llama4

7bf2298

Signed-off-by: Mamta Singh <mamtsing@qti.qualcomm.com>

quic-mamta force-pushed the TF_version_4.55_upgrade branch from 8217cb5 to 7bf2298 Compare October 8, 2025 10:49

Merge branch 'main' into TF_version_4.55_upgrade

c9fd8ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 #542

Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 #542

quic-mamta commented Aug 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

quic-hemagnih Oct 6, 2025

Uh oh!

asmigosw Oct 8, 2025

Uh oh!

vbaddi Oct 6, 2025

Uh oh!

Uh oh!

Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 #542

Are you sure you want to change the base?

Transformers version 4.55 upgrade, Update PyTorch to 2.7.0+cpu, Torchvision to 0.22.0+cpu, and Python Requirement to >=3.9 #542

Conversation

quic-mamta commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

quic-hemagnih Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

asmigosw Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

vbaddi Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

quic-mamta commented Aug 19, 2025 •

edited

Loading