LFM2 support #945

GAD-cell · 2025-09-02T09:18:30Z

Small patch to support LFM2 with vLLM.
Since LFM2 doesn’t support prefix caching with vLLM, I had to add enable_prefix_caching to both VLLMModelConfig and VLLMModel._create_auto_model to make it work.

HuggingFaceDocBuilderDev · 2025-09-08T12:23:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

GAD-cell · 2025-09-08T12:43:50Z

@NathanHB seems like a cuda compilation error, do you have any clue why ?

NathanHB · 2025-09-09T08:17:50Z

mhh not sure why. I launched again but if it does not work can you try and set the default value of enable prefix caching to None ?

GAD-cell · 2025-09-10T11:12:57Z

mhh not sure why. I launched again but if it does not work can you try and set the default value of enable prefix caching to None ?

OK I've changed the default value

GAD-cell and others added 3 commits September 2, 2025 10:59

add enable_prefix_caching in VLLMModel

1d9034a

typo

7e5a853

Merge branch 'main' into lfm2_support

83fed8e

None default value

aa666fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LFM2 support #945

LFM2 support #945

GAD-cell commented Sep 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 8, 2025

Uh oh!

GAD-cell commented Sep 8, 2025

Uh oh!

NathanHB commented Sep 9, 2025

Uh oh!

GAD-cell commented Sep 10, 2025

Uh oh!

Uh oh!

LFM2 support #945

Are you sure you want to change the base?

LFM2 support #945

Conversation

GAD-cell commented Sep 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 8, 2025

Uh oh!

GAD-cell commented Sep 8, 2025

Uh oh!

NathanHB commented Sep 9, 2025

Uh oh!

GAD-cell commented Sep 10, 2025

Uh oh!

Uh oh!