Skip to content

Pickle error when downloading DeepSeek model #38476

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
4 tasks
andrewsykim opened this issue May 29, 2025 · 2 comments
Open
4 tasks

Pickle error when downloading DeepSeek model #38476

andrewsykim opened this issue May 29, 2025 · 2 comments
Labels

Comments

@andrewsykim
Copy link

System Info

I'm consistently running into this pickle error when trying to run DeepSeek R1 models (both R1 and R1-0528):

Can't pickle <class 'transformers_modules.deepseek-ai.DeepSeek-R1-0528.4236a6af538feda4548eca9ab308586007567f52.configuration_deepseek.DeepseekV3Config'>:
it's not the same object as transformers_modules.deepseek-ai.DeepSeek-R1-0528.4236a6af538feda4548eca9ab308586007567f52.configuration_deepseek.DeepseekV3Config"

Who can help?

No response

Information

  • The official example scripts
  • My own modified scripts

Tasks

  • An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • My own task or dataset (give details below)

Reproduction

Deploy DeepSeek R1 using vLLM

Expected behavior

No pickle error

@andrewsykim
Copy link
Author

I suspect something changed between the uploaded configuration_deepseek.py file on hugging face and the one in the repo.

@Rocketknight1
Copy link
Member

Hi @andrewsykim, can you give us some code that reproduces the issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants