Skip to content

Support memory stat querying API with SPMD #9022

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
lsy323 opened this issue Apr 22, 2025 · 3 comments
Open

Support memory stat querying API with SPMD #9022

lsy323 opened this issue Apr 22, 2025 · 3 comments
Assignees
Labels
distributed SPMD and other distributed things. enhancement New feature or request triaged This issue has been reviewed by the triage team and the appropriate priority assigned.

Comments

@lsy323
Copy link
Collaborator

lsy323 commented Apr 22, 2025

🚀 Feature

Support xm.get_memory_info with SPMD, currently, it will hit the assertion in https://github.com/pytorch/xla/blob/master/torch_xla/csrc/runtime/pjrt_computation_client.cc#L1005-L1006

@lsy323 lsy323 added the distributed SPMD and other distributed things. label Apr 22, 2025
@miladm
Copy link
Collaborator

miladm commented Apr 22, 2025

thanks @lsy323 - are you planning to own this bug or should someone else pick it up?

cc @pgmoka for viz

@ysiraichi ysiraichi added the enhancement New feature or request label Apr 23, 2025
@lsy323
Copy link
Collaborator Author

lsy323 commented May 9, 2025

@miladm I don't have plan to work on this yet, it'd be great if someone else can pick this up, can be a good ramp up task.

@miladm miladm assigned miladm and haifeng-jin and unassigned miladm May 9, 2025
@miladm miladm added the triaged This issue has been reviewed by the triage team and the appropriate priority assigned. label May 9, 2025
@miladm
Copy link
Collaborator

miladm commented May 9, 2025

thanks for the bug - cc @haifeng-jin to assist

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
distributed SPMD and other distributed things. enhancement New feature or request triaged This issue has been reviewed by the triage team and the appropriate priority assigned.
Projects
None yet
Development

No branches or pull requests

4 participants