Open
Description
Currently AReaL has too many levels of wrappers, which prevents potential users and contributors to read and understand the code.
We aim to clean up the code in two different levels:
- the experiment launch procedure
- the interaction between master and model workers
We don't have a concrete plan yet. I open this issue to share the current code structure and would like to discuss with the community how to do the refactor.
Readers can also check this amazing deepwiki page to understand the code structure.
Old documentation for ReaLHF: https://openpsi-project.github.io/ReaLHF/
The current launch procedure
The interaction between master and model workers
todo
Metadata
Metadata
Assignees
Labels
No labels