Open-source Project with 100+ Hand-drawn Original Algorithm Diagrams: Visualizing the Obscure Abbreviations of LLM Training and Reinforcement Learning
Can't connect the dots when looking at strings of abbreviations like SFT and RLHF in LLM training? This 4.3k-star open-source project uses 100+ hand-drawn original algorithm diagrams to visualize the entire workflow of large language models and reinforcement learning from pre-training to alignment. It's available in both Chinese and English under the MIT license, and also provides editable vector graphics. Perfect for AI practitioners and students.
Many people who work with LLM training or reinforcement learning struggle to piece together a complete process from text alone when faced with abbreviations like SFT, RLHF, DPO, and GRPO. Scattered bits of knowledge end up like loose building blocks that never form a complete system.
A GitHub project called LLM-RL-Visualized solves this problem perfectly. Boasting 4.3k stars so far, it is maintained by Yu Changye, author of *Large Language Model Algorithms*. The project is available in both Chinese and English under the MIT license. Its core content is more than 100 original hand-drawn algorithm diagrams by the author, which clearly lay out the full logic of large language models and reinforcement learning from pre-training to alignment.
Core features of the project:
- Full process diagramming: All diagrams are hand-drawn following the logic of knowledge, from basic LLM architecture and complete training workflows to core reinforcement learning algorithms. Any logic points that are hard to untangle with text can be understood at a glance with the diagrams;
- Coverage of mainstream training and alignment methods: Common industry alignment methods including SFT, DPO, RLHF, and GRPO are all broken down with dedicated diagrams, no need to search through scattered technical blogs anymore;
- SVG vector graphic format: All diagrams are available in SVG version, which can be infinitely zoomed without blurring, and you can directly select text in the diagrams for note-taking or secondary citation;
- Specialized reinforcement learning content: There are more than 50 detailed diagrams dedicated to reinforcement learning, plus coverage of extended topics including inference optimization, MCTS, knowledge distillation, Constitutional AI, and more.
Using the project is very simple: just clone the repository to your local machine, and the experience is best when viewed alongside the documentation. Diagrams are provided in both PNG and SVG formats: PNG is for direct preview, while SVG is suitable for secondary processing when making courseware or technical presentations. The project's GitHub URL: https://github.com/changyeyu/LLM-RL-Visualized
Below are some screenshots of the project:




This project is particularly suitable for three groups of people: first, practitioners who are researching LLM training, reinforcement learning theory or model alignment, it can help you quickly organize your knowledge system; second, AI beginners and students, you no longer have to feel overwhelmed by full screens of formulas and abbreviations; third, people who need to prepare technical presentations or courseware, the vector graphics can be used directly, saving you a lot of time.
发布时间: 2026-05-22 22:52