sequence-parallelism
Here are 6 public repositories matching this topic...
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
-
Updated
Sep 24, 2025 - Python
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
-
Updated
Aug 21, 2025 - Python
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
-
Updated
Dec 14, 2023 - Python
Fast and easy distributed model training examples.
-
Updated
Nov 26, 2024 - Python
Minimal tensor, sequence, and pipeline parallelism for LLaMA.
-
Updated
May 14, 2026 - Python
Implements sequence parallelism for Transformer layers using MPI4Python and multi-GPU acceleration, exploring scalability trade-offs and communication bottlenecks in distributed attention.
-
Updated
Dec 11, 2025 - Python
Improve this page
Add a description, image, and links to the sequence-parallelism topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the sequence-parallelism topic, visit your repo's landing page and select "manage topics."