Popular repositories Loading
-
TransformerCompression
TransformerCompression PublicForked from microsoft/TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
Python
-
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
-
TransformerEngine
TransformerEngine PublicForked from NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
Python
-
DeepEP
DeepEP PublicForked from deepseek-ai/DeepEP
DeepEP: an efficient expert-parallel communication library
Cuda
-
cutlass
cutlass PublicForked from NVIDIA/cutlass
CUDA Templates and Python DSLs for High-Performance Linear Algebra
C++
If the problem persists, check the GitHub status page or contact support.