nvlink

GPU-native agent-swarm orchestration for the NVIDIA AI stack — NeMo, NIM, Triton, DCGM, NGC, NIXL, OpenShell. Spawn GPU-pinned agent teams across DGX/HGX nodes with NVLink-aware scheduling, task DAGs, adaptive scheduling, and full observability.

python cli nim hpc gpu slurm orchestration nvidia hyperparameter-optimization triton nemo ai-agents mlops dgx nvlink llm agent-swarm dcgm

Updated Mar 28, 2026
Python

manishklach / gb300-rl-runtime

Star

Close-to-metal C/CUDA lab for RL inference fast paths: persistent GPU workers, hugepage KV arenas, cacheline-aware command rings, and async reward handoff. Goal: remove page faults, malloc/free, scheduler wakeups, CPU round-trips, and KV migration from the per-token path.

reinforcement-learning hpc cuda lock-free spsc-queue nvlink gpu-inference gb300 ai-infrastructure close-to-metal

Updated Jun 1, 2026
C

josepselga / NVIDIA-FabricManager-partition-tool

Star

C++ command-line tool for managing NVIDIA Fabric Manager partitions. Supports non-interactive mode and advanced partition operations.

gpu data-center nvidia nvlink fabric-manager nvswitch

Updated Oct 23, 2025
C++

momentics / NeuralTower

Star

Open hardware desktop AI node: 4× Tesla V100, 128GB HBM2, PCIe/NVLink topology and V-Core liquid/air cooling.

Updated Jun 3, 2026
Python

JuhoArtturiHemminki / V-AXION-512

Star

V-AXION-512: Dual-Tier Post-Entropic Framework. I. PROTOCOL: SR-512, GS-512 & G-STORM-512 for O(1) deterministic state recovery. II. ECOSYSTEM: NEPTUNE-PHX, PHX-BUSLINK, KALMAN-ANCHOR, PHX-GENESIS, DIRECT-FABRIC & AETERNA-FLUX for Sigma-H energy harvesting. — Juho Artturi Hemminki (2026)

Updated Apr 1, 2026
SystemVerilog

framsouza / inference-at-scale-on-kubernetes

Star

What to consider when running AI Inference at scale on Kubernetes

kubernetes ai gpu inference nvidia decode prefill nvlink kv-cache pagedattention

Updated May 21, 2026

milyas2001 / cortex-gpu-scheduler

Star

CORTEX - Hardware-Aware AI Workload Scheduler

kubernetes ai dpdk gpu scheduler nvlink

Updated Feb 17, 2026
Rust

SiliconLanguage / model-explorer-open-llm

Star

A hybrid testbed for evaluating top open-source LLMs (like gpt-oss-20b and Llama 3.3) on local, cloud GPUs, and AWS Inferentia2/Trainium instances, focusing on vLLM optimization, capacity management, kernel bypass, hardware-software co-design, as well as supporting infrastructure such as NCCL, RDMA, NVMeoF.

aws gpu rdma nvme kernel-bypass nccl gpudirect nvlink nvmeof llm vllm trainium vllm-serve inferentia2 software-hardware-co-design aws-ofi-nccl

Updated Apr 21, 2026
Python

ziash / nvidia-nca-aiio-study-guide

Star

Comprehensive NCA-AIIO exam prep: study notes, diagrams, screenshots, and field experience for the NVIDIA Certified Associate: AI Infrastructure and Operations certification.

gpu nvidia infiniband certification study-guide datacenter dpu nvlink ai-infrastructure nca-aiio

Updated May 14, 2026

waynehacking8 / nccl-collectives-bench

Star

NCCL collective benchmarks on an 8×H100 NVSwitch host — busbw vs link budget, NVLS/Ring/Tree, small-message latency floors (eager vs CUDA Graph vs symmetric memory), and the TP-decode comms ceiling they imply. Includes a quiet-box rerun methodology for attribution.

benchmark gpu distributed-training nccl nvlink tensor-parallelism llm-inference h100

Updated Jun 2, 2026
Python

sbouhrour / mgpu-cg-stencil-solver

Star

Open-source stencil-aware multi-GPU Conjugate Gradient solver on 8× A100 NVLink. 2.07× SpMV vs cuSPARSE · 1.44× above NVIDIA AmgX · 93.5% strong scaling efficiency. Profiled with Nsight Systems & Nsight Compute.

performance-engineering hpc stencil mpi cuda conjugate-gradient multi-gpu nsight nvlink a100

Updated Jun 3, 2026
Cuda

DarkSliceYT / ai-infra-index

Star

Provide open-source access to detailed AI hardware specs, benchmarks, and infrastructure data for informed decision-making and analysis.

ai mcp inference data-center nvidia gemini infiniband vectorization tpu nvlink groq ai-assistant ai-accelerators qdrant cerebras ollama ai-hardware codebase-analysis

Updated Jun 3, 2026
HTML

Improve this page

Add a description, image, and links to the nvlink topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nvlink topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvlink

Here are 18 public repositories matching this topic...

uuudown / Tartan

c3sr / comm_scope

kmycode / kmy-keiba

Beuth-Erdelt / prometheus_nvlink_exporter

BITS08SATHYA / ares-scheduler

YconquestY / ncclAllReduce

alokemajumder / nemospawn

manishklach / gb300-rl-runtime

josepselga / NVIDIA-FabricManager-partition-tool

momentics / NeuralTower

JuhoArtturiHemminki / V-AXION-512

framsouza / inference-at-scale-on-kubernetes

milyas2001 / cortex-gpu-scheduler

SiliconLanguage / model-explorer-open-llm

ziash / nvidia-nca-aiio-study-guide

waynehacking8 / nccl-collectives-bench

sbouhrour / mgpu-cg-stencil-solver

DarkSliceYT / ai-infra-index

Improve this page

Add this topic to your repo