fep(sig-framework): add PyTorch-Plugin-FL v0.1.0 CUDA backend dispatch proposal by Hchnr · Pull Request #25 · flagos-ai/community

Hchnr · 2026-05-28T02:21:11Z

No description provided.

buzhengjing · 2026-06-03T08:22:38Z

Thanks for the proposal.

From a reviewer perspective, the current FEP provides a good architectural overview, but it does not yet contain sufficient implementation and validation details for reproducible verification.

In particular, the proposal does not currently specify:

Validation environment (image, dependencies, versions)
Installation and configuration procedures
Supported operators in the initial scope
Detailed test cases
Expected execution results
Backend selection verification methodology
Native CUDA vs. FlagGems comparison criteria
CI/regression validation strategy

Without these details, it is difficult for reviewers to reproduce the proposed workflow or assess the completeness of the implementation plan.

Could you consider adding a dedicated "Implementation and Validation Plan" section covering environment setup, test procedures, sample commands, expected outputs, and acceptance criteria?

…ification plan Major updates to the PyTorch-Plugin-FL v0.1.0 FEP: - Rename from "CUDA Backend" to "Multi-Backend Operator Dispatch" - Add Ascend (Huawei) native kernel support alongside CUDA - Introduce Dispatcher<FnPtr> template-based routing mechanism - Support three dispatch paths: native CUDA, native Ascend, FlagGems Triton (C++ and Python) - Define 32 first-phase operators with cross-platform implementations - Add detailed architecture diagrams and registration flow - Provide complete testing strategy with per-operator and end-to-end tests (Qwen3-0.6B) - Document full verification environments for both CUDA (A800) and Ascend (910B) platforms - Include step-by-step installation, test procedures, and expected outputs - Add CI/CD integration and regression testing guidelines

Hchnr · 2026-06-09T02:46:01Z

Update: expand CUDA dispatch to multi-backend architecture with full verification plan

Major updates to the PyTorch-Plugin-FL v0.1.0 FEP:

Rename from "CUDA Backend" to "Multi-Backend Operator Dispatch"
Add Ascend (Huawei) native kernel support alongside CUDA
Introduce Dispatcher template-based routing mechanism
Support three dispatch paths: native CUDA, native Ascend, FlagGems Triton (C++ and Python)
Define 32 first-phase operators with cross-platform implementations
Add detailed architecture diagrams and registration flow
Provide complete testing strategy with per-operator and end-to-end tests (Qwen3-0.6B)
Document full verification environments for both CUDA (A800) and Ascend (910B) platforms
Include step-by-step installation, test procedures, and expected outputs
Add CI/CD integration and regression testing guidelines

Scope: Expands from CUDA-only prototype to production-ready multi-backend framework with comprehensive validation plan.

buzhengjing · 2026-06-09T09:26:02Z

+| GPU | NVIDIA A800-SXM4-80GB |
+| Driver | 535.154.05 |
+| CUDA Toolkit | 12.8 |
+| Conda Env | `pytorch` (Python 3.12.13) |


could you share the full Docker image pytorch2.11.0_cuda12.8_triton3.6.0_flaggems5.0.2?

docker pull harbor.baai.ac.cn/flagscale/cuda12.8.1-cudnn9.15.1-python3.12-torch2.7.1-train:2512031616

fep: add PyTorch-Plugin-FL CUDA backend dispatch proposal

3e8d4f8

buzhengjing mentioned this pull request Jun 4, 2026

FEP Missing Implementation and Validation Plan for Reproducible Verification flagos-ai/PyTorch-Plugin-FL#9

Closed

buzhengjing reviewed Jun 9, 2026

View reviewed changes

buzhengjing mentioned this pull request Jun 10, 2026

[Bug] torch_fl and torch_npu conflict: duplicate PrivateUse1 backend fallback registration on Ascend platform flagos-ai/PyTorch-Plugin-FL#11

Closed

Hchnr added 4 commits June 10, 2026 17:26

update: install steps

8ead76f

update: install steps

b77155f

update: add step (patch traiton-ascend)

fdd2ff3

update: ascend flaggems install

f590835

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fep(sig-framework): add PyTorch-Plugin-FL v0.1.0 CUDA backend dispatch proposal#25

fep(sig-framework): add PyTorch-Plugin-FL v0.1.0 CUDA backend dispatch proposal#25
Hchnr wants to merge 6 commits into
flagos-ai:mainfrom
Hchnr:pytorch_plugin_fl_v0.1.0

Hchnr commented May 28, 2026

Uh oh!

buzhengjing commented Jun 3, 2026

Uh oh!

Hchnr commented Jun 9, 2026

Uh oh!

buzhengjing Jun 9, 2026

Uh oh!

Hchnr Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Hchnr commented May 28, 2026

Uh oh!

buzhengjing commented Jun 3, 2026

Uh oh!

Hchnr commented Jun 9, 2026

Uh oh!

buzhengjing Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Hchnr Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants