-
Notifications
You must be signed in to change notification settings - Fork 122
Pull requests: sgl-project/sgl-kernel-npu
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix of current build.sh cannot handle multiple CANN installation
#460
opened May 4, 2026 by
Sawyer117
Loading…
Adaptation of the Deepep A5 normal and low-latency operators.
#458
opened Apr 30, 2026 by
oagniqgnat
Contributor
Loading…
Add prebuilt metadata support and tests for chunk operations
#454
opened Apr 29, 2026 by
AndyLi429
Contributor
Loading…
improve performance for fused gdn gating and solve tril
#450
opened Apr 27, 2026 by
zhaozx-cn
Loading…
support ssd chunk scan triton & ssd chunk state triton on npu
#448
opened Apr 27, 2026 by
sigama-w
Loading…
add fused_qkvzba_split_reshape_cat_contiguous_kernel
#447
opened Apr 26, 2026 by
McZyWu
Contributor
Loading…
LoRA: Implementing kernels using CUBE computation unit
#432
opened Apr 8, 2026 by
vlserov
Contributor
Loading…
add dispatch_ffn_combine_bf16 kernel for deepep
#410
opened Mar 27, 2026 by
zuje123
Collaborator
Loading…
[WIP] add fuse_deep_moe_no_buffer for enable-torch-compile
#409
opened Mar 27, 2026 by
jiaming1130
Loading…
add fused_deep_moe test for dispatch_ffn_combine
#400
opened Mar 18, 2026 by
zuje123
Collaborator
Loading…
MMLU benchmark for different inverse implementations
#374
opened Feb 11, 2026 by
gioelegott
Loading…
(tri_inv) (pto-isa) implement AIV triangular inverse using pto-isa
#369
opened Feb 6, 2026 by
zouzias
Contributor
Loading…
wrap triton_kernels into callable that can be traced into a graph
#368
opened Feb 5, 2026 by
lawtherWu
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.