-
Notifications
You must be signed in to change notification settings - Fork 2.7k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: HiRadixCache: fix prefetch completion race
#9397
opened Aug 20, 2025 by
pabloiyu
Loading…
4 tasks
[NVIDIA] Fix trtllm fp4 moe backend when used in MTP
high priority
#9384
opened Aug 20, 2025 by
kaixih
Loading…
[Performance] Dynamic Batch Tokenizer
#9382
opened Aug 20, 2025 by
sundar24295s
Loading…
2 of 4 tasks
fix: InternS1 don't recognize image, updates image token for InternVL processor
#9381
opened Aug 20, 2025 by
JustinTong0323
Loading…
4 tasks
Fix model loading error when doing weights sync in RL training
#9379
opened Aug 20, 2025 by
rich-junwang
Loading…
1 of 4 tasks
misc: parse bench_serving result as markdown table
#9377
opened Aug 20, 2025 by
mickqian
Loading…
4 tasks
feat(hicache): Supports 3fs-hicache compatibility with dp-attention
#9372
opened Aug 20, 2025 by
hzh0425
Loading…
4 tasks
[router] Add IGW (Inference Gateway) Feature Flag
#9371
opened Aug 20, 2025 by
key4ng
Loading…
4 tasks
Opt:ascend kv separation; dsv3 support graph; fused dequant+swiglu+quant
#9355
opened Aug 19, 2025 by
chenxu140
Loading…
4 tasks
Fix FP4 inference corruption issue in glm4.5-air model
#9346
opened Aug 19, 2025 by
Azure-Tang
Loading…
[sgl-kernel] misc: update deepgemm version for sgl-kernel
high priority
#9340
opened Aug 19, 2025 by
FlamingoPg
Loading…
3 of 4 tasks
Support trtllm_allreduce_fusion in flashinfer for cuda<12.8
high priority
#9339
opened Aug 19, 2025 by
strgrb
Loading…
4 tasks
[wip] Fix Speculative Decoding with modelopt_fp4
#9338
opened Aug 19, 2025 by
ch-wan
Loading…
4 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.