Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: HiRadixCache: fix prefetch completion race
#9397 opened Aug 20, 2025 by pabloiyu Loading…
4 tasks
xeon ci enhancement
#9395 opened Aug 20, 2025 by DiweiSun Loading…
[Bug] Fix w4afp8 moe kernel
#9392 opened Aug 20, 2025 by yuhyao Loading…
1 of 4 tasks
[Performance] Dynamic Batch Tokenizer
#9382 opened Aug 20, 2025 by sundar24295s Loading…
2 of 4 tasks
Fix model loading error when doing weights sync in RL training
#9379 opened Aug 20, 2025 by rich-junwang Loading…
1 of 4 tasks
misc: parse bench_serving result as markdown table
#9377 opened Aug 20, 2025 by mickqian Loading…
4 tasks
[router] Add IGW (Inference Gateway) Feature Flag
#9371 opened Aug 20, 2025 by key4ng Loading…
4 tasks
Log iteration # for prefill and decode
#9366 opened Aug 19, 2025 by nvcastet Loading…
4 tasks
Support DP attention with GPT-OSS
#9359 opened Aug 19, 2025 by nvcastet Loading…
4 tasks
Register Fp4 allgather with NCCL symmetric memory
#9358 opened Aug 19, 2025 by nvcastet Draft
4 tasks
Add support for Qwen3-seq-cls
#9357 opened Aug 19, 2025 by nathanrchn Loading…
4 tasks
[AMD] Remove the deprecated C10_WARP_SIZE
#9356 opened Aug 19, 2025 by hubertlu-tw Loading…
4 tasks
Support ibm_grouped_gemm and unit test
#9352 opened Aug 19, 2025 by yuan-luo Loading…
4 tasks
Add support for GLM 4.5V FP8
#9349 opened Aug 19, 2025 by pakjoeng Loading…
1 of 4 tasks
fix: fix max-new-tokens is none
#9343 opened Aug 19, 2025 by mickqian Draft
4 tasks
[wip] Fix Speculative Decoding with modelopt_fp4
#9338 opened Aug 19, 2025 by ch-wan Loading…
4 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.