-
Notifications
You must be signed in to change notification settings - Fork 12.7k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: Support mul_mat_id with f32 accumulators
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15337
opened Aug 15, 2025 by
jeffbolznv
•
Draft
optimize the rope ops
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15335
opened Aug 15, 2025 by
YangShuai52
Loading…
vulkan: Add missing bounds checking to scalar/coopmat1 mul_mat_id
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15334
opened Aug 14, 2025 by
jeffbolznv
Loading…
CANN: fix ggml_cann_rms_norm
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#15331
opened Aug 14, 2025 by
yuchuan-cao
Loading…
ci : move ccache action to ggml-org fork
devops
improvements to build systems and github actions
#15328
opened Aug 14, 2025 by
slaren
Loading…
aLoRA Support
examples
python
python script changes
server
#15327
opened Aug 14, 2025 by
gabe-l-hart
•
Draft
1 task
convert : add bos token for Gemma 3 base models
python
python script changes
#15326
opened Aug 14, 2025 by
danbev
Loading…
ci : fix ios-xcode-build
devops
improvements to build systems and github actions
#15324
opened Aug 14, 2025 by
CISC
Loading…
test-opt: fix backend support check
testing
Everything test related
#15317
opened Aug 14, 2025 by
JohannesGaessler
Loading…
OpenCL: add fused group_norm/norm, mul, add
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
testing
Everything test related
#15314
opened Aug 14, 2025 by
rmatif
Loading…
Add OpenVINO backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
64 bit CUDA copy routines via GGML_CUDA_ALLOW_LARGE_TENSORS
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15298
opened Aug 13, 2025 by
createthis
Loading…
ggml: riscv: add riscv spacemit backend
build
Compilation issues
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#15288
opened Aug 13, 2025 by
alex-spacemit
Loading…
Add comprehensive Copilot instructions with Python environment, server testing, and git clang-format
devops
improvements to build systems and github actions
vulkan.Dockerfile: install vulkan SDK using tarball
devops
improvements to build systems and github actions
#15282
opened Aug 13, 2025 by
yeahdongcn
Loading…
vulkan: optimize rms_norm, and allow the work to spread across multiple SMs
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15281
opened Aug 13, 2025 by
jeffbolznv
•
Draft
arm64: add i8mm route with SVE ggml_vec_dot_q4_K_q8_K and ggml_vec_dot_q6_K_…
ggml
changes relating to the ggml tensor library for machine learning
#15277
opened Aug 13, 2025 by
fj-y-saito
Loading…
Q6_K - Block Interleaving Implementation for x86 SIMD (AVX512/AVX2)
ggml
changes relating to the ggml tensor library for machine learning
#15275
opened Aug 12, 2025 by
Srihari-mcw
Loading…
opencl: add initial mxfp4 support via mv
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15270
opened Aug 12, 2025 by
lhez
Loading…
Apple NPU acceleration integrated into llama.cpp, using MiniCPM-V 4.0 as an example.
examples
python
python script changes
#15262
opened Aug 12, 2025 by
tc-mb
Loading…
WIP: ggml-cuda: Add bf16 cuda support to fattn (Flash Attention)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#15261
opened Aug 12, 2025 by
eous
Loading…
musa: fix build warnings
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#15258
opened Aug 12, 2025 by
yeahdongcn
Loading…
vulkan: fuse adds
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#15252
opened Aug 11, 2025 by
jeffbolznv
Loading…
ci : Enable pre-built cuda releases on ubuntu (#5106)
devops
improvements to build systems and github actions
#15249
opened Aug 11, 2025 by
michaelgiba
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-07-14.