-
Notifications
You must be signed in to change notification settings - Fork 13.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
devops: fix s390x docker release failure
devops
improvements to build systems and github actions
#16231
opened Sep 24, 2025 by
taronaeo
Loading…
metal : extend mat-mat multiplication support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
vulkan : make the vulkan.hpp dynamic dispatcher instance private
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16224
opened Sep 24, 2025 by
Acly
Loading…
Improve Mobile UI for dialogs and action dropdowns
examples
server
#16222
opened Sep 24, 2025 by
allozaur
Loading…
HIP: Disable ROCWMMA fattn on CDNA when compiled against ROCWMMA 2.0.0
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16221
opened Sep 24, 2025 by
IMbackK
Loading…
metal : fuse NORM + MUL + ADD, support non-multiples of 4
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16220
opened Sep 24, 2025 by
ggerganov
Loading…
metal : restore im2col perf
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16219
opened Sep 24, 2025 by
ggerganov
Loading…
metal : relax reorder conditions
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16216
opened Sep 24, 2025 by
ggerganov
Loading…
CUDA: refactor and deduplicate vector FA kernels
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#16208
opened Sep 23, 2025 by
JohannesGaessler
Loading…
Model: Granite docling + Idefics3 preprocessing (SmolVLM)
examples
python
python script changes
#16206
opened Sep 23, 2025 by
gabe-l-hart
Loading…
vulkan: Add ACC_TYPE_VEC2 implementation
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16203
opened Sep 23, 2025 by
SavicStefan
Loading…
tools/main: llama-cli: prevent spurious assistant token (#13402)
examples
#16202
opened Sep 23, 2025 by
vinkal-chudgar
Loading…
Enhance text file detection logic for file attachments
examples
server/webui
server
#16199
opened Sep 23, 2025 by
allozaur
Loading…
Implement progress bar and multi-connection downloads
#16196
opened Sep 23, 2025 by
ericcurtin
Loading…
ggml-cpu: implement MXFP4 SIMD for s390x
ggml
changes relating to the ggml tensor library for machine learning
#16193
opened Sep 23, 2025 by
taronaeo
Loading…
ggml webgpu: support for rope,div,sub,glu,scale,cont operators
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#16187
opened Sep 23, 2025 by
reeselevine
Loading…
common : use cpp-httplib as a cURL alternative for downloads
#16185
opened Sep 22, 2025 by
angt
Loading…
ci: run the x64 and arm ci on the github machines instead
devops
improvements to build systems and github actions
testing
Everything test related
#16183
opened Sep 22, 2025 by
netrunnereve
Loading…
ggml : add repack testing support
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16182
opened Sep 22, 2025 by
danbev
Loading…
vulkan: handle mat_mul with A matrix > 4GB
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#16176
opened Sep 22, 2025 by
jeffbolznv
Loading…
minor: root cause in error message if loading backend library fails
ggml
changes relating to the ggml tensor library for machine learning
#16172
opened Sep 22, 2025 by
rlewczuk
Loading…
CANN: improve ACL graph matching
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#16166
opened Sep 22, 2025 by
noemotiovon
Loading…
vulkan: support arbitrary KV dimension in flash attention
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16160
opened Sep 21, 2025 by
jeffbolznv
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-08-24.