-
Notifications
You must be signed in to change notification settings - Fork 12.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support streaming delta.reasoning_content in WebUI
examples
server
#15052
opened Aug 3, 2025 by
mostlygeek
Loading…
model : add text-only support for Kimi-VL
python
python script changes
#15051
opened Aug 3, 2025 by
gabriellarson
Loading…
memory : handle kv_unified for hybrid models
bugfix
fixes an issue or bug
#15050
opened Aug 3, 2025 by
compilade
Loading…
Fix: respect localStorage base URL override in Web UI
examples
server
#15048
opened Aug 3, 2025 by
insanerest
Loading…
fix tokenizer for JetBrain Mellum
python
python script changes
#15045
opened Aug 2, 2025 by
csabakecskemeti
Loading…
OpenCL: ensure command queue is finished before profiling
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#15042
opened Aug 2, 2025 by
rmatif
Loading…
Fix: flush partial stop string when <EOG> is reached in /completion endpoint in streaming mode
examples
server
#15007
opened Aug 1, 2025 by
matteoserva
Loading…
fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 2…
devops
improvements to build systems and github actions
#15005
opened Aug 1, 2025 by
simevo
Loading…
Add support for CogVLM model
examples
python
python script changes
#15002
opened Aug 1, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
OpenCL: add initial FA support
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14987
opened Jul 31, 2025 by
rmatif
Loading…
CUDA: add set
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14980
opened Jul 31, 2025 by
jeemzz147
Loading…
ggml: WebGPU backend host improvements and style fixing
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
#14978
opened Jul 30, 2025 by
reeselevine
Loading…
ggml: initial IBM zDNN backend
devops
improvements to build systems and github actions
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
#14975
opened Jul 30, 2025 by
taronaeo
Loading…
Optimize l2_norm_f32 op with SIMD
ggml
changes relating to the ggml tensor library for machine learning
#14970
opened Jul 30, 2025 by
TIKki43
Loading…
Implementation of GGML_NUMA_MIRROR for 64% inferencing performance gain on numa systems
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
ggml : fix field name when new ggml_backend
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#14944
opened Jul 29, 2025 by
aisk
Loading…
ops: add MUSA
documentation
Improvements or additions to documentation
#14941
opened Jul 29, 2025 by
yeahdongcn
Loading…
mtmd : support home-cooked Mistral Small Omni
examples
#14928
opened Jul 28, 2025 by
ngxson
Loading…
repack : optimize mul_mat_id path
ggml
changes relating to the ggml tensor library for machine learning
#14918
opened Jul 28, 2025 by
ggerganov
Loading…
1 task
opencl: fixed a typo
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#14908
opened Jul 27, 2025 by
l29ah
Loading…
ggml : repack block_iq4_nlx8 (AVX)
ggml
changes relating to the ggml tensor library for machine learning
#14904
opened Jul 27, 2025 by
ggerganov
Loading…
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.