Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

model : add text-only support for Kimi-VL python python script changes
#15051 opened Aug 3, 2025 by gabriellarson Loading…
memory : handle kv_unified for hybrid models bugfix fixes an issue or bug
#15050 opened Aug 3, 2025 by compilade Loading…
fix tokenizer for JetBrain Mellum python python script changes
#15045 opened Aug 2, 2025 by csabakecskemeti Loading…
OpenCL: ensure command queue is finished before profiling ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#15042 opened Aug 2, 2025 by rmatif Loading…
support GLM-4.5 MoE models python python script changes
#15026 opened Aug 2, 2025 by ddh0 Draft
fix compile bug when the BASE_CUDA_DEV_CONTAINER is based on Ubuntu 2… devops improvements to build systems and github actions
#15005 opened Aug 1, 2025 by simevo Loading…
Add support for CogVLM model examples python python script changes
#15002 opened Aug 1, 2025 by Tianyue-Zhao Loading…
2 of 4 tasks
OpenCL: add initial FA support ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14987 opened Jul 31, 2025 by rmatif Loading…
CUDA: add set ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14980 opened Jul 31, 2025 by jeemzz147 Loading…
ggml: WebGPU backend host improvements and style fixing devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
#14978 opened Jul 30, 2025 by reeselevine Loading…
ggml: initial IBM zDNN backend devops improvements to build systems and github actions documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#14975 opened Jul 30, 2025 by taronaeo Loading…
Optimize l2_norm_f32 op with SIMD ggml changes relating to the ggml tensor library for machine learning
#14970 opened Jul 30, 2025 by TIKki43 Loading…
Implementation of GGML_NUMA_MIRROR for 64% inferencing performance gain on numa systems examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14969 opened Jul 30, 2025 by dbsanfte Draft
ggml : fix field name when new ggml_backend ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs OpenCL Issues specific to the OpenCL backend SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language Vulkan Issues specific to the Vulkan backend
#14944 opened Jul 29, 2025 by aisk Loading…
ops: add MUSA documentation Improvements or additions to documentation
#14941 opened Jul 29, 2025 by yeahdongcn Loading…
model: Add support for GLM 4.5 family of models (#14921) model Model specific python python script changes
#14939 opened Jul 29, 2025 by sammcj Loading…
repack : optimize mul_mat_id path ggml changes relating to the ggml tensor library for machine learning
#14918 opened Jul 28, 2025 by ggerganov Loading…
1 task
opencl: fixed a typo ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#14908 opened Jul 27, 2025 by l29ah Loading…
ggml : repack block_iq4_nlx8 (AVX) ggml changes relating to the ggml tensor library for machine learning
#14904 opened Jul 27, 2025 by ggerganov Loading…
1 task done
ProTip! no:milestone will show everything without a milestone.