Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[SYCL] support bfloat16 release package devops improvements to build systems and github actions
#17855 opened Dec 8, 2025 by arthw Loading…
cuda : add FILL op support ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17851 opened Dec 8, 2025 by JayZenith Loading…
Add support for R-4B multimodal model examples python python script changes
#17840 opened Dec 7, 2025 by infil00p Draft
[SYCL] fix softmax for iGPU ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17838 opened Dec 7, 2025 by NeoZhangJianyu Loading…
debug:Adding CPU-side visual trace for hexagon ggml changes relating to the ggml tensor library for machine learning script Script related
#17837 opened Dec 7, 2025 by Ethan-a2 Loading…
[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17826 opened Dec 6, 2025 by NeoZhangJianyu Loading…
cann : fix ops broken by circular padding guard Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17825 opened Dec 6, 2025 by CISC Loading…
cli: new CLI experience devops improvements to build systems and github actions examples script Script related server testing Everything test related
#17824 opened Dec 6, 2025 by ngxson Draft
4 of 6 tasks
llama : add token matching support to llama-grammar testing Everything test related
#17816 opened Dec 6, 2025 by aldehir Loading…
3 tasks done
CANN: support gated linear attn Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17814 opened Dec 6, 2025 by YushengZhao Loading…
vulkan: faster q6_k matmul ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17813 opened Dec 6, 2025 by netrunnereve Loading…
model: support Rnj-1 model Model specific python python script changes
#17811 opened Dec 6, 2025 by philip-essential Loading…
[DRAFT] CUDA: Improve performance via less synchronizations between token ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17795 opened Dec 5, 2025 by aendk Draft
Make graph_max_nodes vary by ubatch size
#17794 opened Dec 5, 2025 by pwilkin Loading…
ProTip! Follow long discussions with comments:>50.