chore: bump llama.cpp to b9501 by ilopezluna · Pull Request #955 · docker/model-runner

ilopezluna · 2026-06-04T19:42:12Z

No description provided.

Signed-off-by: Ignacio López Luna <ignasi.lopez.luna@gmail.com>

gemini-code-assist

Code Review

This pull request updates the llama.cpp server version from b8967 to b9501, modifying the .versions file, Dockerfile, and the vendor subproject commit. There are no review comments, and I have no feedback to provide.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

sourcery-ai

Hey - I've found 1 issue

Prompt for AI Agents

Please address the comments from this code review:

## Individual Comments

### Comment 1
<location path="Dockerfile" line_range="3-6" />
<code_context>

 ARG GO_VERSION=1.25
-ARG LLAMA_SERVER_VERSION=b8967
+ARG LLAMA_SERVER_VERSION=b9501
 ARG LLAMA_SERVER_VARIANT=cpu
-ARG LLAMA_UPSTREAM_IMAGE=ghcr.io/ggml-org/llama.cpp:server-vulkan-b8967
+ARG LLAMA_UPSTREAM_IMAGE=ghcr.io/ggml-org/llama.cpp:server-vulkan-b9501

 ARG VERSION=dev
</code_context>
<issue_to_address>
**suggestion:** Avoid duplicating the version string by deriving LLAMA_UPSTREAM_IMAGE from LLAMA_SERVER_VERSION.

`b9501` is currently duplicated in both `LLAMA_SERVER_VERSION` and `LLAMA_UPSTREAM_IMAGE`, which makes version bumps error-prone. Instead, compose `LLAMA_UPSTREAM_IMAGE` from `LLAMA_SERVER_VERSION`, e.g.:

```dockerfile
ARG LLAMA_SERVER_VERSION=b9501
ARG LLAMA_SERVER_VARIANT=cpu
ARG LLAMA_UPSTREAM_BASE=ghcr.io/ggml-org/llama.cpp
ARG LLAMA_UPSTREAM_IMAGE=${LLAMA_UPSTREAM_BASE}:server-vulkan-${LLAMA_SERVER_VERSION}
```

This keeps the version in a single place and avoids mismatches on future updates.

```suggestion
ARG GO_VERSION=1.25
ARG LLAMA_SERVER_VERSION=b9501
ARG LLAMA_SERVER_VARIANT=cpu
ARG LLAMA_UPSTREAM_BASE=ghcr.io/ggml-org/llama.cpp
ARG LLAMA_UPSTREAM_IMAGE=${LLAMA_UPSTREAM_BASE}:server-vulkan-${LLAMA_SERVER_VERSION}
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2026-06-04T19:43:08Z

 ARG GO_VERSION=1.25
-ARG LLAMA_SERVER_VERSION=b8967
+ARG LLAMA_SERVER_VERSION=b9501
 ARG LLAMA_SERVER_VARIANT=cpu
-ARG LLAMA_UPSTREAM_IMAGE=ghcr.io/ggml-org/llama.cpp:server-vulkan-b8967
+ARG LLAMA_UPSTREAM_IMAGE=ghcr.io/ggml-org/llama.cpp:server-vulkan-b9501


suggestion: Avoid duplicating the version string by deriving LLAMA_UPSTREAM_IMAGE from LLAMA_SERVER_VERSION.

b9501 is currently duplicated in both LLAMA_SERVER_VERSION and LLAMA_UPSTREAM_IMAGE, which makes version bumps error-prone. Instead, compose LLAMA_UPSTREAM_IMAGE from LLAMA_SERVER_VERSION, e.g.:

ARG LLAMA_SERVER_VERSION=b9501 ARG LLAMA_SERVER_VARIANT=cpu ARG LLAMA_UPSTREAM_BASE=ghcr.io/ggml-org/llama.cpp ARG LLAMA_UPSTREAM_IMAGE=${LLAMA_UPSTREAM_BASE}:server-vulkan-${LLAMA_SERVER_VERSION}

This keeps the version in a single place and avoids mismatches on future updates.

Suggested change

ARG GO_VERSION=1.25

ARG LLAMA_SERVER_VERSION=b8967

ARG LLAMA_SERVER_VERSION=b9501

ARG LLAMA_SERVER_VARIANT=cpu

ARG LLAMA_UPSTREAM_IMAGE=ghcr.io/ggml-org/llama.cpp:server-vulkan-b8967

ARG LLAMA_UPSTREAM_IMAGE=ghcr.io/ggml-org/llama.cpp:server-vulkan-b9501

ARG GO_VERSION=1.25

ARG LLAMA_SERVER_VERSION=b9501

ARG LLAMA_SERVER_VARIANT=cpu

ARG LLAMA_UPSTREAM_BASE=ghcr.io/ggml-org/llama.cpp

ARG LLAMA_UPSTREAM_IMAGE=${LLAMA_UPSTREAM_BASE}:server-vulkan-${LLAMA_SERVER_VERSION}

ericcurtin · 2026-06-04T19:52:40Z

@@ -1 +1 @@
 GO_VERSION=1.25


We should do something about this file, manually editing this really isn't great

chore: bump llama.cpp to b9501

09bc107

Signed-off-by: Ignacio López Luna <ignasi.lopez.luna@gmail.com>

gemini-code-assist Bot reviewed Jun 4, 2026

View reviewed changes

sourcery-ai Bot reviewed Jun 4, 2026

View reviewed changes

ericcurtin approved these changes Jun 4, 2026

View reviewed changes

ericcurtin reviewed Jun 4, 2026

View reviewed changes

ericcurtin merged commit 842fd45 into main Jun 4, 2026
14 checks passed

ericcurtin deleted the bump-llamacpp-b9501 branch June 4, 2026 20:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: bump llama.cpp to b9501#955

chore: bump llama.cpp to b9501#955
ericcurtin merged 1 commit into
mainfrom
bump-llamacpp-b9501

ilopezluna commented Jun 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

sourcery-ai Bot Jun 4, 2026

Uh oh!

ericcurtin Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ilopezluna commented Jun 4, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

ericcurtin Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants