devops: fix failing s390x docker build (#16918)

author Aaron Teo <redacted>

Sun, 2 Nov 2025 00:48:46 +0000 (08:48 +0800)

committer GitHub <redacted>

Sun, 2 Nov 2025 00:48:46 +0000 (08:48 +0800)
author Aaron Teo <redacted>
Sun, 2 Nov 2025 00:48:46 +0000 (08:48 +0800)
committer GitHub <redacted>
Sun, 2 Nov 2025 00:48:46 +0000 (08:48 +0800)
diff --git a/.devops/s390x.Dockerfile b/.devops/s390x.Dockerfile

index 3df1a2b0defe01034eca123ffcb0a07afceea330..b7c9457680b08914fd6bf6712b6a7b4f31c29e3b 100644 (file)
--- a/.devops/s390x.Dockerfile
+++ b/.devops/s390x.Dockerfile
@@ -24,8 +24,9 @@ RUN --mount=type=cache,target=/root/.ccache \
          -DCMAKE_C_COMPILER_LAUNCHER=ccache \
          -DCMAKE_CXX_COMPILER_LAUNCHER=ccache \
          -DLLAMA_BUILD_TESTS=OFF \
-        -DGGML_BACKEND_DL=OFF \
          -DGGML_NATIVE=OFF \
+        -DGGML_BACKEND_DL=ON \
+        -DGGML_CPU_ALL_VARIANTS=ON \
          -DGGML_BLAS=ON \
          -DGGML_BLAS_VENDOR=OpenBLAS && \
      cmake --build build --config Release -j $(nproc) && \
@@ -103,6 +104,7 @@ FROM base AS light
  WORKDIR /llama.cpp/bin
  
  # Copy llama.cpp binaries and libraries
+COPY --from=collector /llama.cpp/bin/*.so /llama.cpp/bin
  COPY --from=collector /llama.cpp/bin/llama-cli /llama.cpp/bin
  
  ENTRYPOINT [ "/llama.cpp/bin/llama-cli" ]
@@ -116,6 +118,7 @@ ENV LLAMA_ARG_HOST=0.0.0.0
  WORKDIR /llama.cpp/bin
  
  # Copy llama.cpp binaries and libraries
+COPY --from=collector /llama.cpp/bin/*.so /llama.cpp/bin
  COPY --from=collector /llama.cpp/bin/llama-server /llama.cpp/bin
  
  EXPOSE 8080
diff --git a/docs/docker.md b/docs/docker.md

index bfabf2425a7d6e85f5a96100aeb7597437a527fa..98502a0c50598b3d160c47eb9c6b18504e2fdcec 100644 (file)
--- a/docs/docker.md
+++ b/docs/docker.md
@@ -7,9 +7,9 @@
  ## Images
  We have three Docker images available for this project:
  
-1. `ghcr.io/ggml-org/llama.cpp:full`: This image includes both the main executable file and the tools to convert LLaMA models into ggml and convert into 4-bit quantization. (platforms: `linux/amd64`, `linux/arm64`)
-2. `ghcr.io/ggml-org/llama.cpp:light`: This image only includes the main executable file. (platforms: `linux/amd64`, `linux/arm64`)
-3. `ghcr.io/ggml-org/llama.cpp:server`: This image only includes the server executable file. (platforms: `linux/amd64`, `linux/arm64`)
+1. `ghcr.io/ggml-org/llama.cpp:full`: This image includes both the main executable file and the tools to convert LLaMA models into ggml and convert into 4-bit quantization. (platforms: `linux/amd64`, `linux/arm64`, `linux/s390x`)
+2. `ghcr.io/ggml-org/llama.cpp:light`: This image only includes the main executable file. (platforms: `linux/amd64`, `linux/arm64`, `linux/s390x`)
+3. `ghcr.io/ggml-org/llama.cpp:server`: This image only includes the server executable file. (platforms: `linux/amd64`, `linux/arm64`, `linux/s390x`)
  
  Additionally, there the following images, similar to the above:
author	Aaron Teo <redacted>
	Sun, 2 Nov 2025 00:48:46 +0000 (08:48 +0800)
committer	GitHub <redacted>
	Sun, 2 Nov 2025 00:48:46 +0000 (08:48 +0800)
.devops/s390x.Dockerfile		patch \| blob \| history
docs/docker.md		patch \| blob \| history