]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
llama : Async DirectIO model loading on Linux (#18012)
authorJulius Tischbein <redacted>
Thu, 18 Dec 2025 06:27:19 +0000 (07:27 +0100)
committerGitHub <redacted>
Thu, 18 Dec 2025 06:27:19 +0000 (08:27 +0200)
commit4d4f4cacd1c18975e5a97f20369fc30a225dc51f
treef585f492183a915a810b96311614b85cd8dcffcb
parent0a0bba05e8390ab7e4a54bb8c0ed0a25da64cf62
llama : Async DirectIO model loading on Linux (#18012)

* Uncached model read

* Removing additional --mmap arg

* Removing trailing whitespaces

* Adding fallback when O_DIRECT is not supported

* Remove branching in llama-model-loader.cpp and reduce code duplications in llama-mmap.cpp

* Adding maybe unused keyword for Mac and Windows.

* File seek aligned

* Removing all branches for direct_io in llama-model-loader.cpp

* Always use alignment from llama_file

* use_mmap=true
src/llama-mmap.cpp
src/llama-mmap.h
src/llama-model-loader.cpp