]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-04-18 Chris Thompsongguf-py : GGUF Editor GUI - Python + Qt6 (#12930) gguf-v0.16.1
2025-04-18 Xuan-Son Nguyenserver : use std::move whenever possible (#12936)
2025-04-18 Akarshan BiswasSYCL: Refactor and enable FP16 in binary broadcast...
2025-04-18 Xuan-Son Nguyenmtmd : add methods to access `mtmd_image_tokens` (...
2025-04-18 Radoslav Gerganovrpc : add RPC_CMD_HELLO (#12955)
2025-04-17 Georgi Gerganovgraph : make FA compatible with MLA + add initial Metal...
2025-04-17 Alan Grayggml: Re-enable CUDA graphs in presence of CONT and...
2025-04-17 hipuddingCANN: Add support for async operator submission (#12864)
2025-04-17 Mikko Juolallama : recognize IBM Granite 3.3 FIM tokens (#12988)
2025-04-16 kimminsuopencl: fix incorrect local_size index in profiling...
2025-04-16 Jeff Bolzvulkan: enable coopmat2 FA gqa and split_k optimization...
2025-04-16 Chenguang LiCANN: Add 310P operator support check (#12962)
2025-04-15 lhezopencl: split `ggml-opencl.cl` into multiple files...
2025-04-15 Georgi Gerganovmetal : add FA-vec kernels for head size 96 (#12952)
2025-04-15 hipuddingCANN: Add x86 build ci (#12950)
2025-04-15 David HuangCUDA/HIP: Share the same unified memory allocation...
2025-04-15 Akarshan BiswasSYCL: Add ROPE vision kernel (#12887)
2025-04-15 Juk Armstrongllama : DeepSeek V2/V3 MLA implementation (#12801)
2025-04-15 Srihari-mcwggml : Add AVX512 implementation of GEMM - Q4_Kx8 ...
2025-04-15 Chenguang LiCANN: Opt ROPE optimization (#12865)
2025-04-15 Xinpeng DouCANN: Optimize CANN buffer pool memory management ...
2025-04-14 RussyydsAdd performance print for gemma3 in example (#12929)
2025-04-14 Akarshan BiswasSYCL: Fix im2col (#12910)
2025-04-14 Radoslav Gerganovrpc : use ggml_context_ptr (#12938)
2025-04-14 Neo Zhang Jianyudsiable curl lib check, this action is missed by commit...
2025-04-14 Georgi Gerganovsync : ggml
2025-04-14 cmdr2cpu: fix cpu backend's supports-op for GET_ROWS_BACK...
2025-04-14 SXXggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly...
2025-04-13 Alan Grayggml: disable CUDA graphs for unsupported DUP and CONT...
2025-04-13 Ed Addarioquantize: Handle user-defined quantization levels for...
2025-04-12 Prajwal B Mehendarkarcommon : Define cache directory on AIX (#12915)
2025-04-12 Jeff Bolzvulkan: use aligned loads for flash attention mask...
2025-04-12 Matt Claytonllava: Fix cpu-only clip image encoding sefault (#12907)
2025-04-11 Georgi Gerganovserver : add VSCode's Github Copilot Chat support ...
2025-04-11 yuri@FreeBSDrpc : Set cache directory in rpc-server.cpp on FreeBSD...
2025-04-11 Olivier Chafik`tool-call`: fix non-tool-calling grammar crashes w...
2025-04-11 yuri@FreeBSDcommon : Define cache directory on FreeBSD (#12892)
2025-04-11 Ewan Crawfordsycl: Support sycl_ext_oneapi_limited_graph (#12873)
2025-04-11 tastelikefeetcontrib: support modelscope community (#12664)
2025-04-11 Yuxuan Zhangllama-model : add Glm4Model implementation for GLM...
2025-04-11 Xuan-Son Nguyenclip : use smart pointer (⚠️ breaking change) (#12869)
2025-04-11 Akarshan BiswasSYCL: Add fp16 type support to unary op kernels (#12788)
2025-04-11 Daniel Hanconvert : Llama4 RoPE fix (#12889)
2025-04-11 R0CKSTARci : Replace freediskspace to free_disk_space in docker...
2025-04-11 Daniel Beveniusxcf : add check for visionos build version (#12854)
2025-04-11 Xuan-Son Nguyenconvert : proper tensor name mapping for llama4 (#12870)
2025-04-11 Xuan-Son Nguyenllama : correct rms norm for llama 4 (#12882)
2025-04-11 Aaron Teoggml: fix compilation error s390x (#12848)
2025-04-10 Georgi Gerganovsync : ggml
2025-04-10 Georgi Gerganovtests : fix init order (#0)
2025-04-10 Georgi Gerganovsync : ggml
2025-04-10 cmdr2ggml: don't include arm_neon.h when using CUDA 12 with...
2025-04-10 Diego Devesaggml : add bilinear upscale support (ggml/1185)
2025-04-10 Diego Devesaggml : add more generic custom op, remove deprecated...
2025-04-10 Georgi Gerganovscripts : fix sync-ggml-am.sh
2025-04-10 Xuan-Son Nguyenllava : introduce libmtmd (#12849)
2025-04-10 Xuan-Son Nguyenconvert : ability to lazy-load safetensors remotely...
2025-04-10 Chenguang LiCANN: Support more ops (#12841)
2025-04-09 Prajwal B MehendarkarFixes #12823 (#12830)
2025-04-09 Rudi Servodocker : added all CPU to GPU images (#12749)
2025-04-09 Piotr Kubajggml-cpu-impl.h: do not redefine bool on POWER9 (#12856)
2025-04-09 Piotr Kubajggml-impl.h: fix build on POWER9 (#12855)
2025-04-09 Bo Zhengllama : Support Qwen3 and Qwen3MoE (#12828)
2025-04-09 R0CKSTARmusa: enable freediskspace for docker image build ...
2025-04-09 Romain Biessysycl: update documentation to use -no-cnv (#12845)
2025-04-09 Plamen Minevci: detach common from the library (#12827)
2025-04-09 Xuan-Son Nguyenclip : do not print ftype (#12832)
2025-04-09 Georgi Gerganovreadme : add rpc backend (#12842)
2025-04-09 Chenguang LiCANN: Support Opt CONV_TRANSPOSE_1D and ELU (#12786)
2025-04-09 Jeff Bolzvulkan: In coopmat2 mmq, load q4_k/q5_k scales through...
2025-04-09 Jeff Bolzvulkan: Use fp16 for the flash attention P*V multiplica...
2025-04-08 Sigbjørn Skjæretcuda : add f32 to bf16 copy op (#12806)
2025-04-08 Matt Claytonllava: improve clip_ctx destructor to not memleak load_...
2025-04-08 Georgi Gerganovllama : fix FA when KV cache is not used (i.e. embeddin...
2025-04-08 Xuan-Son Nguyenserver : fix thread.join() on exit (#12831)
2025-04-08 dm4llava: add more helper functions to check projector...
2025-04-08 Prajwal B Mehendarkararg : Including limits file on AIX (#12822)
2025-04-08 characharmserver : webui : Improve Chat Input with Auto-Sizing...
2025-04-08 Neo Zhang JianyuRevert "sycl:remove redundant memcopy in function ggml_...
2025-04-08 compiladegguf-py : support lazy tensor splitting (#12809)
2025-04-07 Xuan-Son Nguyenllama : Support llama 4 text-only (#12791)
2025-04-07 lhezopencl: better identify Adreno GPU (#12760)
2025-04-07 stduhpfhellaswag: display estimated score confidence interval...
2025-04-07 Georgi Gerganovcuda : fix HIP and MUSA BF16 (#0)
2025-04-07 Georgi Gerganovsync : ggml
2025-04-07 Georgi Gerganovggml : simplify Arm fp16 CPU logic (ggml/1177)
2025-04-07 Sigbjørn SkjæretCUDA: don't convert BF16 weights to FP32 (ggml/1174)
2025-04-07 cmdr2cpu: move all the operators into a separate c++ file...
2025-04-07 zhouwgsycl: remove redundant memcopy in function ggml_backend...
2025-04-07 Xuan-Son Nguyenci : no curl on ggml-ci (#12796)
2025-04-07 Xuan-Son Nguyencmake : enable curl by default (#12761)
2025-04-07 zhouwgCANN: fix typo in ggml-cann (#12733)
2025-04-07 hipuddingCANN: Refactor to reduce duplicate code (#12731)
2025-04-06 R0CKSTARmusa: fix compilation warnings in mp_22/31 (#12780)
2025-04-06 Jeff Bolzvulkan: fix NaN issue in flash attention shader (#12776)
2025-04-06 Jeff Bolzvulkan: Use unclamped loads for flash attention mask...
2025-04-05 0cc4mVulkan: Tune Vulkan mmq int dot shader for performance...
2025-04-05 Sergey Fedorovcommon : fix includes in arg.cpp and gemma3-cli.cpp...
2025-04-05 Xuan-Son Nguyenclip : refactor clip_init, add tests (#12757)
2025-04-05 エシュナヴァリシアcommon: custom hf endpoint support (#12769)
next