]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-03-13 Georgi Gerganovllama : refactor llama_context, llama_kv_cache, llm_bui...
2025-03-13 Ishaan Gandhiserver : fix crash when using verbose output with input...
2025-03-12 Oscar BarenysUpdate build.yml for Windows Vulkan builder to use...
2025-03-12 Daniel Beveniusllama.swiftui : fix xcframework dir in README [no ci...
2025-03-12 Alberto Cabrera... sycl : variable sg_size support for mmvq kernels (...
2025-03-12 uvosCUDA/HIP: Fix fattn-vec-* when device warp size is...
2025-03-12 Xuan-Son Nguyenllama : Add Gemma 3 support (+ experimental vision...
2025-03-12 Jeff Bolzvulkan: fix bug in coopmat1 mul_mat_id (#12316)
2025-03-11 uvosCUDA/HIP: refractor mmqv to unify the calculation of...
2025-03-11 jklincnggml-backend : fix backend search path (#12330)
2025-03-11 BB-fatmetal : Cache the Metal library at the device context...
2025-03-11 Xuan-Son Nguyenclip : bring back GPU support (#12322)
2025-03-10 Evemat vec double buffer (#12188)
2025-03-10 R0CKSTARmusa: support new arch mp_31 and update doc (#12296)
2025-03-10 Henry Linjamäkiopencl: use OpenCL C standard supported by the device...
2025-03-10 John Beanreadme: added Sidekick to available UIs (#12311)
2025-03-10 Georgi Gerganovtests : fix test-quantize-fns to init the CPU backend...
2025-03-10 marcoStocchicommon : refactor '-o' option (#12278)
2025-03-10 Olivier Chafik`server`: extract <think> tags from qwq outputs (#12297)
2025-03-10 Olivier Chafik`tool-call`: ensure there's always a non-empty tool...
2025-03-10 Olivier Chafikallow missing content in message if tool_calls provided...
2025-03-10 Olivier Chafik`sampler`: fixes trigger tokens + lazy grammars (fix...
2025-03-10 tc-mbllava : fix bug in minicpm-v code (#11513)
2025-03-09 Georgi Gerganovserver : add speculative decoding presets for FIM ...
2025-03-08 Georgi Gerganovauthors : update (#12271)
2025-03-08 Jason C.Hggml-backend : make path_str compatible with C++20...
2025-03-07 Georgi Gerganovserver : infill gen ends on new line (#12254)
2025-03-07 Daniel Beveniusggml : skip intermediate .air file when compiling ...
2025-03-07 Georgi Gerganovsync : ggml upstream/0.0.4853
2025-03-07 vmobilisggml : ggml_compute_forward_concat() for arbitrary...
2025-03-07 Rémy Oggml-cpu: faster AVX2 variant for IQ1_M (#12216)
2025-03-07 Georgi Gerganovci : fix save-load test invocations (#12245)
2025-03-07 Sigbjørn Skjæretserver : Log original chat template parsing error ...
2025-03-07 Olivier Chafiksync: minja - support QwQ-32B (#12235)
2025-03-07 BB-fatmetal : simplify kernel arguments using a struct (...
2025-03-07 David HuangHIP: fix rocWMMA build flags under Windows (#12230)
2025-03-07 Daniel Beveniusmetal : fix default.metallib build (#12224)
2025-03-07 lhezopencl: Noncontiguous `norm`, `rms_norm`, disable ...
2025-03-06 xiaofeicmake : fix undefined reference errors for std::filesys...
2025-03-06 Lucas Moura... readme : update bindings (#12229)
2025-03-06 Johannes GäßlerCUDA: fix FA logic for PTX 7.0 and CC >= 7.5 (#12222)
2025-03-06 David HuangHIP: rocWMMA documentation and enabling in workflow...
2025-03-06 Olivier Chafikupdate function-calling.md w/ template override for...
2025-03-06 Aaron Teollava: add big-endian conversion for image encoder...
2025-03-06 uvosHIP/CUDA: set the paramerter value in maintain_cuda_gra...
2025-03-06 Han Yinandroid : fix KV cache log message condition (#12212)
2025-03-06 Henry Linjamäkiopencl : fix buffer alignment (#12197)
2025-03-06 Henry Linjamäkiopencl : fix `ulong` kernel args were set from `int...
2025-03-06 simon886212opencl : fix profile-related errors (#12095)
2025-03-06 Rémy Oggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2...
2025-03-05 Akarshan BiswasSYCL: Disable f16 Unary OPs as not supported by the...
2025-03-05 Plamen Minevggml : fix GGMLMetalClass ODR (#12200)
2025-03-05 Daniel Beveniusci : add fetch-depth to xcframework upload (#12195)
2025-03-05 Olivier Chafik`tool-call`: fix Qwen 2.5 Coder support, add micro...
2025-03-05 Daniel Beveniusci : fix xcframework artifact tag (#12191)
2025-03-05 Daniel Beveniusci : remove xframework upload (#12190)
2025-03-05 Clauszyserver : fix cache reuse logic (#12161)
2025-03-05 Daniel Beveniusllama : add xcframework build script (#11996)
2025-03-04 mgroeber9110ggml : portability fixes for VS 2017 (#12150)
2025-03-04 Georgi Gerganovreadme : fix roadmap link (#12185)
2025-03-04 Sigbjørn Skjæretmain: allow preloading conversation with -p and add...
2025-03-04 Olivier Chafik`server`: fix deadly typo in response_format.json_schem...
2025-03-03 David HuangHIP: implement FlashAttention via rocWMMA for CDNA...
2025-03-03 Georgi Gerganovsync : ggml
2025-03-03 cmdr2cuda: unary ops as float + de-duplicate (ggml/1130)
2025-03-03 Georgi Gerganovsync : ggml
2025-03-03 cmdr2cuda/vulkan: specify fp32-only support for some operati...
2025-03-03 Georgi Gerganovsync : ggml
2025-03-03 cmdr2cuda/cpu: Increase support for fp16 unary operations...
2025-03-03 Diego Devesawhisper : support GGML_BACKEND_DL (whisper/2843)
2025-03-03 midnightcmake : fix compile assumptions for power9/etc (whisper...
2025-03-03 petterreinholdtsenTold cmake to install ggml-cpp.h as a public header...
2025-03-03 cmdr2Support pure float16 add/sub/mul/div operations in...
2025-03-03 Georgi Gerganovscripts : sync-ggml-am.sh fix
2025-03-03 Daniel Beveniusci : set GITHUB_ACTION env var for server tests (#12162)
2025-03-03 dm4tts: add speaker file support (#12048)
2025-03-03 Diego Devesatest-backend-ops : add option -p to filter by op params...
2025-03-03 ag2s20150909ggml : fix kleidiai build (#12159)
2025-03-03 Eric CurtinAdding UTF-8 support to llama.cpp (#12111)
2025-03-03 Xuan-Son Nguyenwebui : add ?m=... and ?q=... params (#12148)
2025-03-03 Akarshan BiswasSYCL: Move CPY kernels to a separate file and add few...
2025-03-02 Diego Devesaggml-backend : keep paths in native string type when...
2025-03-02 Sigbjørn Skjæretmain: use jinja chat template system prompt by default...
2025-03-01 Sigbjørn Skjæretmain: update outdated system prompt message (followup...
2025-03-01 Sigbjørn Skjæretcommon : add --system-prompt parameter, replace behavio...
2025-03-01 Erik ScholzCUDA: compress mode option and default to size (#12029)
2025-03-01 Vivianwebui : minor typo fixes (#12116)
2025-02-28 Xuan-Son Nguyenconvert : fix Norway problem when parsing YAML (#12114)
2025-02-28 William Tambelliniggml : upgrade init_tensor API to return a ggml_status...
2025-02-28 Xuan-Son Nguyenllama : add Phi-4-mini support (supersede #12099) ...
2025-02-28 Alex BrooksUpdate granite vision docs for 3.2 model (#12105)
2025-02-28 Rémy Ovulkan: add specific MMV kernels for IQ2 and IQ3 quants...
2025-02-28 Johannes GäßlerCUDA: fix logic for V100 + GGML_CUDA_FORCE_MMQ (#12098)
2025-02-28 Prashant Vithuleggml: aarch64: implement SVE kernels for q2_k_q8_k...
2025-02-28 hipuddingCANN: Fix build error with GCC 13 (#11990)
2025-02-28 Evevulkan: matmul dequantization improvements (#12015)
2025-02-28 Danielevulkan: improve im2col (#11826)
2025-02-27 Vladimir Vuksanoviccmake: Fix ggml backend dependencies and installation...
2025-02-26 Ting Loullava : add struct for FFI bindgen (#12079)
2025-02-26 Sigbjørn SkjæretRefactor gguf scripts to improve metadata handling... gguf-v0.16.0
next