]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-03-18 Georgi Gerganovcontext : always use non-causal attention for encoder...
2025-03-18 Łukasz ŚlusarczykSYCL: using graphs is configurable by environment varia...
2025-03-18 Georgi Gerganovserver : fix warmup draft cache type (#12446)
2025-03-18 Prajwal B Mehendarkarcmake : fix PowerPC build (#12241)
2025-03-18 fj-y-saitoggml : add SVE support for q6_K_q8_K (#12361)
2025-03-18 0cc4mVulkan: Default to 1GB allocations instead of 4GB to...
2025-03-18 Łukasz Ślusarczykfixed compilation warnings in ggml-sycl (#12424)
2025-03-17 Molly Sophiallama: Add support for RWKV v7 architecture (#12412)
2025-03-17 Sigbjørn Skjæretdocs : bring llama-cli conversation/template docs up...
2025-03-17 Gaurav Gargcuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)
2025-03-17 Guus Waalsggml-vulkan: remove unused find_program(glslc) (#12416)
2025-03-17 Jeff Bolzvulkan: Add N/2 and N/4 optimized paths in coopmat2...
2025-03-17 Danielevulkan: subgroup size tuning (#12087)
2025-03-17 Jeff Bolzvulkan: use fp32 in coopmat2 q4_k dequant function...
2025-03-17 Jeff Bolzvulkan: Pad N dimension of B matrix for coopmat2 perf...
2025-03-17 Jeff Bolzvulkan: Adjust coopmat2 tile sizes and selection heuris...
2025-03-17 Christian Kastnercmake : enable building llama.cpp using system libggml...
2025-03-17 Akarshan BiswasSYCL: set extras only on GGML_TYPE_Q4_0 (#12366)
2025-03-16 Sigbjørn Skjæretllama : fix OLMo-2-0325-32B-Instruct K-norm size (...
2025-03-16 Georgi Gerganovcontext : fix init of n_outputs (#12397)
2025-03-16 Daniel Beveniusci : add --symlinks to xcframework zip command (#12409)
2025-03-15 marcoStocchillama-tts : add '-o' option (#12398)
2025-03-15 aubreyliSYCL: Delete redundant plus sign and space (#12391)
2025-03-15 fairydreamingSYCL : support non-contiguous tensors in binary ops...
2025-03-15 Chenguang Li[CANN]MUL_MAT optimization (#12382)
2025-03-14 Eric CurtinAdd CLI arg to llama-run to adjust the number of thread...
2025-03-14 Sigbjørn Skjæretmain : add -sysf / --system-prompt-file (#12249) (...
2025-03-14 fairydreamingLoad all MoE experts during warmup (#11571)
2025-03-14 Victorserver: fix "--grammar-file" parameter (#12285)
2025-03-14 Georgi Gerganovgraph : simplify attn input build for unified KV cache...
2025-03-14 Georgi Gerganovhparams : add SWA rope parameters (#12374)
2025-03-13 Georgi Gerganovllama : fix Gemma3 SWA KV cache shift (#12373)
2025-03-13 Xuan-Son Nguyenarg : no n_predict = -2 for examples except for main...
2025-03-13 Georgi Gerganovllama : refactor llama_context, llama_kv_cache, llm_bui...
2025-03-13 Ishaan Gandhiserver : fix crash when using verbose output with input...
2025-03-12 Oscar BarenysUpdate build.yml for Windows Vulkan builder to use...
2025-03-12 Daniel Beveniusllama.swiftui : fix xcframework dir in README [no ci...
2025-03-12 Alberto Cabrera... sycl : variable sg_size support for mmvq kernels (...
2025-03-12 uvosCUDA/HIP: Fix fattn-vec-* when device warp size is...
2025-03-12 Xuan-Son Nguyenllama : Add Gemma 3 support (+ experimental vision...
2025-03-12 Jeff Bolzvulkan: fix bug in coopmat1 mul_mat_id (#12316)
2025-03-11 uvosCUDA/HIP: refractor mmqv to unify the calculation of...
2025-03-11 jklincnggml-backend : fix backend search path (#12330)
2025-03-11 BB-fatmetal : Cache the Metal library at the device context...
2025-03-11 Xuan-Son Nguyenclip : bring back GPU support (#12322)
2025-03-10 Evemat vec double buffer (#12188)
2025-03-10 R0CKSTARmusa: support new arch mp_31 and update doc (#12296)
2025-03-10 Henry Linjamäkiopencl: use OpenCL C standard supported by the device...
2025-03-10 John Beanreadme: added Sidekick to available UIs (#12311)
2025-03-10 Georgi Gerganovtests : fix test-quantize-fns to init the CPU backend...
2025-03-10 marcoStocchicommon : refactor '-o' option (#12278)
2025-03-10 Olivier Chafik`server`: extract <think> tags from qwq outputs (#12297)
2025-03-10 Olivier Chafik`tool-call`: ensure there's always a non-empty tool...
2025-03-10 Olivier Chafikallow missing content in message if tool_calls provided...
2025-03-10 Olivier Chafik`sampler`: fixes trigger tokens + lazy grammars (fix...
2025-03-10 tc-mbllava : fix bug in minicpm-v code (#11513)
2025-03-09 Georgi Gerganovserver : add speculative decoding presets for FIM ...
2025-03-08 Georgi Gerganovauthors : update (#12271)
2025-03-08 Jason C.Hggml-backend : make path_str compatible with C++20...
2025-03-07 Georgi Gerganovserver : infill gen ends on new line (#12254)
2025-03-07 Daniel Beveniusggml : skip intermediate .air file when compiling ...
2025-03-07 Georgi Gerganovsync : ggml upstream/0.0.4853
2025-03-07 vmobilisggml : ggml_compute_forward_concat() for arbitrary...
2025-03-07 Rémy Oggml-cpu: faster AVX2 variant for IQ1_M (#12216)
2025-03-07 Georgi Gerganovci : fix save-load test invocations (#12245)
2025-03-07 Sigbjørn Skjæretserver : Log original chat template parsing error ...
2025-03-07 Olivier Chafiksync: minja - support QwQ-32B (#12235)
2025-03-07 BB-fatmetal : simplify kernel arguments using a struct (...
2025-03-07 David HuangHIP: fix rocWMMA build flags under Windows (#12230)
2025-03-07 Daniel Beveniusmetal : fix default.metallib build (#12224)
2025-03-07 lhezopencl: Noncontiguous `norm`, `rms_norm`, disable ...
2025-03-06 xiaofeicmake : fix undefined reference errors for std::filesys...
2025-03-06 Lucas Moura... readme : update bindings (#12229)
2025-03-06 Johannes GäßlerCUDA: fix FA logic for PTX 7.0 and CC >= 7.5 (#12222)
2025-03-06 David HuangHIP: rocWMMA documentation and enabling in workflow...
2025-03-06 Olivier Chafikupdate function-calling.md w/ template override for...
2025-03-06 Aaron Teollava: add big-endian conversion for image encoder...
2025-03-06 uvosHIP/CUDA: set the paramerter value in maintain_cuda_gra...
2025-03-06 Han Yinandroid : fix KV cache log message condition (#12212)
2025-03-06 Henry Linjamäkiopencl : fix buffer alignment (#12197)
2025-03-06 Henry Linjamäkiopencl : fix `ulong` kernel args were set from `int...
2025-03-06 simon886212opencl : fix profile-related errors (#12095)
2025-03-06 Rémy Oggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2...
2025-03-05 Akarshan BiswasSYCL: Disable f16 Unary OPs as not supported by the...
2025-03-05 Plamen Minevggml : fix GGMLMetalClass ODR (#12200)
2025-03-05 Daniel Beveniusci : add fetch-depth to xcframework upload (#12195)
2025-03-05 Olivier Chafik`tool-call`: fix Qwen 2.5 Coder support, add micro...
2025-03-05 Daniel Beveniusci : fix xcframework artifact tag (#12191)
2025-03-05 Daniel Beveniusci : remove xframework upload (#12190)
2025-03-05 Clauszyserver : fix cache reuse logic (#12161)
2025-03-05 Daniel Beveniusllama : add xcframework build script (#11996)
2025-03-04 mgroeber9110ggml : portability fixes for VS 2017 (#12150)
2025-03-04 Georgi Gerganovreadme : fix roadmap link (#12185)
2025-03-04 Sigbjørn Skjæretmain: allow preloading conversation with -p and add...
2025-03-04 Olivier Chafik`server`: fix deadly typo in response_format.json_schem...
2025-03-03 David HuangHIP: implement FlashAttention via rocWMMA for CDNA...
2025-03-03 Georgi Gerganovsync : ggml
2025-03-03 cmdr2cuda: unary ops as float + de-duplicate (ggml/1130)
2025-03-03 Georgi Gerganovsync : ggml
2025-03-03 cmdr2cuda/vulkan: specify fp32-only support for some operati...
next