]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-04-09 Georgi Gerganovreadme : add rpc backend (#12842)
2025-04-09 Chenguang LiCANN: Support Opt CONV_TRANSPOSE_1D and ELU (#12786)
2025-04-09 Jeff Bolzvulkan: In coopmat2 mmq, load q4_k/q5_k scales through...
2025-04-09 Jeff Bolzvulkan: Use fp16 for the flash attention P*V multiplica...
2025-04-08 Sigbjørn Skjæretcuda : add f32 to bf16 copy op (#12806)
2025-04-08 Matt Claytonllava: improve clip_ctx destructor to not memleak load_...
2025-04-08 Georgi Gerganovllama : fix FA when KV cache is not used (i.e. embeddin...
2025-04-08 Xuan-Son Nguyenserver : fix thread.join() on exit (#12831)
2025-04-08 dm4llava: add more helper functions to check projector...
2025-04-08 Prajwal B Mehendarkararg : Including limits file on AIX (#12822)
2025-04-08 characharmserver : webui : Improve Chat Input with Auto-Sizing...
2025-04-08 Neo Zhang JianyuRevert "sycl:remove redundant memcopy in function ggml_...
2025-04-08 compiladegguf-py : support lazy tensor splitting (#12809)
2025-04-07 Xuan-Son Nguyenllama : Support llama 4 text-only (#12791)
2025-04-07 lhezopencl: better identify Adreno GPU (#12760)
2025-04-07 stduhpfhellaswag: display estimated score confidence interval...
2025-04-07 Georgi Gerganovcuda : fix HIP and MUSA BF16 (#0)
2025-04-07 Georgi Gerganovsync : ggml
2025-04-07 Georgi Gerganovggml : simplify Arm fp16 CPU logic (ggml/1177)
2025-04-07 Sigbjørn SkjæretCUDA: don't convert BF16 weights to FP32 (ggml/1174)
2025-04-07 cmdr2cpu: move all the operators into a separate c++ file...
2025-04-07 zhouwgsycl: remove redundant memcopy in function ggml_backend...
2025-04-07 Xuan-Son Nguyenci : no curl on ggml-ci (#12796)
2025-04-07 Xuan-Son Nguyencmake : enable curl by default (#12761)
2025-04-07 zhouwgCANN: fix typo in ggml-cann (#12733)
2025-04-07 hipuddingCANN: Refactor to reduce duplicate code (#12731)
2025-04-06 R0CKSTARmusa: fix compilation warnings in mp_22/31 (#12780)
2025-04-06 Jeff Bolzvulkan: fix NaN issue in flash attention shader (#12776)
2025-04-06 Jeff Bolzvulkan: Use unclamped loads for flash attention mask...
2025-04-05 0cc4mVulkan: Tune Vulkan mmq int dot shader for performance...
2025-04-05 Sergey Fedorovcommon : fix includes in arg.cpp and gemma3-cli.cpp...
2025-04-05 Xuan-Son Nguyenclip : refactor clip_init, add tests (#12757)
2025-04-05 エシュナヴァリシアcommon: custom hf endpoint support (#12769)
2025-04-04 Olivier Chafiksync: minja (#12739)
2025-04-04 Georgi Gerganovkv-cache : simplify + fix warning for recurrent models...
2025-04-04 bandotici: add Linux cross-compile build (#12428)
2025-04-04 Nauful Shaikhserver : webui : Upgrade daisyui, tailwindcss. (#12735)
2025-04-04 nick huanggguf-split : --merge now respects --dry-run option...
2025-04-04 Nicolò Scipionesycl: allow ggml-sycl configuration and compilation...
2025-04-04 Ronny Brendelcmake: fix ggml-shaders-gen compiler paths containing...
2025-04-04 Daniel Beveniusdocs : add XCFramework section to README.md [no ci...
2025-04-04 Jeff Bolzvulkan: Hybrid waitForFences/getFenceStatus to reduce...
2025-04-04 Jeff Bolzvulkan: set cmake minimum and project name in vulkan...
2025-04-04 lhezopencl: update doc for OpenCL (#12702)
2025-04-03 Gaurav GargCUDA: Prefer vector flash decoding kernel for Gemma...
2025-04-03 yumeyaovocab : use string_view::find() to avoid unnecessary...
2025-04-03 Jeff Bolzvulkan: Fix missing cmake logic for dot product extensi...
2025-04-03 Atharva Dubeyci : add env variable in ggml-ci and document the same...
2025-04-03 R0CKSTARsync : minja (inclusionAI/Ling) and update tests (...
2025-04-03 a3shfix MUSA compiler warning (#12704)
2025-04-03 Chenguang LiCANN: Support operator SIN COS ARGMAX (#12709)
2025-04-03 Alan GraySimplify and improve CUDA graphs through use of indirec...
2025-04-03 hipuddingCANN: Fix failed test cases (#12708)
2025-04-03 lhezopencl: use `max_alloc_size` in backend ctx instead...
2025-04-02 Jeff Bolzvulkan: Implement split_k for coopmat2 flash attention...
2025-04-02 bandoticmake: remove caching from vulkan coopmat checks (...
2025-04-02 Jeff Bolzvulkan: Implement grouped query attention in the coopma...
2025-04-02 0cc4mVulkan: Fix mmq int dot float cache size (#12722)
2025-04-02 Georgi Gerganovmodel : print tensor size during load (#12711)
2025-04-02 Diego Devesallama : add option to override model tensor buffers... upstream/0.0.5028
2025-04-02 Georgi Gerganovllama : refactor kv cache guard (#12695)
2025-04-02 Sigbjørn Skjæretvocab : BailingMoE : change possessive quantifiers...
2025-04-02 Xuan-Son Nguyencommon : remove json.hpp from common.cpp (#12697)
2025-04-02 Chenguang Li[CANN] get_rows and dup optimization (#12671)
2025-04-01 Xuan-Son Nguyencommon : refactor downloading system, handle mmproj...
2025-04-01 Junil Kimopencl : fix memory allocation size (#12649)
2025-04-01 jklincnllama : use LLM_KV_GENERAL_FILE_TYPE instead of gguf_fi...
2025-04-01 Sigbjørn Skjæretconvert : BailingMoE : fix qkv split when head_dim...
2025-04-01 Georgi Gerganovmetal : use F32 prec in FA kernels (#12688)
2025-04-01 R0CKSTARFix clang warning in gguf_check_reserved_keys (#12686)
2025-04-01 Wagner Brunavulkan: fix build when glslc doesn't support coopmat...
2025-04-01 Romain BiessySYCL: Rename oneMKL to oneMath (#12192)
2025-04-01 Akarshan BiswasSYCL: switch to SYCL namespace (#12674)
2025-03-31 Sigbjørn Skjæretconvert : BailingMoE : avoid setting rope_dim to 0...
2025-03-31 Daniel Beveniusvocab : add special infill tokens for CodeLlama (#11850)
2025-03-31 a3shggml : faster ssm scan (#10558)
2025-03-31 Sigbjørn Skjæretconvert : Qwerky : use lora_rank_tokenshift and lora_ra...
2025-03-31 0cc4mVulkan: Add DP4A MMQ and Q8_1 quantization shader ...
2025-03-31 Georgi Gerganovcmake : fix whitespace (#0)
2025-03-31 Georgi Gerganovsync : ggml
2025-03-31 Sandro Haneacmake: improve Vulkan cooperative matrix support checks...
2025-03-31 Sigbjørn Skjæretllava : proper description fix (#12668)
2025-03-31 Akarshan BiswasSYCL: Remove misleading ggml_sycl_op_flatten function...
2025-03-31 Sigbjørn Skjæretllava : fix clip loading GGUFs with missing description...
2025-03-31 marcoStocchitts : remove printfs (#12640)
2025-03-30 Sigbjørn Skjæretllama : support BailingMoE (Ling) (#12634)
2025-03-30 Georgi Gerganovmetal : use constexpr in FA kernels + fix typedef ...
2025-03-30 Juyoung Sukllama : add Trillion 7B model support (#12556)
2025-03-30 Sergei Vorobyovllama-chat : Add Yandex instruct model template support...
2025-03-30 R0CKSTARmusa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNIN...
2025-03-30 Georgi Gerganovsync : ggml
2025-03-30 Xuan-Son Nguyencpu : rm unused variable (ggml/1166)
2025-03-30 cmdr2cpu: de-duplicate some of the operators and refactor...
2025-03-30 Daniel Beveniusggml : add logging for native build options/vars (whisp...
2025-03-30 Daniel Beveniusexamples : command.wasm updates (whisper/2904)
2025-03-29 Xuan-Son Nguyenllama : fix non-causal mask for gemma 3 (#12615)
2025-03-29 Djip007llama : change cpu_buft_list order: ACCEL -> GPU host...
2025-03-29 Jaycmake : fix ccache conflict (#12522)
2025-03-29 hipuddingCANN : remove clang-format in ggml-cann (#12607)
2025-03-28 Sigbjørn Skjæretllama : fix incorrect Qwen2Moe ffn_moe_out graph callba...
next