]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-07-13 Ed Addarioquantize : fix minor logic flaw in --tensor-type (...
2025-07-13 Sigbjørn Skjæretcuda : add set rows for bf16 (#14664)
2025-07-13 Yavor Ivanovcuda : add ELU support (#14657)
2025-07-13 Georgi Gerganovggml : add build-time message to remind about ggml_set_...
2025-07-13 Yavor Ivanovmetal : Add missing unary ops Metal support (#14660)
2025-07-13 Yavor Ivanovcmake : Add CMake presets for Linux and GCC (#14656)
2025-07-12 Tarek Dakhrantests : cover lfm2 cases in test_ssm_conv (#14651)
2025-07-12 Tarek Dakhrandocs : add LFM2 to models section (#14650)
2025-07-12 Aman GuptaCUDA: add set rows for f32 and f16 (#14551) upstream/0.0.5882
2025-07-12 Georgi Gerganovsync : ggml
2025-07-12 Georgi Gerganovvulkan : remove unused vars (#0)
2025-07-12 Georgi Gerganovsync : ggml
2025-07-12 Aclyvulkan : implement bilinear interpolation (ggml/1291)
2025-07-12 Aclyvulkan : implement ggml_roll (ggml/1290)
2025-07-12 Douglas Hanleyserver : fix pooled embedding output (#14645)
2025-07-12 Jeff Bolzvulkan: support SET_ROWS (#14587)
2025-07-12 Jeff Bolzvulkan: optimizations for deepseek prompt processing...
2025-07-11 Tarek Dakhranmodel : support LiquidAI LFM2 hybrid family (#14620)
2025-07-11 Slobodan JosicHIP : Add HIP 7.0+ compatibility for hipBLAS compute...
2025-07-11 Georgi Gerganovreadme : add hot PRs (#14636)
2025-07-11 Georgi Gerganovllama : move enum llama_vocab_pre_type to implementatio...
2025-07-11 Dowonvocab : add midm-2.0 model pre-tokenizer (#14626)
2025-07-11 Gabe Goodhartmodel : Granite Four (#13550)
2025-07-10 rmatifopencl: add tiled mul_mat_f16_f32 (#14535)
2025-07-10 lhezopencl: add `set_rows` for `f16` and `f32` (#14547)
2025-07-10 Ryan MangenoSmoldocling support (#14597)
2025-07-10 Aman GuptaDocs: script to auto-generate ggml operations docs...
2025-07-10 Eric Zhangcmake : do not search for curl libraries by ourselves...
2025-07-10 Akarshan BiswasSYCL: Initial set_rows kernel implementation (#14562)
2025-07-10 Xuan-Son Nguyenllama : minor coding style fix for smollm3 (#14605)
2025-07-10 Eric Zhangcmake : bump llguidance version to v1.0.1 (#14609)
2025-07-10 Eric Zhangcmake : llguidance build parser library only (#14608)
2025-07-10 compiladecuda : support Falcon-H1 state size for SSM_SCAN (...
2025-07-09 Xuan-Son Nguyenllama : remove llm_graph_input_one (#14603)
2025-07-09 compiladellama : support Jamba hybrid Transformer-Mamba models...
2025-07-09 Xuan-Son Nguyenggml : add ggml_scale_bias (#14417)
2025-07-09 Miaoqian Linggml : prevent integer overflow in gguf tensor size...
2025-07-09 Dowonmodel : add skt/A.X-4.0 model vocabulary (#14589)
2025-07-09 Sigbjørn Skjæretllama : remove unintended whitespace (#14592)
2025-07-09 ibrahim khadraouimodel : add support for Falcon-H1 family (#14534)
2025-07-09 Xuan-Son Nguyenconvert : fix smollm3 jinja template (#14586)
2025-07-08 Jeff Bolzvulkan: optimize flash attention split_k_reduce (#14554)
2025-07-08 stevenkuangmodel : fix hunyuan moe chat template (#14584)
2025-07-08 Xuan-Son Nguyenmodel : add SmolLM3 (#14581)
2025-07-08 compiladememory : fix broken batch splits for recurrent cache...
2025-07-08 Jeff Bolzvulkan : fix rope with partial rotation and non-cont...
2025-07-08 Alawode Oluwandabiraserver: Add ability to mount server at prefix (#14544)
2025-07-08 Xuan-Son Nguyenmodel : add hunyuan moe (#14425)
2025-07-08 Jeff Bolzvulkan: increase timeout for CI (#14574)
2025-07-08 Georgi Gerganovcuda : fix rope with partial rotation and non-cont...
2025-07-08 Aman GuptaCUDA: add bilinear interpolation for upscale (#14563)
2025-07-07 R0CKSTARmusa: fix build warnings (unused variable) (#14561)
2025-07-07 Sigbjørn Skjæretllama : fix incorrect minicpm3 v_states shape (#14571)
2025-07-07 Sigbjørn Skjæretllama : remove ggml_cont where possible (#14568)
2025-07-07 Aman GuptaCUDA: add bf16 and i32 to getrows (#14529)
2025-07-06 Evevulkan: increase LOAD_VEC_A to 8 (IQ1/IQ2) or 4 (IQ3...
2025-07-06 Jeff Bolzvulkan: fix rms_norm+mul fusion (#14545)
2025-07-05 Jeff Bolzvulkan: Handle updated FA dim2/3 definition (#14518)
2025-07-05 Sigbjørn Skjæretserver : fix assistant prefilling when content is an...
2025-07-05 Sigbjørn Skjæretopencl: add GELU_ERF (#14476)
2025-07-05 Georgi Gerganoveval-callback : check for empty input (#14539)
2025-07-05 R0CKSTARtest-backend-ops: add support for specifying output...
2025-07-04 Georgi Gerganovmetal : disable fast math in all quantize kernels ...
2025-07-04 Georgi Gerganovbatch : add optional for sequential equal split (#14511)
2025-07-04 Georgi Gerganovgraph : prepare for 4D mask (#14515)
2025-07-04 Georgi Gerganovbatch : add n_used count (#14512)
2025-07-04 luyhcsuCANN: Replace aclrtMemsetSync with aclnnInplaceZero...
2025-07-03 Sigbjørn Skjæretggml : implement GEGLU_ERF and GEGLU_QUICK ops (#14445)
2025-07-03 lhezopencl : broadcast for soft_max (#14510)
2025-07-03 Jeff Bolzvulkan: support mixed/deepseekR1 FA head sizes (#14509)
2025-07-03 Johannes Gäßlerggml: backward pass for split swiglu (#14483)
2025-07-03 Nicolò ScipioneFix conditional enabling following arch checks for...
2025-07-03 Xuan-Son Nguyenconvert : correct gemma 3n conversion (#14450)
2025-07-03 Georgi Gerganovkv-cache : use ggml_set_rows (#14285)
2025-07-03 Georgi Gerganovggml : fix FA mask dim 2 and 3 (#14505)
2025-07-03 Georgi Gerganovggml : remove kompute backend (#14501)
2025-07-02 Aman GuptaCUDA: add dynamic shared mem to softmax, refactor gener...
2025-07-02 Sigbjørn Skjæretgguf-py : add support for chat template jinja files...
2025-07-02 compiladellama : initial Mamba-2 support (#9126)
2025-07-02 Georgi Gerganovsync : ggml
2025-07-02 Daniel Beveniusggml : add version function to get lib version (ggml...
2025-07-02 Rotem DanSet RPATH to "@loader_path" / "$ORIGIN" to ensure execu...
2025-07-02 Aman GuptaCUDA: add softmax broadcast (#14475)
2025-07-02 Johannes GäßlerCUDA: broadcasting for FlashAttention mask (#14500)
2025-07-02 Jeff Bolzvulkan: support softmax/FA batch and broadcast (#14449)
2025-07-02 Georgi Gerganovggml : support bcast ggml_soft_max_ext, ggml_flash_attn...
2025-07-02 zhouwgopencl : fix possible buffer overflow in dump_tensor...
2025-07-02 Georgi Gerganovsimple-chat : fix context-exceeded condition (#14494)
2025-07-02 Eric Zhangopencl : skip empty nodes on cgraph compute (#14491)
2025-07-02 lhezopencl : update upscale to support align corners (...
2025-07-02 Sigbjørn Skjæretci : add OpenCL to labeler workflow (#14496)
2025-07-02 Eric Zhanggithub : add OpenCL backend to issue templates (#14492)
2025-07-02 Björn Gansterggml : Callback before abort (#14481)
2025-07-01 Georgi Gerganovci : disable fast-math for Metal GHA CI (#14478)
2025-07-01 Grzegorz GraszaAdd Vulkan images to docker.md (#14472)
2025-07-01 Chenguang LiCANN: update aclnnGroupedMatmulV2 to aclnnGroupedMatmul...
2025-07-01 Jeff Bolzvulkan: Split large mul_mat_id to fit in shared memory...
2025-07-01 Sigbjørn Skjæretadd GELU_ERF (#14455)
2025-07-01 Georgi Gerganovggml : remove trailing whitespace (#0)
2025-07-01 Georgi Gerganovsync : ggml
next