]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-07-19 compiladeimatrix : use GGUF to store importance matrices (#9400)
2025-07-19 Peter0x44vulkan: Add logging for bf16 features to ggml_vk_print_...
2025-07-19 0cc4mVulkan: Fix fprintf format-security warning (#14770)
2025-07-19 rspOverflowDocumentation: Update build.md's Vulkan section (#14736)
2025-07-19 Georgi Gerganovsync : ggml
2025-07-18 Georgi Gerganovmetal : fuse add, mul + add tests (#14596)
2025-07-18 Georgi Gerganovgraph : fix graph reuse reset of params (#14760)
2025-07-18 Georgi Gerganovparallel : add option for different RNG seeds (#14757)
2025-07-18 Oliver Simonscuda : Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs...
2025-07-18 Georgi Gerganovgraph : avoid huge warm-up graphs for MoE models (...
2025-07-18 Georgi Gerganovmodel : fix build after merge conflict (#14754)
2025-07-18 lgai-exaonemodel : add EXAONE 4.0 support (#14630)
2025-07-18 Aman GuptaCUDA: set_rows + cpy.cu refactor (#14712)
2025-07-18 Georgi Gerganovgraph : refactor context to not pass gf explicitly...
2025-07-18 Nexes the Eldergraph : Pass the graph placeholder message in debug...
2025-07-18 Neo Zhang Jianyuuse max work group size for device to replace the magic...
2025-07-17 Piotr Wilkin... convert : fix Ernie4.5 MoE without shared experts ...
2025-07-17 Wroclawnix : use optionalAttrs for env mkDerivation attrset...
2025-07-17 Piotr Wilkin... model: add Ernie 4.5 MoE support (#14658)
2025-07-17 Georgi Gerganovkv-cache : fix k-shift for multiple streams (#14742)
2025-07-17 Georgi Gerganovllama : reuse compute graphs (#14482)
2025-07-17 Tarek Dakhranllama : fix parallel processing for lfm2 (#14705)
2025-07-17 Georgi Gerganovkv-cache : opt mask set input (#14600)
2025-07-17 Georgi Gerganovbatch : fix uninitialized has_cpl flag (#14733)
2025-07-16 Sigbjørn Skjæretci : disable failing vulkan crossbuilds (#14723)
2025-07-16 Sigbjørn Skjæretconvert : make hf token optional (#14717)
2025-07-16 Diner Burgerllama : fix parameter order for hybrid memory initializ...
2025-07-16 Reese Levineggml: Add initial WebGPU backend (#14521)
2025-07-16 tempstudiomodel : support output bias for qwen2 (#14711)
2025-07-16 Georgi Gerganovllama : add high-throughput mode (#14363)
2025-07-16 Aman GuptaSupport diffusion models: Add Dream 7B (#14644)
2025-07-16 Georgi Gerganovggml : add asserts (#14720)
2025-07-16 Georgi Gerganovserver : pre-calculate EOG logit biases (#14721)
2025-07-16 Shunta Saitollama : fix parallel processing for plamo2 (#14716)
2025-07-16 Georgi Gerganovserver : fix handling of the ignore_eos flag (#14710)
2025-07-16 Johannes Gäßlerscripts: synthetic prompt mode for server-bench.py...
2025-07-16 Sigbjørn Skjæretconvert : only check for tokenizer folder if we need...
2025-07-16 Sigbjørn Skjæretconvert : add pre-computed hashes first to prevent...
2025-07-16 Min-Huallama: add LLAMA_API to deprecated llama_kv_self_seq_di...
2025-07-15 Ed Addariogguf-py : dump bpw per layer and model in markdown...
2025-07-15 Gabriel Larsonmodel : add Kimi-K2 support (#14654)
2025-07-15 Jeff Bolzvulkan: fix noncontig check for mat_mul_id splitting...
2025-07-15 Jeff Bolzvulkan: add RTE variants for glu/add/sub/mul/div (...
2025-07-15 Shunta Saitomodel : add PLaMo-2 support (#14560)
2025-07-15 R0CKSTARcuda: fix build warnings in set-rows.cu (unused variabl...
2025-07-14 Anton Mitkovsycl: Hotfix for non dnnl codepath (#14677)
2025-07-14 shalinib-ibmggml : refactor llamafile_sgemm PPC code (#14673)
2025-07-14 Aman Guptallama-context: add ability to get logits (#14672)
2025-07-14 Johannes Gäßlerscripts: benchmark for HTTP server throughput (#14668)
2025-07-14 Akarshan BiswasSYCL: use 1D kernel for set_rows (#14618)
2025-07-14 Anton Mitkovsycl: Batched mulmat rework for oneDNN dispatch (#14617)
2025-07-13 Molly Sophiallama : add jinja template for rwkv-world (#14665)
2025-07-13 Ed Addarioquantize : fix minor logic flaw in --tensor-type (...
2025-07-13 Sigbjørn Skjæretcuda : add set rows for bf16 (#14664)
2025-07-13 Yavor Ivanovcuda : add ELU support (#14657)
2025-07-13 Georgi Gerganovggml : add build-time message to remind about ggml_set_...
2025-07-13 Yavor Ivanovmetal : Add missing unary ops Metal support (#14660)
2025-07-13 Yavor Ivanovcmake : Add CMake presets for Linux and GCC (#14656)
2025-07-12 Tarek Dakhrantests : cover lfm2 cases in test_ssm_conv (#14651)
2025-07-12 Tarek Dakhrandocs : add LFM2 to models section (#14650)
2025-07-12 Aman GuptaCUDA: add set rows for f32 and f16 (#14551) upstream/0.0.5882
2025-07-12 Georgi Gerganovsync : ggml
2025-07-12 Georgi Gerganovvulkan : remove unused vars (#0)
2025-07-12 Georgi Gerganovsync : ggml
2025-07-12 Aclyvulkan : implement bilinear interpolation (ggml/1291)
2025-07-12 Aclyvulkan : implement ggml_roll (ggml/1290)
2025-07-12 Douglas Hanleyserver : fix pooled embedding output (#14645)
2025-07-12 Jeff Bolzvulkan: support SET_ROWS (#14587)
2025-07-12 Jeff Bolzvulkan: optimizations for deepseek prompt processing...
2025-07-11 Tarek Dakhranmodel : support LiquidAI LFM2 hybrid family (#14620)
2025-07-11 Slobodan JosicHIP : Add HIP 7.0+ compatibility for hipBLAS compute...
2025-07-11 Georgi Gerganovreadme : add hot PRs (#14636)
2025-07-11 Georgi Gerganovllama : move enum llama_vocab_pre_type to implementatio...
2025-07-11 Dowonvocab : add midm-2.0 model pre-tokenizer (#14626)
2025-07-11 Gabe Goodhartmodel : Granite Four (#13550)
2025-07-10 rmatifopencl: add tiled mul_mat_f16_f32 (#14535)
2025-07-10 lhezopencl: add `set_rows` for `f16` and `f32` (#14547)
2025-07-10 Ryan MangenoSmoldocling support (#14597)
2025-07-10 Aman GuptaDocs: script to auto-generate ggml operations docs...
2025-07-10 Eric Zhangcmake : do not search for curl libraries by ourselves...
2025-07-10 Akarshan BiswasSYCL: Initial set_rows kernel implementation (#14562)
2025-07-10 Xuan-Son Nguyenllama : minor coding style fix for smollm3 (#14605)
2025-07-10 Eric Zhangcmake : bump llguidance version to v1.0.1 (#14609)
2025-07-10 Eric Zhangcmake : llguidance build parser library only (#14608)
2025-07-10 compiladecuda : support Falcon-H1 state size for SSM_SCAN (...
2025-07-09 Xuan-Son Nguyenllama : remove llm_graph_input_one (#14603)
2025-07-09 compiladellama : support Jamba hybrid Transformer-Mamba models...
2025-07-09 Xuan-Son Nguyenggml : add ggml_scale_bias (#14417)
2025-07-09 Miaoqian Linggml : prevent integer overflow in gguf tensor size...
2025-07-09 Dowonmodel : add skt/A.X-4.0 model vocabulary (#14589)
2025-07-09 Sigbjørn Skjæretllama : remove unintended whitespace (#14592)
2025-07-09 ibrahim khadraouimodel : add support for Falcon-H1 family (#14534)
2025-07-09 Xuan-Son Nguyenconvert : fix smollm3 jinja template (#14586)
2025-07-08 Jeff Bolzvulkan: optimize flash attention split_k_reduce (#14554)
2025-07-08 stevenkuangmodel : fix hunyuan moe chat template (#14584)
2025-07-08 Xuan-Son Nguyenmodel : add SmolLM3 (#14581)
2025-07-08 compiladememory : fix broken batch splits for recurrent cache...
2025-07-08 Jeff Bolzvulkan : fix rope with partial rotation and non-cont...
2025-07-08 Alawode Oluwandabiraserver: Add ability to mount server at prefix (#14544)
2025-07-08 Xuan-Son Nguyenmodel : add hunyuan moe (#14425)
next