]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-09-18 Georgi Gerganovmetal : use function constants for mul_mv_ext kernels...
2025-09-18 Sigbjørn Skjæretcuda : add missing F32<->I32 entries in ggml_cuda_cpy_f...
2025-09-18 Radoslav Gerganovserver : include usage statistics only when user reques...
2025-09-18 Georgi Gerganovllama : bump max seq limit from 64 to 256 (#15916)
2025-09-18 Georgi Gerganovmetal : improve F32, F16 and BF16 mat-vec multiplicatio...
2025-09-18 Jhen-Jie Hongmetal : avoid call free for non-owned buffer (#16067)
2025-09-18 Georgi Gerganovmetal : handle nil cv during pipeline creation (#16065)
2025-09-18 Chenguang LiCANN: Remove print (#16044)
2025-09-17 Reese LevineGGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS...
2025-09-17 Georgi Gerganovmetal : refactor + optimize v2 (#15995)
2025-09-17 Aleksander... SvelteKit-based WebUI (#14839)
2025-09-17 Xuan-Son Nguyenconvert : add Llama4ForCausalLM (#16042)
2025-09-17 Johannes GäßlerCUDA: fix FA occupancy, optimize tile kernel (#15982)
2025-09-17 David Ribeiro... common : Fix corrupted memory error on json grammar...
2025-09-17 Evevulkan: automatically remove unsupported devices (...
2025-09-17 Daniel Beveniusci : revert back to macos-13 for macOS-latest-cmake...
2025-09-17 Jie Fu (傅杰)llama-quant : fix the verification of attention layers...
2025-09-17 Jie Fu (傅杰)examples : support encoder-decoder models in the simple...
2025-09-17 Shane Amodel : add OLMo3 support (#16015)
2025-09-17 Chenguang LiCANN: Optimize ggml_cann_set_device (#15935)
2025-09-16 jacekpoplawskillama-bench: add --n-cpu-moe support (#15952)
2025-09-16 Daniel Beveniusci : use macos-latest for arm64 webgpu build (#16029)
2025-09-16 Daniel Beveniusggml : fix padding in timestep embedding kernels (...
2025-09-16 Daniel Beveniusci : upload xcframework artifact from ios-xcode-build...
2025-09-16 Bowen Hanfix: apply clang-format to CUDA macros (#16017)
2025-09-16 Daniel Beveniusci : update macos-latest* jobs to use macos-latest...
2025-09-16 Yuri Khrustalevcmake : Do not install tools on iOS targets (#15903)
2025-09-16 Aman GuptaAdd LLaDA-7b-MoE diffusion model (#16003)
2025-09-15 Jake KarnesCUDA: fix im2col_3d to respect non-contiguous inputs...
2025-09-15 Diego Devesadocker : enable rocWMMA in ROCm images, add gfx1151...
2025-09-15 Diego Devesareleases : switch to rocWMMA develop branch, add gfx115...
2025-09-15 yael-worksSYCL: Add COUNT_EQUAL operator support (#15991)
2025-09-15 Nikolay Popovllama-run: Fix model download on Windows (#15988)
2025-09-15 Aman GuptaCUDA: some micro-optimizations in mmf.cuh for mul_mat_i...
2025-09-15 ddh0fix KLD percentile output (#15999)
2025-09-14 Sigbjørn Skjæretmodel : add grok-2 support (#15539)
2025-09-14 Sigbjørn Skjæretserver : only attempt to enable thinking if using jinja...
2025-09-14 Georgi Gerganovmetal : remove memory pools (#15966)
2025-09-14 Adamrocm.Dockerfile: added gfx1200,gfx1201 architectures...
2025-09-14 Ruben OrtlamVulkan: Clean up mul_mm shader (#15987)
2025-09-14 lcybuild: fix the build failures of Windows HIP release...
2025-09-14 Georgi Gerganovmetal : fix kernel requirements (#15983)
2025-09-14 Radoslav Gerganovrpc : fix regression when --device is used (#15981)
2025-09-14 Diego Devesareleases : update ROCM, add gfx1200, gfx1201, gfx1151...
2025-09-14 Radoslav Gerganovdoc : update documentation for --tensor-split (#15980)
2025-09-14 Aaron Teoggml-zdnn: rm user mapped buffers (#15965)
2025-09-13 Jeff Bolzvulkan: fix failing dequant shaders (#15862)
2025-09-13 Jeff Bolzvulkan: initialize vulkan-hpp to allow using extension...
2025-09-13 Diego Devesallama : allow using iGPUs with --device (#15951)
2025-09-13 Georgi Gerganovmetal : refactor kernel loading (#15964)
2025-09-13 Georgi Gerganovmetal : allow ops to run concurrently (#15929)
2025-09-13 Georgi Gerganovmetal : fix memory leaks (#15962)
2025-09-12 Aaron Teoggml-zdnn: fix #15414, activate FP16 and BF16 accelerat...
2025-09-12 Eric CurtinAdd docker protocol support for llama-server model...
2025-09-12 Haiyue Wangcontext : remove redundant explicit casting to the...
2025-09-12 Georgi Gerganovserver : adjust prompt similarity thold + add logs...
2025-09-12 Ruben OrtlamVulkan iGPU device selection overhaul and PCI ID API...
2025-09-12 Mathieu Baudiervulkan: Make device memory check more portable (#15939)
2025-09-12 Neo Zhang JianyuRevert "sycl: add usage of enqueue_functions extension...
2025-09-11 Diego Devesaggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device...
2025-09-11 Johannes GäßlerCUDA: larger SRAM reads for tile FA, AMD FP16 dot ...
2025-09-11 ddh0nitpick : correct MB to MiB (#15934)
2025-09-11 Daniel Beveniusggml-cpu : add check for ARM MATMUL_INT8/i8mm support...
2025-09-11 Charles Xukleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed...
2025-09-11 hipuddingCANN: Disable acl_graph for prefill stage (#15933)
2025-09-10 Oliver SimonsCUDA: Add `fastdiv` to `k_bin_bcast*`, giving 1-3%...
2025-09-10 Jie Fu (傅杰)llama : support T5 models with unequal number of encode...
2025-09-10 Sigbjørn Skjæretgraph : support non-contiguous Q in build_attn_mha...
2025-09-10 Daniel Beveniusggml-cpu : fix padding in ggml_timestep_embedding ...
2025-09-10 Georgi Gerganovmetal : make the backend async (#15906)
2025-09-10 Daniel Beveniusci : add caching for ROCm installation in release workf...
2025-09-10 Daniel Beveniustests : filter out no-ops from coverage report (#15900)
2025-09-10 j-kmedia : add transparent icon svg and png [no ci] (...
2025-09-10 Jessegitignore : Ignore vim swap files in tests (#15901)
2025-09-10 Chenguang LiCANN: Add ROPE sin/cos cache for reuse (#15912)
2025-09-10 Chenguang LiCANN: implement LRU cache for ACL graphs (#15814)
2025-09-10 Daniel Beveniusllama : check returned fn ptrs from ggml_backend_reg_ge...
2025-09-10 Daniel Beveniusci : cache ROCm installation in windows-latest-cmake...
2025-09-09 Ruben Ortlamvulkan: throw the oom error instead of no memory type...
2025-09-09 Jeff Bolzvulkan: Fix OOB accesses in soft_max_back (#15861)
2025-09-09 Johannes GäßlerHIP: use v_dot2_f32_f16 instruction for FA (#15884)
2025-09-09 lksj92hsWorkaround for subgroup arithmetic failing on MoltenVK...
2025-09-09 Aman GuptaCUDA: Add mul_mat_id support for the mmf kernel (#15767)
2025-09-09 Johannes GäßlerCUDA: fix GET_ROWS for large tensors (#15882)
2025-09-09 Georgi Gerganovcontrib : add notes about merging PRs (#15881)
2025-09-09 Daniel Beveniusrequirements : update transformers/torch for Embedding...
2025-09-09 Piotr Wilkin... model-conversion : add extra debugging support for...
2025-09-08 Aldehir Rojasjson : support `enum` values within `allOf` (#15830)
2025-09-08 j-kmedia : add llama1 icon (#15878)
2025-09-08 Jeff Bolzvulkan: sort graph to allow more parallel execution...
2025-09-08 Aman GuptaCUDA: generate_cu_files.py - add missing mxfp4 (#15880)
2025-09-08 Jessechat : Deepseek V3.1 reasoning and tool calling support...
2025-09-08 Xuan-Son Nguyenserver : bring back timings_per_token (#15879)
2025-09-08 Georgi Gerganovcuda : fix supports_op condition for get_rows when...
2025-09-08 Georgi Gerganovmetal : refactor + optimize (#15857)
2025-09-08 Xuan-Son Nguyenggml: allow casting between f32 and i32 (#15783)
2025-09-08 Sigbjørn SkjæretCUDA: non-contiguous src0 not supported for PAD (#15869)
2025-09-08 Daniel Beveniusconvert : force setting sliding_window from original...
2025-09-08 Georgi Gerganovbatched-bench : fix llama_synchronize usage during...
2025-09-08 Georgi Gerganovcontext : fix n_outputs during reserve (#15858)
next