]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-02-05 Georgi Gerganovmetal : adjust support conditions for norm operators...
2025-02-05 Johannes GäßlerCUDA: support for mat. mul. with ne03 != ne13 (#11656)
2025-02-05 SAMIllava: add quantization for the visual projector LLAVA...
2025-02-05 Olivier Chafik`sync`: minja (#11641)
2025-02-04 Johannes GäßlerCUDA: non-contiguous (RMS) norm support (#11659)
2025-02-04 fxzjshmHIP: force max threads per block to be 1024 (#11621)
2025-02-04 Xuan-Son Nguyenserver : add try..catch to places not covered by set_ex...
2025-02-04 Radoslav Gerganovarg : list RPC devices first when using --list-devices...
2025-02-04 Olivier Chafik`tool-call`: command r7b fix for normal responses ...
2025-02-04 Shelby Jenkinsreadme : add llm_client Rust crate to readme bindings...
2025-02-04 Jhen-Jie Hongswift : fix llama-vocab api usage (#11645)
2025-02-04 Jhen-Jie Hongmetal : use residency set for other platforms (#11648)
2025-02-04 Georgi Gerganovauthors : update
2025-02-04 Georgi Gerganovsync : ggml upstream/0.0.4631
2025-02-04 Christian Kastnercmake: Add ability to pass in GGML_BUILD_NUMBER (ggml...
2025-02-04 Georgi Gerganovci : do not stale-close roadmap issues
2025-02-03 Olivier Chafik`tool-call`: allow `--chat-template chatml` w/ `--jinja...
2025-02-03 Xuan-Son Nguyenserver : (webui) revert hacky solution from #11626...
2025-02-03 Woof Dogserver : (webui) allow typing and submitting during...
2025-02-03 Daniel Beveniusserver : remove CPPHTTPLIB_NO_EXCEPTIONS define (#11622)
2025-02-03 Georgi Gerganovsync : ggml
2025-02-03 Johannes GäßlerCUDA: fix Volta FlashAttention logic (#11615)
2025-02-03 mashdragonserver : (webui) Fix Shift+Enter handling (#11609)
2025-02-02 Johannes GäßlerHIP: fix flash_attn_stream_k_fixup warning (#11604)
2025-02-02 uvosCUDA/HIP: add support for selectable warp size to mmv...
2025-02-02 uvosHIP: add GGML_CUDA_CC_IS_* for amd familys as increasin...
2025-02-02 Olivier Chafiknit: more informative crash when grammar sampler fails...
2025-02-02 Johannes GäßlerCUDA: use mma PTX instructions for FlashAttention ...
2025-02-02 Eric CurtinName colors (#11573)
2025-02-02 Olivier Chafik`tool-call`: support Command R7B (+ return tool_plan...
2025-02-02 Olivier ChafikFix exotic ci env that lacks ostringstream::str (#11581)
2025-02-02 Michał Moskalsampling : support for llguidance grammars (#10224)
2025-02-02 piDackllama : add support for GLM-Edge and GLM-Edge-V series...
2025-02-01 Olivier Chafikci: use sccache on windows HIP jobs (#11553)
2025-02-01 Olivier Chafik`sync`: minja (https://github.com/google/minja/commit...
2025-02-01 Eric CurtinImplement s3:// protocol (#11511)
2025-02-01 Olivier Chafikci: simplify cmake build commands (#11548)
2025-01-31 Olivier Chafik`ci`: use sccache on windows instead of ccache (#11545)
2025-01-31 Olivier Chafik`tool-call`: fix llama 3.x and functionary 3.2, play...
2025-01-31 Olivier Chafikfix stop regression (#11543)
2025-01-31 Olivier ChafikFix chatml fallback for unsupported builtin templates...
2025-01-31 Olivier Chafikserver : fix --jinja when there's no tools or schema...
2025-01-31 Steve Grubbcommon: Add missing va_end (#11529)
2025-01-31 Daniel Beveniusserver : update help metrics processing/deferred (...
2025-01-30 Olivier Chafik`ci`: ccache for all github worfklows (#11516)
2025-01-30 Olivier ChafikTool call support (generic + native for Llama, Function...
2025-01-30 uvosHIP: require at least HIP 5.5
2025-01-30 uvosHIP: Prepare reduction operators for wave 64
2025-01-30 uvosCUDA/HIP: add warp_size to cuda_device_info
2025-01-30 Olivier Chafiksync: minja (#11499)
2025-01-30 mgroeber9110vocab : correctly identify LF token for GPT-2 style...
2025-01-30 Daniel Beveniusserver : use lambda instead of std::bind (#11507)
2025-01-30 Isaac McFadyenserver : (docs) added response format for /apply-templa...
2025-01-30 Guspan Tanadireadme : reference examples relative links (#11505)
2025-01-30 Daniel Beveniusserver : update json snippets in README.md [no ci]...
2025-01-29 Nigel Boschserver : add /apply-template endpoint for additional...
2025-01-29 Rémy Oudomphengvulkan: implement initial support for IQ2 and IQ3 quant...
2025-01-29 Daniel Beveniusserver : update auto gen files comments [no ci] (#11484)
2025-01-29 Jeff Bolzvulkan: Catch pipeline creation failure and print an...
2025-01-29 Eric CurtinParse https://ollama.com/library/ syntax (#11480)
2025-01-29 Georgi Gerganovsync : ggml
2025-01-29 William Tambelliniggml : add option to not print stack on abort (ggml...
2025-01-29 issixxggml-cpu : fix ggml_graph_compute_thread did not termin...
2025-01-29 Daniel Beveniusembedding : enable --no-warmup option (#11475)
2025-01-29 Molly Sophiallama: fix missing k_cache store for rwkv6qwen2 (#11445)
2025-01-28 Emreerdogcmake: add hints for locating ggml on Windows using...
2025-01-28 peidaqiserver : Fixed wrong function name in llamacpp server...
2025-01-28 Xuan-Son Nguyenci : fix build CPU arm64 (#11472)
2025-01-28 uvosHIP: Supress transformation warning in softmax.cu
2025-01-28 Nikita SarychevHIP: Only call rocblas_initialize on rocblas versions...
2025-01-28 Eric CurtinAdd github protocol pulling and http:// (#11465)
2025-01-28 Nunodocker: allow installing pip packages system-wide ...
2025-01-28 someone13574cmake : don't fail on `GGML_CPU=OFF` (#11457)
2025-01-28 Nunodocker: add perplexity and bench commands to full image...
2025-01-28 Akarshan BiswasSYCL : SOFTMAX F16 mask support and other fixes (#11261)
2025-01-28 Michael EngelHandle missing model in CLI parameters for llama-run...
2025-01-27 Eric CurtinAdd new hf protocol for ollama (#11449)
2025-01-27 Haus1AMD: parse the architecture as supplied by gcnArchName...
2025-01-27 lexasubllama : minor fixes for up llama load model speed ...
2025-01-27 Johannes Gäßlerllama: refactor llama_decode_impl (#11381)
2025-01-27 Ihar Hrachyshkametal: Handle null returned from MTLCreateSystemDefault...
2025-01-26 Xuan Son Nguyendocker : fix ARM build and Vulkan build (#11434)
2025-01-26 Georgi Gerganovmetal : use residency sets (#11427)
2025-01-26 Nunodocker: add missing vulkan library to base layer and...
2025-01-26 bandoticmake: add ggml find package (#11369)
2025-01-26 Frank Mairpc: fix register position (#11424)
2025-01-26 Georgi Gerganovreadme : update hot topics
2025-01-26 Jeff Bolzbuild: apply MSVC /bigobj option to c/cpp files only...
2025-01-25 Jeff Bolzvulkan: compile shaders on-demand (#11406)
2025-01-25 uvosHip: disable VMM on hip as it seams that it dosent...
2025-01-25 Jeff Bolzbuild: add /bigobj to MSVC build (#11407)
2025-01-25 Diego Devesadocker : add GGML_CPU_ARM_ARCH arg to select ARM archit...
2025-01-25 Xuan Son Nguyenserver : fix cleaning up stream task (#11418)
2025-01-25 Diego Devesadocker : fix CPU ARM build (#11403)
2025-01-25 Georgi Gerganovci : fix line breaks on windows builds (#11409)
2025-01-24 jiahao suCANN: Add Ascend CANN build ci (#10217)
2025-01-24 uvoship : Add hipGraph and VMM support to ROCM (#11362)
2025-01-24 Johannes GäßlerCUDA: fix FP16 cuBLAS GEMM (#11396)
2025-01-24 uvosrocBLAS: Avoid fp32->fp16->fp32 conversion on cdna...
2025-01-24 Georgi Gerganovrelease : pack /lib in the packages (#11392)
next