]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-09-15 Michael Podvitskiycmake : try to fix sycl+intel build (#9487)
2024-09-14 Yuri Khrustalevggml : ggml_type_name return "NONE" for invalid values...
2024-09-14 VoidIsVoidserver: add data: [DONE] to /chat/completions stream...
2024-09-14 Georgi Gerganovcmake : use list(APPEND ...) instead of set() + dedup...
2024-09-14 Daniel Beveniusllama : make cell_id const in inp_s_mask block (#9470)
2024-09-13 Xuan Son Nguyenserver : add loading html page while model is loading...
2024-09-13 Georgi Gerganovllama : llama_perf + option to disable timings during...
2024-09-13 Gilad S.feat: remove a sampler from a chain (#9445)
2024-09-12 Mathijs Henquetserver : Add option to return token pieces in /tokenize...
2024-09-12 Dou Xinpengcann: Add host buffer type for Ascend NPU (#9406)
2024-09-12 fengerhu1llava : fix the script error in MobileVLM README (...
2024-09-12 Xuan Son Nguyenlora : raise error if lm_head is ignored (#9103)
2024-09-12 Michael Podvitskiycmake : fix for builds without `GGML_CDEF_PUBLIC` ...
2024-09-12 Huang Qici : update HIP SDK to 24.Q3 (ROCm 6.1) (#9329)
2024-09-12 daminhopy : add Phi-1.5/Phi-2 tokenizer (#9361)
2024-09-12 Trivikram Kamatci : bump actions/checkout to v4 (#9377)
2024-09-12 Michael Podvitskiycmake : fixed the order of linking libraries for llama...
2024-09-12 Molly Sophiapy : add special tokens in hf_converter for RWKV v6...
2024-09-12 Ahmad Tameemriscv : modify Makefile and add a RISCV_VECT to print...
2024-09-12 Georgi Gerganovggml : hide ggml_object, ggml_cgraph, ggml_hash_set...
2024-09-12 Neo Zhang Jianyuenhance run script to be easy to change the parameters...
2024-09-12 Xinpeng Doucann: Fix error when running a non-exist op (#9424)
2024-09-12 Faisal ZaghloulAdd Jais to list of supported models (#9439)
2024-09-11 slarenllama : skip token bounds check when evaluating embeddi...
2024-09-11 Pavel Zloipy : support converting local models (#7547)
2024-09-11 Xuan Son Nguyenllava : correct args for minicpmv-cli (#9429)
2024-09-11 Xuan Son Nguyenfiles : remove accidentally added `lora_test` submodule...
2024-09-11 Farbod Bijaryfeat: Implements retrying logic for downloading models...
2024-09-11 Johannes GäßlerCUDA: fix --split-mode row race condition (#9413)
2024-09-11 Georgi Gerganovbatched-bench : remove unused code (#9305)
2024-09-11 R0CKSTARmusa: remove Clang builtins mapping (#9421)
2024-09-11 Alberto Cabrera... sycl : update support conditions (#9394)
2024-09-10 Georgi Gerganovflake.lock: Update (#9360)
2024-09-10 Xuan Son Nguyenarg : bring back missing ifdef (#9411)
2024-09-10 matteoenable --special arg for llama-server (#9419)
2024-09-10 slarenllama : move random seed generation to the samplers...
2024-09-10 Georgi Gerganovmetal : fix compile warning with GGML_METAL_NDEBUG...
2024-09-10 Daniel Beveniusllama : update llm_build_copy_mask_state comment [no...
2024-09-10 Molly SophiaRWKV v6: Add time_mix_decay_w1/w2 in quant exclusion...
2024-09-10 slarenmake : do not run llama-gen-docs when building (#9399)
2024-09-09 Xuan Son Nguyencommon : move arg parser code to `arg.cpp` (#9388)
2024-09-09 Radoslav Gerganovrpc : fix segfault with nkvo (#9389)
2024-09-09 Prashant Vithuleggml : vector length agnostic SVE support (#9290)
2024-09-09 slarenllama : minor sampling refactor (2) (#9386)
2024-09-09 Georgi Gerganovreadme : update hot topics
2024-09-09 Johannes GäßlerCUDA: fix variable name conflict for Windows build...
2024-09-09 Antonis Makropoulosreadme : add LLMUnity to UI projects (#9381)
2024-09-09 Radoslav Gerganovrpc : update README [no ci] (#9320)
2024-09-09 Dan JohanssonArm AArch64: Documentation updates (#9321)
2024-09-08 Markus TavenrathOverlap cmdbuffer creation and cmdbuffer execution...
2024-09-08 Georgi Gerganovcuda : fix FA Q src index (1 -> 0) (#9374)
2024-09-08 Xuan Son Nguyencommon : bring back missing args, add env var duplicati...
2024-09-08 slarencommon : restore --n-gpu-layers (#9371)
2024-09-08 slarenllama : refactor samplers internal implementation ...
2024-09-08 Neo Zhang Jianyu[SYCL] add check malloc result on device (#9346)
2024-09-08 slarenllama : sanitize tokens in the upper bound (#9359)
2024-09-08 Xuan Son Nguyenimatrix : fix arg parser for imatrix (#9366)
2024-09-08 Georgi Gerganovmetal : update support condition for im2col + fix warni...
2024-09-08 Georgi Gerganovsync : ggml
2024-09-08 Georgi Gerganovscripts : option to increase git patch context
2024-09-08 Salvatore Mesoracavulkan: add dryrun support to sin and cos ops (ggml...
2024-09-08 Salvatore Mesoracavulkan: correctly report support for OP_CONT (ggml...
2024-09-08 Johannes Gäßlertests: add gradient tests for all backends (ggml/932)
2024-09-08 Johannes Gäßlerggml: fix ggml_graph_cpy undefined behavior (ggml/943)
2024-09-08 Georgi Gerganovcann : fix doxy (ggml/0)
2024-09-08 Mengqing Caocann : add Ascend NPU support (whisper/2336)
2024-09-08 Georgi Gerganovcuda : mark BF16 CONT as unsupported
2024-09-08 Salvatore Mesoracaggml : fix cont with transposed tensors when one dimens...
2024-09-08 Kevin Gibbonsllama : set attrs of mislabelled EOT/EOM tokens (#9348)
2024-09-07 Georgi Gerganovllama.android : fix build (#9350)
2024-09-07 Georgi Gerganovllama : fix empty ring buffer push (#9358)
2024-09-07 Georgi Gerganovllama : sanitize invalid tokens (#9357)
2024-09-07 Evellamafile : disable sgemm for batch-size 1 (#9330)
2024-09-07 Xuan Son Nguyencommon : refactor arg parser (#9308)
2024-09-07 slarenggml : always check bounds on get_rows operations ...
2024-09-07 Georgi Gerganovllama : refactor sampling v2 (#9294)
2024-09-07 Xuan Son Nguyenggml : fix missing `cpu_set_t` on emscripten (#9336)
2024-09-07 slarenci : disable rocm image creation (#9340)
2024-09-06 Xuan Son Nguyenserver : simplify state machine for slot (#9283)
2024-09-06 Aarni Koskelallama-bench : log benchmark progress (#9287)
2024-09-06 Aarni Koskelabatched-bench : add `--output-format jsonl` option...
2024-09-06 Changyeon Kimggml : fix build break for the vulkan-debug (#9265)
2024-09-06 Xuan Son Nguyenserver : fix missing lock (#9334)
2024-09-06 Markus TavenrathImprove Vulkan shader build system (#9239)
2024-09-06 compiladeggml-quants : ternary packing for TriLMs and BitNet...
2024-09-05 awatunaUpdate build.yml (#9184)
2024-09-05 Michael PodvitskiyCMake fix: host for msvc compiler can only be x86 or...
2024-09-05 slarencuda : fix defrag with quantized KV (#9319)
2024-09-05 slarenllama-bench : fix NUL terminators in CPU name (#9313)
2024-09-04 Srihari-mcwggml : AVX2 support for Q4_0_8_8 (#8713)
2024-09-04 Ouadie EL FAROUKI[SYCL] Fix DMMV dequantization (#9279)
2024-09-04 杨朱 · KikiFix broken links in docker.md (#9306)
2024-09-04 Radoslav Gerganovrpc : make RPC servers come first in the device list...
2024-09-04 Pascal Patryreadme : rename result_format to response_format (...
2024-09-03 Georgi Gerganovflake.lock: Update (#9261)
2024-09-03 Aarni Koskelallama-bench : add JSONL (NDJSON) output mode (#9288)
2024-09-03 Georgi Gerganovreadme : refactor API section + remove old hot topics
2024-09-02 Xuan Son Nguyenserver : test script : add timeout for all requests...
2024-09-02 Zhenwei Jinsrc: make tail invalid when kv cell is intersection...
2024-09-02 slarendocker : fix missing binaries in full-cuda image (...
next