]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-09-12 Xinpeng Doucann: Fix error when running a non-exist op (#9424)
2024-09-12 Faisal ZaghloulAdd Jais to list of supported models (#9439)
2024-09-11 slarenllama : skip token bounds check when evaluating embeddi...
2024-09-11 Pavel Zloipy : support converting local models (#7547)
2024-09-11 Xuan Son Nguyenllava : correct args for minicpmv-cli (#9429)
2024-09-11 Xuan Son Nguyenfiles : remove accidentally added `lora_test` submodule...
2024-09-11 Farbod Bijaryfeat: Implements retrying logic for downloading models...
2024-09-11 Johannes GäßlerCUDA: fix --split-mode row race condition (#9413)
2024-09-11 Georgi Gerganovbatched-bench : remove unused code (#9305)
2024-09-11 R0CKSTARmusa: remove Clang builtins mapping (#9421)
2024-09-11 Alberto Cabrera... sycl : update support conditions (#9394)
2024-09-10 Georgi Gerganovflake.lock: Update (#9360)
2024-09-10 Xuan Son Nguyenarg : bring back missing ifdef (#9411)
2024-09-10 matteoenable --special arg for llama-server (#9419)
2024-09-10 slarenllama : move random seed generation to the samplers...
2024-09-10 Georgi Gerganovmetal : fix compile warning with GGML_METAL_NDEBUG...
2024-09-10 Daniel Beveniusllama : update llm_build_copy_mask_state comment [no...
2024-09-10 Molly SophiaRWKV v6: Add time_mix_decay_w1/w2 in quant exclusion...
2024-09-10 slarenmake : do not run llama-gen-docs when building (#9399)
2024-09-09 Xuan Son Nguyencommon : move arg parser code to `arg.cpp` (#9388)
2024-09-09 Radoslav Gerganovrpc : fix segfault with nkvo (#9389)
2024-09-09 Prashant Vithuleggml : vector length agnostic SVE support (#9290)
2024-09-09 slarenllama : minor sampling refactor (2) (#9386)
2024-09-09 Georgi Gerganovreadme : update hot topics
2024-09-09 Johannes GäßlerCUDA: fix variable name conflict for Windows build...
2024-09-09 Antonis Makropoulosreadme : add LLMUnity to UI projects (#9381)
2024-09-09 Radoslav Gerganovrpc : update README [no ci] (#9320)
2024-09-09 Dan JohanssonArm AArch64: Documentation updates (#9321)
2024-09-08 Markus TavenrathOverlap cmdbuffer creation and cmdbuffer execution...
2024-09-08 Georgi Gerganovcuda : fix FA Q src index (1 -> 0) (#9374)
2024-09-08 Xuan Son Nguyencommon : bring back missing args, add env var duplicati...
2024-09-08 slarencommon : restore --n-gpu-layers (#9371)
2024-09-08 slarenllama : refactor samplers internal implementation ...
2024-09-08 Neo Zhang Jianyu[SYCL] add check malloc result on device (#9346)
2024-09-08 slarenllama : sanitize tokens in the upper bound (#9359)
2024-09-08 Xuan Son Nguyenimatrix : fix arg parser for imatrix (#9366)
2024-09-08 Georgi Gerganovmetal : update support condition for im2col + fix warni...
2024-09-08 Georgi Gerganovsync : ggml
2024-09-08 Georgi Gerganovscripts : option to increase git patch context
2024-09-08 Salvatore Mesoracavulkan: add dryrun support to sin and cos ops (ggml...
2024-09-08 Salvatore Mesoracavulkan: correctly report support for OP_CONT (ggml...
2024-09-08 Johannes Gäßlertests: add gradient tests for all backends (ggml/932)
2024-09-08 Johannes Gäßlerggml: fix ggml_graph_cpy undefined behavior (ggml/943)
2024-09-08 Georgi Gerganovcann : fix doxy (ggml/0)
2024-09-08 Mengqing Caocann : add Ascend NPU support (whisper/2336)
2024-09-08 Georgi Gerganovcuda : mark BF16 CONT as unsupported
2024-09-08 Salvatore Mesoracaggml : fix cont with transposed tensors when one dimens...
2024-09-08 Kevin Gibbonsllama : set attrs of mislabelled EOT/EOM tokens (#9348)
2024-09-07 Georgi Gerganovllama.android : fix build (#9350)
2024-09-07 Georgi Gerganovllama : fix empty ring buffer push (#9358)
2024-09-07 Georgi Gerganovllama : sanitize invalid tokens (#9357)
2024-09-07 Evellamafile : disable sgemm for batch-size 1 (#9330)
2024-09-07 Xuan Son Nguyencommon : refactor arg parser (#9308)
2024-09-07 slarenggml : always check bounds on get_rows operations ...
2024-09-07 Georgi Gerganovllama : refactor sampling v2 (#9294)
2024-09-07 Xuan Son Nguyenggml : fix missing `cpu_set_t` on emscripten (#9336)
2024-09-07 slarenci : disable rocm image creation (#9340)
2024-09-06 Xuan Son Nguyenserver : simplify state machine for slot (#9283)
2024-09-06 Aarni Koskelallama-bench : log benchmark progress (#9287)
2024-09-06 Aarni Koskelabatched-bench : add `--output-format jsonl` option...
2024-09-06 Changyeon Kimggml : fix build break for the vulkan-debug (#9265)
2024-09-06 Xuan Son Nguyenserver : fix missing lock (#9334)
2024-09-06 Markus TavenrathImprove Vulkan shader build system (#9239)
2024-09-06 compiladeggml-quants : ternary packing for TriLMs and BitNet...
2024-09-05 awatunaUpdate build.yml (#9184)
2024-09-05 Michael PodvitskiyCMake fix: host for msvc compiler can only be x86 or...
2024-09-05 slarencuda : fix defrag with quantized KV (#9319)
2024-09-05 slarenllama-bench : fix NUL terminators in CPU name (#9313)
2024-09-04 Srihari-mcwggml : AVX2 support for Q4_0_8_8 (#8713)
2024-09-04 Ouadie EL FAROUKI[SYCL] Fix DMMV dequantization (#9279)
2024-09-04 杨朱 · KikiFix broken links in docker.md (#9306)
2024-09-04 Radoslav Gerganovrpc : make RPC servers come first in the device list...
2024-09-04 Pascal Patryreadme : rename result_format to response_format (...
2024-09-03 Georgi Gerganovflake.lock: Update (#9261)
2024-09-03 Aarni Koskelallama-bench : add JSONL (NDJSON) output mode (#9288)
2024-09-03 Georgi Gerganovreadme : refactor API section + remove old hot topics
2024-09-02 Xuan Son Nguyenserver : test script : add timeout for all requests...
2024-09-02 Zhenwei Jinsrc: make tail invalid when kv cell is intersection...
2024-09-02 slarendocker : fix missing binaries in full-cuda image (...
2024-09-02 yuri@FreeBSDggml : add pthread includes on FreeBSD (#9258)
2024-09-02 Xuan Son Nguyenserver : refactor multitask handling (#9274)
2024-09-02 Guoliang Huallama-cli : remove duplicated log message (#9275)
2024-09-02 Tusharbuild(nix): Package gguf-py (#5664)
2024-09-02 Georgi Gerganovllama : minor style
2024-09-01 Molly Sophiallama : support RWKV v6 models (#8980)
2024-08-31 Echo Nolannix: fix CUDA build - replace deprecated autoAddOpenGLR...
2024-08-31 Srihari-mcwsgemm : improved Q4_0 and Q8_0 performance via 4xN...
2024-08-31 Daniel Beveniusllama : fix typo in xcda_array_view comment [no ci...
2024-08-30 Sutou Kouheillama : fix llama_split_mode enum values in main_gpu...
2024-08-30 蕭澧邦Correct typo run_llama2.sh > run-llama2.sh (#9149)
2024-08-30 tc-mbllava : the function "clip" should be int (#9237)
2024-08-29 Faisal ZaghloulThreadpool: take 2 (#8672)
2024-08-29 Jan Boonserver : fix crash when error handler dumps invalid...
2024-08-29 Georgi Gerganovflake.lock: Update (#9162)
2024-08-28 slarendocker : build images only once (#9225)
2024-08-28 slarendocker : update CUDA images (#9213)
2024-08-27 Georgi Gerganovvulkan : fix build (#0)
2024-08-27 Georgi Gerganovsync : ggml
2024-08-27 Xie YanboFix minicpm example directory (#9111)
2024-08-27 compiladellama : fix qs.n_attention_wv for DeepSeek-V2 (#9156)
next