]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-09-08 Georgi Gerganovscripts : option to increase git patch context
2024-09-08 Salvatore Mesoracavulkan: add dryrun support to sin and cos ops (ggml...
2024-09-08 Salvatore Mesoracavulkan: correctly report support for OP_CONT (ggml...
2024-09-08 Johannes Gäßlertests: add gradient tests for all backends (ggml/932)
2024-09-08 Johannes Gäßlerggml: fix ggml_graph_cpy undefined behavior (ggml/943)
2024-09-08 Georgi Gerganovcann : fix doxy (ggml/0)
2024-09-08 Mengqing Caocann : add Ascend NPU support (whisper/2336)
2024-09-08 Georgi Gerganovcuda : mark BF16 CONT as unsupported
2024-09-08 Salvatore Mesoracaggml : fix cont with transposed tensors when one dimens...
2024-09-08 Kevin Gibbonsllama : set attrs of mislabelled EOT/EOM tokens (#9348)
2024-09-07 Georgi Gerganovllama.android : fix build (#9350)
2024-09-07 Georgi Gerganovllama : fix empty ring buffer push (#9358)
2024-09-07 Georgi Gerganovllama : sanitize invalid tokens (#9357)
2024-09-07 Evellamafile : disable sgemm for batch-size 1 (#9330)
2024-09-07 Xuan Son Nguyencommon : refactor arg parser (#9308)
2024-09-07 slarenggml : always check bounds on get_rows operations ...
2024-09-07 Georgi Gerganovllama : refactor sampling v2 (#9294)
2024-09-07 Xuan Son Nguyenggml : fix missing `cpu_set_t` on emscripten (#9336)
2024-09-07 slarenci : disable rocm image creation (#9340)
2024-09-06 Xuan Son Nguyenserver : simplify state machine for slot (#9283)
2024-09-06 Aarni Koskelallama-bench : log benchmark progress (#9287)
2024-09-06 Aarni Koskelabatched-bench : add `--output-format jsonl` option...
2024-09-06 Changyeon Kimggml : fix build break for the vulkan-debug (#9265)
2024-09-06 Xuan Son Nguyenserver : fix missing lock (#9334)
2024-09-06 Markus TavenrathImprove Vulkan shader build system (#9239)
2024-09-06 compiladeggml-quants : ternary packing for TriLMs and BitNet...
2024-09-05 awatunaUpdate build.yml (#9184)
2024-09-05 Michael PodvitskiyCMake fix: host for msvc compiler can only be x86 or...
2024-09-05 slarencuda : fix defrag with quantized KV (#9319)
2024-09-05 slarenllama-bench : fix NUL terminators in CPU name (#9313)
2024-09-04 Srihari-mcwggml : AVX2 support for Q4_0_8_8 (#8713)
2024-09-04 Ouadie EL FAROUKI[SYCL] Fix DMMV dequantization (#9279)
2024-09-04 杨朱 · KikiFix broken links in docker.md (#9306)
2024-09-04 Radoslav Gerganovrpc : make RPC servers come first in the device list...
2024-09-04 Pascal Patryreadme : rename result_format to response_format (...
2024-09-03 Georgi Gerganovflake.lock: Update (#9261)
2024-09-03 Aarni Koskelallama-bench : add JSONL (NDJSON) output mode (#9288)
2024-09-03 Georgi Gerganovreadme : refactor API section + remove old hot topics
2024-09-02 Xuan Son Nguyenserver : test script : add timeout for all requests...
2024-09-02 Zhenwei Jinsrc: make tail invalid when kv cell is intersection...
2024-09-02 slarendocker : fix missing binaries in full-cuda image (...
2024-09-02 yuri@FreeBSDggml : add pthread includes on FreeBSD (#9258)
2024-09-02 Xuan Son Nguyenserver : refactor multitask handling (#9274)
2024-09-02 Guoliang Huallama-cli : remove duplicated log message (#9275)
2024-09-02 Tusharbuild(nix): Package gguf-py (#5664)
2024-09-02 Georgi Gerganovllama : minor style
2024-09-01 Molly Sophiallama : support RWKV v6 models (#8980)
2024-08-31 Echo Nolannix: fix CUDA build - replace deprecated autoAddOpenGLR...
2024-08-31 Srihari-mcwsgemm : improved Q4_0 and Q8_0 performance via 4xN...
2024-08-31 Daniel Beveniusllama : fix typo in xcda_array_view comment [no ci...
2024-08-30 Sutou Kouheillama : fix llama_split_mode enum values in main_gpu...
2024-08-30 蕭澧邦Correct typo run_llama2.sh > run-llama2.sh (#9149)
2024-08-30 tc-mbllava : the function "clip" should be int (#9237)
2024-08-29 Faisal ZaghloulThreadpool: take 2 (#8672)
2024-08-29 Jan Boonserver : fix crash when error handler dumps invalid...
2024-08-29 Georgi Gerganovflake.lock: Update (#9162)
2024-08-28 slarendocker : build images only once (#9225)
2024-08-28 slarendocker : update CUDA images (#9213)
2024-08-27 Georgi Gerganovvulkan : fix build (#0)
2024-08-27 Georgi Gerganovsync : ggml
2024-08-27 Xie YanboFix minicpm example directory (#9111)
2024-08-27 compiladellama : fix qs.n_attention_wv for DeepSeek-V2 (#9156)
2024-08-27 Xuan Son Nguyenserver : add some missing env variables (#9116)
2024-08-27 CausalLMllama : fix ChatGLM4 wrong shape (#9194)
2024-08-27 Carsten Kragelund... llama : fix llama3.1 rope_freqs not respecting custom...
2024-08-27 arch-btwcommon : Update stb_image.h to latest version (#9161)
2024-08-26 slarenggml : do not crash when quantizing q4_x_x with an...
2024-08-26 Georgi Gerganovmetal : separate scale and mask from QKT in FA kernel...
2024-08-26 Georgi Gerganovggml : add SSM Metal kernels (#8546)
2024-08-26 Georgi Gerganovtests : fix compile warnings for unreachable code ...
2024-08-26 Georgi Gerganovci : add VULKAN support to ggml-ci (#9055)
2024-08-26 Georgi Gerganovserver : update deps (#9183)
2024-08-26 slarenmetal : gemma2 flash attention support (#9159)
2024-08-26 slarenggml-ci : try to improve build time (#9160)
2024-08-26 Justine Tunneyllama : fix time complexity of string replacement ...
2024-08-25 Herman Semenovcommon: fixed not working find argument --n-gpu-layers...
2024-08-25 Johannes GäßlerCUDA: fix Gemma 2 numerical issues for FA (#9166)
2024-08-24 Johannes GäßlerCPU/CUDA: Gemma 2 FlashAttention support (#8542)
2024-08-24 João Dinis... quantize : fix typo in usage help of `quantize.cpp...
2024-08-23 Xuan Son Nguyenlora : fix llama conversion script with ROPE_FREQS...
2024-08-23 piDackllama : use F32 precision in GLM4 attention and no...
2024-08-22 Akarshan Biswas[SYCL] Add a space to supress a cmake warning (#9133)
2024-08-22 luoyu-intel[SYCL] Add oneDNN primitive support (#9091)
2024-08-21 compiladellama : simplify Mamba with advanced batch splits ...
2024-08-21 Xuan Son Nguyenserver : support reading arguments from environment...
2024-08-21 Younes Belkadallama : support for `falcon-mamba` architecture (#9074)
2024-08-21 fairydreamingllava : zero-initialize clip_ctx structure fields with...
2024-08-21 Daniel Beveniusllama : std::move llm_bigram_bpe from work_queue (...
2024-08-20 Changyeon Kimllava: Add ACC OP for GPU acceleration to the Vulkan...
2024-08-20 Meng, Hengyu[SYCL] fallback mmvq (#9088)
2024-08-20 zhentaoyu[SYCL] Fix SYCL `im2col` and `convert` Overflow with...
2024-08-20 fairydreamingtests : add missing comma in grammar integration tests...
2024-08-19 wangshuai09cann: add doc for cann backend (#8867)
2024-08-19 Radoslav Gerganovrpc : print error message when failed to connect endpoi...
2024-08-19 Radoslav Gerganovrpc : prevent crashes on invalid input (#9040)
2024-08-18 Georgi Gerganovflake.lock: Update (#9068)
2024-08-18 ltoniazzitests : add integration test for lora adapters (#8957)
2024-08-17 Yoshi SuharaFix incorrect use of ctx_split for bias tensors (#9063)
2024-08-16 Xuan Son Nguyenserver : refactor middleware and /health endpoint ...
2024-08-16 tc-mbllava : support MiniCPM-V-2.6 (#8967)
next