]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-06-07 woodxserver : do not get prompt in infill mode (#7286)
2024-06-07 pengxin99[SYCL] fix softmax r2r result wrong issue (#7811)
2024-06-07 slarencheck for nans in imatrix and quantize (#7807)
2024-06-06 Georgi Gerganovserver : fix --threads-http arg (#7801)
2024-06-06 Georgi Gerganovimatrix : migrate to gpt_params (#7771)
2024-06-06 Clint HerronAdded support for . (any character) token in grammar...
2024-06-06 Mattheus ChediakREADME minor fixes (#7798) [no ci]
2024-06-06 Olivier Chafikgrammars: x{min,max} repetition operator (#6640)
2024-06-06 Joan Fontanalsllama : add jina v2 base code (#7596)
2024-06-06 slarendocker : build only main and server in their images...
2024-06-06 slarendocker : add openmp lib (#7780)
2024-06-05 GalunidFix encoding in python scripts (#7733)
2024-06-05 Johannes GäßlerCUDA: refactor mmq, dmmv, mmvq (#7716)
2024-06-05 Georgi Gerganovggml : refactor rope norm/neox (#7634)
2024-06-05 arch-btwreadme : remove -ins (#7759)
2024-06-04 jaime-m-pFix per token atrributes bits (#7749)
2024-06-04 agray3Allow number of nodes in CUDA graph to change (#7738)
2024-06-04 Georgi Gerganovcommon : refactor cli arg parsing (#7675)
2024-06-04 Georgi Gerganovggml : remove OpenCL (#7735)
2024-06-04 Georgi Gerganovllama : remove beam search (#7736)
2024-06-04 Georgi Gerganovreadme : remove obsolete Zig instructions (#7471)
2024-06-04 slarenllama-bench : allow using a different printer for stder...
2024-06-04 DanieleImprove hipBLAS support in CMake (#7696)
2024-06-04 zhouwgrefine .gitignore (#7688)
2024-06-04 jaime-m-pPer token attributes (#7685)
2024-06-04 Georgi Gerganovggml : prevent builds with -ffinite-math-only (#7726)
2024-06-03 Radoslav Gerganovllama : offload to RPC in addition to other backends...
2024-06-03 Masaya, Katoggml : use OpenMP as a thread pool (#7606)
2024-06-03 Johannes Gäßlermake: fix debug options not being applied to NVCC ...
2024-06-03 0cc4mVulkan Mixture of Experts (MoE) support (#7628)
2024-06-03 Andy Taicmake : add pkg-config spec file for llama.cpp (#7702)
2024-06-03 zhangkaihuollama : MiniCPM support tied embeddings (#7664)
2024-06-03 Georgi Gerganovllama : avoid double token-to-piece cache (#7654)
2024-06-03 woachkkompute : implement op_getrows_f32 (#6403)
2024-06-02 Dave Airliefix bug introduced in using calloc (#7701)
2024-06-02 Georgi Gerganovflake.lock: Update (#7686)
2024-06-02 Austinchore : add ignore rule for generated server themes...
2024-06-02 nickp27[SYCL] Update rpc-server.cpp to include SYCL backend...
2024-06-01 Johannes GäßlerFix FlashAttention debug test, FP32 assert (#7684)
2024-06-01 Yazan Agha... server : new UI (#7633)
2024-06-01 HanishKVCSimpleChat: Simple histogram/repeatMatching driven...
2024-06-01 Johannes GäßlerCUDA: fix Pascal FA, deq. KV to FP16 for batch > 8...
2024-06-01 Johannes GäßlerCUDA: quantized KV support for FA vec (#7527)
2024-05-31 Georgi Gerganovserver : update js (#7670)
2024-05-31 Galunidconvert-hf : Handle NotImplementedError in convert...
2024-05-31 Johannes Gäßlerscripts: update compare_llama_bench.py [no ci] (#7673)
2024-05-31 DanieleImprove HIP compatibility (#7672)
2024-05-31 Georgi Gerganovreadme : link homebrew discussion
2024-05-31 Georgi Gerganovggml : fix loongson compile warnings (#7537)
2024-05-31 GalunidSomehow '**' got lost (#7663)
2024-05-31 GalunidAdd convert.py removal to hot topics (#7662)
2024-05-30 Sertaç Özercan[no ci] docs: add aikit to readme (#7650)
2024-05-30 JohnnyBFixed painfully slow single process builds. (#7326)
2024-05-30 Georgi Gerganovllama : cache llama_token_to_piece (#7587)
2024-05-30 Martin DelilleFix conan badge display [no ci] (#7645)
2024-05-30 ManuelAdd brew installation instruction to README [no ci...
2024-05-30 Martin Delillereadme : add Conan badge (#7638)
2024-05-30 Briangithub: add contact links to issues and convert questio...
2024-05-30 GalunidMove convert.py to examples/convert-legacy-llama.py...
2024-05-30 Chris Elrodfaster avx512 exp implementation (#7551)
2024-05-30 junchao-loongsonggml : fix loongarch build (O2 issue) (#7636)
2024-05-30 Johannes GäßlerREADME: explain parallel build [no ci] (#7618)
2024-05-30 Meng, Hengyu[SYCL] fix intel docker (#7630)
2024-05-30 Galunidgguf-py : Add tokenizer.ggml.pre to gguf-new-metadata...
2024-05-29 Georgi Gerganovmetal : remove invalid asserts (#7617)
2024-05-29 Georgi Gerganovmetal : add missing asserts (#7617)
2024-05-29 Georgi Gerganovggml : fix YARN + add tests + add asserts (#7617)
2024-05-29 Georgi Gerganovcuda : non-cont concat support (#7610)
2024-05-29 Radoslav Gerganovllama-bench : add support for the RPC backend (#7435)
2024-05-29 slarenggml : use atomic_flag for critical section (#7598)
2024-05-29 Georgi Gerganovscripts : remove mpi remnants
2024-05-29 Georgi Gerganovsync : ggml
2024-05-29 Georgi Gerganovggml : restore ggml_rope_xpos_inplace (ggml/0)
2024-05-29 Akarshan BiswasAdd Arc A750 and Arch linux to readme-sycl.md as verifi...
2024-05-29 zhouwgggml : fix typo in ggml.c (#7603)
2024-05-28 Meng, Hengyu[SYCL] Align GEMM dispatch (#7566)
2024-05-28 jaime-m-pTokenizer WPM fixes (#7500)
2024-05-28 Georgi Gerganovsycl : fix assert (#7563)
2024-05-28 Giuseppe Scrivanollama : support small Granite models (#7481)
2024-05-28 k.h.laivulkan: properly initialize vulkan devices for LLAMA_SP...
2024-05-28 Radoslav Gerganovrpc : resource management rework (#7562)
2024-05-28 fairydreamingAdd support for DeepseekV2ForCausalLM (#7519)
2024-05-28 Georgi Gerganovtests : fix test-tokenizer-0.sh
2024-05-28 Georgi Gerganovllama : handle unknown utf8 bytes (#7588)
2024-05-28 Briangithub: add refactor to issue template (#7561)
2024-05-28 Neo Zhang[SYCL]fix ggml_sycl_mul_mat_id() to match the change...
2024-05-28 Georgi Gerganovggml : generalize GGML_OP_CONCAT (#7563)
2024-05-28 mgroeber9110server: do not remove whitespace at the start of a...
2024-05-28 Nathan EpsteinMarkdownish code block fix (#7571)
2024-05-28 Ikko Eltociear... llava : update clip.h (#7580)
2024-05-27 Djip007update HIP_UMA #7399 (#7414)
2024-05-27 kunnisadding in x64 targets to cmake presets (#7574)
2024-05-27 Johannes Gäßlermake: add --device-debug to NVCC debug flags (#7542)
2024-05-27 agray3Allow multiple copy function pointers for CUDA graph...
2024-05-27 AidanBeltonSFix q_xxs using mul_mat_q (#7459)
2024-05-27 AidanBeltonSAdd freq factors (#7495)
2024-05-27 Georgi Gerganovmetal : add GGML_OP_REPEAT kernels (#7557)
2024-05-27 Georgi Gerganovmetal : disable FA kernel for HS=256 (#7556)
2024-05-27 Georgi Gerganovllama : add comments about experimental flags (#7544)
2024-05-27 Briangithub: add self sorted issue ticket forms (#7543)
next