]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-07-07 compiladepy : type-check all Python scripts with Pyright (#8341)
2024-07-07 Denis SpasyukUpdate llama-cli documentation (#8315)
2024-07-07 Alex Tuddenhamci : add checks for cmake,make and ctest in ci/run...
2024-07-07 Andy Taireadme : update bindings list (#8222)
2024-07-07 Briangguf-hash: model wide and per tensor hashing using...
2024-07-07 toyerllama : support glm3 and glm4 (#8031)
2024-07-07 Georgi Gerganovllama : fix n_rot default (#8348)
2024-07-07 compiladepy : use cpu-only torch in requirements.txt (#8335)
2024-07-07 standby24x7finetune: Rename command name in README.md (#8343)
2024-07-07 standby24x7finetune: Rename an old command name in finetune.sh...
2024-07-07 Bjarke Viksøeserver: Retrieve prompt template in /props (#8337)
2024-07-06 Derrick T.... added support for Authorization Bearer tokens when...
2024-07-06 Xuan Son Nguyenupdate main readme (#8333)
2024-07-06 Daniel Beveniusllama : add early return for empty range (#8327)
2024-07-05 jaime-m-pDetokenizer fixes (#8039)
2024-07-05 Xuan Son NguyenReorganize documentation pages (#8325)
2024-07-05 Georgi Gerganovllama : fix compile warning (#8304)
2024-07-05 Natsucmake : add GGML_BUILD and GGML_SHARED macro definition...
2024-07-05 Ouadie EL FAROUKIEnabled more data types for oneMKL gemm_batch (#8236)
2024-07-05 Georgi Gerganovconvert : remove AWQ remnants (#8320)
2024-07-05 Georgi Gerganovllama : minor indentation during tensor loading (#8304)
2024-07-05 Johannes GäßlerCUDA: MMQ support for iq4_nl, iq4_xs (#8278)
2024-07-05 DanieleCUDA: revert part of the RDNA1 optimizations (#8309)
2024-07-05 Douglas Hanleyllama : streamline embeddings from "non-embedding"...
2024-07-05 Johannes GäßlerCUDA: fix MMQ stream-k rounding if ne00 % 128 != 0...
2024-07-05 Pieter Ouwerkerkreadme : fix minor typos [no ci] (#8314)
2024-07-05 Daniel Beveniuspasskey : add short intro to README.md [no-ci] (#8317)
2024-07-05 Georgi Gerganovllama : prefer n_ over num_ prefix (#8308)
2024-07-05 Georgi Gerganovcontributing : update guidelines (#8316)
2024-07-05 luoyu-intel[SYCL] Fix WARP_SIZE=16 bug of Intel GPU (#8266)
2024-07-05 Georgi Gerganovpy : switch to snake_case (#8305)
2024-07-05 Neo Zhang Jianyurm get_work_group_size() by local cache for performance...
2024-07-04 Xuan Son Nguyencli: add EOT when user hit Ctrl+C (#8296)
2024-07-04 Icecream95llama : add OpenELM support (#7359)
2024-07-04 Daniel Beveniustokenize : add --show-count (token) option (#8299)
2024-07-04 ditsukebuild: Export hf-to-gguf as snakecase
2024-07-04 ditsukedoc: Add context for why we add an explicit pytorch...
2024-07-04 ditsukechore: Remove rebase artifacts
2024-07-04 ditsukechore: Fixup requirements and build
2024-07-04 ditsukechore: ignore all __pychache__
2024-07-04 ditsukefix: Update script paths in CI scripts
2024-07-04 ditsukefix: Actually include scripts in build
2024-07-04 ditsukebuild(python): Package scripts with pip-0517 compliance
2024-07-04 fairydreamingInference support for T5 and FLAN-T5 model families...
2024-07-04 Daniel Beveniustests : add _CRT_SECURE_NO_WARNINGS for WIN32 (#8231)
2024-07-04 Daniel Beveniusllama : suppress unref var in Windows MSVC (#8150)
2024-07-04 Georgi Gerganovconvert : fix gemma v1 tokenizer convert (#8248)
2024-07-04 AidanBeltonS[SYCL] Remove unneeded semicolons (#8280)
2024-07-03 DanieleDefine and optimize RDNA1 (#8085)
2024-07-03 slarenppl : fix n_seq_max for perplexity (#8277)
2024-07-03 Xuan Son Nguyenfix phi 3 conversion (#8262)
2024-07-03 Juddfix typo (#8267)
2024-07-03 AidanBeltonSDequant improvements rebase (#8255)
2024-07-02 MistApproachfix: add missing short command line argument -mli for...
2024-07-02 Clint HerronAdding step to `clean` target to remove legacy binary...
2024-07-02 Clint HerronRemoves multiple newlines at the end of files that...
2024-07-02 Faisal ZaghloulAdd `JAIS` model(s) (#8118)
2024-07-02 Daniel Beveniusconvert-hf : print output file name when completed...
2024-07-02 slarencuda : update supports_op for matrix multiplication...
2024-07-02 luoyu-intel[SYCL] Fix win build conflict of math library (#8230)
2024-07-02 luoyu-intel[SYCL] Fix the sub group size of Intel (#8106)
2024-07-01 Xuan Son NguyenFix gemma2 tokenizer convert (#8244)
2024-07-01 Johannes GäßlerCUDA: refactor and optimize IQ MMVQ (#8215)
2024-07-01 Mateusz Charytoniukreadme: add Paddler to the list of projects (#8239)
2024-07-01 Xuan Son Nguyengemma2: add sliding window mask (#8227)
2024-07-01 Ronireadme : update tool list (#8209)
2024-07-01 Michael Francisnix : enable curl (#8043)
2024-07-01 Georgi Gerganovnix : remove OpenCL remnants (#8235)
2024-07-01 iacoreDocument BERT support. (#8205)
2024-07-01 zhentaoyu[SYCL] Update SYCL-Rope op and Refactor (#8157)
2024-06-30 Georgi Gerganovflake.lock: Update (#8218)
2024-06-30 Xuan Son NguyenFix new line issue with chat template, disable template...
2024-06-30 Andreillama: Add attention and final logit soft-capping,...
2024-06-28 Xuan Son Nguyenfix code typo in llama-cli (#8198)
2024-06-28 Olivier Chafikjson: attempt to skip slow tests when running under...
2024-06-28 Xuan Son NguyenAdd MiniCPM, Deepseek V2 chat template + clean up ...
2024-06-28 Sigbjørn SkjæretAdd SPM infill support (#8016)
2024-06-28 slarencmake : allow user to override default options (#8178)
2024-06-28 Olivier Chafik`json`: restore default additionalProperties to false...
2024-06-28 pcullitonllama: Add support for Gemma2ForCausalLM (#8156)
2024-06-28 Xuan Son NguyenAdd missing items in makefile (#8177)
2024-06-27 Olivier Chafik`json`: update grammars/README w/ examples & note about...
2024-06-27 loonerinCI: fix release build (Ubuntu+Mac) (#8170)
2024-06-27 slarencmake : fix deprecated option names not working (#8171)
2024-06-27 Xuan Son NguyenAdd chatml fallback for cpp `llama_chat_apply_template...
2024-06-27 Georgi Gerganovflake.lock: Update (#8071)
2024-06-27 jukofyorkControl vector loading fixes (#8137)
2024-06-27 Raj Hammeer... Delete examples/llama.android/llama/CMakeLists.txt...
2024-06-27 Sigbjørn SkjæretAdd Qwen2MoE 57B-A14B model identifier (#8158)
2024-06-27 Johannes GäßlerCUDA: fix MMQ stream-k for --split-mode row (#8167)
2024-06-27 kustaayaAdded support for Viking pre-tokenizer (#8135)
2024-06-27 Sigbjørn Skjæretllama : fix CodeLlama FIM token checks (#8144)
2024-06-27 Raj Hammeer... Fix llama-android.cpp for error - "common/common.h...
2024-06-26 Daniel Beveniusclip : suppress unused variable warnings (#8105)
2024-06-26 Georgi Gerganovscripts : fix filename sync
2024-06-26 slarenci : publish new docker images only when the files...
2024-06-26 slarenggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CU...
2024-06-26 slarenmake : fix missing -O3 (#8143)
2024-06-26 Georgi Gerganovsync : ggml
2024-06-26 Georgi Gerganovauthors : regen
next