]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-03-10 Deanandroid : fix utf8 decoding error (#5935)
2024-03-10 Georgi Gerganovreadme : update hot topics
2024-03-10 Georgi Gerganovsync : ggml
2024-03-10 Georgi Gerganovggml : try fix 32-bit arm compat (whisper/1938)
2024-03-10 Georgi Gerganovggml : remove __constant__ specifier for CUDA tables...
2024-03-10 Pierrick Hymbertserver: ci: windows build and tests (#5968)
2024-03-10 DAN™llama : add support for GritLM (#5959)
2024-03-10 Clint Herrongrammar : verify parsed state (#5950)
2024-03-10 Georgi Gerganovnix: update flake.lock (#5969)
2024-03-09 Pierrick Hymbertserver: benchmark: chat/completions scenario and other...
2024-03-09 Georgi Gerganovserver : print chat template info
2024-03-09 slarenperplexity : support using multiple sequences to allow...
2024-03-09 Georgi Gerganovreadme : update hot topics
2024-03-09 Georgi Gerganovggml : fix unnecessary f32 -> f16 -> f32 casts (mmla...
2024-03-09 Georgi Gerganovserver : fix metrics init (#5964)
2024-03-09 Georgi Gerganovggml : remove old quantization functions (#5942)
2024-03-09 Georgi Gerganovserver : clarify some items in the readme (#5957)
2024-03-09 SeungWon Jeongserver : normalize embeddings (#5956)
2024-03-09 Georgi Gerganovtests : gitignore ggml-common.h
2024-03-09 Alexey Parfenovserver : fix passing prompt as tokens (#5955)
2024-03-09 Georgi Gerganovggml : add ggml-common.h to deduplicate shared code...
2024-03-09 Georgi Gerganovserver : simplify logic for empty prompts (#5953)
2024-03-09 Xuan Son NguyenServer: reorganize some http logic (#5939)
2024-03-09 Gabe Goodhartserver : add SSL support (#5926)
2024-03-09 Pierrick Hymbertserver: tests: add truncated prompt tests, better kv...
2024-03-08 compiladellama : support Mamba Selective State Space Models...
2024-03-08 compiladellama : fix quantization of shared token_embd (#5944)
2024-03-08 Pierrick Hymbertserver: metrics: add llamacpp:prompt_seconds_total...
2024-03-08 Don Mahurinllama : assume tied weights if lm_head/output weights...
2024-03-08 Georgi Gerganovserver : fix EOS token detection with disabled cache...
2024-03-08 UEXTM.comlog : fix MSVC compile errors (#5643)
2024-03-07 Georgi Gerganovllama-bench : add embeddings option (#5924)
2024-03-07 Neo Zhang JianyuRevert "[SYCL] fix error when set main gpu to non-zero...
2024-03-07 Minsoo Cheongserver : add `/v1/completions` endpoint (#5914)
2024-03-07 Georgi Gerganovserver : refactor (#5882)
2024-03-07 Neo Zhang Jianyu[SYCL] fix error when set main gpu to non-zero (#5901)
2024-03-06 Jared Van Bortelggml : use SYS_get_cpu if SYS_getcpu is not defined...
2024-03-06 bobqianicggml : use `uint8x16_t` return type for `ggml_vqtbl1q_u...
2024-03-06 Georgi Gerganovconvert : remove AWQ remnants (#5768)
2024-03-06 Neo Zhang Jianyuadd wait() to make code stable (#5895)
2024-03-05 slarencompare-llama-bench.py : remove mul_mat_q (#5892)
2024-03-05 Jared Van Bortelquants : use MM256_SET_M128I consistently to fix gcc...
2024-03-05 ExtReMLapingrammars : blacklists character control set (#5888)
2024-03-05 Georgi GerganovRevert "grammars : don't allow to output unescaped...
2024-03-05 ExtReMLapingrammars : don't allow to output unescaped new line...
2024-03-05 0cc4mVulkan Improvements (#5835)
2024-03-05 Neo Zhang Jianyu[SYCL] fix mul_mat fault in CI/unit-test (#5862)
2024-03-05 Minsoo Cheongfix editorconfig check break (#5879)
2024-03-05 Jeffrey Quesnellefix speculative decoding build on windows (#5874)
2024-03-05 hutlinix: static build (#5814)
2024-03-04 Georgi Gerganovllama : fix embeddings (#5796)
2024-03-04 Georgi Gerganovflake : fix
2024-03-04 Georgi Gerganovggml : fix unknown status (#0)
2024-03-04 Georgi Gerganovsync : ggml
2024-03-04 Michael Podvitskiyggml : introduce ggml_status (ggml/750)
2024-03-04 Dane Madsencmake : handle cases where git index is not found in...
2024-03-04 Minsoo Cheongspeculative : implement stochastic speculative sampling...
2024-03-04 Xuan Son Nguyenadd alias for chat template (#5858)
2024-03-04 Georgi Gerganovsync : ggml
2024-03-04 leejetadd some new ops, fix some operators and add batch...
2024-03-04 DAN™common : use LLAMA_DEFAULT_SEED (#5855)
2024-03-04 DAN™main : support special tokens as reverse/anti prompt...
2024-03-03 slarencuda : fix data race in soft max (#5853)
2024-03-03 Georgi Gerganovreadme : add API changes section
2024-03-03 Douglas Hanleyllama : allow for user specified embedding pooling...
2024-03-03 Nindalethgguf-dump : support i-quants (#5841)
2024-03-03 compiladellama : fix llama_copy_state_data with fragmented KV...
2024-03-03 Pierrick Hymbertci : schedule slow server tests only on Release or...
2024-03-03 Pierrick Hymbertserver : init http requests thread pool with --parallel...
2024-03-03 Georgi Gerganovflake.lock: Update (#5842)
2024-03-02 Pierrick Hymbertserver: tests: passkey challenge / self-extend with...
2024-03-02 Michael Podvitskiyllama : add abort_callback to interrupt computation...
2024-03-02 Georgi Gerganovggml : fix IQ3_S AVX implementation (#5834)
2024-03-02 Jared Van Bortelconvert : automatically fall back to HfVocab if tokeniz...
2024-03-02 Jared Van Bortelconvert-hf : make model class definitions self-containe...
2024-03-02 Kawrakowggml : IQ3_S improvements (#5829)
2024-03-02 Georgi Gerganovscripts : add pod-llama.sh
2024-03-02 Xuan Son Nguyenllama : refactor internal quantization functions (...
2024-03-02 compiladellama : fix segfault from unknown model arch name ...
2024-03-02 Neo Zhang JianyuSupport multiple GPUs (split mode) on SYCL backend...
2024-03-02 crasmworkflows : remove nocleanup arg for check-requirements...
2024-03-01 Tusharbuild(nix): Introduce flake.formatter for `nix fmt...
2024-03-01 noldconvert-hf-to-gguf : require einops for InternLM2ForCau...
2024-03-01 Sourab Mangrulkarllama : add StarCoder2 support (#5795)
2024-03-01 Georgi Gerganovserver : remove api_like_OAI.py proxy script (#5808)
2024-03-01 ddpasaggml-vulkan: fix VULKAN_CHECK_RESULTS flag, which was...
2024-03-01 kunal-vaishnavigemma : fix bfloat16 -> float16 conversion issue (...
2024-03-01 Miwa / Ensancommon : fix flag `--logits-all` to `--all-logits`...
2024-03-01 Pierrick Hymbertllama : cleanup unused mmq flags (#5772)
2024-03-01 Douglas Hanleyunicode : switch to multimap based nfd_map (#5799)
2024-03-01 Pierrick Hymbertserver: allow to override threads server pool with...
2024-03-01 Eveci : add Ubuntu 22 Vulkan CI run (#5789)
2024-03-01 Georgi Gerganovserver : fix newlines in help (#5785)
2024-03-01 AidanBeltonS[SYCL] Use batched mul_mat pathway (#5591)
2024-02-29 Xuan Son NguyenServer: normalize naming (#5779)
2024-02-29 Marcus Dunnllama : constified `llama_set_state_data`'s `src` ...
2024-02-28 Georgi Gerganovci : reduce 3b ppl chunks to 1 to avoid timeout (#5771)
2024-02-28 Evemake portability_enumeration_ext apple only (#5757)
2024-02-28 Georgi Gerganovllama : remove deprecated API (#5770)
2024-02-28 Georgi Gerganovawq-py : remove (#5768)
next