]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-10-28 Georgi Gerganovflake.lock: Update (#10063)
2024-10-28 R0CKSTARmusa: workaround for Guilty Lockup in cleaning src0...
2024-10-28 Georgi Gerganovserver : don't overfill the batch during infill (#10018)
2024-10-27 Georgi Gerganovllama : switch KQ multiplication to F32 precision by...
2024-10-26 Georgi Gerganovsync : ggml
2024-10-26 bssrdfincrease cuda_cpy block size (ggml/996)
2024-10-26 Georgi Gerganovscripts : fix amx sync [no ci]
2024-10-25 Georgi Gerganovmetal : support permuted matrix multiplicaions (#10033)
2024-10-25 wwoodsTMllama : add DRY sampler (#9702)
2024-10-25 Michael Podvitskiyllama: string_split fix (#10022)
2024-10-25 Srihari-mcwllamafile : extend sgemm.cpp support for Q5_0 models...
2024-10-25 Georgi Gerganovserver : check that the prompt fits in the slot's conte...
2024-10-24 Xuan Son Nguyenserver : refactor slot input data, move tokenizer to...
2024-10-24 Georgi Gerganovci : fix cmake flags for SYCL
2024-10-24 Johannes GäßlerCUDA: fix insufficient buffer clearing for MMQ (#10032)
2024-10-24 Johannes GäßlerCUDA: fix MMQ for non-contiguous src0, add tests (...
2024-10-23 wwoodsTMserver : samplers accept the prompt correctly (#10019)
2024-10-23 Georgi Gerganovsync : ggml
2024-10-23 Georgi Gerganovllama.vim : bump generation time limit to 3s [no ci]
2024-10-23 Johannes GäßlerCUDA: fix 1D im2col, add tests (ggml/993)
2024-10-23 Daniel Beveniusggml : remove redundant set of contexts used field...
2024-10-23 Michael Coppolallama.vim : add classic vim support (#9995)
2024-10-23 Jun Hee Yoometal : add POOL2D and fix IM2COL (#9943)
2024-10-23 github-actions... flake.lock: Update
2024-10-22 Xuan Son Nguyenllama : fix empty batch causing llama_batch_allocr...
2024-10-22 Daniel Beveniusllama : rename batch to ubatch (#9950)
2024-10-22 Molly SophiaRwkv chat template fix (#10001)
2024-10-22 Xuan Son Nguyenlora : warn user if new token is added in the adapter...
2024-10-22 Molly Sophiallama : add chat template for RWKV-World + fix EOT...
2024-10-22 leo-pony[CANN] Adapt to dynamically loadable backends mechanism...
2024-10-22 Daniel Beveniusarg : fix typo in embeddings argument help [no ci]...
2024-10-21 Georgi Gerganovllama.vim : fix info text display [no ci] (#9787)
2024-10-21 Georgi Gerganovllama.vim : move info to the right of screen [no ci...
2024-10-21 Asghar Ghorbanireadme : update UI list (#9972)
2024-10-21 Daniel Beveniusarg : fix attention non-causal arg value hint (#9985)
2024-10-21 Georgi Gerganovllama.vim : plugin for Neovim (#9787)
2024-10-21 Georgi Gerganovggml : add asserts for type conversion in fattn kernels...
2024-10-21 Radoslav Gerganovrpc : pack only RPC structs (#9959)
2024-10-21 Georgi Gerganovllama : default sampling changes + greedy update (...
2024-10-21 Georgi Gerganovspeculative : fix handling of some input params (#9963)
2024-10-21 Neo Zhang Jianyufix mul_mat_vec_q and *_vec_q error (#9939)
2024-10-20 Loïc Carrèrereadme : update bindings list (#9951)
2024-10-20 icppWorldreadme : update infra list (#9942)
2024-10-18 Xuan Son Nguyenllama : remove all_pos_0, all_pos_1, all_seq_id from...
2024-10-18 Radoslav Gerganovrpc : backend refactoring (#9912)
2024-10-18 Ouadie EL FAROUKI[SYCL] Add SYCL Backend registry, device and Event...
2024-10-18 Ma Mingfeiadd amx kernel for gemm (#8998)
2024-10-18 Georgi Gerganovserver : add n_indent parameter for line indentation...
2024-10-17 Daniel Beveniusllama : rename batch_all to batch (#8881)
2024-10-17 Georgi Gerganovreadme : remove --memory-f32 references (#9925)
2024-10-17 Georgi Gerganovllama : change warning to debug log
2024-10-17 Georgi Gerganovllama : infill sampling handle very long tokens (#9924)
2024-10-17 Tim Wangreadme : update bindings list (#9918)
2024-10-17 Diego Devesavulkan : add backend registry / device interfaces ...
2024-10-16 Gilad S.fix: allocating CPU buffer with size `0` (#9917)
2024-10-16 Gilad S.fix: use `vm_allocate` to allocate CPU backend buffer...
2024-10-16 Daniel Beveniusllama : suppress conversion from 'size_t' to 'int'...
2024-10-16 Daniel Beveniusllava : fix typo in error message [no ci] (#9884)
2024-10-16 Joe Eli McIlvaingrammar : fix JSON Schema for string regex with top...
2024-10-16 Molly Sophiallama : add tensor name for "result_norm" (#9907)
2024-10-16 Alexey Parfenovserver : fix the disappearance of the end of the text...
2024-10-16 Georgi Gerganovsync : ggml
2024-10-16 Daniel Beveniusggml-alloc : remove buffer_id from leaf_alloc (ggml...
2024-10-16 leo-pony[CANN] Fix cann compilation error (#9891)
2024-10-15 Georgi Gerganovllama : add infill sampler (#9896)
2024-10-15 Georgi Gerganovserver : improve infill context reuse (#9894)
2024-10-15 MaggotHATEsampling : add XTC sampler (#9742)
2024-10-15 Georgi Gerganovserver : update preact (#9895)
2024-10-15 Michał Tuszyńskireadme : update bindings list (#9889)
2024-10-14 VoidIsVoidserver : handle "logprobs" field with false value ...
2024-10-14 agray3Vectorize load instructions in dmmv f16 CUDA kernel...
2024-10-13 Georgi Gerganovserver : accept extra_context for the infill endpoint...
2024-10-13 Georgi Gerganovserver : reuse cached context chunks (#9866)
2024-10-13 Georgi Gerganovflake.lock: Update (#9870)
2024-10-12 Georgi Gerganovserver : add option to time limit the generation phase...
2024-10-12 Georgi Gerganovserver : remove self-extend features (#9860)
2024-10-12 Georgi Gerganovserver : remove legacy system_prompt feature (#9857)
2024-10-12 Georgi Gerganovllama : improve infill support and special token detect...
2024-10-12 R0CKSTARmusa : update doc (#9856)
2024-10-11 Diego Devesaggml : move more prints to the ggml log system (#9839)
2024-10-10 Diego Devesacommon : use common_ prefix for common library function...
2024-10-10 Diego Devesarpc : add backend registry / device interfaces (#9812)
2024-10-10 R0CKSTARmusa: add docker image support (#9685)
2024-10-10 Diego Devesaexamples : do not use common library in simple example...
2024-10-09 Diego Devesacmake : do not build common library by default when...
2024-10-09 Georgi Gerganovperplexity : fix integer overflow (#9783)
2024-10-09 Georgi Gerganovexamples : remove llama.vim
2024-10-08 Diego Devesaggml : fix BLAS with unsupported types (#9775)
2024-10-08 Xuan Son Nguyenserver : better security control for public deployments...
2024-10-08 standby24x7scripts : fix spelling typo in messages and comments...
2024-10-07 Diego Devesaggml : add backend registry / device interfaces to...
2024-10-07 Andrew Minh... Update building for Android (#9672)
2024-10-07 Georgi Gerganovflake.lock: Update (#9753)
2024-10-07 Georgi Gerganovggml : add metal backend registry / device (#9713)
2024-10-07 Paul Tsochantarismetal : single allocation of encode_async block (#9747)
2024-10-06 Georgi Gerganovcontrib : simplify + minor edits [no ci]
2024-10-06 Georgi Gerganovreadme : fix typo [no ci]
2024-10-06 Georgi Gerganovsync : llama.cpp
2024-10-06 SRHMorrisvulkan : retry allocation with fallback flags (whisper...
2024-10-05 Georgi Gerganovrerank : use [SEP] token instead of [BOS] (#9737)
next