]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-09-29 Georgi Gerganovggml : fix GGML_MAX_N_THREADS + improve formatting...
2024-09-29 matiaslincommon : ensure llama_batch size does not exceed max...
2024-09-29 nopperlpy : add model class for Chameleon conversion (#9683)
2024-09-29 Georgi Gerganovcontrib : add Resources section (#9675)
2024-09-28 Georgi Gerganovllama : add reranking support (#9510)
2024-09-28 slarentest-backend-ops : use flops for some performance tests...
2024-09-28 Georgi Gerganovllama : add comment about thread-safety [no ci] (#9449)
2024-09-28 Zhenwei Jinvocab : refactor tokenizer to reduce init overhead...
2024-09-28 nopperlllama : add support for Chameleon (#8543)
2024-09-28 Aarni Koskelareadme : add tool (#9655)
2024-09-28 Dan Johanssonggml : add run-time detection of neon, i8mm and sve...
2024-09-28 Markus TavenrathEnable use to the rebar feature to upload buffers to...
2024-09-27 Georgi Gerganovreadme : update hot topics
2024-09-27 Borislav Stanimirovcmake : add option for common library (#9661)
2024-09-26 Neo Zhang Jianyu[SYCL] add missed dll file in package (#9577)
2024-09-26 R0CKSTARmtgpu: enable VMM (#9597)
2024-09-25 Xuan Son Nguyenci : fix docker build number and tag name (#9638)
2024-09-25 Charles Xuggml : remove assert for AArch64 GEMV and GEMM Q4 kerne...
2024-09-25 Xuan Son Nguyenserver : add more env vars, improve gen-docs (#9635)
2024-09-25 Gabe Goodhartllama : add IBM Granite MoE architecture (#9438)
2024-09-25 Dou Xinpengcann: fix crash when llama-bench is running on multiple...
2024-09-24 Eric Zhangggml : add AVX512DQ requirement for AVX512 builds ...
2024-09-24 Georgi Gerganovsync : ggml
2024-09-24 Georgi Gerganovexamples : adapt to ggml.h changes (ggml/0)
2024-09-24 Georgi Gerganovllama : keep track of all EOG tokens in the vocab ...
2024-09-24 Georgi Gerganovlog : add CONT level for continuing previous log entry...
2024-09-24 StrangeBytesDevserver : add newline after chat example (#9616)
2024-09-24 Georgi Gerganovsampling : avoid expensive softmax during greedy sampli...
2024-09-24 Max Krasnyanskythreads: fix msvc build without openmp (#9615)
2024-09-24 Ivancuda: add q8_0->f32 cpy operation (#9571)
2024-09-23 Xuan Son Nguyenserver : add --no-context-shift option (#9607)
2024-09-23 Max Krasnyanskythreads: improve ggml_barrier scaling with large number...
2024-09-23 Riceball LEEreadme : add programmable prompt engine language CLI...
2024-09-23 Georgi Gerganovflake.lock: Update (#9586)
2024-09-23 Srihari-mcwggml : AVX512 gemm for Q4_0_8_8 (#9532)
2024-09-23 Georgi Gerganovperplexity : remove extra new lines after chunks (...
2024-09-23 Georgi Gerganovmetal : use F32 prec for K*Q in vec FA (#9595)
2024-09-23 Akarshan BiswasRevert "[SYCL] fallback mmvq (#9088)" (#9579)
2024-09-22 R0CKSTARmusa: enable building fat binaries, enable unified...
2024-09-22 Molly SophiaFix merge error in #9454 (#9589)
2024-09-22 Johannes GäßlerCUDA: enable Gemma FA for HIP/Pascal (#9581)
2024-09-22 Shankarllama: remove redundant loop when constructing ubatch...
2024-09-22 Molly SophiaRWKV v6: RWKV_WKV op CUDA implementation (#9454)
2024-09-21 slarenggml-alloc : fix list of allocated tensors with GGML_AL...
2024-09-21 agray3Update CUDA graph on scale change plus clear nodes...
2024-09-21 Huang QiCI: Provide prebuilt windows binary for hip (#9467)
2024-09-20 slarenquantize : improve type name parsing (#9570)
2024-09-20 Georgi Gerganovggml : fix builds (#0)
2024-09-20 Georgi Gerganovggml : fix trailing whitespace (#0)
2024-09-20 Georgi Gerganovsync : ggml
2024-09-20 Johannes Gäßlerggml/examples: add backend support for numerical optimi...
2024-09-20 Georgi Gerganovexamples : add null threadpool args where needed (ggml/0)
2024-09-20 Johannes GäßlerCUDA: fix sum.cu compilation for CUDA < 11.7 (#9562)
2024-09-20 Georgi Gerganovexamples : flush log upon ctrl+c (#9559)
2024-09-20 Sigbjørn Skjæretperplexity : do not escape input data by default (...
2024-09-19 Georgi Gerganovserver : clean-up completed tasks from waiting list...
2024-09-19 Sigbjørn Skjæretimatrix : disable prompt escape by default (#9543)
2024-09-18 slarenggml : fix n_threads_cur initialization with one thread...
2024-09-18 Georgi Gerganovscripts : verify py deps at the start of compare (...
2024-09-18 Daniel Beveniusllama : use reserve/emplace_back in sampler_sample...
2024-09-18 Vinesh Janarthananserver : match OAI structured output response (#9527)
2024-09-18 Eric Zhangserver : fix OpenSSL build (remove obsolete `LOG_INFO...
2024-09-18 Neo Zhang Jianyu[SYCL]set context default value to avoid memory issue...
2024-09-17 Michael Podvitskiyllama-bench: correct argument parsing error message...
2024-09-17 Bert Wagnerarg : add env variable for parallel (#9513)
2024-09-17 Michael Podvitskiyllama : fix n_vocab init for 'no_vocab' case (#9511)
2024-09-17 Max Krasnyanskythreadpool : skip polling for unused threads (#9461)
2024-09-17 Yuri Khrustalevunicode : add <algorithm> (#9508)
2024-09-17 Gabe Goodhartllama : support IBM Granite architecture (#9412)
2024-09-17 Michael Podvitskiyllama : add llama_n_head() (#9512)
2024-09-16 slarenggml : move common CPU backend impl to new header ...
2024-09-16 Daniel Beveniusllama : rename n_embed to n_embd in rwkv6_time_mix...
2024-09-16 Michael Podvitskiyggml : link MATH_LIBRARY not by its full path (#9339)
2024-09-16 compiladeconvert : identify missing model files (#9397)
2024-09-16 Georgi Gerganovcmake : do not hide GGML options + rename option (...
2024-09-16 Eveggml : IQ4_NL sgemm + Q4_0 AVX optimization (#9422)
2024-09-16 Shane Allama : support OLMoE (#9462)
2024-09-16 CarryFunllama : support MiniCPM3 (#9322)
2024-09-16 Vinesh Janarthananmain : option to disable context shift (#9484)
2024-09-16 Georgi Gerganovmetal : handle zero-sized allocs (#9466)
2024-09-16 Georgi Gerganovflake.lock: Update (#9488)
2024-09-15 Georgi Gerganovcommon : reimplement logging (#9418)
2024-09-15 slarengguf-split : add basic checks (#9499)
2024-09-15 Michael Podvitskiycmake : correct order of sycl flags (#9497)
2024-09-15 Csaba Kecskemetipy : add "LLaMAForCausalLM" conversion support (#9485)
2024-09-15 OSecretreadme : update tools list (#9475)
2024-09-15 Michael Podvitskiycmake : try to fix sycl+intel build (#9487)
2024-09-14 Yuri Khrustalevggml : ggml_type_name return "NONE" for invalid values...
2024-09-14 VoidIsVoidserver: add data: [DONE] to /chat/completions stream...
2024-09-14 Georgi Gerganovcmake : use list(APPEND ...) instead of set() + dedup...
2024-09-14 Daniel Beveniusllama : make cell_id const in inp_s_mask block (#9470)
2024-09-13 Xuan Son Nguyenserver : add loading html page while model is loading...
2024-09-13 Georgi Gerganovllama : llama_perf + option to disable timings during...
2024-09-13 Gilad S.feat: remove a sampler from a chain (#9445)
2024-09-12 Mathijs Henquetserver : Add option to return token pieces in /tokenize...
2024-09-12 Dou Xinpengcann: Add host buffer type for Ascend NPU (#9406)
2024-09-12 fengerhu1llava : fix the script error in MobileVLM README (...
2024-09-12 Xuan Son Nguyenlora : raise error if lm_head is ignored (#9103)
2024-09-12 Michael Podvitskiycmake : fix for builds without `GGML_CDEF_PUBLIC` ...
2024-09-12 Huang Qici : update HIP SDK to 24.Q3 (ROCm 6.1) (#9329)
next