]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-10-03 Johannes Gäßlerggml/ex: calculate accuracy in graph, adapt MNIST ...
2024-10-03 Johannes Gäßlerggml: refactor cross entropy loss CPU impl. (ggml/976)
2024-10-03 Jack Mousseaumetal : fix compute pass descriptor autorelease crash...
2024-10-03 Diego Devesaggml-backend : add device description to CPU backend...
2024-10-03 bandotiggml: unify backend logging mechanism (#9709)
2024-10-03 compiladeconvert : handle tokenizer merges format from transform...
2024-10-03 Radoslav Gerganovrpc : enable vulkan (#9714)
2024-10-03 Ouadie EL FAROUKIFixed dequant precision issues in Q4_1 and Q5_1 (#9711)
2024-10-02 Diego Devesaggml-backend : add device and backend reg interfaces...
2024-10-02 Xuan Son Nguyenllama : reduce compile time and binary size (#9712)
2024-10-02 Alberto Cabrera... [SYCL] Initial cmake support of SYCL for AMD GPUs ...
2024-10-02 Radoslav Gerganovvulkan : do not use tensor->extra (#9407)
2024-10-02 Zhenwei Jingguf-split : improve --split and --merge logic (#9619)
2024-10-02 Georgi Gerganovexamples : remove benchmark (#9704)
2024-10-01 Paweł WodnickiUpdate README.md (#9591)
2024-10-01 Georgi Gerganovsync : ggml
2024-10-01 Johannes Gäßlertest: fix OPT_STEP_ADAMW for test-backend-ops (ggml...
2024-10-01 Salvatore Mesoracavulkan : mul_mat: fix UB with small warps (ggml/952)
2024-10-01 Borislav Stanimirovggml : fix ggml_cast (ggml/973)
2024-10-01 Johannes Gäßlerggml: fix gradient allocation logic (ggml/966)
2024-10-01 Georgi Gerganovmetal : reduce command encoding overhead (#9698)
2024-10-01 Georgi Gerganovllama : print correct model type for Llama 3.2 1B and 3B
2024-10-01 compiladeconvert : refactor rope_freqs generation (#9396)
2024-09-30 serhii-nakonFix Docker ROCM builds, use AMDGPU_TARGETS instead...
2024-09-30 compiladeci : reduce severity of unused Pyright ignore comments...
2024-09-30 vbpy : update transfomers version (#9694)
2024-09-30 Georgi Gerganovflake.lock: Update (#9680)
2024-09-30 Ruchira Hasarangaconsole : utf-8 fix for windows stdin (#9690)
2024-09-29 Georgi Gerganovggml : define missing HWCAP flags (#9684)
2024-09-29 Georgi Gerganovsync : ggml
2024-09-29 Johannes GäßlerCUDA: remove bad assert (ggml/972)
2024-09-29 Jeff Bolzvulkan : multithread pipeline creation (ggml/963)
2024-09-29 Jeff Bolzvulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOP...
2024-09-29 Salvatore Mesoracavulkan : argsort barriers must be under uniform control...
2024-09-29 Georgi Gerganovggml : fix GGML_MAX_N_THREADS + improve formatting...
2024-09-29 matiaslincommon : ensure llama_batch size does not exceed max...
2024-09-29 nopperlpy : add model class for Chameleon conversion (#9683)
2024-09-29 Georgi Gerganovcontrib : add Resources section (#9675)
2024-09-28 Georgi Gerganovllama : add reranking support (#9510)
2024-09-28 slarentest-backend-ops : use flops for some performance tests...
2024-09-28 Georgi Gerganovllama : add comment about thread-safety [no ci] (#9449)
2024-09-28 Zhenwei Jinvocab : refactor tokenizer to reduce init overhead...
2024-09-28 nopperlllama : add support for Chameleon (#8543)
2024-09-28 Aarni Koskelareadme : add tool (#9655)
2024-09-28 Dan Johanssonggml : add run-time detection of neon, i8mm and sve...
2024-09-28 Markus TavenrathEnable use to the rebar feature to upload buffers to...
2024-09-27 Georgi Gerganovreadme : update hot topics
2024-09-27 Borislav Stanimirovcmake : add option for common library (#9661)
2024-09-26 Neo Zhang Jianyu[SYCL] add missed dll file in package (#9577)
2024-09-26 R0CKSTARmtgpu: enable VMM (#9597)
2024-09-25 Xuan Son Nguyenci : fix docker build number and tag name (#9638)
2024-09-25 Charles Xuggml : remove assert for AArch64 GEMV and GEMM Q4 kerne...
2024-09-25 Xuan Son Nguyenserver : add more env vars, improve gen-docs (#9635)
2024-09-25 Gabe Goodhartllama : add IBM Granite MoE architecture (#9438)
2024-09-25 Dou Xinpengcann: fix crash when llama-bench is running on multiple...
2024-09-24 Eric Zhangggml : add AVX512DQ requirement for AVX512 builds ...
2024-09-24 Georgi Gerganovsync : ggml
2024-09-24 Georgi Gerganovexamples : adapt to ggml.h changes (ggml/0)
2024-09-24 Georgi Gerganovllama : keep track of all EOG tokens in the vocab ...
2024-09-24 Georgi Gerganovlog : add CONT level for continuing previous log entry...
2024-09-24 StrangeBytesDevserver : add newline after chat example (#9616)
2024-09-24 Georgi Gerganovsampling : avoid expensive softmax during greedy sampli...
2024-09-24 Max Krasnyanskythreads: fix msvc build without openmp (#9615)
2024-09-24 Ivancuda: add q8_0->f32 cpy operation (#9571)
2024-09-23 Xuan Son Nguyenserver : add --no-context-shift option (#9607)
2024-09-23 Max Krasnyanskythreads: improve ggml_barrier scaling with large number...
2024-09-23 Riceball LEEreadme : add programmable prompt engine language CLI...
2024-09-23 Georgi Gerganovflake.lock: Update (#9586)
2024-09-23 Srihari-mcwggml : AVX512 gemm for Q4_0_8_8 (#9532)
2024-09-23 Georgi Gerganovperplexity : remove extra new lines after chunks (...
2024-09-23 Georgi Gerganovmetal : use F32 prec for K*Q in vec FA (#9595)
2024-09-23 Akarshan BiswasRevert "[SYCL] fallback mmvq (#9088)" (#9579)
2024-09-22 R0CKSTARmusa: enable building fat binaries, enable unified...
2024-09-22 Molly SophiaFix merge error in #9454 (#9589)
2024-09-22 Johannes GäßlerCUDA: enable Gemma FA for HIP/Pascal (#9581)
2024-09-22 Shankarllama: remove redundant loop when constructing ubatch...
2024-09-22 Molly SophiaRWKV v6: RWKV_WKV op CUDA implementation (#9454)
2024-09-21 slarenggml-alloc : fix list of allocated tensors with GGML_AL...
2024-09-21 agray3Update CUDA graph on scale change plus clear nodes...
2024-09-21 Huang QiCI: Provide prebuilt windows binary for hip (#9467)
2024-09-20 slarenquantize : improve type name parsing (#9570)
2024-09-20 Georgi Gerganovggml : fix builds (#0)
2024-09-20 Georgi Gerganovggml : fix trailing whitespace (#0)
2024-09-20 Georgi Gerganovsync : ggml
2024-09-20 Johannes Gäßlerggml/examples: add backend support for numerical optimi...
2024-09-20 Georgi Gerganovexamples : add null threadpool args where needed (ggml/0)
2024-09-20 Johannes GäßlerCUDA: fix sum.cu compilation for CUDA < 11.7 (#9562)
2024-09-20 Georgi Gerganovexamples : flush log upon ctrl+c (#9559)
2024-09-20 Sigbjørn Skjæretperplexity : do not escape input data by default (...
2024-09-19 Georgi Gerganovserver : clean-up completed tasks from waiting list...
2024-09-19 Sigbjørn Skjæretimatrix : disable prompt escape by default (#9543)
2024-09-18 slarenggml : fix n_threads_cur initialization with one thread...
2024-09-18 Georgi Gerganovscripts : verify py deps at the start of compare (...
2024-09-18 Daniel Beveniusllama : use reserve/emplace_back in sampler_sample...
2024-09-18 Vinesh Janarthananserver : match OAI structured output response (#9527)
2024-09-18 Eric Zhangserver : fix OpenSSL build (remove obsolete `LOG_INFO...
2024-09-18 Neo Zhang Jianyu[SYCL]set context default value to avoid memory issue...
2024-09-17 Michael Podvitskiyllama-bench: correct argument parsing error message...
2024-09-17 Bert Wagnerarg : add env variable for parallel (#9513)
2024-09-17 Michael Podvitskiyllama : fix n_vocab init for 'no_vocab' case (#9511)
next