]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2024-05-11 Neo Zhangadd device version in device list (llama/6959)
2024-05-11 agray3Reset schedule earlier to allow overlap with ggml graph...
2024-05-11 slarenadd basic tensor data validation function (llama/6884)
2024-05-11 slarengguf : fix mismatch between alloc and free functions...
2024-05-11 Georgi GerganovMerge pull request from GHSA-p5mv-gjc5-mwqv
2024-05-11 Georgi Gerganovggml : fix redefinition of vaddvq_f32 for 32-bit ARM...
2024-05-11 Georgi Gerganovggml : fix MIN / MAX macros (llama/6904)
2024-05-11 Georgi Gerganovggml : move 32-bit arm compat in ggml-impl.h (llama...
2024-05-11 Justine Tunneyllamafile : improve sgemm.cpp (llama/6796)
2024-05-11 Dave Airlieggml : fix calloc argument ordering. (llama/6820)
2024-05-11 Georgi Gerganovggml : fix ggml_backend_cpu_supports_op() for CPY ...
2024-05-11 slarenggml : group all experts in a single ggml_mul_mat_id...
2024-05-11 Georgi Gerganovggml : fix llamafile sgemm wdata offsets (llama/6710)
2024-05-11 Justine Tunneyggml : add llamafile sgemm (llama/6414)
2024-05-11 Shijiellama : add qwen2moe (llama/6074)
2024-05-11 Neo Zhang Jianyufix mul_mat_id() for new input, make the ut pass (llama...
2024-05-11 DaveAdded support for GGML_OP_CLAMP in Metal (llama/6662)
2024-05-11 Neo Zhang Jianyufix memcpy() crash, add missed cmd in guide, fix softma...
2024-05-11 Johannes GäßlerCUDA: fix matrix multiplication logic for tests (llama...
2024-05-11 slarenmetal : unify mul_mv_id kernels (llama/6556)
2024-05-11 jiezllama : add gguf_remove_key + remove split meta during...
2024-05-01 Justina Chofeat: implemented sigmoid function (#806)
2024-04-25 Borislav Stanimirovbuild: fix and ignore msvc warnings (#805)
2024-04-18 Daniel Beveniusspec : fix typo in gguf.md (#798)
2024-04-09 Georgi Gerganovsync : whisper.cpp
2024-04-09 Brad Murraywhisper : fix DTW memory access (whisper/2012)
2024-04-09 ulatekhcommon : fix file-handle leak in read_wav() (whisper...
2024-04-09 Rotem Danmain : set stdin to binary mode on Windows (whisper...
2024-04-09 ulatekhmain : allow a response-file as the sole parameter...
2024-04-09 ulatekhwhisper : suppress tokens with a regex (whisper/1997)
2024-04-09 Georgi Gerganovsync : llama.cpp
2024-04-09 Georgi Gerganovlicense : update copyright notice + add AUTHORS
2024-04-09 Carolinabananallama : add Command R Plus support (llama/6491)
2024-04-09 Abhilash Majumderremove row=1 cond (llama/6532)
2024-04-09 Neo Zhang Jianyusupport/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS...
2024-04-09 Georgi Gerganovscripts : sync LICENSE and gen-authors.sh
2024-04-07 Georgi Gerganovsync : whisper.cpp (grammar-parser, skip)
2024-04-07 Georgi Gerganovsync : whisper.cpp
2024-04-07 Slava Primenkoggml: bypass code incompatible with CUDA < 11.1 (whispe...
2024-04-07 ulatekhmain : add command-style grammar (whisper/1998)
2024-04-07 Georgi Gerganovscripts : sync ggml-cuda folder
2024-04-06 Georgi Gerganovci : temporary build embeded metal library
2024-04-06 Georgi Gerganovsync : llama.cpp (skip)
2024-04-06 Ouadie EL FAROUKIFixed minor bug when enabling FP16 for non intel target...
2024-04-06 slarenggml : mul_mat_id use the same tensor for all the exper...
2024-04-06 Meng, HengyuDisable iqx on windows as WA (llama/6435)
2024-04-06 0cc4mVulkan k-quant mmq and ggml-backend offload functionali...
2024-04-06 Neo Zhang Jianyufix set main gpu crash (llama/6339)
2024-04-06 slarenggml : fix bounds checking of zero size views (llama...
2024-04-03 Daniel Beveniusbackend : fix typo in scheduler documentation (#781)
2024-04-03 Andreicmake : add `GGML_METAL_EMBED_LIBRARY` (#780)
2024-04-03 Georgi Gerganovlogo : add files (#782)
2024-03-27 JacobLinCoolgguf : update type enum (#775)
2024-03-27 Georgi Gerganovsync : whisper.cpp
2024-03-27 Georgi Gerganovsync : whisper.cpp
2024-03-27 Georgi Gerganovwhisper : improve handling of prompts (whisper/1981)
2024-03-27 Sanchit Gandhiwhisper : improve support for distil-large-v3 (whisper...
2024-03-27 denerscwhisper : token-level timestamps with DTW (whisper...
2024-03-27 Jo Lissexamples : rename --audio-context to --audio-ctx per...
2024-03-27 Georgi Gerganovexamples : more CUDA leftovers (#0)
2024-03-27 Georgi Gerganovexamples : fix CUBLAS leftovers (#0)
2024-03-27 Georgi Gerganovsync : adapt to CUDA changes (#0)
2024-03-27 Georgi Gerganovsync : llama.cpp
2024-03-27 AidanBeltonSFix batched impl for NVidia GPU (llama/6164)
2024-03-27 KawrakowMake IQ1_M work for QK_K = 64 (llama/6327)
2024-03-27 compiladellama : greatly reduce output buffer memory usage ...
2024-03-27 KawrakowIQ1_M: 1.75 bpw quantization (llama/6302)
2024-03-27 slarencuda : rename build flag to LLAMA_CUDA (llama/6299)
2024-03-27 Kawrakowtests : include IQ2_XXS and IQ2_XS in test-quantize...
2024-03-27 slarencuda : refactor into multiple files (llama/6269)
2024-03-27 Justine Tunneyggml : support AVX512VNNI (llama/6280)
2024-03-27 Rick GFix heap corruption from wmode out-of-bound writes...
2024-03-27 Meng, Hengyuoffload op (llama/6217)
2024-03-27 Jared Van Borteluse _wfopen instead of fopen on Windows (llama/6248)
2024-03-27 slarencuda : add LLAMA_CUDA_NO_PEER_COPY to workaround broken...
2024-03-27 Georgi Gerganovmetal : proper assert for mat-mat memory alignment...
2024-03-27 Georgi Gerganovmetal : pad n_ctx by 32 (llama/6177)
2024-03-27 slarencuda : disable host register by default (llama/6206)
2024-03-27 slarencuda : fix LLAMA_CUDA_F16 build (llama/6197)
2024-03-27 Kawrakowggml : same IQ4_NL quantization for CPU/CUDA/Metal...
2024-03-27 KawrakowAdd ability to use Q5_0, Q5_1, and IQ4_NL for quantized...
2024-03-27 AidanBeltonSAdd nvidia and amd backends (llama/6157)
2024-03-27 slarencuda : fix conflict with std::swap (llama/6186)
2024-03-27 slarencuda : print the returned error when CUDA initializatio...
2024-03-27 slarencuda : refactor to remove global resources (llama/6170)
2024-03-27 Abhilash Majumderincrease igpu cluster limit (llama/6159)
2024-03-27 slarenbackend : set max split inputs to GGML_MAX_SRC (llama...
2024-03-27 slarenbackend : offload large batches to GPU (llama/6083)
2024-03-27 GainLeeggml:fix finding transfer queue family index error...
2024-03-27 AmirAli Mirianggml : add AVX512F SIMD (llama/6088)
2024-03-27 slarencuda : disable unused cudaLaunchHostFunc code (llama...
2024-03-27 Neo Zhang Jianyufix set main gpu error (llama/6073)
2024-03-27 AidanBeltonSFix non-intel device selection (llama/6042)
2024-03-27 Ondřej Čertíkgguf : add support for I64 and F64 arrays (llama/6062)
2024-03-22 Bryan Lozanocmake : bump to 3.12 (#768)
2024-03-22 Bryan Lozanoexamples : fix simple (#770)
2024-03-18 Georgi Gerganovsync : whisper.cpp
2024-03-18 Georgi Gerganovwhisper : set outputs from conv graph (whisper/1959)
2024-03-18 slarenalloc : fix allocation data of pre-allocated leafs
2024-03-15 Georgi Gerganovsync : whisper.cpp
next