]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2024-01-27 slarenllama : run all KQV ops on the CPU with no KV offload...
2024-01-27 Kylincuda : fix compile error in jetson platform (llama...
2024-01-26 Neuman Vonggpt-2 : clarify instructions for CLBlast on Android...
2024-01-26 Juddggml : check ggml_add src1 type (#708)
2024-01-22 Jack Vialmnist : add tensorflow and keras to requirements.txt...
2024-01-18 Georgi Gerganovsync : whisper.cpp
2024-01-18 Georgi Gerganovsync : llama.cpp
2024-01-18 Paul Tsochantarismetal : fix memory leak, dangling pointer and unused...
2024-01-18 Georgi Gerganovggml : fix SPM package headers
2024-01-17 Juddreadme : add link (#699)
2024-01-17 Georgi Gerganovsync : llama.cpp
2024-01-17 Georgi Gerganovmetal : update ggml-metal.m from llama.cpp
2024-01-17 Georgi Gerganovggml : add IQ2 to test-backend-ops + refactoring (llama...
2024-01-17 Georgi Gerganovimatrix : offload to GPU support (llama/4957)
2024-01-17 Georgi Gerganovbackend : add eval callback (llama/4935)
2024-01-17 Georgi Gerganovmetal : create autorelease pool during library build...
2024-01-17 Kawrakowggml : importance matrix support for legacy quants...
2024-01-17 Alex Azarovmetal : log `recommendedMaxWorkingSetSize` on iOS 16...
2024-01-17 Justine Tunneyggml : introduce GGML_CALL function annotation (llama...
2024-01-17 Georgi Gerganovcuda : fix dequantize kernel names (llama/4938)
2024-01-17 KawrakowCUDA: faster dequantize kernels for Q4_0 and Q4_1 ...
2024-01-17 KawrakowAdd ability to use importance matrix for all k-quants...
2024-01-14 Georgi Gerganovsync : whisper.cpp
2024-01-14 Georgi Gerganovsync : llama.cpp
2024-01-14 Alex Azarovmetal : correctly set SIMD support flags on iOS (llama...
2024-01-14 Kawrakow2-bit quantizations (llama/4897)
2024-01-13 Georgi Gerganovsync : whisper.cpp
2024-01-13 Georgi Gerganovwhisper : load the model into multiple buffers of max...
2024-01-13 Georgi Gerganovsync : llama.cpp
2024-01-13 Georgi Gerganovexamples : adapt to metal API
2024-01-13 Johannes Gäßlerggml: cache sin/cos for RoPE (llama/4908)
2024-01-13 Georgi Gerganovmetal : remove old API (llama/4919)
2024-01-13 Georgi Gerganovmetal : disable log for loaded kernels (llama/4794)
2024-01-13 texmex76gguf : fix potential infinite for-loop (llama/4600)
2024-01-13 Georgi Gerganovmetal : refactor kernel loading code (llama/4794)
2024-01-13 Johannes GäßlerCUDA: faster q8_0 -> f16 dequantization (llama/4895)
2024-01-12 Georgi Gerganovsync : whisper.cpp
2024-01-12 Georgi Gerganovggml : fix 32-bit ARM compat for IQ2_XS (whisper/1758)
2024-01-12 Georgi Gerganovwhisper : fix segment length with params.no_timestamps...
2024-01-12 George Hindleparams : don't compute timestamps when not printing...
2024-01-12 slarenbackend_sched : fix assignments
2024-01-12 Georgi Gerganovexamples : remove obsolete starcoder mmap example
2024-01-12 slarenupdate gpt-2 example
2024-01-12 Georgi Gerganovsync : llama.cpp
2024-01-12 slarenllama : ggml-backend integration (llama/4766)
2024-01-12 Johannes GäßlerCUDA: fix softmax compile for old CUDA versions (llama...
2024-01-12 KawrakowImportance Matrix calculation (llama/4861)
2024-01-11 Georgi Gerganovsync : whisper.cpp
2024-01-11 Georgi Gerganovsync : ggml
2024-01-11 Georgi Gerganovmain : add cli option to disable system prints (whisper...
2024-01-11 Georgi Gerganovsync : llama.cpp
2024-01-11 Kawrakowggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856)
2024-01-11 Paul Tsochantarismetal : put encoder debug group behind a define (llama...
2024-01-11 Georgi Gerganovmetal : improve dequantize precision to match CPU ...
2024-01-11 Georgi Gerganovggml : fix vld1q_s8_x4 32-bit compat (llama/4828)
2024-01-11 Johannes GäßlerCUDA: faster softmax via shared memory + fp16 math...
2024-01-11 Georgi Gerganovmetal : fix deprecation warning (#690)
2024-01-11 Timothy Croninggml : remove ggml_cpy_inplace and ggml_cont_inplace...
2024-01-10 otaGranUpdate README.md (#692)
2024-01-10 Jack Mousseaumetal : wrap each operation in debug group (#690)
2024-01-10 leejetggml : change GGML_MAX_NAME at compile time (#682)
2024-01-09 Halalaluyafail3Fix execlp call (#689)
2024-01-08 Georgi Gerganovsync : llama.cpp
2024-01-08 KawrakowSOTA 2-bit quants (llama/4773)
2024-01-08 Johannes GäßlerCUDA: fixed redundant value dequantization (llama/4809)
2024-01-08 Konstantin... ggml : use __builtin_amdgcn_sudot4 in __dp4a for gfx11...
2024-01-08 Georgi Gerganovggml : do not sched_yield when calling BLAS (llama...
2024-01-08 Johannes GäßlerPrint backend name on test-backend-ops failure (llama...
2024-01-08 Georgi Gerganovggml : include stdlib.h before intrin.h (llama/4736)
2024-01-06 旺旺碎冰冰fix ROCm on Windows (#683)
2024-01-05 postmastersgguf : add keys for kv sizes to spec (#676)
2024-01-05 Erik Scholzfix : cuda order of synchronization when setting a...
2024-01-05 Georgi Gerganovmetal : switch back to default.metallib (#681)
2024-01-05 Georgi Gerganovggml : fix q2_k bpw in comments (#680)
2024-01-04 Georgi Gerganovwhisper : reset the "batched" timings (whisper/1721)
2024-01-04 Finn Voorheesggml : add error handling to graph_compute (whisper...
2024-01-03 Georgi Gerganovscripts : allow to skip commits during sync (#678)
2024-01-03 Georgi GerganovUpdate src/ggml-cuda.cu
2024-01-03 Georgi Gerganovcuda : mark I16 and I32 ops as unsupported
2024-01-03 Georgi Gerganovmetal : add kernel_get_rows_i32
2024-01-03 Georgi Gerganovsync : llama.cpp
2024-01-03 Georgi Gerganovmetal : optimize ggml_mul_mat_id (faster Mixtral PP...
2024-01-03 Georgi Gerganovmetal : enable shader debugging (cmake option) (llama...
2024-01-03 Georgi Gerganovggml : add ggml_vdotq_s32 alias (llama/4715)
2024-01-03 Johannes GäßlerCUDA: fixed tensor cores not being used on RDNA3 (llama...
2024-01-03 automaticcatggml : add ggml_cpu_has_avx_vnni() (llama/4589)
2024-01-03 Johannes GäßlerCUDA: fix tensor core logic for Pascal and HIP (llama...
2024-01-03 hydaicuda: fix vmm oom issue on NVIDIA AGX Orin (llama/4687)
2024-01-03 Georgi Gerganovscripts : fix sync order + metal sed
2024-01-03 Ashraful Islamswift : add Swift Package declaration (#674)
2023-12-29 Guillaume Wenzekggml : extend ggml_get_rows, ggml_repeat, ggml_concat...
2023-12-29 Georgi Gerganovscripts : do not sync synced commits
2023-12-29 Georgi Gerganovsync : whisper.cpp
2023-12-29 Tamotsu Takahashici : build with CLBlast + ggml-opencl use GGML_API...
2023-12-29 Georgi Gerganovscripts : print list of sync commits
2023-12-29 Georgi Gerganovscripts : fix format-patch range
2023-12-29 Georgi Gerganovscripts : do not sync commits from this repo
2023-12-29 Georgi Gerganovsync : whisper.cpp
2023-12-29 bobqianicwhisper : Replace WHISPER_PRINT_DEBUG with WHISPER_LOG_...
2023-12-29 Georgi Gerganovscripts : add sync-whisper-am.sh
next