]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2023-06-06 Spencer Suttonmetal : add checks for buffer size (#1706)
2023-06-05 Yuval Peleddocs : add performance troubleshoot + example benchmark...
2023-06-05 Foul-Tarnishedreadme : fix typo (#1700)
2023-06-05 mgroeber9110llama : consistently catch and throw only exceptions...
2023-06-05 kiltyjmetal : use shared buffers between CPU and GPU (#1696)
2023-06-05 grahamethggml : fix internal overflow in ggml_time_us on Windows...
2023-06-05 Georgi Gerganovci : disable auto tidy (#1705)
2023-06-05 Kawrakowggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
2023-06-05 Henri VassermanIncrease 3B scratch buffers. (#1698)
2023-06-05 Georgi Gerganovllama : fix Metal KV cache sync (close #1695)
2023-06-04 Georgi Gerganovreadme : update hot topics
2023-06-04 Georgi Gerganovllama : Metal inference (#1642)
2023-06-04 0cc4mOpenCL: Fix duplication of layers in VRAM and RAM,...
2023-06-03 Henri VassermanAdd info about CUDA_VISIBLE_DEVICES (#1682)
2023-06-03 Jiří PodivínDocker: change to calling convert.py (#1641)
2023-06-03 Evan JonesFix prompt cache saving and chat-persistent rollover...
2023-05-30 Henri VassermanOpenLLaMA 3B support (#1588)
2023-05-29 Georgi Gerganovggml : sync cgraph import / export API
2023-05-29 Georgi Gerganovggml : fix bug in ggml_alibi
2023-05-29 DannyDaemonicWork around for recalculating logits in cached prompts...
2023-05-29 Jiří PodivínAdding git in container package dependencies (#1621)
2023-05-28 Johannes GäßlerLLAMA_DEBUG adds debug symbols (#1617)
2023-05-28 KerfuffleOnly show -ngl option when relevant + other doc/arg...
2023-05-28 Vladimir Zorinexamples : add --alias option to gpt_params to set...
2023-05-28 Howard Suopencl : no need to allocate cl_mem on heap (#1612)
2023-05-28 Howard Suopencl : use strstr to check if fp16 supported (#1611)
2023-05-27 apcameronggml : add support for the RISCV architecture (#1616)
2023-05-27 KerfuffleInclude server in releases + other build system cleanup...
2023-05-27 Henri VassermanAdd documentation about CLBlast (#1604)
2023-05-27 Henri Vasserman[CI] Fix openblas (#1613)
2023-05-27 Georgi Gerganovggml : add ggml_tensor_overhead()
2023-05-27 Henri Vasserman[CI] CLBlast: Fix directory name (#1606)
2023-05-27 Georgi Gerganovggml : sync ggml core (minor additions, e.g. ggml_get_t...
2023-05-26 KerfuffleSome improvements to loading the session with --prompt...
2023-05-25 Johannes Gäßlercuda : performance optimizations (#1530)
2023-05-24 Henri VassermanUpdate CLBlast to 1.6.0 (#1580)
2023-05-24 Evan Jonesreadme : add docs for chat-persistent.sh (#1568)
2023-05-24 Senemuchat-persistent.sh : use bracket expressions in grep...
2023-05-23 Maarten ter... Fix handling of "invalid property" when creating OpenCL...
2023-05-22 0cc4mOpenCL Token Generation Acceleration (#1459)
2023-05-21 Steward Garciaexamples : add server example with REST API (#1443)
2023-05-21 Stefan Sydowmake : .PHONY clean (#1553)
2023-05-21 Georgi Gerganovggml : output 3d sizes in ggml_graph_dump_dot()
2023-05-20 Georgi Gerganovggml : update WASM SIMD
2023-05-20 Zenixfeature : support blis and other blas implementation...
2023-05-20 Henri VassermanOpenCL: Fixes for older devices. (#1435)
2023-05-20 Juuso Alasuutarillama : define magic numbers as integer constants ...
2023-05-20 Georgi Gerganovggml : add ggml_clamp() (#1539)
2023-05-20 Johannes Gäßlercuda : loading models directly into VRAM, norm calculat...
2023-05-20 Georgi GerganovRevert "feature : add blis and other BLAS implementatio...
2023-05-20 Zenixfeature : add blis and other BLAS implementation suppor...
2023-05-20 Georgi Gerganovllama : add llama_init_backend() API (close #1527)
2023-05-20 DannyDaemonicFix for mingw (#1462)
2023-05-20 Maximellama : fix name shadowing and C4146 (#1526)
2023-05-20 Georgi Gerganovllama : fix compile warnings in llama_set_state_data()
2023-05-20 Georgi Gerganovggml : fix scalar implementation of Q4_1 dot
2023-05-19 Georgi Gerganovggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 ...
2023-05-19 Georgi Gerganovtests : add missing header
2023-05-19 Evan Jonesexamples : add persistent chat (#1495)
2023-05-19 Jason McCartneymain : make reverse prompt option act as a stop token...
2023-05-19 David Kennedyreadme : adds WizardLM to the list of supported models...
2023-05-19 Georgi Gerganovminor : fix compile warnings
2023-05-18 Erik Scholzmake kv_f16 the default for api users (#1517)
2023-05-18 DannyDaemonicFixes #1511 lambda issue for w64devkit (mingw) (#1513)
2023-05-17 Stephan WalterRemove unused n_parts parameter (#1509)
2023-05-17 rankaiyxbenchmark-matmul: Print the average of the test results...
2023-05-16 Tom Jobbinsconvert.py: Support models which are stored in a single...
2023-05-16 Ilya Kurdyukov~7% faster Q5_1 AVX2 code (#1477)
2023-05-16 András Salamondefine default model path once, sync path with readme...
2023-05-16 sandyiscoolAdd alternate include path for openblas (#1476)
2023-05-15 zrmfix get_num_physical_cores() (#1436)
2023-05-14 slarenbenchmark-matmul: fix clang-tidy issues, report results...
2023-05-14 Johannes Gäßlercuda : deduplicated dequantization code (#1453)
2023-05-14 xaedesggml : alternative fix for race condition bug in non...
2023-05-14 Georgi Gerganovggml : various fixes (#1450)
2023-05-14 katsu560ggml : add AVX support based on AVX2 code (#1430)
2023-05-14 Georgi Gerganovggml : add GGML_QNT_VERSION to track quantization forma...
2023-05-13 Georgi Gerganovcuda : fix convert function (#1412)
2023-05-13 Georgi Gerganovmake : fix PERF build with cuBLAS
2023-05-13 Georgi Gerganovllama : fix unused warning
2023-05-13 Georgi Gerganovggml : multi-thread mul and diag_mask ops (#1428)
2023-05-13 Johannes Gäßlerggml : GPU-accelerated token generation (#1412)
2023-05-13 xaedesggml : implement backward pass for llama + small traini...
2023-05-13 Georgi Gerganovggml : sync alibi fix from ggml repo
2023-05-13 3ooabkhxtnAdding SSE instructions to ggml_vec_dot_q4_0_q8_0 ...
2023-05-13 Georgi Gerganovllama : fix various warnings
2023-05-13 Rinneembedding : remove unused code (#1426)
2023-05-13 Georgi Gerganovreadme : update Q4_0 perplexities
2023-05-13 Georgi Gerganovllama : free ggml context in set / copy state data...
2023-05-13 Henri Vassermanopencl : fix kernels for the new formats (#1422)
2023-05-12 Georgi Gerganovllama : fix --mtest option (close #1414)
2023-05-12 Johannes GäßlerCLI args use - instead of _, backwards compatible ...
2023-05-12 slarenAdd clang-tidy reviews to CI (#1407)
2023-05-12 Rinnereadme : add C#/.NET bindings repo (#1409)
2023-05-11 Georgi Gerganovggml : remove bit shuffling (#1405)
2023-05-11 CRD716prompts : model agnostic DAN (#1304)
2023-05-10 Evan Jonesmain : add option to save full output to session (...
2023-05-09 DannyDaemonicLocale fix for Windows (#1379)
2023-05-09 Sami Farinuse pause asm insn in busyloop to run the CPU (13600K...
2023-05-09 DannyDaemonicInterface improvements and `--multiline-input` (previou...
next