2023-06-06 |
Spencer Sutton | metal : add checks for buffer size (#1706) |
commit | commitdiff | tree |
2023-06-05 |
Yuval Peled | docs : add performance troubleshoot + example benchmark... |
commit | commitdiff | tree |
2023-06-05 |
Foul-Tarnished | readme : fix typo (#1700) |
commit | commitdiff | tree |
2023-06-05 |
mgroeber9110 | llama : consistently catch and throw only exceptions... |
commit | commitdiff | tree |
2023-06-05 |
kiltyj | metal : use shared buffers between CPU and GPU (#1696) |
commit | commitdiff | tree |
2023-06-05 |
grahameth | ggml : fix internal overflow in ggml_time_us on Windows... |
commit | commitdiff | tree |
2023-06-05 |
Georgi Gerganov | ci : disable auto tidy (#1705) |
commit | commitdiff | tree |
2023-06-05 |
Kawrakow | ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684) |
commit | commitdiff | tree |
2023-06-05 |
Henri Vasserman | Increase 3B scratch buffers. (#1698) |
commit | commitdiff | tree |
2023-06-05 |
Georgi Gerganov | llama : fix Metal KV cache sync (close #1695) |
commit | commitdiff | tree |
2023-06-04 |
Georgi Gerganov | readme : update hot topics |
commit | commitdiff | tree |
2023-06-04 |
Georgi Gerganov | llama : Metal inference (#1642) |
commit | commitdiff | tree |
2023-06-04 |
0cc4m | OpenCL: Fix duplication of layers in VRAM and RAM,... |
commit | commitdiff | tree |
2023-06-03 |
Henri Vasserman | Add info about CUDA_VISIBLE_DEVICES (#1682) |
commit | commitdiff | tree |
2023-06-03 |
Jiří Podivín | Docker: change to calling convert.py (#1641) |
commit | commitdiff | tree |
2023-06-03 |
Evan Jones | Fix prompt cache saving and chat-persistent rollover... |
commit | commitdiff | tree |
2023-05-30 |
Henri Vasserman | OpenLLaMA 3B support (#1588) |
commit | commitdiff | tree |
2023-05-29 |
Georgi Gerganov | ggml : sync cgraph import / export API |
commit | commitdiff | tree |
2023-05-29 |
Georgi Gerganov | ggml : fix bug in ggml_alibi |
commit | commitdiff | tree |
2023-05-29 |
DannyDaemonic | Work around for recalculating logits in cached prompts... |
commit | commitdiff | tree |
2023-05-29 |
Jiří Podivín | Adding git in container package dependencies (#1621) |
commit | commitdiff | tree |
2023-05-28 |
Johannes Gäßler | LLAMA_DEBUG adds debug symbols (#1617) |
commit | commitdiff | tree |
2023-05-28 |
Kerfuffle | Only show -ngl option when relevant + other doc/arg... |
commit | commitdiff | tree |
2023-05-28 |
Vladimir Zorin | examples : add --alias option to gpt_params to set... |
commit | commitdiff | tree |
2023-05-28 |
Howard Su | opencl : no need to allocate cl_mem on heap (#1612) |
commit | commitdiff | tree |
2023-05-28 |
Howard Su | opencl : use strstr to check if fp16 supported (#1611) |
commit | commitdiff | tree |
2023-05-27 |
apcameron | ggml : add support for the RISCV architecture (#1616) |
commit | commitdiff | tree |
2023-05-27 |
Kerfuffle | Include server in releases + other build system cleanup... |
commit | commitdiff | tree |
2023-05-27 |
Henri Vasserman | Add documentation about CLBlast (#1604) |
commit | commitdiff | tree |
2023-05-27 |
Henri Vasserman | [CI] Fix openblas (#1613) |
commit | commitdiff | tree |
2023-05-27 |
Georgi Gerganov | ggml : add ggml_tensor_overhead() |
commit | commitdiff | tree |
2023-05-27 |
Henri Vasserman | [CI] CLBlast: Fix directory name (#1606) |
commit | commitdiff | tree |
2023-05-27 |
Georgi Gerganov | ggml : sync ggml core (minor additions, e.g. ggml_get_t... |
commit | commitdiff | tree |
2023-05-26 |
Kerfuffle | Some improvements to loading the session with --prompt... |
commit | commitdiff | tree |
2023-05-25 |
Johannes Gäßler | cuda : performance optimizations (#1530) |
commit | commitdiff | tree |
2023-05-24 |
Henri Vasserman | Update CLBlast to 1.6.0 (#1580) |
commit | commitdiff | tree |
2023-05-24 |
Evan Jones | readme : add docs for chat-persistent.sh (#1568) |
commit | commitdiff | tree |
2023-05-24 |
Senemu | chat-persistent.sh : use bracket expressions in grep... |
commit | commitdiff | tree |
2023-05-23 |
Maarten ter... | Fix handling of "invalid property" when creating OpenCL... |
commit | commitdiff | tree |
2023-05-22 |
0cc4m | OpenCL Token Generation Acceleration (#1459) |
commit | commitdiff | tree |
2023-05-21 |
Steward Garcia | examples : add server example with REST API (#1443) |
commit | commitdiff | tree |
2023-05-21 |
Stefan Sydow | make : .PHONY clean (#1553) |
commit | commitdiff | tree |
2023-05-21 |
Georgi Gerganov | ggml : output 3d sizes in ggml_graph_dump_dot() |
commit | commitdiff | tree |
2023-05-20 |
Georgi Gerganov | ggml : update WASM SIMD |
commit | commitdiff | tree |
2023-05-20 |
Zenix | feature : support blis and other blas implementation... |
commit | commitdiff | tree |
2023-05-20 |
Henri Vasserman | OpenCL: Fixes for older devices. (#1435) |
commit | commitdiff | tree |
2023-05-20 |
Juuso Alasuutari | llama : define magic numbers as integer constants ... |
commit | commitdiff | tree |
2023-05-20 |
Georgi Gerganov | ggml : add ggml_clamp() (#1539) |
commit | commitdiff | tree |
2023-05-20 |
Johannes Gäßler | cuda : loading models directly into VRAM, norm calculat... |
commit | commitdiff | tree |
2023-05-20 |
Georgi Gerganov | Revert "feature : add blis and other BLAS implementatio... |
commit | commitdiff | tree |
2023-05-20 |
Zenix | feature : add blis and other BLAS implementation suppor... |
commit | commitdiff | tree |
2023-05-20 |
Georgi Gerganov | llama : add llama_init_backend() API (close #1527) |
commit | commitdiff | tree |
2023-05-20 |
DannyDaemonic | Fix for mingw (#1462) |
commit | commitdiff | tree |
2023-05-20 |
Maxime | llama : fix name shadowing and C4146 (#1526) |
commit | commitdiff | tree |
2023-05-20 |
Georgi Gerganov | llama : fix compile warnings in llama_set_state_data() |
commit | commitdiff | tree |
2023-05-20 |
Georgi Gerganov | ggml : fix scalar implementation of Q4_1 dot |
commit | commitdiff | tree |
2023-05-19 |
Georgi Gerganov | ggml : use F16 instead of F32 in Q4_0, Q4_1, Q8_0 ... |
commit | commitdiff | tree |
2023-05-19 |
Georgi Gerganov | tests : add missing header |
commit | commitdiff | tree |
2023-05-19 |
Evan Jones | examples : add persistent chat (#1495) |
commit | commitdiff | tree |
2023-05-19 |
Jason McCartney | main : make reverse prompt option act as a stop token... |
commit | commitdiff | tree |
2023-05-19 |
David Kennedy | readme : adds WizardLM to the list of supported models... |
commit | commitdiff | tree |
2023-05-19 |
Georgi Gerganov | minor : fix compile warnings |
commit | commitdiff | tree |
2023-05-18 |
Erik Scholz | make kv_f16 the default for api users (#1517) |
commit | commitdiff | tree |
2023-05-18 |
DannyDaemonic | Fixes #1511 lambda issue for w64devkit (mingw) (#1513) |
commit | commitdiff | tree |
2023-05-17 |
Stephan Walter | Remove unused n_parts parameter (#1509) |
commit | commitdiff | tree |
2023-05-17 |
rankaiyx | benchmark-matmul: Print the average of the test results... |
commit | commitdiff | tree |
2023-05-16 |
Tom Jobbins | convert.py: Support models which are stored in a single... |
commit | commitdiff | tree |
2023-05-16 |
Ilya Kurdyukov | ~7% faster Q5_1 AVX2 code (#1477) |
commit | commitdiff | tree |
2023-05-16 |
András Salamon | define default model path once, sync path with readme... |
commit | commitdiff | tree |
2023-05-16 |
sandyiscool | Add alternate include path for openblas (#1476) |
commit | commitdiff | tree |
2023-05-15 |
zrm | fix get_num_physical_cores() (#1436) |
commit | commitdiff | tree |
2023-05-14 |
slaren | benchmark-matmul: fix clang-tidy issues, report results... |
commit | commitdiff | tree |
2023-05-14 |
Johannes Gäßler | cuda : deduplicated dequantization code (#1453) |
commit | commitdiff | tree |
2023-05-14 |
xaedes | ggml : alternative fix for race condition bug in non... |
commit | commitdiff | tree |
2023-05-14 |
Georgi Gerganov | ggml : various fixes (#1450) |
commit | commitdiff | tree |
2023-05-14 |
katsu560 | ggml : add AVX support based on AVX2 code (#1430) |
commit | commitdiff | tree |
2023-05-14 |
Georgi Gerganov | ggml : add GGML_QNT_VERSION to track quantization forma... |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | cuda : fix convert function (#1412) |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | make : fix PERF build with cuBLAS |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | llama : fix unused warning |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | ggml : multi-thread mul and diag_mask ops (#1428) |
commit | commitdiff | tree |
2023-05-13 |
Johannes Gäßler | ggml : GPU-accelerated token generation (#1412) |
commit | commitdiff | tree |
2023-05-13 |
xaedes | ggml : implement backward pass for llama + small traini... |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | ggml : sync alibi fix from ggml repo |
commit | commitdiff | tree |
2023-05-13 |
3ooabkhxtn | Adding SSE instructions to ggml_vec_dot_q4_0_q8_0 ... |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | llama : fix various warnings |
commit | commitdiff | tree |
2023-05-13 |
Rinne | embedding : remove unused code (#1426) |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | readme : update Q4_0 perplexities |
commit | commitdiff | tree |
2023-05-13 |
Georgi Gerganov | llama : free ggml context in set / copy state data... |
commit | commitdiff | tree |
2023-05-13 |
Henri Vasserman | opencl : fix kernels for the new formats (#1422) |
commit | commitdiff | tree |
2023-05-12 |
Georgi Gerganov | llama : fix --mtest option (close #1414) |
commit | commitdiff | tree |
2023-05-12 |
Johannes Gäßler | CLI args use - instead of _, backwards compatible ... |
commit | commitdiff | tree |
2023-05-12 |
slaren | Add clang-tidy reviews to CI (#1407) |
commit | commitdiff | tree |
2023-05-12 |
Rinne | readme : add C#/.NET bindings repo (#1409) |
commit | commitdiff | tree |
2023-05-11 |
Georgi Gerganov | ggml : remove bit shuffling (#1405) |
commit | commitdiff | tree |
2023-05-11 |
CRD716 | prompts : model agnostic DAN (#1304) |
commit | commitdiff | tree |
2023-05-10 |
Evan Jones | main : add option to save full output to session (... |
commit | commitdiff | tree |
2023-05-09 |
DannyDaemonic | Locale fix for Windows (#1379) |
commit | commitdiff | tree |
2023-05-09 |
Sami Farin | use pause asm insn in busyloop to run the CPU (13600K... |
commit | commitdiff | tree |
2023-05-09 |
DannyDaemonic | Interface improvements and `--multiline-input` (previou... |
commit | commitdiff | tree |
next |