]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-01-22 slarenllama : fix not enough space in buffer with Qwen (...
2024-01-22 KawrakowKL-divergence (#5076)
2024-01-22 Reinforce-IIggml : parallelize FP32 conversion when using BLAS...
2024-01-22 XiaotaoChenllava : MobileVLM support (#4954)
2024-01-22 Someone Sergeflake.nix: add a comment about flakes vs nix
2024-01-22 Someone Sergenix: add a comment on the many nixpkgs-with-cuda instances
2024-01-22 Someone Sergenix: add a comment about makeScope
2024-01-22 Someone Sergenix: refactor the cleanSource rules
2024-01-22 Someone Sergeworkflows: nix-ci: drop the redundant "paths" filter
2024-01-22 Someone Sergeworkflows: nix-build-aarch64: rate limit
2024-01-22 Someone Sergeworkflows: nix-ci: rebuild on flake.lock updates
2024-01-22 Kawrakowimatrix : keep intermediate imatrix results (#5077)
2024-01-22 compiladellama : support StableLM 2 1.6B (#5052)
2024-01-22 Daniel Beveniusfinetune : print sample-start/include-sample-start...
2024-01-22 Kawrakowllama : add Q3_K_XS (#5060)
2024-01-22 bobqianicci : fix Windows CI by updating Intel SDE version ...
2024-01-22 Shijiellama : add more qwen2 models (#5071)
2024-01-21 iSmaRevert LLAMA_NATIVE to OFF in flake.nix (#5066)
2024-01-21 kuronekosaikoadd safetensors support to convert-lora-to-ggml.py...
2024-01-21 bobqianicadd `#include <string>` to unicode.h (#5051)
2024-01-21 KawrakowAdd ability to evauate multiple choice tasks (#5047)
2024-01-21 KawrakowSlightly faster imatrix (#5050)
2024-01-21 Georgi Gerganovflake.lock: Update (#5054)
2024-01-20 Jared Van Bortelconvert : partially revert PR #4818 (#5041)
2024-01-20 Jared Van Bortelperplexity : fix MSVC build after #5020 (#5043)
2024-01-20 slarenllama : run all KQV ops on the CPU with no KV offload...
2024-01-20 Herman Semenovcmake : add support for ccache (#5002)
2024-01-20 adel boussakenAdd a dart/flutter binding to README.md (#4882)
2024-01-20 Kylincuda : fix compile error in jetson platform (#4975)
2024-01-19 Uzo Nwekefinetune : fix ggml_allocr lifetimes (tmp workaround...
2024-01-19 Georgi Gerganovimatrix : add README.md
2024-01-19 Shijiellama : support upcoming Qwen2 (#5037)
2024-01-19 Georgi Gerganovpy : fix flake8 lint
2024-01-19 Kawrakowwinogrande: evaluate log-probs in parallel (#5036)
2024-01-19 chirankollama : add CodeShell support (#5016)
2024-01-19 Kawrakowperplexity: avoid unnecessary alloocations and logit...
2024-01-19 Georgi Gerganovperplexity : faster Winogrande via batching (#5024)
2024-01-18 Johnllama : fix falcon arch for tied output embeddings...
2024-01-18 Georgi Gerganovcmake : add ggml public headers (#5011)
2024-01-18 Xuan Son Nguyenserver : defer tasks when "slot unavailable" (#5018)
2024-01-18 slarenllama : fix mlock with no-mmap with Metal (#5025)
2024-01-18 Georgi Gerganovimatrix : fix assert for src0 non-cont check
2024-01-18 Georgi Gerganovperplexity : fix winogrande N tasks option
2024-01-18 Georgi Gerganovscripts : add get-winogrande.sh
2024-01-18 David Sommersconvert.py : fix llama/llama2 conversion due to vocab_s...
2024-01-18 KawrakowHellaSwag: speed up by parallelizing log-prob evaluatio...
2024-01-18 Georgi Gerganovperplexity : faster HellaSwag via batching (#5017)
2024-01-18 KawrakowAdd Winogrande evaluation (#5015)
2024-01-18 Georgi Gerganovscritps : add helper script to get hellaswag data in...
2024-01-18 Paul Tsochantarismetal : fix memory leak, dangling pointer and unused...
2024-01-17 Georgi Gerganovsync : ggml
2024-01-17 Georgi Gerganovggml : add IQ2 to test-backend-ops + refactoring (...
2024-01-17 Georgi Gerganovimatrix : offload to GPU support (#4957)
2024-01-17 Georgi Gerganovbackend : add eval callback (#4935)
2024-01-17 Georgi Gerganovmetal : create autorelease pool during library build...
2024-01-17 Georgi Gerganovpy : fix whitespace
2024-01-17 Georgi Gerganovpy : fix missing added_tokens_dict for SPM and BPE...
2024-01-17 Kawrakowllama : use Q4_K for attn_v for Q2_K_S when n_gqa ...
2024-01-17 Paul Tsochantarismetal : remove unnecessary nil check (#4986)
2024-01-17 David Renshawllama : fix copy/paste error in llama_sampling_params...
2024-01-16 Georgi Gerganovpy : remove unnecessary hasattr (#4903)
2024-01-16 Philip Taronnix: remove nixConfig from flake.nix (#4984)
2024-01-16 Daniel Beveniusfinetune : add training data file to log message (...
2024-01-16 Kawrakowggml : importance matrix support for legacy quants...
2024-01-16 Maximilian... examples : add complete parallel function calling examp...
2024-01-16 Georgi Gerganovperplexity : fix kv cache handling for hellaswag (...
2024-01-16 Georgi Gerganovflake.lock: update flake-parts, flake-parts/nixpkgs...
2024-01-16 Paul Tsochantarismetal : localized logic in `ggml_metal_graph_compute...
2024-01-16 Neuman Vongandroid : introduce starter project example (#4926)
2024-01-16 Alex Azarovmetal : replace loop of dispatch_async with dispatch_ap...
2024-01-16 Alex Azarovmetal : log `recommendedMaxWorkingSetSize` on iOS 16...
2024-01-16 Maximilian... examples : fix and improv docs for the grammar generato...
2024-01-16 Justine Tunneyggml : introduce GGML_CALL function annotation (#4850)
2024-01-16 Daniel Beveniusfinetune : use LLAMA_FILE_MAGIC_GGLA (#4961)
2024-01-16 stduhpfspeculative : threading options (#4959)
2024-01-15 ngc92pass cpu-architecture arguments only to host code ...
2024-01-15 David Friehsllama : apply classifier-free guidance to logits direct...
2024-01-15 Victor Z. Pengawq-py : fix typo in awq-py/README.md (#4947)
2024-01-15 Georgi Gerganovcuda : fix dequantize kernel names (#4938)
2024-01-15 Kawrakowllama : check for 256 divisibility for IQ2_XS, IQ2_XXS...
2024-01-15 KawrakowCUDA: faster dequantize kernels for Q4_0 and Q4_1 ...
2024-01-14 David Pflugllama : fix missing quotes (#4937)
2024-01-14 KawrakowAdd ability to use importance matrix for all k-quants...
2024-01-14 Georgi Gerganovllama : check LLAMA_TRACE env for extra logging (#4929)
2024-01-14 Georgi Gerganovscripts : sync-ggml-am.sh option to skip commits
2024-01-14 Georgi Gerganovllama : use LLAMA_LOG_ macros for logging
2024-01-14 KawrakowFix ffn_down quantization mix for MoE models (#4927)
2024-01-14 Alex Azarovmetal : correctly set SIMD support flags on iOS (#4923)
2024-01-14 Karthik Kumar... llama : support WinXP build with MinGW 8.1.0 (#3419)
2024-01-14 Kawrakow2-bit quantizations (#4897)
2024-01-14 KawrakowMake Q3_K_S be the same as olf Q3_K_L for Mixtral-8x7B...
2024-01-13 Georgi Gerganovsync : ggml
2024-01-13 Johannes Gäßlerggml: cache sin/cos for RoPE (#4908)
2024-01-13 Georgi Gerganovmetal : remove old API (#4919)
2024-01-13 Georgi Gerganovserver : fix prompt caching with system prompt (#4914)
2024-01-13 Georgi Gerganovllama : fix detokenization of non-special added-tokens...
2024-01-13 Georgi Gerganovmetal : disable log for loaded kernels (#4794)
2024-01-13 David Friehsllama : minimize size used for state save/load (#4820)
2024-01-13 Someoneworkflows: unbreak nix-build-aarch64, and split it...
2024-01-13 Yann Folletmain : add parameter --no-display-prompt (#4541)
next