]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-02-04 github-actions... flake.lock: Update
2024-02-04 KawrakowAdding some imatrix tools (#5302)
2024-02-04 Welby Seelycmake : use set() for LLAMA_WIN_VER (#5298)
2024-02-03 Johannes Gäßlermake: add nvcc info print (#5310)
2024-02-03 Johannes Gäßlermake: fix nvcc optimization flags for host code (#5309)
2024-02-03 Martin Schwaighoferadd Vulkan support to Nix flake
2024-02-03 0cc4mVulkan Intel Fixes, Optimizations and Debugging Flags...
2024-02-03 Michael Klimenkorefactor : switch to emplace_back to avoid extra object...
2024-02-03 Jared Van BortelYaRN : store rope scaling type as int32_t in memory...
2024-02-03 BADRreadme : add tenere in the ui tools list (#5284)
2024-02-03 AidanBeltonSFix im2col with 32fp (#5286)
2024-02-02 kalomazeperplexity : fix KL divergence calculations on Windows...
2024-02-02 Georgi Gerganovscripts : parse wtype in server-llm.sh (#5167)
2024-02-02 Mirror Azurepy : add check for '.attn.masked_bias' layers to GPT2mo...
2024-02-02 AidanBeltonSTidy ggml-sycl (#5261)
2024-02-02 Xuan Son Nguyendocker : add build for SYCL, Vulkan + update readme...
2024-02-02 Meng, Hengyu[SYCL] get MAX_MEM_ALLOC from device property (#5270)
2024-02-02 Neo Zhang Jianyu[SYCL] update guide of SYCL backend (#5254)
2024-02-02 Ian Bullllama : fix memory leak in llama_batch_free (#5252)
2024-02-01 Neo Zhang Jianyuadd --no-mmap in llama-bench (#5257)
2024-02-01 0cc4mVulkan Phi Fix for AMD Proprietary Drivers (#5260)
2024-02-01 slarencuda : fix LLAMA_CUDA_F16 (#5262)
2024-02-01 Ali Nehzatmake : generate .a library for static linking (#5205)
2024-02-01 Guotengllama : support InternLM2 (#5184)
2024-01-31 EveFix broken Vulkan Cmake (properly) (#5230)
2024-01-31 Georgi Gerganovllama : reorder build_orion() at correct place (#5118)
2024-01-31 Georgi Gerganovllama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU...
2024-01-31 Georgi Gerganovmetal : add im2col F32 dst support (#5132)
2024-01-31 JidongZhang-THUllava : add MobileVLM support (#5132)
2024-01-31 Neo Zhang Jianyuformat license text, restore apache license by legal...
2024-01-31 slarenggml : limit n_threads to the max n_tasks (#5238)
2024-01-31 0cc4mVulkan Fixes (#5223)
2024-01-31 Yiming CuiFix typos of IQ2_XXS and IQ3_XXS in llama.cpp (#5231)
2024-01-31 Neo Zhang Jianyusupport SYCL backend windows build (#5208)
2024-01-31 Jared Van Bortelkompute : llama-bench support and ggml_cpu_has_kompute...
2024-01-30 Georgi GerganovRevert "server : change deps.sh xxd files to string...
2024-01-30 Georgi Gerganovserver : fix context shift (#5195)
2024-01-30 JohnnyBserver : change deps.sh xxd files to string literals...
2024-01-30 Kawrakowggml : fix IQ3_XXS on Metal (#5219)
2024-01-30 Georgi Gerganovsync : ggml (#0)
2024-01-30 Georgi Gerganovgguf : fix comparison (ggml/715)
2024-01-30 John Balis`ggml_cuda_cpy` support for 4d tensors and float16...
2024-01-30 Georgi Gerganovgguf : add input validation, prevent integer overflows...
2024-01-30 Georgi Gerganovci : fix yolo URLs + fix metal capture (ggml/712)
2024-01-30 Jack Mousseaumetal : add debug capture backend function (ggml/694)
2024-01-30 KawrakowFaster AVX2 dot product for IQ2_XS (#5187)
2024-01-30 KawrakowSOTA 3-bit quants (#5196)
2024-01-30 0cc4mVulkan Windows APU Memory Handling (#5199)
2024-01-30 Vladimir Malyutinquantize : fix typo (#5211)
2024-01-30 divinity76main : allow empty --prompt-cache file (#5176)
2024-01-30 Romain Neutronreadme : minor (#5204)
2024-01-30 Georgi Gerganovreadme : update hot topics
2024-01-30 Wu Jian Pingserver : improve README (#5209)
2024-01-29 Paul Tsochantarisggml alloc: Fix for null dereference on alloc failure...
2024-01-29 Jared Van Bortelkompute : fix fallback to CPU (#5201)
2024-01-29 Jared Van BortelNomic Vulkan backend (#4456)
2024-01-29 divinity76fix typo "RLIMIT_MLOCK" (#5175)
2024-01-29 Wu Jian Pingserver : embeddings compatibility for OpenAI (#5190)
2024-01-29 Georgi Gerganovpy : fix except (#5194)
2024-01-29 Sang-Kil Parkpy : improve BPE tokenizer support (#5189)
2024-01-29 slarenggml : add max buffer sizes to opencl and metal backend...
2024-01-29 Evecmake : fix Vulkan build (#5182)
2024-01-28 Paul Tsochantarismetal : free metal objects (#5161)
2024-01-28 Georgi Gerganovsync : ggml
2024-01-28 Georgi Gerganovggml : minor type fix (int64_t -> size_t)
2024-01-28 0cc4mggml : add Vulkan backend (#2059)
2024-01-28 Abhilash Majumderggml : add unified SYCL backend for Intel GPUs (#2690)
2024-01-28 Georgi Gerganovflake.lock: Update (#5162)
2024-01-28 Johannes GäßlerApply min_p to unsorted tokens (#5115)
2024-01-28 Johannes GäßlerTests for min_p, sampling queue (#5147)
2024-01-28 Marcus Dunnreadme : add link to rust bindings (#5148)
2024-01-28 sharpHLllama : add support for Orion-14B (#5118)
2024-01-28 Kyle Misteledocker : add server-first container images (#5157)
2024-01-27 Johnllava : support for Yi-VL and fix for mobileVLM (#5093)
2024-01-27 Georgi Gerganovsync : ggml
2024-01-27 Juddggml : check ggml_add src1 type (ggml/708)
2024-01-27 Michael KlimenkoRemove unused data and add fixes (#5154)
2024-01-27 Maximilian... server : add self-extend support (#5104)
2024-01-26 0cc4mAdd OpenCL add kernel (#5151)
2024-01-26 Jared Van Bortelcmake : pass CPU architecture flags to nvcc (#5146)
2024-01-26 slarencuda : fix tensor size calculation for non-split buffer...
2024-01-26 slarenggml-alloc : add 10% margin to the buffer sizes (#5149)
2024-01-26 snadampalggml : update softmax n_task calculation (#5126)
2024-01-26 Georgi Gerganovscripts : move run-with-preset.py from root to scripts...
2024-01-26 Georgi Gerganovtests : gitignore test-c.o
2024-01-26 Xuan Son Nguyenserver : refactored the task processing logic (#5065)
2024-01-26 crasmci : add model tests + script wrapper (#4586)
2024-01-26 Paul Tsochantarismetal : remove unused `n_buffers` and `buffers` (#5129)
2024-01-26 Riceball LEEgguf : fix "general.alignment" type in gguf_reader...
2024-01-26 Georgi Gerganovreadme : update hot topics
2024-01-26 KawrakowAnother bucket sort (#5109)
2024-01-25 XiaotaoChenreadme : add MobileVLM 1.7B/3B to the supported models...
2024-01-25 l3utterflyllama : dynamic temperature sampling (#4972)
2024-01-25 Jared Van Bortelexamples : make pydantic scripts pass mypy and support...
2024-01-25 Valentin Konovalovandroid : use release cmake build type by default ...
2024-01-25 KawrakowFix Q3_K_XS for MoE models (#5113)
2024-01-25 Georgi Gerganovmetal : show compile log messages
2024-01-24 Engininja2cuda : fix 2-bit quants on amd hip (#5105)
2024-01-24 Michael Hueschennix-shell: use addToSearchPath
2024-01-24 Michael Hueschennix: add cc to devShell LD_LIBRARY_PATH
next