git.djapps.eu Git - pkg/ggml/sources/ggml/log

]> git.djapps.eu Git - pkg/ggml/sources/ggml/log

overview / pkg / ggml / sources / ggml / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

LoganDark [Fri, 16 Jun 2023 19:39:09 +0000 (12:39 -0700)]

ggml : return input tensor in ggml_set_name (#262)

this is SO USEFUL for debugging. in order to find any cgraph node,
I can wrap it in ggml_set_name and set a conditional breakpoint.

but I can only wrap existing code if this returns its input.
otherwise the barrier becomes annoyingly high (have to move a
bunch of code around to add name to a tensor)

commit | commitdiff | tree

LoganDark [Fri, 16 Jun 2023 19:17:30 +0000 (12:17 -0700)]

ggml : fix ggml_clamp (#263)

This unconditionally failed before

commit | commitdiff | tree

M. Yusuf Sarıgöz [Fri, 16 Jun 2023 17:36:46 +0000 (20:36 +0300)]

ggml : add quick GELU (#254)

* Implement Quick GELU

* Revert "Implement Quick GELU"

This reverts commit ff220cc1f91a184f195d19b17ed4c352cc72a6f0.

* Tidy up ggml.h

* Respect to the style of ggml

* Fix: Fix minor typo

* Rename `quick_gelu` -> `gelu_quick`

commit | commitdiff | tree

Andrei [Thu, 8 Jun 2023 18:51:39 +0000 (14:51 -0400)]

cmake : export all symbols on windows when building shared library (#234)

Currently building ggml on windows as a shared library does not export all symbols by default.

commit | commitdiff | tree

LoganDark [Wed, 7 Jun 2023 16:16:19 +0000 (09:16 -0700)]

ggml : correct off-by-one bounds check in ggml_compute_forward_set_f32 (#229)

without this fix you will be unable to set a zero-length tensor to the end of another tensor

this sounds stupid, but is used in my testing

commit | commitdiff | tree

klosax [Wed, 7 Jun 2023 16:15:50 +0000 (18:15 +0200)]

gpt-neox : fix ctx size calculation (#228)

commit | commitdiff | tree

Georgi Gerganov [Wed, 7 Jun 2023 16:14:50 +0000 (19:14 +0300)]

ggml : fix ggml_clamp thresholds being read as ints instead of floats (#221)

commit | commitdiff | tree

Jiahao Li [Wed, 7 Jun 2023 16:14:27 +0000 (00:14 +0800)]

ggml : add inplace ops api in header file (#219)

commit | commitdiff | tree

Georgi Gerganov [Fri, 2 Jun 2023 12:46:59 +0000 (15:46 +0300)]

ggml : add ggml_conv_2d_sk_p0(), ggml_win_part(), ggml_win_unpart()

commit | commitdiff | tree

Georgi Gerganov [Tue, 30 May 2023 10:49:08 +0000 (13:49 +0300)]

ggml : fix ggml op conv_1d enum names

commit | commitdiff | tree

Georgi Gerganov [Tue, 30 May 2023 10:19:55 +0000 (13:19 +0300)]

ggml : better conv_1d naming

commit | commitdiff | tree

Georgi Gerganov [Tue, 30 May 2023 07:18:31 +0000 (10:18 +0300)]

ggml : rename conv_1d ops to reflect half-padding used

commit | commitdiff | tree

Georgi Gerganov [Tue, 30 May 2023 07:03:30 +0000 (10:03 +0300)]

ggml : fix compiler warnings for printf

commit | commitdiff | tree

Georgi Gerganov [Mon, 29 May 2023 18:14:52 +0000 (21:14 +0300)]

mnist : remove redundant stuff + rename ctx0

commit | commitdiff | tree

Eldar Yusupov [Mon, 29 May 2023 16:55:13 +0000 (19:55 +0300)]

mnist : add missing header (#213)

commit | commitdiff | tree

Eldar Yusupov [Mon, 29 May 2023 16:47:57 +0000 (19:47 +0300)]

common : fix compilation on Linux (#212)

commit | commitdiff | tree

Georgi Gerganov [Mon, 29 May 2023 16:28:07 +0000 (19:28 +0300)]

ggml : cgraph export/import/eval example + GPU support (#108)

* ggml : cgraph export brainstorming

* mnist : code style

* mnist : minor

* ggml : initial cgraph export

* ggml : initial graph import (wip)

* ggml : import op args correctly

* ggml : add ggml_get_tensor_by_name()

* mnist : add compute graph evaluation on CPU example

* ggml : add ggml_tensor_overhead()

* ggml : rename new functions to ggml_cgraph_...

* mnist : add Metal inference skeleton (WIP)

* mnist : working on the Metal pipeline (WIP)

* mnist : prepare the Metal encoder (WIP)

* mnist : first Metal kernel for F32 ADD

* mnist : looks like MTLHeap does not work

* mnist : initial full pass of MNIST on the GPU (not verified)

* mnist : minor cleanup

* mnist : full GPU inference works

* mnist : use custom soft_max kernel since MPSMatrixSoftMax is bugged

* mnist : use constant for soft_max instead of hardcoded 10

* mnist : check multiple predictions (Metal)

* mnist : minor

* ggml : move cgraph import / export to ggml

* mnist : remove common dependencies

* mnist : fix soft_max threadgroup size

* mnist : init no_alloc member

* ggml : improve "get tensor" API

commit | commitdiff | tree

Tyé singwa [Sun, 28 May 2023 17:41:11 +0000 (20:41 +0300)]

fix : fix ggml_alibi (#204)

commit | commitdiff | tree

Skyler Celestinian-Sterling [Sun, 28 May 2023 10:45:30 +0000 (03:45 -0700)]

readme : add "development" (#203)

You are welcome lol

commit | commitdiff | tree

apcameron [Sat, 27 May 2023 13:48:33 +0000 (14:48 +0100)]

ggml : add CLBLAST support (#197)

Enable support for the RISCV architecture

This addresses https://github.com/ggerganov/ggml/issues/129

commit | commitdiff | tree

Georgi Gerganov [Sat, 27 May 2023 13:20:24 +0000 (16:20 +0300)]

cuda : sync latest llama.cpp (control DMMV X/Y sizes)

commit | commitdiff | tree

Georgi Gerganov [Sat, 27 May 2023 13:18:28 +0000 (16:18 +0300)]

ggml : add ggml_tensor_overhead() + ggml_get_tensort_by_name()

commit | commitdiff | tree

Georgi Gerganov [Sat, 27 May 2023 08:55:25 +0000 (11:55 +0300)]

ggml : sync llama.cpp (OpenCL support for GPU offload)

commit | commitdiff | tree

Georgi Gerganov [Sat, 27 May 2023 08:51:29 +0000 (11:51 +0300)]

mnist : gitignore stuff

commit | commitdiff | tree

jaeminSon [Sat, 27 May 2023 08:47:34 +0000 (17:47 +0900)]

examples : add tokenization tests and refactor codes (#186)

* examples : [refactor] remove unnecessary lines and segments

* examples : [feature] add tokenization test for gpt-neox

* examples : [feature] handle multibyte character set

* examples : [refactor] find the longest token for word

* examples : [refactor] move test_tokenizer to common.cpp as the function affects other models

* add 'test_tokenizer' function after loading the model

* examples : [feature] add test cases for checking tokenization

* examples : [feature] tokenize with huggingface tokenizers for currently supported models

* examples : add tokenization test cases for each model

* revert conversion from string to utf-8 encoded byte strings

* [refactor] make util functions for testing tokenizers available

* [bug fix] test replit using functions and variables (e.g. tokenizer struct, tokenization method) defined in its main.cpp

* [refactor] modify function name test_tokenizer -> test_gpt_tokenizer

* [refactor] put parenthesis on single line for-loops and if-statements

* [refactor] withdraw <filesystem> and use <iostream> and <dirent.h>

* [refactor] remove 'find_test_file' function and directly set test file path from 'test_gpt_tokenizer' function

* call a function for testing tokenizer with filename specified

* revert test tokenizer in replit (replit uses seperate methods for tokenzation and decoding)

* compare vector of id to check if two tokenizations are identical.

* write token ids instead of strings.

* [refactor] use --token_test rather than --test for token-test argument

* add english test cases

* update test cases with more english prompts

* examples : tokenizer testing fixes

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Georgi Gerganov [Sat, 27 May 2023 06:11:10 +0000 (09:11 +0300)]

Update README.md

commit | commitdiff | tree

Georgi Gerganov [Fri, 26 May 2023 09:34:29 +0000 (12:34 +0300)]

minor : fix printf warnings

commit | commitdiff | tree

Radoslav Gerganov [Fri, 26 May 2023 08:53:18 +0000 (11:53 +0300)]

mnist : smooth user input (#199)

Drawing on the canvas is now smooth. The final image which is used for
prediction is obtained by down-scaling the canvas to 28x28 pixels.
Download button is aslo added for downloading raw image values.

commit | commitdiff | tree

Radoslav Gerganov [Fri, 26 May 2023 06:36:40 +0000 (09:36 +0300)]

examples : add missing header file (#198)

Some of the examples are missing the cstring header which is needed for
memcpy().

commit | commitdiff | tree

Radoslav Gerganov [Thu, 25 May 2023 08:27:15 +0000 (11:27 +0300)]

mnist : add progress indicator on the web page (#194)

Prevent user actions before the model and the data set is loaded

commit | commitdiff | tree

Georgi Gerganov [Wed, 24 May 2023 08:49:53 +0000 (11:49 +0300)]

mnist : add WASM instructions + web-page link

commit | commitdiff | tree

Radoslav Gerganov [Wed, 24 May 2023 08:40:47 +0000 (11:40 +0300)]

mnist : add web page for the MNIST example (#190)

The web page is using WASM for model inference.
Users can draw digits on an HTML canvas and load random digits from the
MNIST dataset.

commit | commitdiff | tree

Georgi Gerganov [Wed, 24 May 2023 08:38:41 +0000 (11:38 +0300)]

mnist : cleanup main.cpp

commit | commitdiff | tree

Ettore Di Giacinto [Wed, 24 May 2023 08:01:31 +0000 (10:01 +0200)]

docs : add golang transformer bindings (#191)

This PR adds golang bindings to transformers in ggml

commit | commitdiff | tree

Georgi Gerganov [Wed, 24 May 2023 07:54:45 +0000 (10:54 +0300)]

mpt : fix n_ctx (close #165)

commit | commitdiff | tree

Georgi Gerganov [Wed, 24 May 2023 07:41:06 +0000 (10:41 +0300)]

examples : remove prompt pipe-in support

Need cross-platform solution, factored out in common

commit | commitdiff | tree

Georgi Gerganov [Wed, 24 May 2023 07:40:27 +0000 (10:40 +0300)]

common : add missing declarations

commit | commitdiff | tree

klosax [Wed, 24 May 2023 07:27:36 +0000 (09:27 +0200)]

mpt : utf-8 support, perplexity testing, repeat penalty sampling (#184)

* common: utf-8 decoder, reverted gpt_toeknize utf-8 convert

* Update common.h

* main: decode utf-8 tokens on load

* mpt import: bug fix

* common: style fixes

* common: style fix

* Update common.h

* common: revert gpt_tokenize utf-8 convert

* Update common.cpp

* Update common.cpp

* Update common.cpp

* Add perplexity to mpt

* Update CMakeLists: perplexity

* mpt-perplexity: fixes

* Update perplexity.cpp

* common: add sampling with repeat penalty

* mpt-main: add repeat penalty sampling, add commandline parameters

* Update common.h

* mpt-main: style fixes

* Update perplexity.cpp

* Delete perplexity.cpp

* mpt: move perplexity to main

* mpt: move perplexity to main

* common.cpp: Use codecvt utf-8 converter

* main.cpp: Use codecvt utf-8 converter

* mpt : code style changes

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Georgi Gerganov [Mon, 22 May 2023 14:57:21 +0000 (17:57 +0300)]

readme : update Features

commit | commitdiff | tree

Ravindra Marella [Sun, 21 May 2023 12:32:05 +0000 (18:02 +0530)]

readme : add link to python bindings (#181)

commit | commitdiff | tree

klosax [Sun, 21 May 2023 08:21:51 +0000 (10:21 +0200)]

common : support utf-8 + fix gpt_tokenize + fix MPT model import (#179)

* Update convert-h5-to-ggml.py

* Import tokens correctly

* gpt_tokenize: Convert input to utf-8 + bug fix

* common : minor style fixes

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Dan Forbes [Sat, 20 May 2023 18:25:25 +0000 (11:25 -0700)]

readme : add link to GGML format docs (#177)

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 17:56:35 +0000 (20:56 +0300)]

examples : use scratch buffers to reduce memory usage (#176)

* starcoder : example for using scratch buffers to reduce memory usage

* starcoder : bump scratch buffers to 256 MB

* examples : add scratch buffers to MPT and GPT-NeoX

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 17:00:27 +0000 (20:00 +0300)]

ggml : update WASM SIMD

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 15:59:04 +0000 (18:59 +0300)]

whisper : fix Hebrew lang id

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 15:01:40 +0000 (18:01 +0300)]

examples : add quantize version to MPT and Replit examples (ref #168)

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 14:45:49 +0000 (17:45 +0300)]

common : force --top_k to be at least 1

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 14:33:07 +0000 (17:33 +0300)]

examples : fix vocab loading (close #163)

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 14:22:58 +0000 (17:22 +0300)]

common : fix gpt_tokenize (ref #170)

commit | commitdiff | tree

Michael Verrilli [Sat, 20 May 2023 14:12:24 +0000 (10:12 -0400)]

dolly-v2 : par_res and neox changes (#167)

* dolly-v2 example: par_res and neox changes

* Update examples/dolly-v2/quantize.cpp

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 14:09:41 +0000 (17:09 +0300)]

examples : call ggml_time_init() (close #166)

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 13:48:03 +0000 (16:48 +0300)]

Update README.md

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 12:59:34 +0000 (15:59 +0300)]

ggml : sync llama.cpp - CUDA improvements + ggml minor fixes

commit | commitdiff | tree

Georgi Gerganov [Sat, 20 May 2023 11:56:14 +0000 (14:56 +0300)]

ggml : sync llama.cpp - new quantization formats Q4 + Q8

commit | commitdiff | tree

pikalover6 [Thu, 18 May 2023 06:52:22 +0000 (23:52 -0700)]

readme : update roadmap (#164)

+ MPT & Replit

commit | commitdiff | tree

Lukas Möller [Wed, 17 May 2023 19:58:21 +0000 (21:58 +0200)]

examples : sample replit + MPT inference (#145)

* Add replit model

* Add unigram tokenization support

* Remove debug log

* Port alibi attn bias fix

* Remove torch input

* Fix hardcoded path

* Remove unsupported hyperparams

* Add mpt

* Add replit quantization script

* Remove debug print

* Add quantization support to mpt

* Reformat

* Remove trailing return type

* Implement stylistic changes

* use f16 in k/v memory calculations for replit/mpt

* Update context size calculation

* Add clip_qkv and alibi_bias_max support

* fix clamping implementation, remove implicit conversions

* Fix qkv if condition

* Fix replit context size calculation

* Potentially fix gcc compilation error

* Fix warning

* Adjust object overhead

* Remove dead code

commit | commitdiff | tree

jaeminSon [Wed, 17 May 2023 15:49:37 +0000 (00:49 +0900)]

examples : fix a hyperparameter value in gpt-neox (#161) (#162)

commit | commitdiff | tree

Andrei [Wed, 17 May 2023 06:27:11 +0000 (02:27 -0400)]

ggml : fix typo in ggml_diag_mask_zero_inplace() (#159)

commit | commitdiff | tree

Georgi Gerganov [Mon, 15 May 2023 04:50:54 +0000 (07:50 +0300)]

readme : add link to training example

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 15:56:18 +0000 (18:56 +0300)]

ggml : add AVX dot products

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 15:55:29 +0000 (18:55 +0300)]

whisper : sync whisper.cpp

commit | commitdiff | tree

IGUILIZ Salah-Eddine [Sun, 14 May 2023 15:31:08 +0000 (17:31 +0200)]

starcoder : detect santacoder fix end of text token (#155)

Co-authored-by: IGUILIZ Salah-Eddine <redacted>

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 14:26:41 +0000 (17:26 +0300)]

readme : add re-quantization warning

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 12:10:32 +0000 (15:10 +0300)]

examples : use inplace calls explicitly

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 11:55:28 +0000 (14:55 +0300)]

tests : add tests from llama.cpp

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 11:45:13 +0000 (14:45 +0300)]

ggml : fix multi-threaded ggml_compute_forward_diag_mask_f32()

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 11:16:47 +0000 (14:16 +0300)]

ggml : fix rope calculation (!inplace + GPT-NeoX mode)

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 08:23:02 +0000 (11:23 +0300)]

ggml : new Q4 and Q5 quantization formats + backward ops

sync llama.cpp

- bump GGML_QNT_VERSION -> 1
- increase cwggml object overhead size from 256 to 512 in examples
- drop Q4_2 support
- tensor backend support CUDA

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 07:07:27 +0000 (10:07 +0300)]

ggml : add GGML_QNT_VERSION for tracking changes to the quantization format

ref #150

commit | commitdiff | tree

Georgi Gerganov [Sun, 14 May 2023 07:06:19 +0000 (10:06 +0300)]

whisper : sync whisper.cpp minor changes

commit | commitdiff | tree

Ravindra Marella [Sat, 13 May 2023 13:47:02 +0000 (19:17 +0530)]

starcoder : update example to follow the naming convention of other examples (#153)

commit | commitdiff | tree

Georgi Gerganov [Sat, 13 May 2023 13:02:49 +0000 (16:02 +0300)]

readme : fix gpt-neox example link

commit | commitdiff | tree

Ravindra Marella [Sat, 13 May 2023 12:24:47 +0000 (17:54 +0530)]

examples : fix warnings (#152)

commit | commitdiff | tree

Nouamane Tazi [Sat, 13 May 2023 10:46:10 +0000 (12:46 +0200)]

readme : add BLOOM example (#151)

commit | commitdiff | tree

Georgi Gerganov [Sat, 13 May 2023 10:08:56 +0000 (13:08 +0300)]

examples : update readme with new quantization usage + remove bug alert

commit | commitdiff | tree

Georgi Gerganov [Sat, 13 May 2023 10:04:57 +0000 (13:04 +0300)]

readme : update example list (#146)

commit | commitdiff | tree

Nouamane Tazi [Sat, 13 May 2023 09:54:03 +0000 (11:54 +0200)]

examples : add StarCoder/SantaCoder sample inference (#146)

* init commit

* fix building starcoder

* gen work

* fix vocab

* santacoder mha

* .

* fix quantize

* offload_state_dict

* endoftext

* rename scripts

* fix main

* scripts

* update README

* quickfixes

commit | commitdiff | tree

Eldar Yusupov [Sat, 13 May 2023 09:41:45 +0000 (12:41 +0300)]

gpt-neox : add non-parallel residual support (#139)

* Add non-parallel residual support

* Rename stablelm to gpt-neox

* Fix stablelm model name

commit | commitdiff | tree

Nevin [Sat, 13 May 2023 08:41:43 +0000 (10:41 +0200)]

common : allow prompts to be loaded from file (#102)

* common: allow prompts to be loaded from file

* common : extra help for -f

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

yangyaofei [Thu, 11 May 2023 21:47:48 +0000 (05:47 +0800)]

ggml : fix bug in alibi (#143)

commit | commitdiff | tree

Georgi Gerganov [Mon, 8 May 2023 15:07:10 +0000 (18:07 +0300)]

dolly-v2 : ggml_cgraph init (#112)

commit | commitdiff | tree

Tanmay Sachan [Mon, 8 May 2023 15:06:36 +0000 (20:36 +0530)]

examples : make struct initialization more portable (#112)

commit | commitdiff | tree

Georgi Gerganov [Mon, 8 May 2023 15:03:47 +0000 (18:03 +0300)]

dolly-v2 : minor formatting

commit | commitdiff | tree

Michael Verrilli [Sat, 6 May 2023 05:51:45 +0000 (01:51 -0400)]

examples : add dolly-v2 sample inference (#132)

* Vocab support for special tokens

* Initial dolly-v2 commit

* update README

commit | commitdiff | tree

Georgi Gerganov [Thu, 4 May 2023 15:45:39 +0000 (18:45 +0300)]

stablelm : update README.md

commit | commitdiff | tree

Georgi Gerganov [Wed, 3 May 2023 20:22:14 +0000 (23:22 +0300)]

ggml : vectorize Q8_0 quantization (#127)

commit | commitdiff | tree

Georgi Gerganov [Tue, 2 May 2023 19:14:27 +0000 (22:14 +0300)]

ggml : fix 32-bit ARM

commit | commitdiff | tree

Georgi Gerganov [Tue, 2 May 2023 18:28:21 +0000 (21:28 +0300)]

whisper : sync with latest

commit | commitdiff | tree

Georgi Gerganov [Tue, 2 May 2023 18:27:02 +0000 (21:27 +0300)]

scripts : update sync scripts

commit | commitdiff | tree

Georgi Gerganov [Tue, 2 May 2023 17:23:16 +0000 (20:23 +0300)]

ggml : sync llama.cpp (clBLAST support + tensor names)

commit | commitdiff | tree

Georgi Gerganov [Mon, 1 May 2023 07:13:59 +0000 (10:13 +0300)]

ggml : temp comment

commit | commitdiff | tree

Georgi Gerganov [Sun, 30 Apr 2023 19:28:14 +0000 (22:28 +0300)]

ggml : fix UB (int << 31)

commit | commitdiff | tree

Georgi Gerganov [Sun, 30 Apr 2023 16:03:35 +0000 (19:03 +0300)]

ggml, whisper : sync whisper.cpp (GGML_FTYPE + Q5 WASM SIMD)

commit | commitdiff | tree

Georgi Gerganov [Sun, 30 Apr 2023 07:25:13 +0000 (10:25 +0300)]

ggml : fix labels for GGML_OP_ALIBI

commit | commitdiff | tree

Georgi Gerganov [Sat, 29 Apr 2023 18:33:59 +0000 (21:33 +0300)]

ggml : fix 32-bit ARM NEON

commit | commitdiff | tree

Georgi Gerganov [Sat, 29 Apr 2023 18:13:40 +0000 (21:13 +0300)]

ggml : use vzip instead of vuzp for consistency

commit | commitdiff | tree

Georgi Gerganov [Sat, 29 Apr 2023 16:13:53 +0000 (19:13 +0300)]

ggml : fix SHARED build

commit | commitdiff | tree

Georgi Gerganov [Sat, 29 Apr 2023 16:07:19 +0000 (19:07 +0300)]

ggml : sync llama.cpp (less memory for mul_mat f16 + asserts)

commit | commitdiff | tree

Georgi Gerganov [Sat, 29 Apr 2023 09:33:57 +0000 (12:33 +0300)]

scripts : add sync-whisper.sh

commit | commitdiff | tree

Georgi Gerganov [Sat, 29 Apr 2023 07:30:56 +0000 (10:30 +0300)]

common : forgot to remove Q4_3 references

Packaging of ggml-org/ggml