[SYCL] Update README-sycl.md for Chapter "Recommended release" and "News" (#7946)

author Neo Zhang <redacted>

Mon, 17 Jun 2024 03:17:07 +0000 (11:17 +0800)

committer GitHub <redacted>

Mon, 17 Jun 2024 03:17:07 +0000 (11:17 +0800)
author Neo Zhang <redacted>
Mon, 17 Jun 2024 03:17:07 +0000 (11:17 +0800)
committer GitHub <redacted>
Mon, 17 Jun 2024 03:17:07 +0000 (11:17 +0800)
diff --git a/README-sycl.md b/README-sycl.md

index 93b623daf6a1a16890d973eb49e986d1ce5abcdb..bd1984706225fc6c27fec0b166a5c2723c8c95b3 100644 (file)
--- a/README-sycl.md
+++ b/README-sycl.md
@@ -1,6 +1,7 @@
  # llama.cpp for SYCL
  
  - [Background](#background)
+- [Recommended Release](#recommended-release)
  - [News](#news)
  - [OS](#os)
  - [Hardware](#hardware)
@@ -31,8 +32,23 @@ When targeting **Intel CPU**, it is recommended to use llama.cpp for [Intel oneM
  
  It has the similar design of other llama.cpp BLAS-based paths such as *OpenBLAS, cuBLAS, etc..*. In beginning work, the oneAPI's [SYCLomatic](https://github.com/oneapi-src/SYCLomatic) open-source migration tool (Commercial release [Intel® DPC++ Compatibility Tool](https://www.intel.com/content/www/us/en/developer/tools/oneapi/dpc-compatibility-tool.html)) was used for this purpose.
  
+## Recommended Release
+
+The SYCL backend would be broken by some PRs due to no online CI.
+
+The following release is verified with good quality:
+
+|Commit ID|Tag|Release|Verified  Platform|
+|-|-|-|-|
+|fb76ec31a9914b7761c1727303ab30380fd4f05c|b3038 |[llama-b3038-bin-win-sycl-x64.zip](https://github.com/ggerganov/llama.cpp/releases/download/b3038/llama-b3038-bin-win-sycl-x64.zip) |Arc770/Linux/oneAPI 2024.1<br>MTL Arc GPU/Windows 11/oneAPI 2024.1|
+
+
  ## News
  
+- 2024.5
+  - Performance is increased: 34 -> 37 tokens/s of llama-2-7b.Q4_0 on Arc770.
+  - Arch Linux is verified successfully.
+
  - 2024.4
    - Support data types: GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_TYPE_IQ3_S, GGML_TYPE_IQ2_XXS, GGML_TYPE_IQ2_XS, GGML_TYPE_IQ2_S, GGML_TYPE_IQ1_S, GGML_TYPE_IQ1_M.
author	Neo Zhang <redacted>
	Mon, 17 Jun 2024 03:17:07 +0000 (11:17 +0800)
committer	GitHub <redacted>
	Mon, 17 Jun 2024 03:17:07 +0000 (11:17 +0800)