l3utterfly commited on
Commit
16a446f
β€’
1 Parent(s): 1e7e281

Add PTE files for context sizes 2048, 4096, 8192

Browse files
suzume-llama-3-8B-multilingual_kv_sdpa_xnn_qe_4_32_ctx2048.pte β†’ suzume-llama-3-8B-multilingual_kv2_sdpa_xnn_qe_4_32_ctx2048.pte RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:243396f562ffc9dbaee17c24d0034f001d992e7256d3b6f85054e5a66e6b8d6b
3
- size 4202394272
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3571a0f94164ffa51a804fd433a6434ee85ccd12f15f8dbee7beafc2a8be6640
3
+ size 4169560736
suzume-llama-3-8B-multilingual_kv_sdpa_xnn_qe_4_32_ctx4096.pte β†’ suzume-llama-3-8B-multilingual_kv2_sdpa_xnn_qe_4_32_ctx4096.pte RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fad5dce28733f0fa10dbcf2b1d45439ab0aba1b08b668df5f5cc551de9625658
3
- size 4204491424
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dbd5a123d4f1b0ee79fee27faab634fe6f5a78c8d8e99a61047d8feb57e880c9
3
+ size 4171657888
suzume-llama-3-8B-multilingual_kv_sdpa_xnn_qe_4_32_ctx8192.pte β†’ suzume-llama-3-8B-multilingual_kv2_sdpa_xnn_qe_4_32_ctx8192.pte RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a8e19225eae54e2ae2b3ef400694e134719a339c6def5ff1c77a13f1b4bc8d67
3
- size 4208685728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eff0f7df355ba65b329bc35ba849fb21644c2661c15e4ebae87956f4b99bc11e
3
+ size 4175852192