solidrust
/

Hermes-2-Pro-Mistral-7B-AWQ

Model card Files Files and versions Community

sparsh35 commited on Mar 28

Commit

ada3848

•

1 Parent(s): 5619f8d

update config.json vocab_size to tokenizer length

Browse files

.i.e., 32000 as for high throughput in vllm there can be sampling of padded tokens which will result in error in vllm , it is an open issue here . https://github.com/vllm-project/vllm/issues/340

Files changed (1) hide show

config.json +1 -1

config.json CHANGED Viewed

@@ -30,5 +30,5 @@
   "torch_dtype": "float16",
   "transformers_version": "4.38.2",
   "use_cache": false,
-  "vocab_size": 32032
 }

   "torch_dtype": "float16",
   "transformers_version": "4.38.2",
   "use_cache": false,
+  "vocab_size": 32000
 }