BEE-spoke-data
/

NanoLlama-GQA-L10-A32_KV8-v13-KI

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

NanoLlama-GQA-L10-A32_KV8-v13-KI

2 contributors

History: 26 commits

pszemraj's picture

Training in progress, step 2500

a5e3e9c 10 months ago

.gitattributes

1.52 kB

initial commit 10 months ago
config.json

717 Bytes

Training in progress, step 100 10 months ago
model.safetensors

436 MB
LFS

Training in progress, step 2500 10 months ago
special_tokens_map.json

414 Bytes

Training in progress, step 100 10 months ago
tokenizer.model

500 kB
LFS

Training in progress, step 100 10 months ago
tokenizer_config.json

959 Bytes

Training in progress, step 100 10 months ago
training_args.bin
Detected Pickle imports (8)
- "torch.device",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.training_args.TrainingArguments",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.training_args.OptimizerNames"
How to fix it?
4.92 kB
LFS

Training in progress, step 100 10 months ago