MohammadOthman's picture
Create README.md
42e4847 verified
|
raw
history blame contribute delete
No virus
555 Bytes
metadata
license: mit
language:
  - en
pipeline_tag: text-classification
tags:
  - gpt2
  - distributed-training
  - megatron
  - accelerate

Megatron-GPT2-Classification

Description

The megatron-gpt2-classification model is a language model trained using Megatron and Accelerate frameworks. It has been fine-tuned for classification tasks and benefits from distributed training across 4 GPUs (RTX 4070).

Key Features

  • Trained with Megatron and Accelerate.
  • Distributed training on 4 GPUs (RTX 4070).
  • Fine-tuned for classification tasks.