|
--- |
|
license: mit |
|
language: |
|
- en |
|
pipeline_tag: text-classification |
|
tags: |
|
- gpt2 |
|
- distributed-training |
|
- megatron |
|
- accelerate |
|
--- |
|
# Megatron-GPT2-Classification |
|
|
|
## Description |
|
|
|
The `megatron-gpt2-classification` model is a language model trained using Megatron and Accelerate frameworks. It has been fine-tuned for classification tasks and benefits from distributed training across 4 GPUs (RTX 4070). |
|
|
|
## Key Features |
|
|
|
- Trained with **Megatron** and **Accelerate**. |
|
- Distributed training on **4 GPUs (RTX 4070)**. |
|
- Fine-tuned for classification tasks. |