MohammadOthman commited on
Commit
42e4847
1 Parent(s): 6802671

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -0
README.md ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ pipeline_tag: text-classification
6
+ tags:
7
+ - gpt2
8
+ - distributed-training
9
+ - megatron
10
+ - accelerate
11
+ ---
12
+ # Megatron-GPT2-Classification
13
+
14
+ ## Description
15
+
16
+ The `megatron-gpt2-classification` model is a language model trained using Megatron and Accelerate frameworks. It has been fine-tuned for classification tasks and benefits from distributed training across 4 GPUs (RTX 4070).
17
+
18
+ ## Key Features
19
+
20
+ - Trained with **Megatron** and **Accelerate**.
21
+ - Distributed training on **4 GPUs (RTX 4070)**.
22
+ - Fine-tuned for classification tasks.