hongjing0312 commited on
Commit
5eb0d5b
1 Parent(s): c26668e

End of training

Browse files
Files changed (2) hide show
  1. README.md +29 -29
  2. pytorch_model.bin +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
- value: 6.8568
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,9 +32,9 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the opus_books dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.4187
36
- - Bleu: 6.8568
37
- - Gen Len: 17.5266
38
 
39
  ## Model description
40
 
@@ -53,9 +53,9 @@ More information needed
53
  ### Training hyperparameters
54
 
55
  The following hyperparameters were used during training:
56
- - learning_rate: 2e-05
57
- - train_batch_size: 32
58
- - eval_batch_size: 32
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
@@ -63,28 +63,28 @@ The following hyperparameters were used during training:
63
 
64
  ### Training results
65
 
66
- | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
67
- |:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
68
- | 1.8788 | 1.0 | 3178 | 1.6359 | 5.4222 | 17.5966 |
69
- | 1.8039 | 2.0 | 6356 | 1.5794 | 5.8176 | 17.5753 |
70
- | 1.7556 | 3.0 | 9534 | 1.5462 | 6.0454 | 17.5573 |
71
- | 1.7288 | 4.0 | 12712 | 1.5209 | 6.2076 | 17.5527 |
72
- | 1.6931 | 5.0 | 15890 | 1.5033 | 6.3197 | 17.5439 |
73
- | 1.6658 | 6.0 | 19068 | 1.4886 | 6.4248 | 17.5415 |
74
- | 1.6634 | 7.0 | 22246 | 1.4757 | 6.4836 | 17.54 |
75
- | 1.644 | 8.0 | 25424 | 1.4649 | 6.5554 | 17.5357 |
76
- | 1.6315 | 9.0 | 28602 | 1.4575 | 6.6177 | 17.536 |
77
- | 1.6194 | 10.0 | 31780 | 1.4495 | 6.6509 | 17.5339 |
78
- | 1.6035 | 11.0 | 34958 | 1.4431 | 6.7028 | 17.5276 |
79
- | 1.6072 | 12.0 | 38136 | 1.4375 | 6.7392 | 17.529 |
80
- | 1.5908 | 13.0 | 41314 | 1.4331 | 6.7639 | 17.5303 |
81
- | 1.5911 | 14.0 | 44492 | 1.4284 | 6.7943 | 17.5267 |
82
- | 1.5948 | 15.0 | 47670 | 1.4255 | 6.8244 | 17.5289 |
83
- | 1.5869 | 16.0 | 50848 | 1.4227 | 6.8443 | 17.5244 |
84
- | 1.5843 | 17.0 | 54026 | 1.4209 | 6.854 | 17.5248 |
85
- | 1.5862 | 18.0 | 57204 | 1.4199 | 6.8496 | 17.5258 |
86
- | 1.5795 | 19.0 | 60382 | 1.4189 | 6.8488 | 17.5252 |
87
- | 1.585 | 20.0 | 63560 | 1.4187 | 6.8568 | 17.5266 |
88
 
89
 
90
  ### Framework versions
 
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
+ value: 8.665
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the opus_books dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.1887
36
+ - Bleu: 8.665
37
+ - Gen Len: 17.52
38
 
39
  ## Model description
40
 
 
53
  ### Training hyperparameters
54
 
55
  The following hyperparameters were used during training:
56
+ - learning_rate: 0.0002
57
+ - train_batch_size: 16
58
+ - eval_batch_size: 16
59
  - seed: 42
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
 
63
 
64
  ### Training results
65
 
66
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
67
+ |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
68
+ | 1.6522 | 1.0 | 6355 | 1.4220 | 6.8084 | 17.5646 |
69
+ | 1.5046 | 2.0 | 12710 | 1.3440 | 7.2961 | 17.5479 |
70
+ | 1.424 | 3.0 | 19065 | 1.3085 | 7.6625 | 17.5388 |
71
+ | 1.3927 | 4.0 | 25420 | 1.2794 | 7.8254 | 17.5447 |
72
+ | 1.3279 | 5.0 | 31775 | 1.2606 | 8.0112 | 17.5417 |
73
+ | 1.2972 | 6.0 | 38130 | 1.2440 | 8.159 | 17.5222 |
74
+ | 1.263 | 7.0 | 44485 | 1.2328 | 8.2809 | 17.5201 |
75
+ | 1.2414 | 8.0 | 50840 | 1.2263 | 8.3546 | 17.5234 |
76
+ | 1.2216 | 9.0 | 57195 | 1.2144 | 8.4076 | 17.537 |
77
+ | 1.1954 | 10.0 | 63550 | 1.2076 | 8.425 | 17.5313 |
78
+ | 1.1741 | 11.0 | 69905 | 1.2069 | 8.4543 | 17.5247 |
79
+ | 1.1573 | 12.0 | 76260 | 1.1971 | 8.5306 | 17.5245 |
80
+ | 1.1423 | 13.0 | 82615 | 1.1989 | 8.6061 | 17.5168 |
81
+ | 1.1329 | 14.0 | 88970 | 1.1946 | 8.6169 | 17.5322 |
82
+ | 1.1145 | 15.0 | 95325 | 1.1926 | 8.6135 | 17.5258 |
83
+ | 1.1007 | 16.0 | 101680 | 1.1889 | 8.6164 | 17.5314 |
84
+ | 1.1127 | 17.0 | 108035 | 1.1882 | 8.686 | 17.5217 |
85
+ | 1.0888 | 18.0 | 114390 | 1.1884 | 8.6621 | 17.5209 |
86
+ | 1.0737 | 19.0 | 120745 | 1.1883 | 8.673 | 17.5209 |
87
+ | 1.0733 | 20.0 | 127100 | 1.1887 | 8.665 | 17.52 |
88
 
89
 
90
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e3f615c6776c3d830755371daa783911fa2270dd5952132df6db4d4e44fde25a
3
  size 242071641
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7e9a092a7f75302c912731bf7d84ac1080e9d97d379966cc058b19b9f106829c
3
  size 242071641