hongjing0312
/

my_awesome_opus_books_model

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Bleu
       type: bleu
-      value: 6.8568
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,9 +32,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the opus_books dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4187
-- Bleu: 6.8568
-- Gen Len: 17.5266
 ## Model description
@@ -53,9 +53,9 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -63,28 +63,28 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Bleu   | Gen Len |
-|:-------------:|:-----:|:-----:|:---------------:|:------:|:-------:|
-| 1.8788        | 1.0   | 3178  | 1.6359          | 5.4222 | 17.5966 |
-| 1.8039        | 2.0   | 6356  | 1.5794          | 5.8176 | 17.5753 |
-| 1.7556        | 3.0   | 9534  | 1.5462          | 6.0454 | 17.5573 |
-| 1.7288        | 4.0   | 12712 | 1.5209          | 6.2076 | 17.5527 |
-| 1.6931        | 5.0   | 15890 | 1.5033          | 6.3197 | 17.5439 |
-| 1.6658        | 6.0   | 19068 | 1.4886          | 6.4248 | 17.5415 |
-| 1.6634        | 7.0   | 22246 | 1.4757          | 6.4836 | 17.54   |
-| 1.644         | 8.0   | 25424 | 1.4649          | 6.5554 | 17.5357 |
-| 1.6315        | 9.0   | 28602 | 1.4575          | 6.6177 | 17.536  |
-| 1.6194        | 10.0  | 31780 | 1.4495          | 6.6509 | 17.5339 |
-| 1.6035        | 11.0  | 34958 | 1.4431          | 6.7028 | 17.5276 |
-| 1.6072        | 12.0  | 38136 | 1.4375          | 6.7392 | 17.529  |
-| 1.5908        | 13.0  | 41314 | 1.4331          | 6.7639 | 17.5303 |
-| 1.5911        | 14.0  | 44492 | 1.4284          | 6.7943 | 17.5267 |
-| 1.5948        | 15.0  | 47670 | 1.4255          | 6.8244 | 17.5289 |
-| 1.5869        | 16.0  | 50848 | 1.4227          | 6.8443 | 17.5244 |
-| 1.5843        | 17.0  | 54026 | 1.4209          | 6.854  | 17.5248 |
-| 1.5862        | 18.0  | 57204 | 1.4199          | 6.8496 | 17.5258 |
-| 1.5795        | 19.0  | 60382 | 1.4189          | 6.8488 | 17.5252 |
-| 1.585         | 20.0  | 63560 | 1.4187          | 6.8568 | 17.5266 |
 ### Framework versions

     metrics:
     - name: Bleu
       type: bleu
+      value: 8.665
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the opus_books dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1887
+- Bleu: 8.665
+- Gen Len: 17.52
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step   | Validation Loss | Bleu   | Gen Len |
+|:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
+| 1.6522        | 1.0   | 6355   | 1.4220          | 6.8084 | 17.5646 |
+| 1.5046        | 2.0   | 12710  | 1.3440          | 7.2961 | 17.5479 |
+| 1.424         | 3.0   | 19065  | 1.3085          | 7.6625 | 17.5388 |
+| 1.3927        | 4.0   | 25420  | 1.2794          | 7.8254 | 17.5447 |
+| 1.3279        | 5.0   | 31775  | 1.2606          | 8.0112 | 17.5417 |
+| 1.2972        | 6.0   | 38130  | 1.2440          | 8.159  | 17.5222 |
+| 1.263         | 7.0   | 44485  | 1.2328          | 8.2809 | 17.5201 |
+| 1.2414        | 8.0   | 50840  | 1.2263          | 8.3546 | 17.5234 |
+| 1.2216        | 9.0   | 57195  | 1.2144          | 8.4076 | 17.537  |
+| 1.1954        | 10.0  | 63550  | 1.2076          | 8.425  | 17.5313 |
+| 1.1741        | 11.0  | 69905  | 1.2069          | 8.4543 | 17.5247 |
+| 1.1573        | 12.0  | 76260  | 1.1971          | 8.5306 | 17.5245 |
+| 1.1423        | 13.0  | 82615  | 1.1989          | 8.6061 | 17.5168 |
+| 1.1329        | 14.0  | 88970  | 1.1946          | 8.6169 | 17.5322 |
+| 1.1145        | 15.0  | 95325  | 1.1926          | 8.6135 | 17.5258 |
+| 1.1007        | 16.0  | 101680 | 1.1889          | 8.6164 | 17.5314 |
+| 1.1127        | 17.0  | 108035 | 1.1882          | 8.686  | 17.5217 |
+| 1.0888        | 18.0  | 114390 | 1.1884          | 8.6621 | 17.5209 |
+| 1.0737        | 19.0  | 120745 | 1.1883          | 8.673  | 17.5209 |
+| 1.0733        | 20.0  | 127100 | 1.1887          | 8.665  | 17.52   |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e3f615c6776c3d830755371daa783911fa2270dd5952132df6db4d4e44fde25a
 size 242071641

 version https://git-lfs.github.com/spec/v1
+oid sha256:7e9a092a7f75302c912731bf7d84ac1080e9d97d379966cc058b19b9f106829c
 size 242071641