lombardata
/

dinov2-large-linearhead-2024_03_06-with_data_aug_batch-size32_epochs93_freeze

+---
+license: apache-2.0
+base_model: facebook/dinov2-large
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: dinov2-large-linearhead-2024_03_06-with_data_aug_batch-size32_epochs93_freeze
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# dinov2-large-linearhead-2024_03_06-with_data_aug_batch-size32_epochs93_freeze
+This model is a fine-tuned version of [facebook/dinov2-large](https://huggingface.co/facebook/dinov2-large) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0975
+- F1 Micro: 0.8499
+- F1 Macro: 0.8118
+- Roc Auc: 0.9047
+- Accuracy: 0.5522
+- Learning Rate: 0.0000
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 32
+- eval_batch_size: 32
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 93
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | F1 Micro | F1 Macro | Roc Auc | Accuracy | Rate   |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|:--------:|:-------:|:--------:|:------:|
+| No log        | 1.0   | 274   | 0.1280          | 0.7998   | 0.7183   | 0.8765  | 0.4798   | 0.001  |
+| 0.1493        | 2.0   | 548   | 0.1210          | 0.8095   | 0.7528   | 0.8793  | 0.5080   | 0.001  |
+| 0.1493        | 3.0   | 822   | 0.1156          | 0.8119   | 0.7609   | 0.8667  | 0.5237   | 0.001  |
+| 0.1156        | 4.0   | 1096  | 0.1167          | 0.8240   | 0.7881   | 0.9000  | 0.5003   | 0.001  |
+| 0.1156        | 5.0   | 1370  | 0.1073          | 0.8367   | 0.7939   | 0.8968  | 0.5254   | 0.001  |
+| 0.1069        | 6.0   | 1644  | 0.1082          | 0.8319   | 0.7918   | 0.8911  | 0.5341   | 0.001  |
+| 0.1069        | 7.0   | 1918  | 0.1065          | 0.8367   | 0.7811   | 0.8963  | 0.5390   | 0.001  |
+| 0.1026        | 8.0   | 2192  | 0.1071          | 0.8364   | 0.8001   | 0.8959  | 0.5351   | 0.001  |
+| 0.1026        | 9.0   | 2466  | 0.1101          | 0.8346   | 0.7925   | 0.9113  | 0.5111   | 0.001  |
+| 0.1           | 10.0  | 2740  | 0.1074          | 0.8345   | 0.7808   | 0.8973  | 0.5320   | 0.001  |
+| 0.0964        | 11.0  | 3014  | 0.1079          | 0.8375   | 0.7967   | 0.8985  | 0.5292   | 0.001  |
+| 0.0964        | 12.0  | 3288  | 0.1070          | 0.8353   | 0.7908   | 0.8951  | 0.5341   | 0.001  |
+| 0.0949        | 13.0  | 3562  | 0.1060          | 0.8371   | 0.7926   | 0.8987  | 0.5296   | 0.001  |
+| 0.0949        | 14.0  | 3836  | 0.1035          | 0.8443   | 0.7987   | 0.9007  | 0.5438   | 0.001  |
+| 0.0926        | 15.0  | 4110  | 0.1099          | 0.8363   | 0.8004   | 0.9060  | 0.5118   | 0.001  |
+| 0.0926        | 16.0  | 4384  | 0.1086          | 0.8327   | 0.7991   | 0.8886  | 0.5355   | 0.001  |
+| 0.0911        | 17.0  | 4658  | 0.1084          | 0.8333   | 0.7967   | 0.8952  | 0.5209   | 0.001  |
+| 0.0911        | 18.0  | 4932  | 0.1083          | 0.8358   | 0.7976   | 0.8968  | 0.5344   | 0.001  |
+| 0.0902        | 19.0  | 5206  | 0.1129          | 0.8301   | 0.7799   | 0.8829  | 0.5233   | 0.001  |
+| 0.0902        | 20.0  | 5480  | 0.1033          | 0.8464   | 0.8107   | 0.9065  | 0.5400   | 0.001  |
+| 0.0896        | 21.0  | 5754  | 0.1091          | 0.8375   | 0.8014   | 0.9018  | 0.5233   | 0.001  |
+| 0.0881        | 22.0  | 6028  | 0.1040          | 0.8412   | 0.7987   | 0.8995  | 0.5383   | 0.001  |
+| 0.0881        | 23.0  | 6302  | 0.1090          | 0.8385   | 0.7908   | 0.9012  | 0.5278   | 0.001  |
+| 0.0874        | 24.0  | 6576  | 0.1078          | 0.8338   | 0.7961   | 0.8917  | 0.5313   | 0.001  |
+| 0.0874        | 25.0  | 6850  | 0.1054          | 0.8455   | 0.8077   | 0.9023  | 0.5501   | 0.001  |
+| 0.0864        | 26.0  | 7124  | 0.1085          | 0.8346   | 0.7913   | 0.8860  | 0.5348   | 0.001  |
+| 0.0864        | 27.0  | 7398  | 0.0994          | 0.8486   | 0.8134   | 0.9040  | 0.5487   | 0.0001 |
+| 0.0793        | 28.0  | 7672  | 0.0989          | 0.8495   | 0.8123   | 0.9039  | 0.5532   | 0.0001 |
+| 0.0793        | 29.0  | 7946  | 0.0986          | 0.8485   | 0.8107   | 0.9028  | 0.5511   | 0.0001 |
+| 0.0751        | 30.0  | 8220  | 0.0986          | 0.8510   | 0.8188   | 0.9080  | 0.5501   | 0.0001 |
+| 0.0751        | 31.0  | 8494  | 0.0990          | 0.8488   | 0.8139   | 0.9034  | 0.5539   | 0.0001 |
+| 0.0753        | 32.0  | 8768  | 0.0983          | 0.8510   | 0.8181   | 0.9048  | 0.5505   | 0.0001 |
+| 0.0748        | 33.0  | 9042  | 0.0987          | 0.8494   | 0.8110   | 0.9018  | 0.5539   | 0.0001 |
+| 0.0748        | 34.0  | 9316  | 0.0980          | 0.8501   | 0.8117   | 0.9045  | 0.5515   | 0.0001 |
+| 0.0748        | 35.0  | 9590  | 0.0981          | 0.8502   | 0.8133   | 0.9064  | 0.5505   | 0.0001 |
+| 0.0748        | 36.0  | 9864  | 0.0984          | 0.8507   | 0.8130   | 0.9045  | 0.5536   | 0.0001 |
+| 0.0745        | 37.0  | 10138 | 0.0983          | 0.8507   | 0.8145   | 0.9067  | 0.5484   | 0.0001 |
+| 0.0745        | 38.0  | 10412 | 0.0986          | 0.8486   | 0.8107   | 0.9011  | 0.5546   | 0.0001 |
+| 0.0749        | 39.0  | 10686 | 0.0986          | 0.8491   | 0.8140   | 0.9029  | 0.5508   | 0.0001 |
+| 0.0749        | 40.0  | 10960 | 0.0982          | 0.8487   | 0.8114   | 0.9002  | 0.5553   | 0.0001 |
+| 0.0752        | 41.0  | 11234 | 0.0976          | 0.8505   | 0.8131   | 0.9058  | 0.5508   | 1e-05  |
+| 0.0734        | 42.0  | 11508 | 0.0977          | 0.8500   | 0.8128   | 0.9046  | 0.5515   | 1e-05  |
+| 0.0734        | 43.0  | 11782 | 0.0975          | 0.8498   | 0.8118   | 0.9053  | 0.5515   | 1e-05  |
+| 0.0736        | 44.0  | 12056 | 0.0976          | 0.8495   | 0.8118   | 0.9046  | 0.5522   | 1e-05  |
+| 0.0736        | 45.0  | 12330 | 0.0975          | 0.8503   | 0.8119   | 0.9053  | 0.5508   | 1e-05  |
+| 0.0731        | 46.0  | 12604 | 0.0976          | 0.8498   | 0.8119   | 0.9046  | 0.5511   | 1e-05  |
+| 0.0731        | 47.0  | 12878 | 0.0975          | 0.8500   | 0.8115   | 0.9046  | 0.5518   | 1e-05  |
+| 0.0736        | 48.0  | 13152 | 0.0975          | 0.8505   | 0.8141   | 0.9052  | 0.5511   | 1e-05  |
+| 0.0736        | 49.0  | 13426 | 0.0975          | 0.8504   | 0.8144   | 0.9053  | 0.5518   | 1e-05  |
+| 0.073         | 50.0  | 13700 | 0.0975          | 0.8502   | 0.8138   | 0.9052  | 0.5518   | 0.0000 |
+| 0.073         | 51.0  | 13974 | 0.0975          | 0.8499   | 0.8123   | 0.9049  | 0.5515   | 0.0000 |
+| 0.0732        | 52.0  | 14248 | 0.0975          | 0.8500   | 0.8119   | 0.9049  | 0.5515   | 0.0000 |
+| 0.0732        | 53.0  | 14522 | 0.0975          | 0.8499   | 0.8118   | 0.9047  | 0.5522   | 0.0000 |
+### Framework versions
+- Transformers 4.36.2
+- Pytorch 2.1.0+cu118
+- Datasets 2.14.5
+- Tokenizers 0.15.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:832effd6b1672bb1edd2066f8d243f4edaf6b9b6d237ef74fb77b1ebc5e62d7c
 size 1217722832

 version https://git-lfs.github.com/spec/v1
+oid sha256:d96f1bb739bca07ec7660d51ddd59d2c14144542e840b22e1f631614d18cb795
 size 1217722832