End of training

Files changed (7) hide show

README.md CHANGED Viewed

@@ -15,8 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
 # multi-wiki-qa-gn-bert-tiny-cased
 This model is a fine-tuned version of [mmaguero/gn-bert-tiny-cased](https://huggingface.co/mmaguero/gn-bert-tiny-cased) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 4.2579
 ## Model description
@@ -41,18 +39,11 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 201  | 4.3459          |
-| No log        | 2.0   | 402  | 4.2300          |
-| 4.5082        | 3.0   | 603  | 4.1851          |
-| 4.5082        | 4.0   | 804  | 4.1651          |
-| 4.2284        | 5.0   | 1005 | 4.1620          |
 ### Framework versions

 # multi-wiki-qa-gn-bert-tiny-cased
 This model is a fine-tuned version of [mmaguero/gn-bert-tiny-cased](https://huggingface.co/mmaguero/gn-bert-tiny-cased) on an unknown dataset.
 ## Model description
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:61dff2ddedcda329043df564625d9c3b4435cd3832752eb1b0ea633837677bbb
 size 36522312

 version https://git-lfs.github.com/spec/v1
+oid sha256:a8d3b2eff47dc3350f2bf1a2b567f277da41c2b6533ea8b65c394d9f3a603218
 size 36522312

runs/Nov10_17-43-14_69e3e5a50abd/events.out.tfevents.1762796653.69e3e5a50abd.714.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3d7832f9a1d83dc0716ddaa45a4858f04abab231612769efe64df1a9315c2e81
+size 5287

runs/Nov10_17-58-32_69e3e5a50abd/events.out.tfevents.1762797513.69e3e5a50abd.714.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:aa4ed5ee9dff4b763650dddb31a47dd339533de6dea136b0b6bf1229c833fb5b
+size 5858

runs/Nov10_17-58-32_69e3e5a50abd/events.out.tfevents.1762798619.69e3e5a50abd.714.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1af52bf09711fd9a4fb2279361c11a4d0565ef43a28f2d72fe902cb86a1e6bb
+size 311

tokenizer.json CHANGED Viewed

@@ -4,7 +4,7 @@
     "direction": "Right",
     "max_length": 384,
     "strategy": "OnlySecond",
-    "stride": 0
   },
   "padding": {
     "strategy": {

     "direction": "Right",
     "max_length": 384,
     "strategy": "OnlySecond",
+    "stride": 128
   },
   "padding": {
     "strategy": {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:22b498c91b88deba5294e4c5e20dbb291422002e376190aaa95fcea606387ded
-size 5905

 version https://git-lfs.github.com/spec/v1
+oid sha256:58b7f74a315e7301f77933159355b6f8bcdd4c42310a7a9efbb14a0e27ab94fb
+size 5841