mmaguero commited on
Commit
b1c663d
·
verified ·
1 Parent(s): 0c93b8a

End of training

Browse files
README.md CHANGED
@@ -15,8 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
15
  # multi-wiki-qa-gn-bert-tiny-cased
16
 
17
  This model is a fine-tuned version of [mmaguero/gn-bert-tiny-cased](https://huggingface.co/mmaguero/gn-bert-tiny-cased) on an unknown dataset.
18
- It achieves the following results on the evaluation set:
19
- - Loss: 4.2579
20
 
21
  ## Model description
22
 
@@ -41,18 +39,11 @@ The following hyperparameters were used during training:
41
  - seed: 42
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
- - num_epochs: 5
45
  - mixed_precision_training: Native AMP
46
 
47
  ### Training results
48
 
49
- | Training Loss | Epoch | Step | Validation Loss |
50
- |:-------------:|:-----:|:----:|:---------------:|
51
- | No log | 1.0 | 201 | 4.3459 |
52
- | No log | 2.0 | 402 | 4.2300 |
53
- | 4.5082 | 3.0 | 603 | 4.1851 |
54
- | 4.5082 | 4.0 | 804 | 4.1651 |
55
- | 4.2284 | 5.0 | 1005 | 4.1620 |
56
 
57
 
58
  ### Framework versions
 
15
  # multi-wiki-qa-gn-bert-tiny-cased
16
 
17
  This model is a fine-tuned version of [mmaguero/gn-bert-tiny-cased](https://huggingface.co/mmaguero/gn-bert-tiny-cased) on an unknown dataset.
 
 
18
 
19
  ## Model description
20
 
 
39
  - seed: 42
40
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
41
  - lr_scheduler_type: linear
42
+ - num_epochs: 2
43
  - mixed_precision_training: Native AMP
44
 
45
  ### Training results
46
 
 
 
 
 
 
 
 
47
 
48
 
49
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:61dff2ddedcda329043df564625d9c3b4435cd3832752eb1b0ea633837677bbb
3
  size 36522312
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a8d3b2eff47dc3350f2bf1a2b567f277da41c2b6533ea8b65c394d9f3a603218
3
  size 36522312
runs/Nov10_17-43-14_69e3e5a50abd/events.out.tfevents.1762796653.69e3e5a50abd.714.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d7832f9a1d83dc0716ddaa45a4858f04abab231612769efe64df1a9315c2e81
3
+ size 5287
runs/Nov10_17-58-32_69e3e5a50abd/events.out.tfevents.1762797513.69e3e5a50abd.714.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa4ed5ee9dff4b763650dddb31a47dd339533de6dea136b0b6bf1229c833fb5b
3
+ size 5858
runs/Nov10_17-58-32_69e3e5a50abd/events.out.tfevents.1762798619.69e3e5a50abd.714.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d1af52bf09711fd9a4fb2279361c11a4d0565ef43a28f2d72fe902cb86a1e6bb
3
+ size 311
tokenizer.json CHANGED
@@ -4,7 +4,7 @@
4
  "direction": "Right",
5
  "max_length": 384,
6
  "strategy": "OnlySecond",
7
- "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
 
4
  "direction": "Right",
5
  "max_length": 384,
6
  "strategy": "OnlySecond",
7
+ "stride": 128
8
  },
9
  "padding": {
10
  "strategy": {
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:22b498c91b88deba5294e4c5e20dbb291422002e376190aaa95fcea606387ded
3
- size 5905
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58b7f74a315e7301f77933159355b6f8bcdd4c42310a7a9efbb14a0e27ab94fb
3
+ size 5841