-
Notifications
You must be signed in to change notification settings - Fork 8
/
eval.log
83 lines (83 loc) · 6.37 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_Korean-Kaist:
commit 56d958bfb1e17ffbe1b498522d3e0ded1cd7ee04
Author: Dan Zeman <[email protected]>
Date: Sun May 5 12:58:58 2024 +0200
Size: counted 350090 of 350090 words (nodes).
Size: min(0, log((N/1000)**2)) = 11.7163805285701.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.8.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 0 out of 350090 total words have one or more features.
Features: source of annotation (from README) factor is 0.4.
Universal relations: 32 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.8.
Udapi:
TOTAL 88228
Udapi: found 88228 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 350090 words.
Genres: found 3 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang ko --max-err=10 UD_Korean-Kaist/ko_kaist-ud-dev.conllu
[Line 495 Sent M2TA_069-s33 Node 6]: [L3 Syntax too-many-subjects] Multiple subjects [1, 5] not subtyped as ':outer'. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 750 Sent M2TA_069-s49 Node 7]: [L3 Syntax too-many-subjects] Multiple subjects [3, 6] not subtyped as ':outer'.
[Line 1002 Sent M2TA_089-s1 Node 6]: [L3 Syntax too-many-subjects] Multiple subjects [4, 5] not subtyped as ':outer'.
[Line 1282 Sent M2TA_089-s23 Node 5]: [L3 Syntax too-many-subjects] Multiple subjects [3, 4] not subtyped as ':outer'.
[Line 2033 Sent MH2_0069-s7 Node 13]: [L3 Syntax too-many-subjects] Multiple subjects [10, 12] not subtyped as ':outer'.
[Line 4392 Sent MH2_0069-s139 Node 4]: [L3 Syntax too-many-subjects] Multiple subjects [2, 3] not subtyped as ':outer'.
[Line 4445 Sent MH2_0069-s141 Node 20]: [L3 Syntax too-many-subjects] Multiple subjects [13, 19] not subtyped as ':outer'.
[Line 5032 Sent MH2_0069-s174 Node 13]: [L3 Syntax too-many-subjects] Multiple subjects [10, 12] not subtyped as ':outer'.
[Line 8876 Sent MH2_0069-s384 Node 10]: [L3 Syntax too-many-subjects] Multiple subjects [3, 9] not subtyped as ':outer'.
[Line 9608 Sent MH2_0069-s426 Node 19]: [L3 Syntax too-many-subjects] Multiple subjects [10, 18] not subtyped as ':outer'.
...suppressing further errors regarding Syntax
Syntax errors: 34
*** FAILED *** with 34 errors
Exit code: 1
/net/work/people/zeman/unidep/tools/validate.py --lang ko --max-err=10 UD_Korean-Kaist/ko_kaist-ud-test.conllu
[Line 1835 Sent M2TA_090-s45 Node 6]: [L3 Syntax too-many-subjects] Multiple subjects [4, 5] not subtyped as ':outer'. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 2510 Sent MH2_0010-s30 Node 4]: [L3 Syntax too-many-subjects] Multiple subjects [2, 3] not subtyped as ':outer'.
[Line 2932 Sent MH2_0010-s54 Node 4]: [L3 Syntax too-many-subjects] Multiple subjects [2, 3] not subtyped as ':outer'.
[Line 3572 Sent MH2_0010-s90 Node 6]: [L3 Syntax too-many-subjects] Multiple subjects [3, 5] not subtyped as ':outer'.
[Line 3806 Sent MH2_0010-s103 Node 10]: [L3 Syntax too-many-subjects] Multiple subjects [6, 9] not subtyped as ':outer'.
[Line 3865 Sent MH2_0010-s107 Node 6]: [L3 Syntax too-many-subjects] Multiple subjects [3, 5] not subtyped as ':outer'.
[Line 4641 Sent MH2_0010-s154 Node 16]: [L3 Syntax too-many-subjects] Multiple subjects [10, 15] not subtyped as ':outer'.
[Line 4904 Sent MH2_0010-s169 Node 12]: [L3 Syntax too-many-subjects] Multiple subjects [9, 11] not subtyped as ':outer'.
[Line 6246 Sent MH2_0010-s246 Node 13]: [L3 Syntax too-many-subjects] Multiple subjects [9, 12] not subtyped as ':outer'.
[Line 6829 Sent MH2_0010-s280 Node 5]: [L3 Syntax too-many-subjects] Multiple subjects [1, 4] not subtyped as ':outer'.
...suppressing further errors regarding Syntax
Syntax errors: 32
*** FAILED *** with 32 errors
Exit code: 1
/net/work/people/zeman/unidep/tools/validate.py --lang ko --max-err=10 UD_Korean-Kaist/ko_kaist-ud-train.conllu
[Line 170 Sent M2TA_064-s11 Node 7]: [L3 Syntax too-many-subjects] Multiple subjects [4, 6] not subtyped as ':outer'. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 1604 Sent M2TA_065-s48 Node 15]: [L3 Syntax too-many-subjects] Multiple subjects [11, 13] not subtyped as ':outer'.
[Line 2068 Sent M2TA_065-s83 Node 4]: [L3 Syntax too-many-subjects] Multiple subjects [2, 3] not subtyped as ':outer'.
[Line 7456 Sent M2TA_072-s23 Node 10]: [L3 Syntax too-many-subjects] Multiple subjects [7, 9] not subtyped as ':outer'.
[Line 8193 Sent M2TA_073-s26 Node 8]: [L3 Syntax too-many-subjects] Multiple subjects [3, 7] not subtyped as ':outer'.
[Line 8237 Sent M2TA_073-s28 Node 17]: [L3 Syntax too-many-subjects] Multiple subjects [14, 16] not subtyped as ':outer'.
[Line 8696 Sent M2TA_074-s3 Node 7]: [L3 Syntax too-many-subjects] Multiple subjects [3, 6] not subtyped as ':outer'.
[Line 10894 Sent M2TA_082-s44 Node 6]: [L3 Syntax too-many-subjects] Multiple subjects [4, 5] not subtyped as ':outer'.
[Line 11152 Sent M2TA_082-s63 Node 7]: [L3 Syntax too-many-subjects] Multiple subjects [5, 6] not subtyped as ':outer'.
[Line 12181 Sent M2TA_083-s58 Node 5]: [L3 Syntax too-many-subjects] Multiple subjects [1, 4] not subtyped as ':outer'.
...suppressing further errors regarding Syntax
Syntax errors: 323
*** FAILED *** with 323 errors
Exit code: 1
Validity: 0.01
(weight=0.0769230769230769) * (score{features}=0.01) = 0.000769230769230769
(weight=0.0769230769230769) * (score{genres}=0.176470588235294) = 0.0135746606334842
(weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615
(weight=0.256410256410256) * (score{size}=0.848059901906116) = 0.217451256899004
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.8) = 0.0615384615384615
(weight=0.307692307692308) * (score{udapi}=0.01) = 0.00307692307692308
(weight=0.0769230769230769) * (score{udeprels}=0.691891891891892) = 0.0532224532224532
(TOTAL score=0.46245349896007) * (availability=1) * (validity=0.01) = 0.0046245349896007
STARS = 0
UD_Korean-Kaist 0.0046245349896007 0