Analytical Results/Token Analysis

From BookOfWoo
Jump to: navigation, search

The following uses Foogod's transliteration (folded), and refers to Foogod and Phlosioneer's theory that a single "token" consists of an a or w glyph, or two consecutive glyphs.

Token Frequency[edit]

The following shows the frequency with which each token appears in the text:

 w    204   15.5%       lj    25    1.9%       sq    11    0.8%
 a    181   13.7%       sy    23    1.7%       fj    10    0.8%
pj    131   9.9%         j    23    1.7%       ld    10    0.8%
xj    70    5.3%        xu    22    1.7%       su    9     0.7%
 d    69    5.2%        lq    21    1.6%       pq    9     0.7%
sd    43    3.3%         u    21    1.6%        &    8     0.6%
lu    42    3.2%        kd    20    1.5%       fu    6     0.5%
pd    39    3.0%        sj    18    1.4%       xy    6     0.5%
od    37    2.8%        xd    16    1.2%       fq    5     0.4%
 y    36    2.7%        pu    16    1.2%       gu    5     0.4%
gy    33    2.5%        fy    15    1.1%       ku    3     0.2%
gd    32    2.4%        py    15    1.1%        q    2     0.2%
oy    30    2.3%        ly    12    0.9%       ky    1     0.1%
fd    27    2.0%        oj    12    0.9%       gq    1     0.1%

Word Length[edit]

The following shows the number of words of each respective length:

1 token  long: 20
2 tokens long: 108
3 tokens long: 69
4 tokens long: 70
5 tokens long: 49
6 tokens long: 25
7 tokens long: 17
8 tokens long: 8
9 tokens long: 2