Transliterations

From BookOfWoo
Revision as of 00:35, 2 March 2018 by Narga (talk | contribs) (Text)
Jump to: navigation, search

Transliteration is the act of rewriting a given text using letters from a different alphabet. As the Book of Woo is written in an apparently previously unknown alphabet, it can be difficult to manipulate, analyze, and quote it using computers and the Internet. As such, one of the first steps typically employed is to transliterate the text into another alphabet which can be more easily used. This page contains various transliterations which have been performed and posted for others to use.

The following transliterations have been updated to include all corrections which have been discovered to date (both transcription errors and errors in the original document which have been confirmed by Novil). They have also been reformatted here to better match the layout of the original.

(Some older transliterations without these corrections have also been posted for historical reference in the Old Transliterations page.)

Ryan (corrected)

Ryan was the first user to transcribe the original text and post the result for others to use. As such, his has become something of a de-facto standard transliteration amongst those discussing the text online. The following version has been updated (by Foogod) to include the corrections noted by Novil and to correspond better with the later Satsuoni-Novil transcription.

Transliteration Key

Ryan abc.jpg

Text

Direct Download

lehvrn svrnzrn mnsn vzmn: iuoypj $ypjwpj ge#n >ufdwfem ivhn. h=mzr=
svrn ibrnzr= uadwpjwke v & >nsvt=. cvm>u adaxd odpq ljfy xjajwpd
em=zrn uxyawpemzrn #ua lyojwpj fqgy ywfem ivhn. #ua lyojwpj ge#v
vzmn: uwdpdld yw$d $d$dwlja. em= svrnzrn >=m= vzmnsn "sn c=h="
c=h=zt=m i=s= =rv i=s=zrn c=h=zc=mzs= >n ibmuwxj xjaj dpd. gyaxe
c=h= cvm>u gdodw$d pygywpd em= svrnzrn rb#nm vz#vmz>n supjwya
xdfdpd. gdod xem=zvm iuad
xem=zr= iuad em=zr=z#ua
lyoj jfywpj $ern =r=
vzuad $ypj. gyaxe
ibmuwpd em= r=>v.
cvm>u xj$ykdwpd
ae#= ibmuwpj xdad
ywadlja gdod em=zrua
gdaw$d xj$ykdwpem
c=h=zs= ibmu.

xjajwxj gdod
lqpjwpd em=zrn
#=s= c=h=zs= >nmn.
s= >nmnzrn #=in
ibrn ruaw$dwxj
fdlj lqpjwxj
fqpy pdlez>n
cvm>u dpy.

--

lehvrn svrnzrn cufj xe#=z#ua lyojwpj $d$d $ypjwxj lehvrn =rvz>n
inm>nmz>n & >nsvt=. sn iuoypjwpj ld$d dpd ywlehvrn =mcv.
m=inmzmnzr= sn iuoypjwpj ld$d:
$ypj lqpjwpj xjpja pqfd
ywlehvrn =rv #v>v#vm rb#=
ivhnz>n uadwxj $ypj lqpj.
gdawajwpd odod lyojwpj
gdodwpem rb#=z>nzsn sbcvzrn
incvrvm vzm=inmz>n s=z>n
#=in ibrn. m=inm ivhnz>n svrn
ibrnzrnzrua pdodw$j $qgy.
lehvrn =rnz>n & >nsvt=zrn hnrv
c=h= i=s= svrn ibrn. iuoypjwpj
fqgy ywaj$jwxj em= ivhnzr=
iuoypjwxj ljaxja dpywxj &
xj$ykdwpj dpdld yw$dwlja.

gdod xem=zvm iuad xem=zr=
iuoypjwxj ljaxjawxj &
xj$ykdwpj daxd ywxdfdpd.
lehvrnz>n & >nsvt=zrn #v>v#vm sbcv
vzrb#=zrn #v>v#vm rnrn vzh=h= =mcv.

--

cvm>u ljfywxj adaxd
odpqwpd lehvrnzmnzrn
sbinzruaw$d xj fdlj
lqpjwxj pdlew>n cvm>u
dpy. dpdwpj dagy.
gdle cvm>u
ljfy lqae

r=zt=m iuadwodawpj
fd$d gdod ljaxjawya
ljaxjawpemzs= >nzs=
mvm=z>n nmi= ibrn.

t=mzrn =m>= vzh=m vz=rv. =rvzr=zrn rua xe#= =mcv.
t=m iuadwpj gdodw$dw$j. $jwpj gdodw$dwxj kda lem=.
t=m sbcvzrn >nrnm n#v c=h=zsn.
t=m sbcvzrn >nrnm upja gdodw$j. kda pjpjwpj xjpja
ywlem=zrua xjw$j lehvrn. sn iuoypj $qgywpj fd$dwke
vz>nrnm >=#=r= vzsupjwpem cu$e ibrn sualqgd. lehvrn sbcv
=mcvzrn #=s=ztu ywxem=z#nm ruaw$dwxj fdlj lqpjwgda
fjoyawya fjoyawxj fqpy dpdwxjwkda lem=zmn.

--

t=m iuad $qgywpjwpem s= =mcv vziuoypjwxj & xj$ykd. gdle cvm>u
lypjwxjw$d dpywpd odle r=h= n#vzvm h=iu $ern n#vzrn uxya ywgesu
qgd gdod xdpj dpd gdod lehvrn >=rnz>n m=m>= rnrn. iuoypjwpj daxd
yw$dwlja. $j lehvrnzrn c=h= nmi=zs=z>n #=in ibrn c=h= nmi=zs=z>n
#nhvmzvm #nhvmz>n #brv =r=. sn iuoypjwpj fd$d lem= sbcv. s=s=
sntvzvm s=s= svrnzrn =m>=. c=iu gyaxe #=s=zr= iuoypj pjpjwpj
lyoj! gyaxe mnz#nmzr= m=m>=z>n iuoypjw$jwpj lqpj! gyaxe #=s=zr=
m=m>=zrn #=s= =r= rnrn. rb#= h=mzrnzrua qgd fjoyawxj jpe supj.
pqfdwgqwoda pjwpemzs= >n sbinzvm supj dpd! gyaxez>n
iuoypjwxj &
xj$ykdwpjwlja!

Satsuoni-Novil

Satsuoni posted an updated version based on Ryan's original transliteration, but accounting for the character-set overlap which had been observed, and substituting equivalent characters to reduce the effective character set to 16 symbols instead of 31. A corrected version of this transliteration (using the same character mapping as Satsuoni, but fixing several confirmed errors) was later posted by Novil. The following is Novil's corrected version of Satsuoni's mapping of the text.

Text

Direct Download

i/hvrn svrnzrn mnsn vzmn: i/hvrn svrnzrn c/#n >/#=z#/m ivhn. h=mzr=
svrn ibrnzr= /m=zrnzt/ v & >nsvt=. cvm>/ m=m>= h=rb in#v >nmnzr=
/m=zrn />vmzr/mzrn #/m ivhnzrn #bcv vz#/m ivhn. #/m ivhnzrn c/#v
vzmn: /z=r=i= vzs= s=s=zinm. /m= svrnzrn >=m= vzmnsn "sn c=h="
c=h=zt=m i=s= =rv i=s=zrn c=h=zc=mzs= >n ibm/z>n >nmn =r=. cvm>/
c=h= cvm>/ c=h=zs= rvcvzr= /m= svrnzrn rb#nm vz#vmz>n s/rnzvm
>=#=r=. c=h= >/m=zvm i/m=
>/m=zr= i/m= /m=zr=z#/m
ivhn n#vzrn s/rn =r=
vz/m= svrn. cvm>/
ibm/zr= /m= r=>v.
cvm>/ >nsvt=zr=
m/#= ibm/zrn >=m=
vzm=inm c=h= /m=zr/m
c=mzs= >nsvt=zr/m
c=h=zs= ibm/.

>nmnz>n c=h=
ibrnzr= /m=zrn
#=s= c=h=zs= >nmn.
s= >nmnzrn #=in
ibrn r/mzs=z>n
#=in ibrnz>n
#brv r=i/z>n
cvm>/ =rv.

--

i/hvrn svrnzrn c/#n >/#=z#/m ivhnzrn s=s= svrnz>n i/hvrn =rvz>n
inm>nmz>n & >nsvt=. sn i/hvrnzrn i=s= =r= vzi/hvrn =mcv.
m=inmzmnzr= sn i/hvrnzrn i=s=:
svrn ibrnzrn >nrnm rb#=
vzi/hvrn =rv #v>v#vm rb#=
ivhnz>n /m=z>n svrn ibrn.
c=mzmnzr= h=h= ivhnzrn
c=h=zr/m rb#=z>nzsn sbcvzrn
incvrvm vzm=inmz>n s=z>n
#=in ibrn. m=inm ivhnz>n svrn
ibrnzrnzr/m r=h=zsn sbcv.
i/hvrn =rnz>n & >nsvt=zrn hnrv
c=h= i=s= svrn ibrn. i/hvrnzrn
#bcv vzmnsnz>n /m= ivhnzr=
i/hvrnz>n inm>nm =rvz>n &
>nsvt=zrn =r=i= vzs=zinm.

c=h= >/m=zvm i/m= >/m=zr=
i/hvrnz>n inm>nmz>n &
>nsvt=zrn =m>= vz>=#=r=.
i/hvrnz>n & >nsvt=zrn #v>v#vm sbcv
vzrb#=zrn #v>v#vm rnrn vzh=h= =mcv.

--

cvm>/ in#vz>n m=m>=
h=rbzr= i/hvrnzmnzrn
sbinzr/mzs= >n #=in
ibrnz>n r=i/z>n cvm>/
=rv. =r=zrn =mcv.
c=i/ cvm>/
in#v ibm/

10000

r=zt=m i/m=zh=mzrn
#=s= c=h= inm>nmzvm
inm>nmzr/mzs= >nzs=
mvm=z>n nmi= ibrn.

t=mzrn =m>= vzh=m vz=rv. =rvzr=zrn r/m >/#= =mcv.
t=m i/m=zrn c=h=zs=zsn. snzrn c=h=zs=z>n t=m i/m=.
t=m sbcvzrn >nrnm n#v c=h=zsn.
t=m sbcvzrn >nrnm /rnm c=h=zsn. t=m rnrnzrn >nrnm
vzi/m=zr/m >nzsn i/hvrn. sn i/hvrn sbcvzrn #=s=zt/
vz>nrnm >=#=r= vzs/rnzr/m c/s/ ibrn s/mibc=. i/hvrn sbcv
=mcvzrn #=s=zt/ vz>/m=z#nm r/mzs=z>n #=in ibrnzc=m
#nhvmzvm #nhvmz>n #brv =r=z>nzt=m i/m=zmn.

--

t=m i/m= sbcvzrnzr/m s= =mcv vzi/hvrnz>n & >nsvt=. c=i/ cvm>/
ivrnz>nzs= =rvzr= h=i/ r=h= n#vzvm h=i/ s/rn n#vzrn />vm vzc/s/
bc= c=h= >=rn =r= c=h= i/hvrn >=rnz>n m=m>= rnrn. i/hvrnzrn =m>=
vzs=zinm. sn i/hvrnzrn c=h= nmi=zs=z>n #=in ibrn c=h= nmi=zs=z>n
#nhvmzvm #nhvmz>n #brv =r=. sn i/hvrnzrn #=s= i/m= sbcv. s=s=
sntvzvm s=s= svrnzrn =m>=. c=i/ cvm>/ #=s=zr= i/hvrn rnrnzrn
ivhn! cvm>/ mnz#nmzr= m=m>=z>n i/hvrnzsnzrn ibrn! cvm>/ #=s=zr=
m=m>=zrn #=s= =r= rnrn. rb#= h=mzrnzr/m bc= #nhvmz>n nr/ s/rn.
rb#=zcbzh=m rnzr/mzs= >n sbinzvm s/rn =r=! cvm>/z>n
i/hvrnz>n &
>nsvt=zrnzinm!

Text for Cypher Replacement Tool

If you want to use the Satsuoni-Novil version with Jamie’s "Cypher Replacement Tool", you can use this version:

Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/6/67/Transliteration_Satsuoni_Novil_Repltool.txt" />

Foogod

Foogod has proposed an alternate character mapping, based on Ryan's and Satsuoni's, but designed to have the following features:

  • One-to-one mapping between glyphs and letters (no loss of information)
  • All word characters (except "&") are ASCII letters (works better with text processing tools and human eyes)
  • The Satsuoni character duplication is represented by upper/lower-case letters (so Satsuoni-folding can be achieved by simply converting to all-upper or all-lower case).

Transliteration Key

Foogod key.png

Text

The Foogod transliteration is provided in both ASCII and font-embedded format below.

Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/7/78/Transliteration_Foogod.txt" />

WooGlyph

<include nopre src="http://bookofwoo.foogod.com/w/images/7/78/Transliteration_Foogod.txt" class="wootext" />

This text may not display correctly in all browsers --Iamthepiguy (talk) 07:29, 21 July 2014 (UTC)

Font

Note that an OpenType font file of the Book of Woo alphabet is also available using this mapping, and is embedded in some articles of this wiki.

nneonneo

nneonneo proposes another transliteration, this time based on logical properties of the characters and not the glyphs they represent.

Essentially, glyphs are grouped into three classes, "consonants", "vowels" and "other" based on their observed relationships. These groups are translated logically into equivalent English letters. The class names aren't meant to actually imply a relationship to the analogous English letters; they are merely convenient notation to refer to the classes.

The essential properties of this transliteration are as follows:

  • The glyph classes can be easily distinguished by eye, because they are quite simply vowels/non-vowels.
  • The text itself can now be "pronounced" because of the alternation property of these glyph classes. This aids substantially in word recall as the text is now vaguely (but not too closely) English-like.
  • The mapping is intended as a "logical" mapping rather than a "physical" mapping; consequently the characters chosen deliberately bear little to no resemblance to the glyphs
  • The mapping partially preserves the frequency distribution: the more common glyphs were mapped into more common English letters where possible.
  • The mapping is still lossless because of Satsuoni's observation, though reversing the mapping is somewhat more challenging.

Here's the mapping:

Glyph (uppercase) A D F G J K L O P Q S U W X Y &
Glyph a d f g j k l o p q s u w x y &
Satsuoni-Novil m = # c n t i h r b s / z > v &
Foogod a d f g j k l o p q s u w x y &
nneonneo n e g w a c r l t u d i ' s o &

Vowels are italicized, and special characters are bold-italicized.

The full transliterated text:

Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/6/61/Transliteration_nneonneo.txt" />


kana-based token transcription

Given the regularities that have been noticed in the phonotactics (as it were) of how the glyphs are used, with consonants being consistently followed by vowels, it seemed sensible to me to produce a transcription that takes this into account, representing tokens (i.e., nneonneo syllables) as one character each. In order to avoid confusing people by overloading the Latin letters yet again, I chose the kana, because they represent syllables in Japanese, and because I am familiar with them.

When choosing individual kana for each token, I chose them so that the results would roughly correspond with the nneonneo transcription when pronounced. I also tried to avoid excessive visual similarity between characters (which would be confusing for people who do not know kana) by taking some of the symbols from katakana and others from hiragana, in order to avoid pairs that would be too similar in appearance. In a couple of cases, I allowed this desire to avoid visual similarity to trump the nneonneo pronunciation correspondance.

Working from the folded text, I assigned the foogod w (nneonneo ') to the sokuon (small tsu), which is a geminator in Japanese, on the theory (mentioned by someone else elsewhere) that it may even be used that way, although this has not really been explored or confirmed yet; and I assigned the other token that can stand on its own (foogod a, nneonneo n) to the syllabic nasal in those situations.

The remainder were chosen so that their standard romanization corresponds to nneonneo, with two exceptions: nneonneo tu (foogod pq) would have been too visually similar to either ' (w) or du (sq), so I grabbed hiragana u, which was available (because nneonneo u is represented by the katakana one); and wu (foogod gq) cannot be written in a single kana character, so I chose a wa character to stand in.

I treated c as k in all cases rather than s. Where two characters would otherwise have differed only by dakuten (voicing), such as with ta/da, or not at all, such as ra/la, I took one from katakana and the other from hiragana.

Here, then, is a draft kana transcription of the text:

リろタ どタッタ なだ オッな: リろタ どタッタ ヰが しげッぎん ロら。 れんッテ 
どタ ルタッテ イねッタッキ オ & サどケ。 をんし ねんセ れう ラご サなッテ 
イねッタ イそんッチんッタ ぎん ロらッタ ぐを オッぎん ロら。 ぎん ロらッタ ヰご 
オッな: イッエテレ オッで ででッラん。 イね どタッタ セね オッなだ "だ ヱれ" 
ヱれッケん レで エト レでッタ ヱれッヱんッで サ ルにッサ サな エテ。 をんし 
ヱれ をんし ヱれッで トをッテ イね どタッタ うがん オッごんッサ ぢタッオん 
セげテ。 ヱれ しねッオん リね 
しねッテ リね イねッテッぎん 
ロら アごッタ ぢタ エテ 
オッイね どタ。 をんし 
ルにッテ イね テそ。 
をんし サどケッテ 
にげ ルにッタ セね 
オッねラん ヱれ イねッチん 
ヱんッで サどケッチん 
ヱれッで ルに。 
サなッサ ヱれ 
ルタッテ イねッタ 
げで ヱれッで サな。 
で サなッタ げラ 
ルタ チんッでッサ 
げラ ルタッサ 
ぐト テリッサ 
をんし エト。 
ーー 
リろタ どタッタ ヰが しげッぎん ロらッタ でで どタッサ リろタ エトッサ 
ラんサんッサ & サどケ。 だ リろタッタ レで エテ オッリろタ エんを。 
ねラんッなッテ だ リろタッタ レで: 
どタ ルタッタ サタん うげ 
オッリろタ エト ごそごん うげ 
ロらッサ イねッサ どタ ルタ。 
ヱんッなッテ れれ ロらッタ 
ヱれッチん うげッサッだ づをッタ 
ラをトん オッねラんッサ でッサ 
げラ ルタ。 ねラん ロらッサ どタ 
ルタッタッチん テれッだ づを。 
リろタ エタッサ & サどケッタ らト 
ヱれ レで どタ ルタ。 リろタッタ 
ぐを オッなだッサ イね ロらッテ 
リろタッサ ラんサん エトッサ & 
サどケッタ エテレ オッでッラん。 
ヱれ しねッオん リね しねッテ 
リろタッサ ラんサんッサ & 
サどケッタ エんセ オッセげテ。 
リろタッサ & サどケッタ ごそごん づを 
オッうげッタ ごそごん タタ オッれれ エんを。 
ーー 
をんし ラごッサ ねんセ 
れうッテ リろタッなッタ 
づラッチんッで サ げラ 
ルタッサ テリッサ をんし 
エト。 エテッタ エんを。 
ヱリ をんし 
ラご ルに 
テッケん リねッれんッタ 
げで ヱれ ラんサんッオん 
ラんサんッチんッで サッで 
のねッサ アんレ ルタ。 
ケんッタ エんセ オッれん オッエト。 エトッテッタ チん しげ エんを。 
ケん リねッタ ヱれッでッだ。 だッタ ヱれッでッサ ケん リね。 
ケん づをッタ サタん アご ヱれッだ。 
ケん づをッタ サタん イタん ヱれッだ。 ケん タタッタ サタん 
オッリねッチん サッだ リろタ。 だ リろタ づをッタ げでッキ 
オッサタん セげテ オッぢタッチん ヰぢ ルタ ぢんルヱ。 リろタ づを 
エんをッタ げでッキ オッしねッがん チんッでッサ げラ ルタッヱん 
がろんッオん がろんッサ ぐト エテッサッケん リねッな。 
ーー 
ケん リね づをッタッチん で エんを オッリろタッサ & サどケ。 ヱリ をんし 
ロタッサッで エトッテ れリ テれ アごッオん れリ ぢタ アごッタ イそん オッヰぢ 
ウヱ ヱれ セタ エテ ヱれ リろタ セタッサ ねんセ タタ。 リろタッタ エんセ 
オッでッラん。 だ リろタッタ ヱれ アんレッでッサ げラ ルタ ヱれ アんレッでッサ 
がろんッオん がろんッサ ぐト エテ。 だ リろタッタ げで リね づを。 でで 
だコッオん でで どタッタ エんセ。 ヱリ をんし げでッテ リろタ タタッタ 
ロら! をんし なッがんッテ ねんセッサ リろタッだッタ ルタ! をんし げでッテ 
ねんセッタ げで エテ タタ。 うげ れんッタッチん ウヱ がろんッサ アチ ぢタ。 
うげッヲッれん タッチんッで サ づラッオん ぢタ エテ! をんしッサ 
リろタッサ & 
サどケッタッラん! 

Hopefully this transcription may prove useful. If indeed these "tokens" are the basic unit of information for the next layer of decryption, having them represented by just one character each may be helpful. (If nothing else, doing frequency analysis on this transcription shows that we are clearly not yet at the final stage of decryption, since the frequency of individual tokens/kana, except for the four most common ones, clearly falls off linearly.)

I believe there are 46 kana, if I counted correctly (plus &, which I left as & for now). Jonadab (talk) 14:50, 12 August 2014 (UTC)

token transliteration table

Here is a table showing all the different transliterations, in both alphabets, for each token: Woo-tokens-by-frequency.png Jonadab (talk) 14:48, 13 August 2014 (UTC)

Syllabary

I wondered whether Jonadab's "tokenization" was a complete syllabary or not, so I made a table! XD

The "consonants" that are on their own on the left column do not occur independently, hence they do not have an associated frequency. They are there as a headers for the rest of each row.

Syllables & Their Frequencies
"vowels" other
Template:Woo
31
Template:Woo
8
Template:Woo
2
Template:Woo
15
Template:Woo
35
Template:Woo
204
Template:Woo
8
"consonants" Template:Woo
121
Template:Woo
38
Template:Woo
15
Template:Woo
6
Template:Woo
1
Template:Woo Template:Woo
16
Template:Woo
70
Template:Woo
22
Template:Woo
6
Template:Woo Template:Woo
37
Template:Woo
12
Template:Woo
30
Template:Woo Template:Woo
20
Template:Woo
3
Template:Woo
1
Template:Woo Template:Woo
32
Template:Woo
1
Template:Woo
5
Template:Woo
33
Template:Woo Template:Woo
27
Template:Woo
10
Template:Woo
5
Template:Woo
6
Template:Woo
15
Template:Woo Template:Woo
10
Template:Woo
25
Template:Woo
21
Template:Woo
42
Template:Woo
12
Template:Woo Template:Woo
39
Template:Woo
131
Template:Woo
9
Template:Woo
16
Template:Woo
15
Template:Woo Template:Woo
43
Template:Woo
18
Template:Woo
11
Template:Woo
9
Template:Woo
23

--Trackwayback (talk) 22:51, 20 December 2014 (UTC)