Transliterations

From BookOfWoo
Jump to: navigation, search

Transliteration is the act of rewriting a given text using letters from a different alphabet. As the Book of Woo is written in an apparently previously unknown alphabet, it can be difficult to manipulate, analyze, and quote it using computers and the Internet. As such, one of the first steps typically employed is to transliterate the text into another alphabet which can be more easily used. This page contains various transliterations which have been performed and posted for others to use.

The following transliterations have been updated to include all corrections which have been discovered to date (both transcription errors and errors in the original document which have been confirmed by Novil). They have also been reformatted here to better match the layout of the original.

(Some older transliterations without these corrections have also been posted for historical reference in the Old Transliterations page.)

Toki Pona[edit]

This is the correspondence:

Glyph (uppercase) A D F G J K L O P Q S U W X Y &
Glyph a d f g j k l o p q s u w x y &
Toki Pona n a k t i j s w l u m o ' p e &

Text[edit]

soweli meli'li nimi e'ni: soweli meli'li toki poka'kon sewi. wan'la meli suli'la ona'li'jo e & pimeja. tenpo nanpa walu sike pini'la ona'li open'lon'li kon sewi'li kute e'kon sewi. kon sewi'li toke e'ni: o'alasa e'ma mama'sin. ona meli'li pana e'nimi "mi tawa" tawa'jan sama ale sama'li tawa'tan'ma pi suno'pi pini ala. tenpo tawa tenpo tawa'ma lete'la ona meli'li lukin e'ken'pi moli'en pakala. tawa pona'en sona pona'la sona ona'la'kon sewi ike'li moli ala e'ona meli. tenpo suno'la ona lape. tenpo pimeja'la noka suno'li pana e'nasin tawa ona'lon tan'ma pimeja'lon tawa'ma suno.

pini'pi tawa suli'la ona'li kama tawa'ma pini. ma pini'li kasi suli lon'ma'pi kasi suli'pi kule laso'pi tenpo ale.

--

soweli meli'li toki poka'kon sewi'li mama meli'pi soweli ale'pi sinpin'pi & pimeja. mi soweli'li sama ala e'soweli ante. nasin'ni'la mi soweli'li sama meli suli'li pilin luka e'soweli ale kepeken luka sewi'pi ona'pi meli suli. tan'ni'la wawa sewi'li tawa'lon luka'pi'mi mute'li sitelen e'nasin'pi ma'pi kasi suli. nasin sewi'pi meli suli'li'lon lawa'mi mute. soweli ali'pi & pimeja'li wile tawa sama meli suli. soweli'li kute e'nimi'pi ona sewi'la soweli'pi sinpin ale'pi & pimeja'li alasa e'ma'sin.

tawa pona'en sona pona'la soweli'pi sinpin'pi & pimeja'li anpa e'pakala. soweli'pi & pimeja'li kepeken mute e'luka'li kepeken lili e'wawa ante.

--

tenpo sike'pi nanpa walu'la soweli'ni'li musi'lon'ma pi kasi suli'pi laso'pi tenpo ale. ala'li ante. taso tenpo sike suno

la'jan sona'wan'li kama tawa sinpin'en sinpin'lon'ma pi'ma nena'pi insa suli.

jan'li anpa e'wan e'ale. ale'la'li lon poka ante. jan sona'li tawa'ma'mi. mi'li tawa'ma'pi jan sona. jan mute'li pilin ike tawa'mi. jan mute'li pilin olin tawa'mi. jan lili'li pilin e'sona'lon pi'mi soweli. mi soweli mute'li kama'jo e'pilin pakala e'moli'lon tomo suli monsuta. soweli mute ante'li kama'jo e'pona'kin lon'ma'pi kasi suli'tan kiwen'en kiwen'pi kule ala'pi'jan sona'ni.

--

jan sona mute'li'lon ma ante e'soweli'pi & pimeja. taso tenpo seli'pi'ma ale'la waso lawa ike'en waso moli ike'li open e'tomo uta tawa pali ala tawa soweli pali'pi nanpa lili. soweli'li anpa e'ma'sin. mi soweli'li tawa insa'ma'pi kasi suli tawa insa'ma'pi kiwen'en kiwen'pi kule ala. mi soweli'li kama sona mute. mama mije'en mama meli'li anpa. taso tenpo kama'la soweli lili'li sewi! tenpo ni'kin'la nanpa'pi soweli'mi'li suli! tenpo kama'la nanpa'li kama ala lili. luka wan'li'lon uta kiwen'pi ilo moli. luka'tu'wan li'lon'ma pi musi'en moli ala! tenpo'pi soweli'pi & pimeja'li'sin!


additional text of speedrun: len insa anpa

word-by-word translation[edit]

Using the word list from http://tokipona.net/tp/classicwordlist.aspx Two words, WALU and TOKE, do not appear to have a translation. Maybe they're typos. Regarding grammar, many words can be a noun or an adjective or a verb depending on context. I chose to represent the subject-verb separator as "is" and the object separator as "for".


animal woman is name for this: animal woman is speech with wind high. one context-of woman big context-of they is have for & dark. time number WALU(white?) circle end context-of they is open exist is wind high is listen for wind high. wind high is TOKE(speech?) for this: Oh hunt for earth parent new. they woman is give for name "me towards" towards human same everything same is towards cause earth of light of end not. time towards time towards earth cold context-of they woman is looking for can of death and mistake/destroy. towards good and know good context-of know they context-of wind high bad is death not for they woman. time light context-of they sleep. time dark context-of leg light is give for way towards they exist cause earth dark exist towards earth light.

end of towards big context-of they is come towards earth end. earth end is plant big exist earth of plant big of color blue of time everything.

--

animal woman is speech with wind high is parent woman of animal everything of front of & dark. me animal is same not for animal other. way this context-of me animal is same woman big is feel hand for animal everything use/with hand high of they of woman big. cause this context-of power high is towards exist hand of me lots is drawing for way of earth of plant big. way high of woman big is exist head me lots. animal all of & dark is want towards same woman big. animal is listen for name of they high context-of animal of front everything of & dark is hunt for earth new.

towards good and know good context-of animal of front of & dark is bottom for mistake/destroy. animal of & dark is use/with lots for hand is use/with small for power other.

--

time circle of number WALU(white?) context-of animal this is play exist earth of plant big of blue of time everything. not is other. only time circle light

context-of human know one is come towards front and front exist earth of earth hill of inside big.

human is bottom for one for everything. everything context-of is exist with other. human know is towards earth me. me is towards earth of human know. human lots is feel bad towards me. human lots is feel love towards me. human small is feel for know exist of me animal. me animal lots is come have for feel mistake/destroy for death exist home big monster. animal lots other is come have for good also exist earth of plant big cause stone and stone of color not of human know this.

--

human know lots is exist earth other for animal of & dark. only time fire of earth everything context-of bird head bad and bird death bad is open for home mouth towards work not towards animal work of number small. animal is bottom for earth new. me animal is towards inside earth of plant big towards inside earth of stone and stone of color not. me animal is come know lots. parent man and parent woman is bottom. only time come context-of animal small is high! time this also context-of number of animal me is big! time come context-of number is come not small. hand one is exist mouth stone of tool death. hand two one is exist earth of play and death not! time of animal of & dark is new!


additional text of speedrun: cloth inside bottom

Ryan (corrected)[edit]

Ryan was the first user to transcribe the original text and post the result for others to use. As such, his has become something of a de-facto standard transliteration amongst those discussing the text online. The following version has been updated (by Foogod) to include the corrections noted by Novil and to correspond better with the later Satsuoni-Novil transcription.

Transliteration Key[edit]

Ryan abc.jpg

Text[edit]

Direct Download

Satsuoni-Novil[edit]

Satsuoni posted an updated version based on Ryan's original transliteration, but accounting for the character-set overlap which had been observed, and substituting equivalent characters to reduce the effective character set to 16 symbols instead of 31. A corrected version of this transliteration (using the same character mapping as Satsuoni, but fixing several confirmed errors) was later posted by Novil. The following is Novil's corrected version of Satsuoni's mapping of the text.

Text[edit]

Direct Download

Text for Cypher Replacement Tool[edit]

If you want to use the Satsuoni-Novil version with Jamie’s "Cypher Replacement Tool", you can use this version:

Direct Download

Foogod[edit]

Foogod has proposed an alternate character mapping, based on Ryan's and Satsuoni's, but designed to have the following features:

  • One-to-one mapping between glyphs and letters (no loss of information)
  • All word characters (except "&") are ASCII letters (works better with text processing tools and human eyes)
  • The Satsuoni character duplication is represented by upper/lower-case letters (so Satsuoni-folding can be achieved by simply converting to all-upper or all-lower case).

Transliteration Key[edit]

Foogod key.png

Text[edit]

The Foogod transliteration is provided in both ASCII and font-embedded format below.

Direct Download


WooGlyph[edit]

lUOYPJ SYPJWPJ AJSJ YWAJ: Luoypj sypjwpj gUFJ XufdwfUA LYOJ. ODAWPD SYPJ LQPJWPD uadwpjwkU Y & XJSYKD. GYAXu adaxd odpq ljfy xjajwpd UADWPJ uxyawpUAWPJ Fua lyojwpj fqgy ywfUA LYOJ. Fua lyojwpj gUFY YWAJ: uwdpdld ywsd sdsdwlja. UAD SYPJWPJ XDAD YWAJSJ "SJ GDOD" GDODWKDA LDSD DPY LDSDWPJ GDODWGDAWSD XJ LQAuwxj xjaj dpd. gyaxU GDOD GYAXu gdodwsd pygywpd UAD SYPJWPJ PQFJA YWFYAWXJ Supjwya xdfdpd. gdod xUADWYA Luad xUADWPD Luad UADWPDWFua lyoj jfywpj sUPJ DPD YWuad sypj. gyaxU LQAuwpd UAD PDXY. GYAXu xjsykdwpd aUFD LQAuwpj xdad ywadlja gdod UADWPua gdawsd xjsykdwpUA GDODWSD LQAu.

xjajwxj gdod lqpjwpd UADWPJ FDSD GDODWSD XJAJ. SD XJAJWPJ FDLJ LQPJ Puawsdwxj fdlj lqpjwxj fqpy pdlUWXJ GYAXu dpy. -- lUOYPJ SYPJWPJ Gufj xUFDWFua lyojwpj sdsd sypjwxj lUOYPJ DPYWXJ LJAXJAWXJ & XJSYKD. SJ Luoypjwpj ldsd dpd ywlUOYPJ DAGY. ADLJAWAJWPD SJ Luoypjwpj ldsd: sypj lqpjwpj xjpja pqfd ywlUOYPJ DPY FYXYFYA PQFD LYOJWXJ uadwxj sypj lqpj. gdawajwpd odod lyojwpj gdodwpUA PQFDWXJWSJ SQGYWPJ LJGYPYA YWADLJAWXJ SDWXJ FDLJ LQPJ. ADLJA LYOJWXJ SYPJ LQPJWPJWPua pdodwsj sqgy. lUOYPJ DPJWXJ & XJSYKDWPJ OJPY GDOD LDSD SYPJ LQPJ. Luoypjwpj fqgy ywajsjwxj UAD LYOJWPD Luoypjwxj ljaxja dpywxj & xjsykdwpj dpdld ywsdwlja.

gdod xUADWYA Luad xUADWPD Luoypjwxj ljaxjawxj & xjsykdwpj daxd ywxdfdpd. lUOYPJWXJ & XJSYKDWPJ FYXYFYA SQGY YWPQFDWPJ FYXYFYA PJPJ YWODOD DAGY. -- GYAXu ljfywxj adaxd odpqwpd lUOYPJWAJWPJ SQLJWPuawsd xj fdlj lqpjwxj pdlUwXJ GYAXu dpy. dpdwpj dagy. gdlU GYAXu ljfy lqaU

PDWKDA Luadwodawpj fdsd gdod ljaxjawya ljaxjawpUAWSD XJWSD AYADWXJ JALD LQPJ.

KDAWPJ DAXD YWODA YWDPY. DPYWPDWPJ Pua xUFD DAGY. KDA Luadwpj gdodwsdwsj. sjwpj gdodwsdwxj kda lUAD. KDA SQGYWPJ XJPJA JFY GDODWSJ. KDA SQGYWPJ XJPJA upja gdodwsj. kda pjpjwpj xjpja ywlUADWPua xjwsj lUOYPJ. SJ Luoypj sqgywpj fdsdwkU YWXJPJA XDFDPD YWSupjwpUA GusU LQPJ Sualqgd. lUOYPJ SQGY DAGYWPJ FDSDWKu ywxUADWFJA Puawsdwxj fdlj lqpjwgda fjoyawya fjoyawxj fqpy dpdwxjwkda lUADWAJ.

--

KDA Luad sqgywpjwpUA SD DAGY YWLuoypjwxj & xjsykd. gdlU GYAXu lypjwxjwsd dpywpd odlU PDOD JFYWYA ODLu sUPJ JFYWPJ uxya ywgUSu qgd gdod xdpj dpd gdod lUOYPJ XDPJWXJ ADAXD PJPJ. Luoypjwpj daxd ywsdwlja. sj lUOYPJWPJ GDOD JALDWSDWXJ FDLJ LQPJ GDOD JALDWSDWXJ FJOYAWYA FJOYAWXJ FQPY DPD. SJ Luoypjwpj fdsd lUAD SQGY. SDSD SJKYWYA SDSD SYPJWPJ DAXD. GDLu gyaxU FDSDWPD Luoypj pjpjwpj lyoj! gyaxU AJWFJAWPD ADAXDWXJ Luoypjwsjwpj lqpj! gyaxU FDSDWPD ADAXDWPJ FDSD DPD PJPJ. PQFD ODAWPJWPua qgd fjoyawxj jpU Supj. pqfdwgqwoda pjwpUAWSD XJ SQLJWYA Supj dpd! gyaxUWXJ Luoypjwxj & xjsykdwpjwlja!

This text may not display correctly in all browsers --Iamthepiguy (talk) 07:29, 21 July 2014 (UTC)

Font[edit]

Note that an Open Type Font WooGlyphSans of the Book of Woo alphabet is also available using this mapping, and is embedded in some articles of this wiki.

nneonneo[edit]

nneonneo proposes another transliteration, this time based on logical properties of the characters and not the glyphs they represent.

Essentially, glyphs are grouped into three classes, "consonants", "vowels" and "other" based on their observed relationships. These groups are translated logically into equivalent English letters. The class names aren't meant to actually imply a relationship to the analogous English letters; they are merely convenient notation to refer to the classes.

The essential properties of this transliteration are as follows:

  • The glyph classes can be easily distinguished by eye, because they are quite simply vowels/non-vowels.
  • The text itself can now be "pronounced" because of the alternation property of these glyph classes. This aids substantially in word recall as the text is now vaguely (but not too closely) English-like.
  • The mapping is intended as a "logical" mapping rather than a "physical" mapping; consequently the characters chosen deliberately bear little to no resemblance to the glyphs
  • The mapping partially preserves the frequency distribution: the more common glyphs were mapped into more common English letters where possible.
  • The mapping is still lossless because of Satsuoni's observation, though reversing the mapping is somewhat more challenging.

Here's the mapping:

Glyph (uppercase) A D F G J K L O P Q S U W X Y &
Glyph a d f g j k l o p q s u w x y &
Satsuoni-Novil m = # c n t i h r b s / z > v &
Foogod a d f g j k l o p q s u w x y &
nneonneo n e g w a c r l t u d i ' s o &

Vowels are italicized, and special characters are bold-italicized.

The full transliterated text:

Direct Download

kana-based token transcription[edit]

Given the regularities that have been noticed in the phonotactics (as it were) of how the glyphs are used, with consonants being consistently followed by vowels, it seemed sensible to me to produce a transcription that takes this into account, representing tokens (i.e., nneonneo syllables) as one character each. In order to avoid confusing people by overloading the Latin letters yet again, I chose the kana, because they represent syllables in Japanese, and because I am familiar with them.

When choosing individual kana for each token, I chose them so that the results would roughly correspond with the nneonneo transcription when pronounced. I also tried to avoid excessive visual similarity between characters (which would be confusing for people who do not know kana) by taking some of the symbols from katakana and others from hiragana, in order to avoid pairs that would be too similar in appearance. In a couple of cases, I allowed this desire to avoid visual similarity to trump the nneonneo pronunciation correspondance.

Working from the folded text, I assigned the foogod w (nneonneo ') to the sokuon (small tsu), which is a geminator in Japanese, on the theory (mentioned by someone else elsewhere) that it may even be used that way, although this has not really been explored or confirmed yet; and I assigned the other token that can stand on its own (foogod a, nneonneo n) to the syllabic nasal in those situations.

The remainder were chosen so that their standard romanization corresponds to nneonneo, with two exceptions: nneonneo tu (foogod pq) would have been too visually similar to either ' (w) or du (sq), so I grabbed hiragana u, which was available (because nneonneo u is represented by the katakana one); and wu (foogod gq) cannot be written in a single kana character, so I chose a wa character to stand in.

I treated c as k in all cases rather than s. Where two characters would otherwise have differed only by dakuten (voicing), such as with ta/da, or not at all, such as ra/la, I took one from katakana and the other from hiragana.

Here, then, is a draft kana transcription of the text:

リろタ どタッタ なだ オッな: リろタ どタッタ ヰが しげッぎん ロら。 れんッテ 
どタ ルタッテ イねッタッキ オ & サどケ。 をんし ねんセ れう ラご サなッテ 
イねッタ イそんッチんッタ ぎん ロらッタ ぐを オッぎん ロら。 ぎん ロらッタ ヰご 
オッな: イッエテレ オッで ででッラん。 イね どタッタ セね オッなだ "だ ヱれ" 
ヱれッケん レで エト レでッタ ヱれッヱんッで サ ルにッサ サな エテ。 をんし 
ヱれ をんし ヱれッで トをッテ イね どタッタ うがん オッごんッサ ぢタッオん 
セげテ。 ヱれ しねッオん リね 
しねッテ リね イねッテッぎん 
ロら アごッタ ぢタ エテ 
オッイね どタ。 をんし 
ルにッテ イね テそ。 
をんし サどケッテ 
にげ ルにッタ セね 
オッねラん ヱれ イねッチん 
ヱんッで サどケッチん 
ヱれッで ルに。 
サなッサ ヱれ 
ルタッテ イねッタ 
げで ヱれッで サな。 
で サなッタ げラ 
ルタ チんッでッサ 
げラ ルタッサ 
ぐト テリッサ 
をんし エト。 
ーー 
リろタ どタッタ ヰが しげッぎん ロらッタ でで どタッサ リろタ エトッサ 
ラんサんッサ & サどケ。 だ リろタッタ レで エテ オッリろタ エんを。 
ねラんッなッテ だ リろタッタ レで: 
どタ ルタッタ サタん うげ 
オッリろタ エト ごそごん うげ 
ロらッサ イねッサ どタ ルタ。 
ヱんッなッテ れれ ロらッタ 
ヱれッチん うげッサッだ づをッタ 
ラをトん オッねラんッサ でッサ 
げラ ルタ。 ねラん ロらッサ どタ 
ルタッタッチん テれッだ づを。 
リろタ エタッサ & サどケッタ らト 
ヱれ レで どタ ルタ。 リろタッタ 
ぐを オッなだッサ イね ロらッテ 
リろタッサ ラんサん エトッサ & 
サどケッタ エテレ オッでッラん。 
ヱれ しねッオん リね しねッテ 
リろタッサ ラんサんッサ & 
サどケッタ エんセ オッセげテ。 
リろタッサ & サどケッタ ごそごん づを 
オッうげッタ ごそごん タタ オッれれ エんを。 
ーー 
をんし ラごッサ ねんセ 
れうッテ リろタッなッタ 
づラッチんッで サ げラ 
ルタッサ テリッサ をんし 
エト。 エテッタ エんを。 
ヱリ をんし 
ラご ルに 
テッケん リねッれんッタ 
げで ヱれ ラんサんッオん 
ラんサんッチんッで サッで 
のねッサ アんレ ルタ。 
ケんッタ エんセ オッれん オッエト。 エトッテッタ チん しげ エんを。 
ケん リねッタ ヱれッでッだ。 だッタ ヱれッでッサ ケん リね。 
ケん づをッタ サタん アご ヱれッだ。 
ケん づをッタ サタん イタん ヱれッだ。 ケん タタッタ サタん 
オッリねッチん サッだ リろタ。 だ リろタ づをッタ げでッキ 
オッサタん セげテ オッぢタッチん ヰぢ ルタ ぢんルヱ。 リろタ づを 
エんをッタ げでッキ オッしねッがん チんッでッサ げラ ルタッヱん 
がろんッオん がろんッサ ぐト エテッサッケん リねッな。 
ーー 
ケん リね づをッタッチん で エんを オッリろタッサ & サどケ。 ヱリ をんし 
ロタッサッで エトッテ れリ テれ アごッオん れリ ぢタ アごッタ イそん オッヰぢ 
ウヱ ヱれ セタ エテ ヱれ リろタ セタッサ ねんセ タタ。 リろタッタ エんセ 
オッでッラん。 だ リろタッタ ヱれ アんレッでッサ げラ ルタ ヱれ アんレッでッサ 
がろんッオん がろんッサ ぐト エテ。 だ リろタッタ げで リね づを。 でで 
だコッオん でで どタッタ エんセ。 ヱリ をんし げでッテ リろタ タタッタ 
ロら! をんし なッがんッテ ねんセッサ リろタッだッタ ルタ! をんし げでッテ 
ねんセッタ げで エテ タタ。 うげ れんッタッチん ウヱ がろんッサ アチ ぢタ。 
うげッヲッれん タッチんッで サ づラッオん ぢタ エテ! をんしッサ 
リろタッサ & 
サどケッタッラん! 

Hopefully this transcription may prove useful. If indeed these "tokens" are the basic unit of information for the next layer of decryption, having them represented by just one character each may be helpful. (If nothing else, doing frequency analysis on this transcription shows that we are clearly not yet at the final stage of decryption, since the frequency of individual tokens/kana, except for the four most common ones, clearly falls off linearly.)

I believe there are 46 kana, if I counted correctly (plus &, which I left as & for now). Jonadab (talk) 14:50, 12 August 2014 (UTC)

token transliteration table[edit]

Here is a table showing all the different transliterations, in both alphabets, for each token: Woo-tokens-by-frequency.png Jonadab (talk) 14:48, 13 August 2014 (UTC)

Syllabary[edit]

I wondered whether Jonadab's "tokenization" was a complete syllabary or not, so I made a table! XD

The "consonants" that are on their own on the left column do not occur independently, hence they do not have an associated frequency. They are there as a headers for the rest of each row.

Syllables & Their Frequencies
"vowels" other
dD
31
jJ
8
qQ
2
Uu
15
yY
35
wW
204
&
8
"consonants" aA
121
ad AD
38
aj AJ
15
aU Au
6
ay AY
1
xX xd XD
16
xj XJ
70
xU Xu
22
xy XY
6
oO od OD
37
oj OJ
12
oy OY
30
kK kd KD
20
kU Ku
3
ky KY
1
gG gd GD
32
gq GQ
1
gU Gu
5
gy GY
33
fF fd FD
27
fj FJ
10
fq FQ
5
fU Fu
6
fy FY
15
lL ld LD
10
lj LJ
25
lq LQ
21
lU Lu
42
ly LY
12
pP pd PD
39
pj PJ
131
pq PQ
9
pU Pu
16
py PY
15
sS sd SD
43
sj SJ
18
sq SQ
11
sU Su
9
sy SY
23

--Trackwayback (talk) 22:51, 20 December 2014 (UTC)