Transliterations
Transliteration is the act of rewriting a given text using letters from a different alphabet. As the Book of Woo is written in an apparently previously unknown alphabet, it can be difficult to manipulate, analyze, and quote it using computers and the Internet. As such, one of the first steps typically employed is to transliterate the text into another alphabet which can be more easily used. This page contains various transliterations which have been performed and posted for others to use.
The following transliterations have been updated to include all corrections which have been discovered to date (both transcription errors and errors in the original document which have been confirmed by Novil). They have also been reformatted here to better match the layout of the original.
(Some older transliterations without these corrections have also been posted for historical reference in the Old Transliterations page.)
Contents
Ryan (corrected)
Ryan was the first user to transcribe the original text and post the result for others to use. As such, his has become something of a de-facto standard transliteration amongst those discussing the text online. The following version has been updated (by Foogod) to include the corrections noted by Novil and to correspond better with the later Satsuoni-Novil transcription.
Transliteration Key
Text
lehvrn svrnzrn mnsn vzmn: iuoypj $ypjwpj ge#n >ufdwfem ivhn. h=mzr= svrn ibrnzr= uadwpjwke v & >nsvt=. cvm>u adaxd odpq ljfy xjajwpd em=zrn uxyawpemzrn #ua lyojwpj fqgy ywfem ivhn. #ua lyojwpj ge#v vzmn: uwdpdld yw$d $d$dwlja. em= svrnzrn >=m= vzmnsn "sn c=h=" c=h=zt=m i=s= =rv i=s=zrn c=h=zc=mzs= >n ibmuwxj xjaj dpd. gyaxe c=h= cvm>u gdodw$d pygywpd em= svrnzrn rb#nm vz#vmz>n supjwya xdfdpd. gdod xem=zvm iuad xem=zr= iuad em=zr=z#ua lyoj jfywpj $ern =r= vzuad $ypj. gyaxe ibmuwpd em= r=>v. cvm>u xj$ykdwpd ae#= ibmuwpj xdad ywadlja gdod em=zrua gdaw$d xj$ykdwpem c=h=zs= ibmu. xjajwxj gdod lqpjwpd em=zrn #=s= c=h=zs= >nmn. s= >nmnzrn #=in ibrn ruaw$dwxj fdlj lqpjwxj fqpy pdlez>n cvm>u dpy. -- lehvrn svrnzrn cufj xe#=z#ua lyojwpj $d$d $ypjwxj lehvrn =rvz>n inm>nmz>n & >nsvt=. sn iuoypjwpj ld$d dpd ywlehvrn =mcv. m=inmzmnzr= sn iuoypjwpj ld$d: $ypj lqpjwpj xjpja pqfd ywlehvrn =rv #v>v#vm rb#= ivhnz>n uadwxj $ypj lqpj. gdawajwpd odod lyojwpj gdodwpem rb#=z>nzsn sbcvzrn incvrvm vzm=inmz>n s=z>n #=in ibrn. m=inm ivhnz>n svrn ibrnzrnzrua pdodw$j $qgy. lehvrn =rnz>n & >nsvt=zrn hnrv c=h= i=s= svrn ibrn. iuoypjwpj fqgy ywaj$jwxj em= ivhnzr= iuoypjwxj ljaxja dpywxj & xj$ykdwpj dpdld yw$dwlja. gdod xem=zvm iuad xem=zr= iuoypjwxj ljaxjawxj & xj$ykdwpj daxd ywxdfdpd. lehvrnz>n & >nsvt=zrn #v>v#vm sbcv vzrb#=zrn #v>v#vm rnrn vzh=h= =mcv. -- cvm>u ljfywxj adaxd odpqwpd lehvrnzmnzrn sbinzruaw$d xj fdlj lqpjwxj pdlew>n cvm>u dpy. dpdwpj dagy. gdle cvm>u ljfy lqae r=zt=m iuadwodawpj fd$d gdod ljaxjawya ljaxjawpemzs= >nzs= mvm=z>n nmi= ibrn. t=mzrn =m>= vzh=m vz=rv. =rvzr=zrn rua xe#= =mcv. t=m iuadwpj gdodw$dw$j. $jwpj gdodw$dwxj kda lem=. t=m sbcvzrn >nrnm n#v c=h=zsn. t=m sbcvzrn >nrnm upja gdodw$j. kda pjpjwpj xjpja ywlem=zrua xjw$j lehvrn. sn iuoypj $qgywpj fd$dwke vz>nrnm >=#=r= vzsupjwpem cu$e ibrn sualqgd. lehvrn sbcv =mcvzrn #=s=ztu ywxem=z#nm ruaw$dwxj fdlj lqpjwgda fjoyawya fjoyawxj fqpy dpdwxjwkda lem=zmn. -- t=m iuad $qgywpjwpem s= =mcv vziuoypjwxj & xj$ykd. gdle cvm>u lypjwxjw$d dpywpd odle r=h= n#vzvm h=iu $ern n#vzrn uxya ywgesu qgd gdod xdpj dpd gdod lehvrn >=rnz>n m=m>= rnrn. iuoypjwpj daxd yw$dwlja. $j lehvrnzrn c=h= nmi=zs=z>n #=in ibrn c=h= nmi=zs=z>n #nhvmzvm #nhvmz>n #brv =r=. sn iuoypjwpj fd$d lem= sbcv. s=s= sntvzvm s=s= svrnzrn =m>=. c=iu gyaxe #=s=zr= iuoypj pjpjwpj lyoj! gyaxe mnz#nmzr= m=m>=z>n iuoypjw$jwpj lqpj! gyaxe #=s=zr= m=m>=zrn #=s= =r= rnrn. rb#= h=mzrnzrua qgd fjoyawxj jpe supj. pqfdwgqwoda pjwpemzs= >n sbinzvm supj dpd! gyaxez>n iuoypjwxj & xj$ykdwpjwlja!
Satsuoni-Novil
Satsuoni posted an updated version based on Ryan's original transliteration, but accounting for the character-set overlap which had been observed, and substituting equivalent characters to reduce the effective character set to 16 symbols instead of 31. A corrected version of this transliteration (using the same character mapping as Satsuoni, but fixing several confirmed errors) was later posted by Novil. The following is Novil's corrected version of Satsuoni's mapping of the text.
Text
Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/e/ed/Transliteration_Satsuoni_Novil.txt" />
Text for Cypher Replacement Tool
If you want to use the Satsuoni-Novil version with Jamie’s "Cypher Replacement Tool", you can use this version:
Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/6/67/Transliteration_Satsuoni_Novil_Repltool.txt" />
Foogod
Foogod has proposed an alternate character mapping, based on Ryan's and Satsuoni's, but designed to have the following features:
- One-to-one mapping between glyphs and letters (no loss of information)
- All word characters (except "&") are ASCII letters (works better with text processing tools and human eyes)
- The Satsuoni character duplication is represented by upper/lower-case letters (so Satsuoni-folding can be achieved by simply converting to all-upper or all-lower case).
Transliteration Key
Text
The Foogod transliteration is provided in both ASCII and font-embedded format below.
Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/7/78/Transliteration_Foogod.txt" />
WooGlyph
<include nopre src="http://bookofwoo.foogod.com/w/images/7/78/Transliteration_Foogod.txt" class="wootext" />
This text may not display correctly in all browsers --Iamthepiguy (talk) 07:29, 21 July 2014 (UTC)
Font
Note that an OpenType font file of the Book of Woo alphabet is also available using this mapping, and is embedded in some articles of this wiki.
nneonneo
nneonneo proposes another transliteration, this time based on logical properties of the characters and not the glyphs they represent.
Essentially, glyphs are grouped into three classes, "consonants", "vowels" and "other" based on their observed relationships. These groups are translated logically into equivalent English letters. The class names aren't meant to actually imply a relationship to the analogous English letters; they are merely convenient notation to refer to the classes.
The essential properties of this transliteration are as follows:
- The glyph classes can be easily distinguished by eye, because they are quite simply vowels/non-vowels.
- The text itself can now be "pronounced" because of the alternation property of these glyph classes. This aids substantially in word recall as the text is now vaguely (but not too closely) English-like.
- The mapping is intended as a "logical" mapping rather than a "physical" mapping; consequently the characters chosen deliberately bear little to no resemblance to the glyphs
- The mapping partially preserves the frequency distribution: the more common glyphs were mapped into more common English letters where possible.
- The mapping is still lossless because of Satsuoni's observation, though reversing the mapping is somewhat more challenging.
Here's the mapping:
Glyph (uppercase) | A | D | F | G | J | K | L | O | P | Q | S | U | W | X | Y | & |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Glyph | a | d | f | g | j | k | l | o | p | q | s | u | w | x | y | & |
Satsuoni-Novil | m | = | # | c | n | t | i | h | r | b | s | / | z | > | v | & |
Foogod | a | d | f | g | j | k | l | o | p | q | s | u | w | x | y | & |
nneonneo | n | e | g | w | a | c | r | l | t | u | d | i | ' | s | o | & |
Vowels are italicized, and special characters are bold-italicized.
The full transliterated text:
Direct Download | Wiki Source <include iframe src="http://bookofwoo.foogod.com/w/images/6/61/Transliteration_nneonneo.txt" />
kana-based token transcription
Given the regularities that have been noticed in the phonotactics (as it were) of how the glyphs are used, with consonants being consistently followed by vowels, it seemed sensible to me to produce a transcription that takes this into account, representing tokens (i.e., nneonneo syllables) as one character each. In order to avoid confusing people by overloading the Latin letters yet again, I chose the kana, because they represent syllables in Japanese, and because I am familiar with them.
When choosing individual kana for each token, I chose them so that the results would roughly correspond with the nneonneo transcription when pronounced. I also tried to avoid excessive visual similarity between characters (which would be confusing for people who do not know kana) by taking some of the symbols from katakana and others from hiragana, in order to avoid pairs that would be too similar in appearance. In a couple of cases, I allowed this desire to avoid visual similarity to trump the nneonneo pronunciation correspondance.
Working from the folded text, I assigned the foogod w (nneonneo ') to the sokuon (small tsu), which is a geminator in Japanese, on the theory (mentioned by someone else elsewhere) that it may even be used that way, although this has not really been explored or confirmed yet; and I assigned the other token that can stand on its own (foogod a, nneonneo n) to the syllabic nasal in those situations.
The remainder were chosen so that their standard romanization corresponds to nneonneo, with two exceptions: nneonneo tu (foogod pq) would have been too visually similar to either ' (w) or du (sq), so I grabbed hiragana u, which was available (because nneonneo u is represented by the katakana one); and wu (foogod gq) cannot be written in a single kana character, so I chose a wa character to stand in.
I treated c as k in all cases rather than s. Where two characters would otherwise have differed only by dakuten (voicing), such as with ta/da, or not at all, such as ra/la, I took one from katakana and the other from hiragana.
Here, then, is a draft kana transcription of the text:
リろタ どタッタ なだ オッな: リろタ どタッタ ヰが しげッぎん ロら。 れんッテ どタ ルタッテ イねッタッキ オ & サどケ。 をんし ねんセ れう ラご サなッテ イねッタ イそんッチんッタ ぎん ロらッタ ぐを オッぎん ロら。 ぎん ロらッタ ヰご オッな: イッエテレ オッで ででッラん。 イね どタッタ セね オッなだ "だ ヱれ" ヱれッケん レで エト レでッタ ヱれッヱんッで サ ルにッサ サな エテ。 をんし ヱれ をんし ヱれッで トをッテ イね どタッタ うがん オッごんッサ ぢタッオん セげテ。 ヱれ しねッオん リね しねッテ リね イねッテッぎん ロら アごッタ ぢタ エテ オッイね どタ。 をんし ルにッテ イね テそ。 をんし サどケッテ にげ ルにッタ セね オッねラん ヱれ イねッチん ヱんッで サどケッチん ヱれッで ルに。 サなッサ ヱれ ルタッテ イねッタ げで ヱれッで サな。 で サなッタ げラ ルタ チんッでッサ げラ ルタッサ ぐト テリッサ をんし エト。 ーー リろタ どタッタ ヰが しげッぎん ロらッタ でで どタッサ リろタ エトッサ ラんサんッサ & サどケ。 だ リろタッタ レで エテ オッリろタ エんを。 ねラんッなッテ だ リろタッタ レで: どタ ルタッタ サタん うげ オッリろタ エト ごそごん うげ ロらッサ イねッサ どタ ルタ。 ヱんッなッテ れれ ロらッタ ヱれッチん うげッサッだ づをッタ ラをトん オッねラんッサ でッサ げラ ルタ。 ねラん ロらッサ どタ ルタッタッチん テれッだ づを。 リろタ エタッサ & サどケッタ らト ヱれ レで どタ ルタ。 リろタッタ ぐを オッなだッサ イね ロらッテ リろタッサ ラんサん エトッサ & サどケッタ エテレ オッでッラん。 ヱれ しねッオん リね しねッテ リろタッサ ラんサんッサ & サどケッタ エんセ オッセげテ。 リろタッサ & サどケッタ ごそごん づを オッうげッタ ごそごん タタ オッれれ エんを。 ーー をんし ラごッサ ねんセ れうッテ リろタッなッタ づラッチんッで サ げラ ルタッサ テリッサ をんし エト。 エテッタ エんを。 ヱリ をんし ラご ルに テッケん リねッれんッタ げで ヱれ ラんサんッオん ラんサんッチんッで サッで のねッサ アんレ ルタ。 ケんッタ エんセ オッれん オッエト。 エトッテッタ チん しげ エんを。 ケん リねッタ ヱれッでッだ。 だッタ ヱれッでッサ ケん リね。 ケん づをッタ サタん アご ヱれッだ。 ケん づをッタ サタん イタん ヱれッだ。 ケん タタッタ サタん オッリねッチん サッだ リろタ。 だ リろタ づをッタ げでッキ オッサタん セげテ オッぢタッチん ヰぢ ルタ ぢんルヱ。 リろタ づを エんをッタ げでッキ オッしねッがん チんッでッサ げラ ルタッヱん がろんッオん がろんッサ ぐト エテッサッケん リねッな。 ーー ケん リね づをッタッチん で エんを オッリろタッサ & サどケ。 ヱリ をんし ロタッサッで エトッテ れリ テれ アごッオん れリ ぢタ アごッタ イそん オッヰぢ ウヱ ヱれ セタ エテ ヱれ リろタ セタッサ ねんセ タタ。 リろタッタ エんセ オッでッラん。 だ リろタッタ ヱれ アんレッでッサ げラ ルタ ヱれ アんレッでッサ がろんッオん がろんッサ ぐト エテ。 だ リろタッタ げで リね づを。 でで だコッオん でで どタッタ エんセ。 ヱリ をんし げでッテ リろタ タタッタ ロら! をんし なッがんッテ ねんセッサ リろタッだッタ ルタ! をんし げでッテ ねんセッタ げで エテ タタ。 うげ れんッタッチん ウヱ がろんッサ アチ ぢタ。 うげッヲッれん タッチんッで サ づラッオん ぢタ エテ! をんしッサ リろタッサ & サどケッタッラん!
Hopefully this transcription may prove useful. If indeed these "tokens" are the basic unit of information for the next layer of decryption, having them represented by just one character each may be helpful. (If nothing else, doing frequency analysis on this transcription shows that we are clearly not yet at the final stage of decryption, since the frequency of individual tokens/kana, except for the four most common ones, clearly falls off linearly.)
I believe there are 46 kana, if I counted correctly (plus &, which I left as & for now). Jonadab (talk) 14:50, 12 August 2014 (UTC)
token transliteration table
Here is a table showing all the different transliterations, in both alphabets, for each token: Jonadab (talk) 14:48, 13 August 2014 (UTC)
Syllabary
I wondered whether Jonadab's "tokenization" was a complete syllabary or not, so I made a table! XD
The "consonants" that are on their own on the left column do not occur independently, hence they do not have an associated frequency. They are there as a headers for the rest of each row.
"vowels" | other | |||||||
Template:Woo 31 |
Template:Woo 8 |
Template:Woo 2 |
Template:Woo 15 |
Template:Woo 35 |
Template:Woo 204 |
Template:Woo 8 | ||
"consonants" | Template:Woo 121 |
Template:Woo 38 |
Template:Woo 15 |
Template:Woo 6 |
Template:Woo 1 | |||
Template:Woo | Template:Woo 16 |
Template:Woo 70 |
Template:Woo 22 |
Template:Woo 6 | ||||
Template:Woo | Template:Woo 37 |
Template:Woo 12 |
Template:Woo 30 | |||||
Template:Woo | Template:Woo 20 |
Template:Woo 3 |
Template:Woo 1 | |||||
Template:Woo | Template:Woo 32 |
Template:Woo 1 |
Template:Woo 5 |
Template:Woo 33 | ||||
Template:Woo | Template:Woo 27 |
Template:Woo 10 |
Template:Woo 5 |
Template:Woo 6 |
Template:Woo 15 | |||
Template:Woo | Template:Woo 10 |
Template:Woo 25 |
Template:Woo 21 |
Template:Woo 42 |
Template:Woo 12 | |||
Template:Woo | Template:Woo 39 |
Template:Woo 131 |
Template:Woo 9 |
Template:Woo 16 |
Template:Woo 15 | |||
Template:Woo | Template:Woo 43 |
Template:Woo 18 |
Template:Woo 11 |
Template:Woo 9 |
Template:Woo 23 |
--Trackwayback (talk) 22:51, 20 December 2014 (UTC)