| Cirth: U+E080 - U+E0FF
Proposals 1993-04-08, 1996-05-06; revision 1997-11-03
NOTE: This is still a proposed encoding and has not been standardized. A discussion paper is available here
The Cirth script was invented by the philologist and author
J. R. R. Tolkien as part of the mythological world he created and
was widely popularized through his work, The Lord of the Rings, The Silmarillion, etc. Along with a family of artificial languages and a large corpus of
etymological data describing their relationships, the Cirth
script has attracted the attention of a large community of
linguists and other enthusiasts interested in this expression of
Tolkien's expertise in historical and comparative linguistics.
It can be categorized as a
Category D (Attested Extinct) alphabet: there is a relatively limited
corpus, and a relatively small (but existent) scholarly body studying it. In
order to set a standard Cirth character coding for such
scholars and enthusiasts, it has been suggested that this
character set be included into the Unicode standard and ISO 10646.
8 columns are reserved to encode the Cirth. The last column is currently unused, and is reserved for future discoveries in the Tolkien manuscripts. The Cirth was and is used to write the languages Sindarin, Khuzdul, and others. It has also been used to write English, as on the title page
of The Lord Of The Rings.
General Principles of the Cirth script
The Cirth are a Runic-type alphabet, although they are not connected
with Nordic runes except due to a general resemblance resulting from
the constraints of letterforms carved in wood or stone. Some of the Cirth had two
different forms, which represented glyphic variants in some languages, but could be used for different purposes in others. The Cirth
were written from left to right. No positional variants or non-spacing
marks exist.
Ordering follows the presentation of the Angerthas Daeron with its Eregion and Morian extensions (from The Return of the King, Appendix E), to which the earliest Beleriand runes, additional early Doriathrin and Noldorin Cirth, and other Cirth used for English, etc. have been added following the same structural order (from The Treason of Isengard, Appendix on Runes, and other sources). Where duplication in letter names occurs, a modifier has been added to the name to differentiate it from the primary form. Pronouncible or meaningful names are not known for the Cirth, so their phonetic values are given in the names. Long vowels are written doubled.
Punctuation
Little is known about punctuation marks, though four have been identified: a single dot serves sometimes to separate letters or words; two vertical
dots is used to break up groups longer than a word; three or four vertical dots are used at the beginning and ending of texts. Only three Cirth
digits are extant; each is formed by placing a dot beneath an existing Certh, so that non-spacing dot has been encoded here.
Sometimes word space is not used; word separation may be achieved in that case with U+200B,
ZERO WIDTH SPACE. Hyphenation is not used; words may be broken after any LETTER.
U+E080 CIRTH LETTER P
U+E081 CIRTH LETTER B
U+E082 CIRTH LETTER F
U+E083 CIRTH LETTER V
U+E084 CIRTH LETTER HW
U+E085 CIRTH LETTER M
U+E086 CIRTH LETTER MB
U+E087 CIRTH LETTER T
U+E088 CIRTH LETTER D
U+E089 CIRTH LETTER TH
U+E08A CIRTH LETTER DH
U+E08B CIRTH LETTER N
U+E08C CIRTH LETTER CH
U+E08D CIRTH LETTER J
U+E08E CIRTH LETTER SH
U+E08F CIRTH LETTER ZH
U+E090 CIRTH LETTER NJ
U+E091 CIRTH LETTER K
U+E092 CIRTH LETTER G
U+E093 CIRTH LETTER KH
U+E094 CIRTH LETTER GH
U+E095 CIRTH LETTER ENG
U+E096 CIRTH LETTER KW
U+E097 CIRTH LETTER GW
U+E098 CIRTH LETTER KHW
U+E099 CIRTH LETTER GHW
U+E09A CIRTH LETTER NGW
U+E09B CIRTH LETTER NW
U+E09C CIRTH LETTER R
U+E09D CIRTH LETTER RH
U+E09E CIRTH LETTER L
U+E09F CIRTH LETTER LH
U+E0A0 CIRTH LETTER NG
U+E0A1 CIRTH LETTER S
U+E0A2 CIRTH LETTER KHUZDUL GLOTTAL STOP
U+E0A3 CIRTH LETTER Z
U+E0A4 CIRTH LETTER KHUZDUL NG
U+E0A5 CIRTH LETTER ND
U+E0A6 CIRTH LETTER EI
U+E0A7 CIRTH LETTER I
U+E0A8 CIRTH LETTER KHUZDUL Y
U+E0A9 CIRTH LETTER KHUZDUL HY
U+E0AA CIRTH LETTER U
U+E0AB CIRTH LETTER UU
U+E0AC CIRTH LETTER W
U+E0AD CIRTH LETTER UE
U+E0AE CIRTH LETTER UI
U+E0AF CIRTH LETTER E
U+E0B0 CIRTH LETTER EE
U+E0B1 CIRTH LETTER A
U+E0B2 CIRTH LETTER AA
U+E0B3 CIRTH LETTER O
U+E0B4 CIRTH LETTER OO
U+E0B5 CIRTH LETTER VARIANT OO
U+E0B6 CIRTH LETTER OE
U+E0B7 CIRTH LETTER VARIANT OE
U+E0B8 CIRTH LETTER KHUZDUL N
U+E0B9 CIRTH LETTER H
U+E0BA CIRTH LETTER KHUZDUL LEFT-POINTING SCHWA
U+E0BB CIRTH LETTER SHORT LEFT-POINTING SCHWA
U+E0BC CIRTH LETTER KHUZDUL RIGHT-POINTING SCHWA
U+E0BD CIRTH LETTER SHORT RIGHT-POINTING SCHWA
U+E0BE CIRTH LETTER KHUZDUL PS
U+E0BF CIRTH LETTER KHUZDUL TS
U+E0C0 CIRTH MODIFIER LETTER H
U+E0C1 CIRTH AMPERSAND
U+E0C2 CIRTH LETTER SP
U+E0C3 CIRTH LETTER SB
U+E0C4 CIRTH LETTER SC
U+E0C5 CIRTH LETTER SG
U+E0C6 CIRTH LETTER NDZH
U+E0C7 CIRTH LETTER DORIATHRIN KW
U+E0C8 CIRTH LETTER DORIATHRIN GW
U+E0C9 CIRTH LETTER DORIATHRIN KHW
U+E0CA CIRTH LETTER DORIATHRIN GHW
U+E0CB CIRTH LETTER DORIATHRIN L
U+E0CC CIRTH LETTER ENGLISH ND
U+E0CD CIRTH LETTER DORIATHRIN Z
U+E0CE CIRTH LETTER IU
U+E0CF CIRTH LETTER AI
U+E0D0 CIRTH LETTER AU
U+E0D1 CIRTH LETTER AY
U+E0D2 CIRTH LETTER AE
U+E0D3 CIRTH LETTER EA
U+E0D4 CIRTH LETTER EW
U+E0D5 CIRTH LETTER NOLDORIN O
U+E0D6 CIRTH LETTER NOLDORIN OO
U+E0D7 CIRTH LETTER IO
U+E0D8 CIRTH LETTER EU
U+E0D9 CIRTH LETTER OU
U+E0DA CIRTH LETTER NOLDORIN OE
U+E0DB CIRTH LETTER DORIATHRIN O
U+E0DC CIRTH LETTER ENGLISH THE
U+E0DD CIRTH LETTER NOLDORIN L
U+E0DE CIRTH LETTER ENGLISH OF
U+E0DF CIRTH LETTER Y
U+E0E0 CIRTH LETTER ENGLISH IS
U+E0E1 CIRTH LETTER VARIANT Y
U+E0E2 CIRTH LETTER YY
U+E0E3 CIRTH LETTER NOLDORIN OE
U+E0E4 CIRTH LETTER NOLDORIN OOE
U+E0E5 CIRTH SEPARATOR SINGLE DOT
U+E0E6 CIRTH SEPARATOR DOUBLE DOT
U+E0E7 CIRTH SEPARATOR TRIPLE DOT
U+E0E8 CIRTH START OR END OF TEXT
U+E0E9 CIRTH SEPARATOR DOUBLE PIPE
U+E0EA CIRTH COMBINING NASAL MARK
U+E0EB CIRTH COMBINING LENGTH MARK
U+E0EC CIRTH COMBINING NUMERIC DOT
U+E0ED (This position shall not be used)
U+E0EE (This position shall not be used)
U+E0EF (This position shall not be used)
U+E0F0 (This position shall not be used)
U+E0F1 (This position shall not be used)
U+E0F2 (This position shall not be used)
U+E0F3 (This position shall not be used)
U+E0F4 (This position shall not be used)
U+E0F5 (This position shall not be used)
U+E0F6 (This position shall not be used)
U+E0F7 (This position shall not be used)
U+E0F8 (This position shall not be used)
U+E0F9 (This position shall not be used)
U+E0FA (This position shall not be used)
U+E0FB (This position shall not be used)
U+E0FC (This position shall not be used)
U+E0FD (This position shall not be used)
U+E0FE (This position shall not be used)
U+E0FF (This position shall not be used)
| |
|
HTML Michael Everson, Evertype, Cnoc na Sceiche, Leac an Anfa, Cathair na Mart, Co. Mhaigh Eo, Éire, 2006-05-28
Copyright © 1993-2006 Evertype. All Rights Reserved
|
|