Language Encoding System Convertor
Latest version 0.91.3 beta
What is it?
LESC stands for Language Encoding System Convertor. It is a Java application that allows users to convert among the various encoding schemes that may exist for a particular language. It will hopefully help some people with the transition to Unicode, or with the adaptation of high-quality encoding shemes.
LESC works by mapping one character encoding to another, both of which are defined on a particular language. A "language" consists of a DTD file that lists all of the characters related to a human language that may need to be converted among encoding schemes. You can use LESC, for example, to convert regular Latin-character text ("hello") into an expanded "radio" letters-format ("hotel echo lima lima oscar "). The DTD for English defines the abstract letters a through z. Character encoding files (XML) define implementations of those characters. In ASCII, the abstract English character "A" is represented by character number 65. In radio letters, the abstract English character "A" is represented by the string "ALPHA".
News
04 Apr 2004: Javadocs are now available online.
17 Mar 2004: Screenshot posted--a primitive example of LESC in operation.
Downloads
Name | Type | Description | Language | Date Updated |
---|---|---|---|---|
lesc-0.91.3.bin.zip | binary (JAR) | most recent stable build | ||
lesc-0.91.3.src.zip | Source (Java) | most recent source distribution | ||
English.dtd | Language definition (DTD) | Defines basic English alphabet | Modern English | 27 Mar 2004 |
English.ASCII.xml | Encoding definition (XML) | Implements the English Latin alphabet in ASCII/Unicode | Modern English | 27 Mar 2004 |
English.RadioLetters.xml | Encoding definition (XML) | Example implementation of radio letters on English (alpha, bravo, charlie, etc.). | Modern English | 27 Mar 2004 |
German.dtd | Language definition (DTD) | Defines German alphabet (basic English plus umlaut characters and ess) | Modern German | 08 May 2004 |
German.Unicode.xml | Encoding definition (XML) | Implements Unicode on German.dtd | Modern German | 08 May 2004 |
German.ASCII.xml | Encoding definition (XML) | Allows typing of German with English keyboard using backslashes to represent "special" chars | Modern German | 08 May 2004 |
Russian.dtd | Language definition (DTD) | Defines Russian Cyrillic alphabet and some punctuation | Modern Russian | 17 Mar 2004 |
Russian.Unicode.xml | Encoding definition (XML) | Implements Unicode on Russian.dtd | Modern Russian | 17 Mar 2004 |
Russian.KOI8.xml | Encoding definition (XML) | Implements the KOI8R codepage on Russian | Modern Russian | 17 Mar 2004 |
Russian.CP1251.xml | Encoding definition (XML) | Implements codepage 1251 (Windows Cyrillic) on Russian | Modern Russian | 17 Mar 2004 |
Russian.LOCTranslit.FROM.xml | Encoding definition (XML) | Implements a variation on Library of Congress Cyrillic transliteration on Russian for converting FROM transliteration | Modern Russian | 17 Mar 2004 |
Contribute new languages or encodings |
© 2004 Jim Tonn <jmtonn at users * sourceforge * net>