Internationalization (i18n)


FlorèsTM,Character set conversion to automate standardization for universal access

The problem


You want to internationalize your application so you can market it around the world. Right away, you're faced with a client that has to process text (e.g. multilingual database) in different character sets, at least those that are supported by the platforms the application runs on. The traditional solution is to develop separate versions that are each linked to a specific language/platform combination. This method is no longer viable, first due to the cost of maintaining all the versions, and second because users--most likely already online--want to transmit information in more than one character set.


The solution 


Part of the solution lies in developing applications that are based on Unicode, which eliminates the problem of multiple versions and allows your applications to run in several languages. Until Unicode has been adopted worldwide, conversion will continue to be necessary to handle the different platforms, pre-existing data, etc. That's why we developed Florès--to provide a powerful character set conversion tool that can cover all the bases. Florès converts between any pair of character sets on the following list using Unicode as the pivot:

 

 

 
 
Advantages of Florès:
  • Based on Unicode, eliminates the problem of multiple versions and allows your applications to run in several languages
  • Provides a powerful character set conversion tool
  • Converts between any pair of character sets
  • Is platform-independent
  • Adapts to multilingual environments
  • Supports multiple languages at the same time
  • Has a conversion API that  handles the simplest as well as the most complex cases symmetrically
  • Accesses the filters directly
  • Selects character sets automatically

  • ASMO-708-FR
  • ASMO-708-Plus
  • Big5
  • CP437
  • CP737
  • CP775
  • CP850
  • CP852
  • CP853
  • CP855
  • CP857
  • CP860
  • CP861
  • CP863
  • CP864
  • CP865
  • CP866
  • CP869
  • EUC-JP
  • GB2312
  • HZ
  • ISCII
  • ISIRI-3342
  • ISO-2022-JP
  • ISO-2022-KR
  • ISO-8859-1
  • ISO-8859-2
  • ISO-8859-3
  • ISO-8859-4
  • ISO-8859-5
  • ISO-8859-6
  • ISO-8859-7
  • ISO-8859-8
  • ISO-8859-8-I
  • ISO-8859-9
  • ISO-8859-15
  • Jawi-Al-Arabi
  • Kamenicky
  • KOI8
  • KOI8-R
  • KSC5601
  • Macintosh Arabic
  • Macintosh Croatian
  • Macintosh Cyrillic
  • Macintosh Eastern Eur.
  • Macintosh Greek
  • Macintosh Icelandic
  • Macintosh Roman
  • Macintosh Rumanian
  • Macintosh Turkish
  • Macintosh Ukrainian
  • MS_1368
  • Murasu
  • Mylai
  • Shift-JIS
  • TamilNet
  • TIS-620
  • UCS-2
  • US-ASCII
  • UTF-7
  • UTF-8
  • VISCII
  • VIQR
  • VNCII
  • VietNet
  • Windows 1250
  • Windows 1251
  • Windows 1252
  • Windows 1253
  • Windows 1255
  • Windows 1255-V
  • Windows 1256
  • Windows 1257
  • Windows 12
The conversion API

The conversion tool is based on the concept of filters that symmetrically handle the simplest cases (8-bit character sets) as well as the most complex (control sequence sets). This means that the complexity of managing states is handled transparently, inside the filters.

The API has two ways to access the conversion tool: a simple API converts memory buffers effortlessly, while a more comprehensive API accesses the filters directly, filtering dataflow on the fly.

 

 


Other internationalization products and
services:
  • Batam TM: Internationalization library
  • ¿Qué? TM: Language and encoding identifier for character sets
Find out more about Florès™ or contact Alis for further information.