Transforms

Contents

  1. Overview
    1. Case Mappings
    2. Normalization
    3. Transforms
    4. Bidirectional Algorithm

Overview

Transforms are used to process Unicode text in many different ways. Some include case mapping, normalization, transliteration and bidirectional text handling.

Case Mappings

Case mapping is used to handle mappings of upper- and lower-case characters from one language to another language, and writing systems that use letters of the same alphabet to handle titlecase mappings that are particular to some class. They provide for certain language-specific mappings as well.

Normalization

Normalization is used to convert text to a unique, equivalent form. Systems can normalize Unicode-encoded text to one particular sequence, such as a normalizing composite character sequences into precomposed characters. While Normalization Forms are specified for Unicode text, they can also be extended to non-Unicode (legacy) character encodings. This is based on mapping the legacy character set strings to and from Unicode.

Transforms

Transforms provide a general-purpose package for processing Unicode text. They are a powerful and flexible mechanism for handling a variety of different tasks, including:

  • Uppercase, Lowercase, Titlecase, Full/Halfwidth conversions

  • Normalization

  • Hex and Character Name conversions

  • Script to Script conversion

Bidirectional Algorithm

The Bidirectional Algorithm was developed to specify the direction of text in a text flow.


Table of contents