International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
  · Data Customizer
 

Related Websites

Unicode Consortium

Common Locale Data

IBM Open Source

Globalize
Your E-Business

Oracle: Java i18n forum

 

ICU  >  Demonstrations  > 

Normalization Browser




Decomposition exclusions:
Unicode version:
(Help)

Normalization Results
ModeQuick CheckNormalizedText
Input (empty)
NFDYES(empty)
NFCYES(empty)
NFKDYES(empty)
NFKCYES(empty)
FCDYES(empty)

About this demo

The options flags are for a prototype to demonstrate tailored normalization as mentioned as Unicode public review issue 7. Uncheck all of these options for regular Unicode Normalization.

Hangul excludes AC00..D7A3. CJK Compat. excludes CJK Compatibility Ideographs (those with a canonical decomposition).

The Unicode 3.2 option performs normalization according to Unicode 3.2 (except for NormalizationCorrections) even if ICU otherwise supports a higher version.

FCD is not a normalization form but a test for whether text is canonically ordered. "Normalizing to FCD" does not generate a unique form but only one of potentially many that are canonically ordered. See UTN #5 Canonical Equivalence in Applications.


Unicode version 12.1 — ICU 65.1