com.vgrs.xcode.common.unicodedata
Class UnicodeDataLoader

java.lang.Object
  extended by com.vgrs.xcode.common.unicodedata.UnicodeDataLoader

public final class UnicodeDataLoader
extends java.lang.Object

This class reads and parses the information in the "data/unicode/UnicodeData.txt.gz" file. This file contains all the code points assigned by Unicode. A number of fields are read and parsed along with the code point. This information is then used to implement the IDNA2008 Protocol and also by the [@link Normalize} class to perform NFKC normalization as required by the IDNA2008 Protocol. The general category and the BIDI class is also stored for each assigned code point.

Version:
1.0 Jun 14, 2010
Author:
nchigurupati

Constructor Summary
UnicodeDataLoader()
           
 
Method Summary
protected static TIntObjectMap<java.lang.String> getBidiClassTable()
          Returns the bidiClassTable
protected static TIntIntMap getCanonicalClassTable()
          Returns the canonicalClassTable
protected static TIntSet getCompatibilityTable()
          Returns the compatibilityTable
protected static TLongIntMap getComposeTable()
          Returns the composeTable
protected static TIntObjectMap<int[]> getDecomposeTable()
          Returns the decomposeTable
protected static TIntSet getExcludedTable()
          Returns the excludedTable
protected static TIntObjectMap<java.lang.String> getGeneralCategoryTable()
          Returns the generalCategoryTable
protected static TIntSet getUnicodeDataPoints()
          Returns the unicodeDataPoints
protected static void nullifyUnicodeDataPointsTable()
          Once the derived property of each Unicode code point is calculated, the UNICODE_DATA_POINTS data structure is no longer needed.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

UnicodeDataLoader

public UnicodeDataLoader()
Method Detail

getUnicodeDataPoints

protected static TIntSet getUnicodeDataPoints()
Returns the unicodeDataPoints

Returns:
the unicodeDataPoints

getExcludedTable

protected static TIntSet getExcludedTable()
Returns the excludedTable

Returns:
the excludedTable

getCompatibilityTable

protected static TIntSet getCompatibilityTable()
Returns the compatibilityTable

Returns:
the compatibilityTable

getCanonicalClassTable

protected static TIntIntMap getCanonicalClassTable()
Returns the canonicalClassTable

Returns:
the canonicalClassTable

getComposeTable

protected static TLongIntMap getComposeTable()
Returns the composeTable

Returns:
the composeTable

getDecomposeTable

protected static TIntObjectMap<int[]> getDecomposeTable()
Returns the decomposeTable

Returns:
the decomposeTable

getGeneralCategoryTable

protected static TIntObjectMap<java.lang.String> getGeneralCategoryTable()
Returns the generalCategoryTable

Returns:
the generalCategoryTable

getBidiClassTable

protected static TIntObjectMap<java.lang.String> getBidiClassTable()
Returns the bidiClassTable

Returns:
the bidiClassTable

nullifyUnicodeDataPointsTable

protected static void nullifyUnicodeDataPointsTable()
Once the derived property of each Unicode code point is calculated, the UNICODE_DATA_POINTS data structure is no longer needed. Null out this data structure to free up memory.



Copyright © 2000-2010 VeriSign Inc. All Rights Reserved