[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Missing Character Set Encodings?



Dear IANA:
 
I have been examining the Character Set name specification you have list at the following URL on your Web site
 
http://www.isi.edu/in-notes/iana/assignments/character-sets
 
It seems that there are a number of character sets that are not listed in the above online specification.  I was wondering if you could provide some insight to why these are missing as well as comment on the possibility of their addition.
 
The character sets I found missing are based on an examination of current Java supported encodings.  My company is currently trying to link the Java-based encodings to the IANA-based character set encodings used in XML, HMTL, etc.  This link will help us provide more coherent, international support for our internet products.
 
The missing character sets for Microsoft's GBK, Thai (874), and multibyte code pages (932, 936, 949 and 950) are of the greatest concern.  The list of character sets I found missing are as follows (I don't claim this to be comprehensive):
 
  No IANA definition for       "Cp737"
  No IANA definition for       "Cp838"
  No IANA definition for       "Cp874"    
  No IANA definition for       "Cp875"    
  No IANA definition for       "Cp921"    
  No IANA definition for       "Cp922"    
  No IANA definition for       "Cp930"    
  No IANA definition for       "Cp933"    
  No IANA definition for       "Cp935"    
  No IANA definition for       "Cp937"    
  No IANA definition for       "Cp939"    
  No IANA definition for       "Cp942"    
  No IANA definition for       "Cp948"    
  No IANA definition for       "Cp949"    
  No IANA definition for       "Cp950"    
  No IANA definition for       "Cp964"    
  No IANA definition for       "Cp970"    
  No IANA definition for       "Cp1006"  
  No IANA definition for       "Cp1025"   
  No IANA definition for       "Cp1046"   
  No IANA definition for       "Cp1097"   
  No IANA definition for       "Cp1098"   
  No IANA definition for       "Cp1112"   
  No IANA definition for       "Cp1122"   
  No IANA definition for       "Cp1123"   
  No IANA definition for       "Cp1124"   
  No IANA definition for       "Cp1252"   
  No IANA definition for       "Cp1381"   
  No IANA definition for       "Cp1383"   
  No IANA definition for       "Cp33722" 
  No IANA definition for       "EUC-TW" 
  No IANA definition for       "GBK"      
  No IANA definition for       "ISO2022CN-CNS"     
  No IANA definition for       "ISO2022CN-GB"      
  No IANA definition for       "MS874"       //Though IBM-THAI might be the alias 
  No IANA definition for       "MS932"       //Though SHIFT-JIS might be the alias
  No IANA definition for       "MS936"       //Though BIG5 might be the alias
  No IANA definition for       "MS949"       //Though EUC-KR might be the alias
  No IANA definition for       "MS950"      
 
 
You comments on how we can obtain a more complete IANA character set specification would be appreciated.  If a later specification exists that does include some (or all) of the missing character sets I listed above, please let me know how I might access this updated specification. 
 
Thank you,
 
 
Craig R. Cummings
Product Internationalization Manager
Tools Division
NLS Group
Oracle Corporation
500 Oracle Parkway
M/S 2op11
Redwood Shores, CA  94065  USA
(email) crcummin@us.oracle.com
(tel) +1-650-506-4273
(fax) +1-650-506-7432
(intranet) http://toolsnls.us.oracle.com

Privacy Policy | Terms of Service | Cookies Policy