ICANN ICANN Email List Archives

[gnso-rn-wg]


<<< Chronological Index >>>    <<< Thread Index >>>

[gnso-rn-wg] RE: [gnso-idn-wg] Re: Banning CCHH anywhere in a label

  • To: "'Shahram Soboutipour'" <ceo@xxxxxxxxxxx>, <owner-gnso-idn-wg@xxxxxxxxx>
  • Subject: [gnso-rn-wg] RE: [gnso-idn-wg] Re: Banning CCHH anywhere in a label
  • From: "Edmon Chung" <edmon@xxxxxxxxxxx>
  • Date: Tue, 6 Mar 2007 17:52:43 +0800

Hi Shahram,

 

There was an extensive discussion in the original IDN protocol development 
about the use of the prefix (or suffix or other possible identifiers), and 
finally CCHH was chosen.  I highly doubt that we would be choosing a scheme 
that would split up a label (for many good reasons including bidi and single 
script considerations) into different chunks with different prefixes, but no 
one can predict the future I suppose :-)

 

Nevertheless, with regards to our discussion at hand, I am quite certain we 
have comprehensive protection with the CCHH reserved as a prefix.

 

Edmon

 

 

 

 

 

From: owner-gnso-idn-wg@xxxxxxxxx [mailto:owner-gnso-idn-wg@xxxxxxxxx] On 
Behalf Of Shahram Soboutipour
Sent: Tuesday, March 06, 2007 4:50 PM
To: owner-gnso-idn-wg@xxxxxxxxx
Cc: 'Sophia Bekele'; gnso-idn-wg@xxxxxxxxx; gnso-rn-wg@xxxxxxxxx
Subject: RE: [gnso-idn-wg] Re: Banning CCHH anywhere in a label

 

Dear Edmon

 

Regarding the sample CITIBANKchina.TLD (where china is in Chinese charset), I 
think there is a 3rd possibility which might be Sophia’s idea:

<CCHH>citibank-<CCHH><encodedCHINA>.tld

It means that every separate part of a label in non-ascii strings be translated 
with a CCHH at first. I am not sure if there is a rule for this right now or 
not, but I myself do not agree with this type. I prefer 
<CCHH>citibank-<encodedCHINA>.tld cause:

1. I think there is enough space for possible further changes and developments 
in IDNA standard in CC part of CCHH, so there must be no worries.

2. the CCHH (at first) is a good rule to define an IDN , and I think it can be 
a rule in all the levels of a url (not only 2nd and 3rd) but seems higher 
levels other than 3rd are out of scope of ICANN’s policy, BUT must be mentioned 
in their own technical decision makings.

 

 

Regards,

 

 <BLOCKED::mailto:soboutipour@xxxxxxxxxxx> Shahram Soboutipour

President and CEO

 <BLOCKED::http://www.karmania.ir/> Karmania Media

Tel: +98 341 2117844,5

Mobile: +98 913 1416626

Fax: +98 341 2117851

-----Original Message-----
From: owner-gnso-idn-wg@xxxxxxxxx [mailto:owner-gnso-idn-wg@xxxxxxxxx] On 
Behalf Of Edmon Chung
Sent: Tuesday, March 06, 2007 6:09 AM
To: 'Tan, William'; rmohan@xxxxxxxxxxxx
Cc: 'Sophia B'; gnso-idn-wg@xxxxxxxxx; gnso-rn-wg@xxxxxxxxx
Subject: RE: [gnso-idn-wg] Re: Banning CCHH anywhere in a label

 

I dont think you are missing anything William.

Was trying to speak up during the call earlier, I dont think the concern Sophia 
was articulating should be an issue.

 

If I am not mistaken, Sophia was asking whether it would be necessary to 
reserve names such as:

 

abc<CCHH>xyz.tld

 

These names would NOT be considered IDN nor parts of which IDN, but are simply 
ASCII domains.  <CCHH> can be best seen as a prefix to denote that a domain 
label (i.e. between two dots) has at least one non LDH (letter-digit-hyphen) 
character.

 

Using the example described:

 

citibank<CHINA>.tld

 

where <CHINA> is in Chinese, William's explanation is correct, it should become:

 

<CCHH>citibank-<encodedCHINA>.tld

 

And NOT

 

Citibank<CCHH><encodedCHINA>.tld

 

So, by reserving <CCHH> at the front (i.e. first 4 characters, or more 
precisely, hyphens in the third and fourth character> we cover all cases of 
intended IDN expressions.

 

Edmon

 

 

 

 

> -----Original Message-----

> From: owner-gnso-idn-wg@xxxxxxxxx [mailto:owner-gnso-idn-wg@xxxxxxxxx] On

> Behalf Of Tan, William

> Sent: Tuesday, March 06, 2007 7:47 AM

> To: rmohan@xxxxxxxxxxxx

> Cc: 'Sophia B'; gnso-idn-wg@xxxxxxxxx; gnso-rn-wg@xxxxxxxxx

> Subject: Re: [gnso-idn-wg] Re: Banning CCHH anywhere in a label

> 

> Hi all,

> 

> I believe the motivations behind banning strings with hyphens in the

> *third *and *fourth *positions are:

> 1. to protect registries who do not offer IDN registrations from

> unknowingly registering IDNs; and

> 2. to reserve future revisions to the IDNA standard where a different

> prefix might be assigned.

> 

> 

> Ram Mohan wrote:

> >

> > Are you saying – something like <*CITIBANKchina.TLD*> where “china” is

> > in local script while CITIBANK is in Latin script should be banned,

> > because its Punycode translation would result in an <xn--> midway

> > through the string?

> >

> I'm not sure I follow this. CITIBANKchina.TLD would translate to

> xn--citibank-encodedchunk.TLD, so xn-- would not occur midway in the ACE

> string.

> 

> > In general, the rationale for banning “CCHH” at a position other than

> > the beginning of a string/label is unclear.

> I have not seen any documents that suggest banning CCHH at anything but

> the beginning of a string. Am I missing something?

> 

> 

> Sophia said:

> > All registrations should

> > be in the IDN label, and that the ACE label should be internal to the

> > operations of the registration. *One should not be offering to

> > register xn--.... as a label or any ACE label since it is an internal

> > encoding, so as to prevent confusion and other malfeasance (phishing)*.

> Many registries today use the ACE string at the registration protocol

> level, so your statement would essentially be advising against that

> practice. Personally, I don't think it is a problem unless the registry

> does NOT offer IDN and is accepting xn-- labels (in which case it

> probably simply treats the registration as ASCII and does not check for

> IDNA validity.) We may be in agreement here, but I wanted to further

> qualify your statement.

> 

> 

> In table 4.4 of "Recommendation Tables for RN-WG Reports.doc":

> > For each IDN gTLD proposed, applicant must provide both the "ASCII

> > compatible (ACE) form of an IDNA valid string" (“A-label”) and in

> > local script form (Unicode) of the top level domain (“U-label”).

> I would also add that the applicant should provide additional strings

> that, after applying IDNA ToASCII operation, result in the A-label.

> 

> Additionally, there may also be complications where the U-label could be

> entered into an application using an input method editor ("keyboard")

> that may produce a sequence of Unicode characters that may not convert

> to the A-label (either becomes a different A-label or fails conversion.)

> This may be due to user perception that a character is what one thinks

> it is, but when entered using the local input software produces a

> different character due to locale differences. I will try to dig up some

> examples. This is not a technical / policy issue, but is a usability

> issue that affects the stability of IDNs.

> 

> 

> 

> Best,

> 

> =wil

 

 



<<< Chronological Index >>>    <<< Thread Index >>>

Privacy Policy | Terms of Service | Cookies Policy