[tei-council] Internationalised domains

Kevin Hawkins kevin.s.hawkins at ultraslavonic.info
Tue Sep 20 09:26:24 EDT 2011


I'm not sure about prescribing use of RFC 3492.  This seems to me like 
prescribing use of US-ASCII with character entity references instead of 
UTF-8 within XML documents to ensure that we can use our documents with 
a full range of software toolS -- something that fewer and fewer people 
support doing.

On 9/20/2011 4:49 AM, Stuart A. Yeates wrote:
> Currently domain names in TEI can occur in typed fields (such as
> data.pointer) or in many other fields where type checking is more
> relaxed (or non-existent). I would like to propose the following note
> to appear somewhere in the standard (I'm thinking the data.pointer
> page, but I'm open to suggestions). The URL in the example is perhaps
> the best-known punycode URL (see
> http://en.wikipedia.org/wiki/Masr_%28domain_name%29 ), but if Arabic
> script causes problems in the publishing process I can probably find a
> more Latin-esque one.
>
> cheers
> stuart
>
> ----
>
> Internationalised domains containing non-ASCII characters should
> always be escaped using RFC 3492 syntax ("punycode") Thus
> http://موقع.وزارة-الاتصالات.مصر/ is written
> http://xn--4gbrim.xn----rmckbbajlc6dj7bxne2c.xn--wgbh1c/ Such escaping
> permits internationalised domains to be used with a full range of
> software tools.
>
> ----
> _______________________________________________
> tei-council mailing list
> tei-council at lists.village.Virginia.EDU
> http://lists.village.Virginia.EDU/mailman/listinfo/tei-council
>
> PLEASE NOTE: postings to this list are publicly archived


More information about the tei-council mailing list