[tei-council] More on TEI Lite: work in progress

Sebastian Rahtz Sebastian.Rahtz at oucs.ox.ac.uk
Tue Feb 21 05:06:37 EST 2006


>> I dont agree. you specify the encoding that your document uses.
> 
> But you are always free to use any codepoint with numeric references.
> These NCRs are to *Unicode* not to the declared encoding. 

agreed. but the non-entity characters are in the declared encoding

> So what you
> declare is a subset.

I guess everything is a subset of unicode, yes

>> Most parsers will transcode it, but do not have to.
>>
> Show me one that does not transcode.  Life would be pretty hard then,
> I assume.

they all transcode; but they don't all support all encodings

> My friend uses CCCII, the Chinese Character Coding for Information
> Interchange defined in Taiwan 1985.  There is also more recently the
> TRON character set in Japan, which is used in PC like operating
> systems and cell phones.


and these are not subsets of Unicode?
-- 
Sebastian Rahtz
Information Manager, Oxford University Computing Services
13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431

OSS Watch: JISC Open Source Advisory Service
http://www.oss-watch.ac.uk



More information about the tei-council mailing list