Charset/CCDD (was: Let's develop an open-source media archive

ben franchuk bfranchuk at jetnet.ab.ca
Thu Aug 12 15:14:12 CDT 2004


Sean 'Captain Napalm' Conner wrote:
> It was thus said that the Great Hans Franke once stated:
> 
>>Rather then restricting the encodeing of the XML file to a
>>specific charset, we need to restrict the USAGE within the
>>standard to certain characters, regardless of the encodeing.
> 
> 
>   Unless otherwise noted, XML files are assumed to be encoded in UTF-8,
> *but* an XML parser is required to abort at the first error in the XML file.
> If a parser is reading an XML file without an explicit character set
> encoding scheme (which means it's assuming UTF-8) and it reads a character
> that is illegal (say the file was encoded in ISO-8859-3) it gives up
> (usually with an "illegal character at such-n-such position" error).

What about a boot-strapable subset and revised error handeling?
Ben.





More information about the cctalk mailing list