[jdom-interest] JDOM Exception: invalid XML character ( Unicod e: 0xb) found

Les Hill leh at galaxynine.com
Tue Jul 23 19:23:42 PDT 2002


From: "Charlie Wu" <cwu at brocade.com>
> The other question then, is: can I go over my XML file as a character
> stream and evaluate them byte by byte and remove anything between 0 and
> 0x20 (except the 3 you mentioned)? Would this be a problem for UTF-8
because
> they could be multi-byted?

No. 0x00-0x7F are one-byte only in UTF-8.

For more info, here is a recycled answer:

Alex Rosen writes:
>Read more about Unicode and the various
>encodings, e.g. http://www.cl.cam.ac.uk/~mgk25/unicode.html

Les Hill
leh at galaxynine.com





More information about the jdom-interest mailing list