SV: [jdom-interest] encoding problem linux vs windows
pernorrman at telia.com
Tue Oct 28 16:13:44 PST 2003
Ok, this sounds like a linux jvm problem. Which character encoding are you
Which XMLOutputter method are you using?
As for the é stuff, JDOM doesn't do entity "roundtripping". The
is defined in the XHTML dtd but is converted by the SAX parser before it
SAXBuilder/SAXHandler. If the encoding in effect does not "support" a
directly, it is written as a character reference, &#xNNNN;. The only
by JDOM is <, > and &, which are predefined in the XML
Från: manish sharan [mailto:manish.sharan at divlogic.com]
Skickat: den 29 oktober 2003 00:14
Till: Per Norrman; jdom-interest at jdom.org
Ämne: Re: [jdom-interest] encoding problem linux vs windows
I am running my program on Linux and Windows and then bringing over the
result to my winodws folder where I open it with Notepad.
The output from the Windows is ok while the output from Linux has the ? in
place of é
Obviously , this has nothing to do with JDOM per se but more to do with
differences in character encoding schemes etc. between Linux and Windows.
Can someone please point me to a knowledge resource that can help me figure
it out ?
ps: on another note , JDOM Ouputter converts "é" in my xhtml to
"é" . I havent tested it with other HTML entirties but Is this a known bug
----- Original Message -----
From: Per <mailto:pernorrman at telia.com> Norrman
To: 'manish sharan' <mailto:manish.sharan at divlogic.com> ;
jdom-interest at jdom.org
Sent: Tuesday, October 28, 2003 4:54 PM
Subject: SV: [jdom-interest] encoding problem linux vs windows
My guess is that the problem is not with JDOM or Java or the platform,
but with the editor/viewer/console (whatever), i.e. the application that
to look at the result. How do you determine the problem? If you transfer the
result to windows, is the problem still there?
Från: jdom-interest-admin at jdom.org [mailto:jdom-interest-admin at jdom.org] För
Skickat: den 28 oktober 2003 21:49
Till: jdom-interest at jdom.org
Ämne: [jdom-interest] encoding problem linux vs windows
I am using JDOM to process an XHTML page. The problem is with html entities
such as and é
On Windows , it handles them without problem. On Linux RH AS 2 , it turns
them into '?' . I am using sun jdk 1.4.2 on both.
Can anyone please tell me what could be the problem ? Why is it behaving
differently on Linux ?
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the jdom-interest