[jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML Basi c 1.0//EN" "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd"> error?

Jason Long jason at jmlie.com
Mon Nov 19 11:21:47 PST 2001


This is the only thing that is in the document.  These pages come from
another web application that my company will not allow to be modifed.

<!DOCTYPE html
PUBLIC "-//W3C//DTD XHTML Basic 1.0//EN"
"http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">

It JTidy has no problem building a DOM object with these files, but JDOM
throws errors because of the DOCTYPE declaration. I need to know how build a
JDOM document that contains this type of DOCTYPE.  I would be happy if I
could just ingore it.

Any suggestions>

Jason Long
JML Internet Enterprises - www.jmlie.com
BS Physics, MS Chemical Engineering

-----Original Message-----
From: rpcee [mailto:rpcee at operamail.com]
Sent: Monday, November 19, 2001 12:48 PM
To: Jason Long
Subject: RE: [jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML
Basi c 1.0//EN" "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
error?


in fact, colons are disallowed because of namespaces, rather than the xml
1.0
recommendation:

http://www.w3.org/TR/REC-xml-names/#Conformance

(referenced by the javaDoc)

>===== Original Message From Jason Long <jason at jmlie.com> =====
>I am considering removing anything that is between <! and > from the xml
>text using a regex, but this is just hack and there has to be a better way.
>The stack traces of the errors I recieve are attached to the email before
>this one.  I would appreciate any comment on this behavior.  Also, what
>problems might I encounter if I remove the string in this manner and
rebuild
>the document?
>
>Best Regards,
>
>Jason Long
>JML Internet Enterprises - www.jmlie.com
>BS Physics, MS Chemical Engineering
>
>-----Original Message-----
>From: jdom-interest-admin at jdom.org
>[mailto:jdom-interest-admin at jdom.org]On Behalf Of Jason Long
>Sent: Saturday, November 17, 2001 10:58 AM
>To: JDOM; philip.nelson at omniresources.com
>Subject: RE: [jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML
>Basi c 1.0//EN" "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
>error?
>
>
>I have attached the stack trace for both the SAXBuilder and DOMBuilder.
For
>some reason my email software kept wrapping the text and not seeing the
>newlines.
>I really want to just ignore the doctype declaration all together when
>building these documents.
>
>Thanks for your help,
>
>Jason Long
>JML Internet Enterprises - www.jmlie.com
>BS Physics, MS Chemical Engineering
>
>
>-----Original Message-----
>From: jdom-interest-admin at jdom.org
>[mailto:jdom-interest-admin at jdom.org]On Behalf Of
>philip.nelson at omniresources.com
>Sent: Saturday, November 17, 2001 7:59 AM
>To: jason at jmlie.com; jdom-interest at jdom.org
>Subject: RE: [jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML
>Basi c 1.0//EN" "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
>error?
>
>
>You don't actually say what the error is?!
>
>> -----Original Message-----
>> From: Jason Long [mailto:jason at jmlie.com]
>> Sent: Saturday, November 17, 2001 3:15 AM
>> To: JDOM
>> Subject: RE: [jdom-interest] <!DOCTYPE html PUBLIC
>> "-//W3C//DTD XHTML Basic 1.0//EN"
>> "http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd"> error?
>>
>>
>> It is the colon in the url that seems to be causing the
>> problem.  The DOCTYPE is from the orginal document take from
>> the web, and it is left unchanged by JTidy.  I would
>> appreciate any assistance in resolving this matter.
>>
>> -----Original Message-----
>> From: jdom-interest-admin at jdom.org
>> [mailto:jdom-interest-admin at jdom.org]On Behalf > Of Jason Long
>>
>> Sent: Friday, November 16, 2001 4:38 PM
>> To: JDOM
>> Subject: [jdom-interest] <!DOCTYPE html PUBLIC "-//W3C//DTD
>> XHTML Basic 1.0//EN"
>"http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd"> error?
>
>
>I am writing an application that makes use of JTidy and JDOM.  I use JTidy
>to grab a page and convert it to DOM and then convert DOM to JDOM.  There
is
>no error throw by JTidy.  I get an error with both the DOMBuilder and
>SAXBuilder due to the following PI found.
>
><!DOCTYPE html
>PUBLIC "-//W3C//DTD XHTML Basic 1.0//EN"
>"http://www.w3.org/TR/xhtml-basic/xhtml-basic10.dtd">
>
>What is the best way to deal with this?
>
>Jason Long
>JML Internet Enterprises - www.jmlie.com
>BS Physics, MS Chemical Engineering
>
>_______________________________________________
>To control your jdom-interest membership:
>http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourho
s
>t.com
>
>_______________________________________________
>To control your jdom-interest membership:
>http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourho
s
>t.com
>_______________________________________________
>To control your jdom-interest membership:
>http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourho
s
>t.com
>
>_______________________________________________
>To control your jdom-interest membership:
>http://lists.denveronline.net/mailman/options/jdom-interest/youraddr@yourho
st
.com




More information about the jdom-interest mailing list