<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
  <meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
  <title></title>
</head>
<body bgcolor="#ffffff" text="#000000">
William Krick wrote:<br>
<blockquote cite="midKEEAIOEEIFICLFEIJDHMMEGMCGAA.wkrick@eio-online.com"
 type="cite">
  <pre wrap="">...

XMLOutputter fmt = new XMLOutputter("", false, "UTF-8");
  </pre>
</blockquote>
<br>
Puzzling. Has this constructor been removed recently?&nbsp; It's not in the
CVS trunk.<br>
<br>
<blockquote cite="midKEEAIOEEIFICLFEIJDHMMEGMCGAA.wkrick@eio-online.com"
 type="cite">
  <pre wrap="">...and the problem seems to be gone.


The "byte order mark"  FF FE  is still there when viewed
in a hex editor but the XML output is no longer clipped
at the beginning.
  </pre>
</blockquote>
<br>
<br>
Does this possibly suggest a bug somewhere?&nbsp; When writing UTF-8, the
BOM should be EF BB BF not FF FE&nbsp; (FF FE indicates UTF16-LE).&nbsp; A quick
look at XMLOutputter makes me think it's not the problem: it merely
calls the standard Java APIs.<br>
<br>
>From <a class="moz-txt-link-freetext" href="http://www.unicode.org/unicode/faq/utf_bom.html#BOM">http://www.unicode.org/unicode/faq/utf_bom.html#BOM</a> :<br>
<br>
<table border="1" cellpadding="2" cellspacing="0">
  <tbody>
    <tr>
      <th width="50%">Bytes</th>
      <th width="50%">Encoding Form</th>
    </tr>
    <tr>
      <td width="50%">00 00 FE FF</td>
      <td width="50%">UTF-32, big-endian</td>
    </tr>
    <tr>
      <td width="50%">FF FE 00 00</td>
      <td width="50%">UTF-32, little-endian</td>
    </tr>
    <tr>
      <td width="50%">FE FF</td>
      <td width="50%">UTF-16, big-endian</td>
    </tr>
    <tr>
      <td width="50%">FF FE</td>
      <td width="50%">UTF-16, little-endian</td>
    </tr>
    <tr>
      <td width="50%">EF BB BF</td>
      <td width="50%">UTF-8</td>
    </tr>
  </tbody>
</table>
<br>
<br>
Rick :-)<br>
<br>
</body>
</html>