XML Encoding
XML documents can contain non ASCII characters, like Norwegian æ ø å , or French ê è é.
To avoid errors, specify the XML encoding, or save XML files as
Unicode.
XML Encoding Errors
If you load an XML document, you can get two different errors
indicating encoding problems:
An invalid character was found in text content.
You get this error if your XML
contains non ASCII characters, and the file was saved as
single-byte ANSI (or ASCII) with no encoding specified.
Single byte XML file with encoding
attribute.
Same single byte XML file with no
encoding attribute.
Switch from current encoding to specified encoding not supported.
You get this error if your XML file was saved as double-byte Unicode (or UTF-16)
with a single-byte encoding (Windows-1252,
ISO-8859-1, UTF-8) specified.
You also get this error if your XML file was saved with single-byte ANSI (or
ASCII), with double-byte
encoding (UTF-16) specified.
Double byte XML file without
encoding.
Same double byte XML file with
single byte encoding.
Windows Notepad
Windows Notepad save files as single-byte ANSI (ASCII) by default.
If you select "Save as...", you can specify double-byte Unicode (UTF-16).
Save the
XML file below as Unicode (note that the document does not contain any encoding
attribute):
<?xml version="1.0"?>
<note>
<from>Jani</from>
<to>Tove</to>
<message>Norwegian: æøå. French: êèé</message>
</note>
|
The file above, note_encode_none_u.xml will NOT generate
an error. But if you specify a single-byte encoding it will.
The following encoding (open it),
will give an error message:
<?xml version="1.0" encoding="windows-1252"?>
|
The following encoding (open it),
will give an error message:
<?xml version="1.0" encoding="ISO-8859-1"?>
|
The following encoding (open it),
will give an error message:
<?xml version="1.0" encoding="UTF-8"?>
|
The following encoding (open it), will NOT
give an error:
<?xml version="1.0" encoding="UTF-16"?>
|
Conclusion
- Always use the encoding attribute
- Use an editor that supports encoding
- Make sure you know what encoding the editor uses
- Use the same encoding in your encoding attribute
The Ektron Intranet
lets you do everything you need to do on your corporate intranet and everything you want to do... all with just one application.
What can you do with the Ektron Intranet? |

|
Navigate through content, documents, assets, colleagues and workgroups quickly and intuitively with enterprise search |

|
Communicate with friends and colleagues with forums, message boards and corporate blogging using the new Social Networking Platform |

|
Promote collaboration among coworkers in your organization through project workspaces where others can efficiently find information and work together |

|
Personalize your company profile by bookmarking and organizing favorite content, uploading assets, posting photos, blogging, and more |

|
Interact with features like tagging, flagging, wikis and ratings found in the Web 2.0 Toolbox |
 |
Author/edit content, manage navigation, menus, audit trails, workflow and approvals with the best in breed Content Management |
|
|
|
|
See why there are 20,000+ Ektron integrations worldwide. Request an
INSTANT DEMO or download a
FREE TRIAL today. |
|