Difference between revisions of "Prokee Module: unireader"
Jump to navigation
Jump to search
(Created page with "The <strong>unireader</strong> reads characters from text-files. == Supported Encodings == The following encodings are supported: * UTF-8 with and without BOM (byte order m...") |
|||
| Line 14: | Line 14: | ||
== Handling of Special Characters == | == Handling of Special Characters == | ||
| − | * Different variants of line breaks are converted to '\ n'. | + | * Different variants of line breaks are converted to '\n'. |
* The BOM character (if present) is not ignored but handled as a valid character. | * The BOM character (if present) is not ignored but handled as a valid character. | ||
== Implementations == | == Implementations == | ||
* [http://www.andreaspollhammer.com/lab/docu/html/unireader_v01.php unireader (version v01)] | * [http://www.andreaspollhammer.com/lab/docu/html/unireader_v01.php unireader (version v01)] | ||
Revision as of 20:56, 30 April 2019
The unireader reads characters from text-files.
Supported Encodings
The following encodings are supported:
- UTF-8 with and without BOM (byte order mark)
- UTF-16 with BOM (Big or Little Endian)
- UTF-32 with BOM (Big or Little Endian)
- UTF-16BE (Big Endian)
- UTF-16LE (Little Endian)
- UTF-32BE (Big Endian)
- UTF-32LE (Little Endian)
- ASCII as subset of UTF-8
Handling of Special Characters
- Different variants of line breaks are converted to '\n'.
- The BOM character (if present) is not ignored but handled as a valid character.