Difference between revisions of "Prokee Module: unireader"

From prokee
Jump to navigation Jump to search
Line 19: Line 19:
 
== Implementations ==
 
== Implementations ==
 
* [http://www.andreaspollhammer.com/lab/docu/html/unireader_v01.php unireader (version v01)]
 
* [http://www.andreaspollhammer.com/lab/docu/html/unireader_v01.php unireader (version v01)]
 +
 +
[[Category:Prokee Modules]]

Revision as of 00:46, 7 May 2019

The unireader reads characters from text-files.

Supported Encodings

The following encodings are supported:

  • UTF-8 with and without BOM (byte order mark)
  • UTF-16 with BOM (Big or Little Endian)
  • UTF-32 with BOM (Big or Little Endian)
  • UTF-16BE (Big Endian)
  • UTF-16LE (Little Endian)
  • UTF-32BE (Big Endian)
  • UTF-32LE (Little Endian)
  • ASCII as subset of UTF-8

Handling of Special Characters

  • Different variants of line breaks are converted to '\n'.
  • The BOM character (if present) is not ignored but handled as a valid character.

Implementations