UTF.Unknown 1.0.0
Detect character set for files, steams and other bytes.
This package is based on Ude (https://github.com/errepi/ude), which is a port of the Mozilla Universal Charset Detector (https://mxr.mozilla.org/mozilla/source/extensions/universalchardet/).
Features:
- Detects 28 charsets
- Easy to use API
- .NET standard 1.0 + 2.0 support
- Strong named
- XML documentation included
No packages depend on UTF.Unknown.
Compared to Ude:
- Refactor of API, namespaces and deadcode removal
- Added some docs
- Improve error handling
- Improved unit tests
Bug fixes:
- EUCTW: System.IndexOutOfRangeException
- pureascii detection issue
- BUG in SBCSGroupProber class in function Reset
- Detection fails on particular, simple ANSI file
See https://github.com/CharsetDetector/UTF-unknown/milestone/1?closed=1
.NET Framework 4.0
- No dependencies.
.NET Standard 1.0
- NETStandard.Library (>= 1.6.1)
.NET Standard 1.3
- NETStandard.Library (>= 1.6.1)