linguist/iso8859-8-i.txt at 26ab33754f95b587ee9b7d24aa0b15026c446cb2 - linguist - Schleppe Git

KevinMidboe/linguist

mirror of https://github.com/KevinMidboe/linguist.git synced 2025-10-29 17:50:22 +00:00

Files

Andy Lindeman aa5a94cc3e Handle case where newline chars don't transcode to detected encoding

We've seen cases where binary files are detected as encodings such as
ISO-8859-8-I. This usually happens when the binary files are short, so
while the detector is mistaken, there is also not very much data for use
in the detection algorithm in the first place so it's understandable
that the detector was wrong.

In these cases, the code to convert ASCII newline characters to
encodings such as ISO-8859-8-I fails because there is no conversion
between them.

We now simply assume that the data is all one line in those cases. In
reality the data is binary, but this obviously difficult to detect
reliably.

2014-06-03 12:26:23 -04:00

1 line

4 B

Plaintext

Raw Blame History

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

%<25>בי