zeripath bc4764ffc6 Detect truncated utf-8 characters at the end of content as still representing utf-8 (#19773)
Our character detection algorithm can potentially incorrectly detect utf-8 as iso-8859-x
if there is a truncated character at the end of the partially read file.

This PR changes the detection algorithm to truncated utf8 characters at the end of the
buffer.

Fix #19743

Signed-off-by: Andrew Thornton <art27@cantab.net>
2022-05-21 14:06:24 +01:00
..
2022-01-20 18:46:10 +01:00
2022-05-06 10:12:30 +01:00
2022-04-01 02:15:46 +02:00
2022-01-20 18:46:10 +01:00
2022-01-20 18:46:10 +01:00
2022-01-20 18:46:10 +01:00
2022-01-19 19:31:39 -05:00
2022-01-20 18:46:10 +01:00
2022-04-19 12:55:35 -04:00
2022-01-20 18:46:10 +01:00
2022-01-20 18:46:10 +01:00
2022-03-30 16:42:47 +08:00
2022-01-20 18:46:10 +01:00
2021-12-20 04:41:31 +00:00
2022-05-10 23:55:54 +02:00
2022-04-25 13:55:12 +08:00