diff options
author | Carlo Marcelo Arenas Belón <carenas@gmail.com> | 2019-08-28 07:54:44 -0700 |
---|---|---|
committer | Junio C Hamano <gitster@pobox.com> | 2019-09-09 11:50:08 -0700 |
commit | ad7c543e3b0f80befd26f4115f8fec4285a018bf (patch) | |
tree | dec38a9cdfb93d1519241a6e1caddbe5b4911143 /t/t9350/broken-iso-8859-7-commit-message.txt | |
parent | Git 2.22.1 (diff) | |
download | tgif-ad7c543e3b0f80befd26f4115f8fec4285a018bf.tar.xz |
grep: skip UTF8 checks explicitly
18547aacf5 ("grep/pcre: support utf-8", 2016-06-25) that was released
with git 2.10 added the PCRE_UTF8 flag to PCRE1 matching including a
call to has_non_ascii() to try to avoid breakage if there was non-utf8
encoded content in the haystack.
Usually PCRE is compiled with JIT support (even if is not the default),
and therefore the codepath used includes calling pcre_jit_exec, which
skips UTF-8 validation by design (which might result in crashes or hangs)
but when JIT support wasn't compiled we use pcre_exec instead with the
posibility that grep might be aborted if invalid UTF-8 is found in the
haystack.
PCRE1 provides a flag since Mar 5, 2007 that could be used to skip the
checks explicitly so use that to make both codepaths equivalent (the
flag is ignored by pcre1_jit_exec)
this fix is only implemented for PCRE1 because PCRE2 is likely to have
a better solution (without the risks) instead in the future
Helped-by: Johannes Schindelin <Johannes.Schindelin@gmx.de>
Helped-by: Eric Sunshine <sunshine@sunshineco.com>
Helped-by: Ævar Arnfjörð Bjarmason <avarab@gmail.com>
Suggested-by: Junio C Hamano <gitster@pobox.com>
Signed-off-by: Carlo Marcelo Arenas Belón <carenas@gmail.com>
Signed-off-by: Junio C Hamano <gitster@pobox.com>
Diffstat (limited to 't/t9350/broken-iso-8859-7-commit-message.txt')
0 files changed, 0 insertions, 0 deletions