← Back to team overview

maria-developers team mailing list archive

Re: Why local modifications to PCRE library?

 

Hello Pavel,

On 11/16/2013 11:11 AM, Pavel Ivanov wrote:
Alexander,

I see you've included the full source of PCRE library into MariaDB
tree and even made some local modifications which are not included
into PCRE upstream. Could you please explain why did you do that? Why
not allow to build with system standard PCRE library?

I understand that you could have needed some modifications and thus
could want to include sources from PCRE trunk to allow users to build
with modifications when their system-wide library is too old. But as I
don't see your modifications on PCRE trunk that means you probably had
something different in mind, and decided to incur the pain of
constantly updating the included sources and merging security fixes.
So why did you decide to do that?

There is a bug in the PCRE library. It crashes in the function compile_regex() because of the stack overrun in case when the pattern
consists of a deep enough parenthesizes level like this:

SELECT 'x' RLIKE CONCAT(REPEAT('(',300), 'x', REPEAT(')',300));

I reported the problem to Philip Hazel (the author and the main maintainer of PCRE). Philip already made a fix in the PCRE sources
to address this issue.

The Philip's fix is different comparing to our version, this is
probably why you did not recognize it in the PCRE's trunk.

Our fix makes compile_regex() watch the available stack size through
a callback function and stop the recursion when the execution is near
to run out of stack. It works very well with all possible values of
the thread_stack MariaDB system variable, which is quite small by
default (256 Kb).

The Philip's version of the fix limits recursion depth to 250,
which should be enough for any reasonable regular expression,
and should *hopefully* not run out of stack.
This fix will be included into the next release PCRE-8.34,
which is planned around Christmas time.

After the fixed version of PCRE is released we'll check if
it solves the problem on all supported platforms with the
minimum possible value of the MariaDB system variable "thread_stack".
In case it does, we'll add an option to compile MariaDB against
the system installed PCRE library instead of the bundled version.
Otherwise, we'll report to Philip again asking again to consider
including our fix.




Thank you,
Pavel



Follow ups

References