Is this implementation close to the theoretical concept of regular expressions? ...

burntsushi · on April 27, 2014

Yes. It's a reasonably faithful port of RE2, which elides features like backreferences.

pjscott · on April 28, 2014

This is usually a small price to pay for guaranteed O(n) matching speed. (Kudos to the author for doing the Right Thing here.)

burntsushi · on April 28, 2014

Thanks :-) The implementation is basically the Pike VM as described by Russ Cox. Recursion depth in particular has an upper bound corresponding to the number of instructions in the regex. In practice, this means it's safe to run a regex on untrusted data.

(Creating a regex from untrusted data still needs a bit of work, but is fixable!)