For this? No, it is not considered performant. Regular Expressions are efficient...

burntsushi · on Jan 21, 2017

This isn't true. Regular expressions can be fast even when supporting Unicode by building finite state machines that recognize UTF-8 directly. This particular benchmark explains a bit: http://blog.burntsushi.net/ripgrep/#linux-unicode-word

UnoriginalGuy · on Jan 21, 2017

What isn't true? I never said that regular expressions cannot support UNICODE fast. I said that regular expressions are slower than code due to the overhead in all scenarios.

You're responding to a point never made.

burntsushi · on Jan 21, 2017

I am responding to your claim. I'm saying that not all regex implementations are created equal. Some can be just as fast as what you might write by hand.

MichaelGG · on Jan 21, 2017

Regular expressions can be Unicode aware, right? You should be able to use a shortcut specifier that's equivalent of calling something like IsWhitespace.

UnoriginalGuy · on Jan 21, 2017

Yes. It is people's ability to write good UNICODE regular expressions that is at issue.