Regex escape with \ or \\?

C# does not have a special syntax for construction of regular expressions, like Perl, Ruby or JavaScript do. It instead uses a constructor that takes a string. However, strings have their own escaping mechanism, because you want to be able to put quotes inside the string. Thus, there are two levels of escaping.

So, in a regular expression, w means the letter “w”, while \w means a word character. However, if you make a string "\w", you are escaping the character “w”, which makes no sense, since character “w” is not a quote or a backslash, so "w" == "\w". Then this string containing only “w” gets passed to the regexp constructor, and you end up matching the letter “w” instead of any word character. Thus, to pass the backslash to regexp, you need to put in two backslashes in the string literal (\\w): one will be removed when the string literal is interpreted, one will be used by the regular expression.

When working with regular expressions directly (such as on most online regexp testers, or when using verbatim strings @"..."), you don’t have to worry about the interpretation of string literals, and you always write just one backslash (except when you want to match the backslash itself, but then you’re espacing the backslash for the regexp, not for the string).

Leave a Comment