System.Text.RegularExpressions

Popular tags

#parsing [10], #digits [6], #web [5], #text [5], #replace [4], #simple [4], #numbers [4], #group [3], #example [3], #remove [3], #string [3], #html [2], #validate [2], #uri [2], #split [2], #datetime [2], #date [2], #non capturing [2], #sample [2], #ip address [2], #ip [2], #dns [2], #space [2], #number [2], #language [2], #file [2], #0-100 [2], #numbers between 0 to 100 [2], #0 to 100 [2], #mail [1]

Authors

stackoverflow [14], admin [7], mahajan344 [3], nitrogeniy [1], jnickel [0], gsanfordm [0], Jasper [0], bwilliams [0], Sree24 [0], oysteinheimstad [0], msicc [0], netregex [0], regex [0]

Regex Author - stackoverflow

Get IP address - simple non capturing example

?: is used when you want to group an expression, but you do not want to save it as a matched/captured portion of the string. An example would be something to match an IP address: (?:\d{1,3}\.){3}\d{1,3} Note that I don't care about saving the first 3 octets, but the (?:...) grouping allows me to shorten the regex without incurring the overhead of capturing and storing a match.

Type: match, Date: 7/12/2015 3:29:13 PMAuthor: stackoverflow

How to get numbers 1st, 2nd, 3rd, 4th from text

You can use capturing groups to organize and parse an expression. A non-capturing group has the first benefit, but doesn't have the overhead of the second. You can still say a non-capturing group is optional, for example. Say you want to match numeric text, but some numbers could be written as 1st, 2nd, 3rd, 4th... If you want to capture the numeric part, but not the (optional) suffix you can use a non-capturing group. ([0-9]+)(?:st|nd|rd|th)? That will match numbers in the form 1, 2, 3... or in the form 1st, 2nd, 3rd,... but it will only capture the numeric part.

Type: match, Date: 7/12/2015 3:26:19 PMAuthor: stackoverflow

Simple example - non capturing group

If I apply the regex below over it: (http|ftp)://([^/\r\n]+)(/[^\r\n]*)? I would get the following result: Match "http://stackoverflow.com/" Group 1: "http" Group 2: "stackoverflow.com" Group 3: "/" But I don't care about the protocol - I just want the host and path of the URL. So, I change the regex to include the non-capturing group (?:): (?:http|ftp)://([^/\r\n]+)(/[^\r\n]*)? Now, my result looks like this: Match "http://stackoverflow.com/" Group 1: "stackoverflow.com" Group 2: "/"

Type: match, Date: 7/12/2015 3:23:20 PMAuthor: stackoverflow

Regex - \d is less efficient than [0-9] - get ALL Unicode digits

\d checks all Unicode digits, while [0-9] is limited to these 10 characters. For example, Persian digits, ?????????, are an example of Unicode digits which are matched with \d, but not [0-9]. [0-9] isn't equivalent to \d. [0-9] matches only 0123456789 characters, while \d matches [0-9] and other digit characters, for example Eastern Arabic numerals ٠١٢٣٤٥٦٧٨٩

Type: match, Date: 7/12/2015 2:34:00 PMAuthor: stackoverflow

System.Text.RegularExpressions Library

All

Date & Time

Digits, Numbers

E-Mail, Url

Other

Phone, ZipCode

Strings, Text

Web, Html