In programming, regular expressions are a very useful tool designed to validate, search, and match text patterns. In this article, I have compiled more than 10 incredibly useful regular expressions, for any language, that will probably be very beneficial to you.

Validate an URL

Is a particular url valid? The following regexp will let you know.

/^(https?:\/\/)?([\da-z\.-]+)\.([a-z\.]{2,6})([\/\w \?=.-]*)*\/?$/

Source: http://snipplr.com/view/19502/validate-a-url/

Validate US phone number

This regexp will verify that a US phone number is valid.

/^(\+\d)*\s*(\(\d{3}\)\s*)*\d{3}(-{0,1}|\s{0,1})\d{2}(-{0,1}|\s{0,1})\d{2}$/

Source: http://snippets.dzone.com/posts/show/597

Test if a password is strong

Weak passwords are one of the quickest ways to get hacked. The following regexp will make sure that:

  • Passwords will contain at least (1) upper case letter
  • Passwords will contain at least (1) lower case letter
  • Passwords will contain at least (1) number or special character
  • Passwords will contain at least (8) characters in length
  • Password maximum length should not be arbitrarily limited
(?=^.{8,}$)((?=.*\d)|(?=.*\W+))(?![.\n])(?=.*[A-Z])(?=.*[a-z]).*$

Source: http://imar.spaanjaars.com/QuickDocId.aspx?quickdoc=297

Get code within <?php and ?>

If for some reason you need to grab all the code contained within the <?php and ?> tags, this regexp will do the job:

<\?[php]*([^\?>]*)\?>

Source: http://snipplr.com/view/12845/get-all-the-php-code-between/

Match tel: urls

In a recent post, I showed you how you can use iPhone special link prfixes to automatically call someone.
This regular expression will match those tel: urls.

^tel:((?:\+[\d().-]*\d[\d().-]*|[0-9A-F*#().-]*[0-9A-F*#][0-9A-F*#().-]*(?:;[a-z\d-]+(?:=(?:[a-z\d\[\]\/:&+$_!~*'().-]|%[\dA-F]{2})+)?)*;phone-context=(?:\+[\d().-]*\d[\d().-]*|(?:[a-z0-9]\.|[a-z0-9][a-z0-9-]*[a-z0-9]\.)*(?:[a-z]|[a-z][a-z0-9-]*[a-z0-9])))(?:;[a-z\d-]+(?:=(?:[a-z\d\[\]\/:&+$_!~*'().-]|%[\dA-F]{2})+)?)*(?:,(?:\+[\d().-]*\d[\d().-]*|[0-9A-F*#().-]*[0-9A-F*#][0-9A-F*#().-]*(?:;[a-z\d-]+(?:=(?:[a-z\d\[\]\/:&+$_!~*'().-]|%[\dA-F]{2})+)?)*;phone-context=\+[\d().-]*\d[\d().-]*)(?:;[a-z\d-]+(?:=(?:[a-z\d\[\]\/:&+$_!~*'().-]|%[\dA-F]{2})+)?)*)*)$

Source: http://tools.ietf.org/html/rfc3966#section-3

Validate US zip code

When building a registration form, it is common to ask the user’s zip code. As forms are often boring, there’s a strong chance that the user will try to register false data. This regular expression will make sure he entered a valid American zip code.

^[0-9]{5}(-[0-9]{4})?$

Source: http://reusablecode.blogspot.com/2008/08/isvalidzipcode.html

Validate Canadian postal code

This regexp is very similar to the previous one, but it will match Canadian postal codes instead.

^[ABCEGHJ-NPRSTVXY]{1}[0-9]{1}[ABCEGHJ-NPRSTV-Z]{1}[ ]?[0-9]{1}[ABCEGHJ-NPRSTV-Z]{1}[0-9]{1}$

Source: http://reusablecode.blogspot.com/2008/08/isvalidpostalcode.html

Grab unclosed img tags

As you probably know, the xhtml standard requires all tags to be properly closed. This regular expression will search for unclosed img tags. It could be easily modified to grab any other unclosed html tags.

<img([^>]+)(\s*[^\/])>

Source: http://snipplr.com/view/6632/grab-any-unclosed-xhtml-img-tags/

Find all CSS attributes

This regexp will find CSS attributes, such as background:red; or padding-left:25px;.

\s(?[a-zA-Z-]+)\s[:]{1}\s*(?[a-zA-Z0-9\s.#]+)[;]{1}

Source: http://snipplr.com/view/17903/find-css-attributes/

Validate an IBAN

I have recently worked on a banking application and this one was definitely a life-saver. It will verify that the given IBAN is valid.

[a-zA-Z]{2}[0-9]{2}[a-zA-Z0-9]{4}[0-9]{7}([a-zA-Z0-9]?){0,16}

Source: http://snipplr.com/view/15322/iban-regex-all-ibans/

Validate a BIC code

Another one very useful for any banking application or website: This regexp will validate a BIC code.

([a-zA-Z]{4}[a-zA-Z]{2}[a-zA-Z0-9]{2}([a-zA-Z0-9]{3})?)

Source: http://snipplr.com/view/15320/bic-bank-identifier-code-regex/

If you’re interested in regular expressions, make sure you have read our “15 PHP regular expression for developers” post.

 

16 Comments

  1. Posted March 30, 2010 at 11:02 am | Permalink

    Useful collection. Thanks for posting.

  2. Posted March 30, 2010 at 11:08 am | Permalink

    Great post – I am absolute pants at regexp’s, so to see a summary of the most helpful is really, er, helpful!!

  3. Posted March 30, 2010 at 11:08 am | Permalink

    One I use a lot is =”[^"]*["] to get attribute values if I’m trying to clean up html. For example, you could use style=”[^"]*["] to detect any element with an inline style and replace both the inline style attribute and the attribute value with the find/replace function in your IDE.

  4. Posted March 30, 2010 at 1:02 pm | Permalink

    Very useful, thanks for sharing!

  5. Rory M
    Posted March 31, 2010 at 5:32 am | Permalink

    Useful list, but nothing for email validation?

  6. Michael
    Posted March 31, 2010 at 7:05 am | Permalink

    The URL expression incorrectly rejects a number of valid URLs. For example, it rejects a URL where you use the IP address instead of a hostname. It rejects a URL that specifies a port number. It rejects any URL that is not http or https.

    There is an RFC that defines what makes a valid URL. I recommend using that as a specification to guide the development of your regular expression.

    Based on that, I have to wonder about the accuracy of the other regular expressions in the list. I would rate this list as Not Recommended.

  7. Posted March 31, 2010 at 12:50 pm | Permalink

    Thanks for sharing this! very useful

  8. Jonathan Allen
    Posted April 1, 2010 at 3:54 pm | Permalink

    Ugh. Those don’t even begin to consider validity, they just determine if the pattern is remotely plausable. If your intent is to give meaningful feedback to the user you need to consider the checksum.

  9. victor.
    Posted April 1, 2010 at 3:54 pm | Permalink

    @Rory

    validating email and even urls with regex is asking for trouble.

  10. Jonathan Allen
    Posted April 1, 2010 at 3:56 pm | Permalink

    @Jenna Molby

    It is impossible to validate an email address using regular expressions alone. Foruntately you should be able to find a real emal validation function in most platforms. Unfortuantely many of them are also broken, just less so.

  11. Posted April 8, 2010 at 3:30 am | Permalink

    What you guys mean by saying it is trouble to validate emails with regex? I guess all the websites do that. Can you guys be more specific?

  12. Posted April 17, 2010 at 4:25 pm | Permalink

    Regexes are great, and I use them a lot, but sometimes a special-purpose parser is best. For example, in Python there’s the urlparse module for URLs, which I think is more convenient than a regex (it also lets you access each part of the url as a named attribute, e.g. parsed_url.scheme, parsed_url.path).

    Long ago, I came across what was claimed to be an RFC-compliant regex for e-mail addresses. It was hundreds and hundreds of characters long. Can’t find it anymore.

  13. Posted May 14, 2010 at 6:13 am | Permalink

    Wow! I’m not particular with any web development techniques yet. Thanks for posting it here. I now have another idea on how=)

    • EskiMag
      Posted May 19, 2010 at 5:51 am | Permalink

      That “Validate an URL” was falling into infinite cycle and causing 100% CPU load. I think better version should be: /^(https?:\/\/)?([\da-z\.-]+)\.([a-z\.]{2,6})([\/\w\?&=.-]*)\/?$/

  14. Posted June 12, 2010 at 4:12 pm | Permalink

    very intersting collection. Thanks for posting.

  15. Posted June 27, 2010 at 5:20 am | Permalink

    Maybe i am wrong … but doesn’t the URL-Validating Code fail to validate UTF8-URLs (especially with umlauts and stuff)? So in my opinion “\w” would be the better choice than “\da-z”.

3 Trackbacks

  1. By uberVU - social comments on March 30, 2010 at 10:14 am

    Social comments and analytics for this post…

    This post was mentioned on Twitter by logobliss: 10+ regular expressions for efficient web development http://bit.ly/bGCDcQ…

  2. [...] 10+ regular expressions for efficient web development [...]

  3. By Browsat – 19 April, 2010 | born to hula on April 19, 2010 at 4:30 pm

    [...] – 10+ regular expressions for efficient web development30 March, 2010 – Bra [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Subscribe without commenting

  • Smashing Network
  • Hosted by VPS.net and Akamai CDN
WordPress Appliance - Powered by TurnKey Linux