RegEx expression to find a href links and add NoFollow to them

Posted by Rodney on Stack Overflow See other posts from Stack Overflow or by Rodney
Published on 2010-03-15T22:42:45Z Indexed on 2010/03/16 7:46 UTC
Read the original article Hit count: 192

Filed under:

I am trying to write a RegEx rule to find all a href HTML links on my webpage and add a 'rel="nofollow"' to them.

However, I have a list of URLs that must be excluded (for exmaple, ANY (wildcards) internal link (eg. pokerdiy.com) - so that any internal link that has my domain name in is excluded from this. I want to be able to specify exact URLs in the exclude list too - for example - http://www.example.com/link.aspx)

Here is what I have so far which is not working:

(]+)(href="http://.*?(?!(pokerdiy))[^>]+>)

If you need more background/info you can see the full thread and requirements here (skip the top part to get to the meat): http://www.snapsis.com/Support/tabid/601/aff/9/aft/13117/afv/topic/afpgj/1/Default.aspx#14737

© Stack Overflow or respective owner

Related posts about regex