php regex to remove HTML

Posted by Me1000 on Stack Overflow See other posts from Stack Overflow or by Me1000
Published on 2009-04-17T02:53:10Z Indexed on 2010/03/20 11:21 UTC
Read the original article Hit count: 311

Filed under:
|
|
|

Before we start, strip_tags() doesn't work.

now,

I've got some data that needs to be parsed, the problem is, I need to get rid of all the HTML that has been formated very strangely. the tags look like this: (notice the spaces)

< p > blah blah blah < / p > < a href= " link.html " > blah blah blah < /a >

All the regexs I've been trying aren't working, and I don't know enough about regex formating to make them work. I don't care about preserving anything inside of the tags, and would prefer to get rid of the text inside a link if I could.

Anyone have any idea?

(I really need to just sit down and learn regular expressions one day)

© Stack Overflow or respective owner

Related posts about php

Related posts about regex