Robust, Mature HTML Parser for PHP

Posted by Alan Storm on Stack Overflow See other posts from Stack Overflow or by Alan Storm
Published on 2008-11-15T19:09:52Z Indexed on 2010/05/12 18:44 UTC
Read the original article Hit count: 444

Filed under:
|
|
|
|

Are there any robust and mature HTML parsers available for PHP? A quick skimming of PEAR didn't turn anything up (lots of classes for generating HTML, not so much for consuming), and Google taught me a lot of people have started and then abandoned a variety of parser projects.

Not interested in XML parsers (unless then can consume non-well formed HTML) or hacking it on my own with regular expressions.

Clarification of Intent: I'm not interested in filtering of HTML content, I'm interesting in extracting information from HTML documents.

© Stack Overflow or respective owner

Related posts about php

Related posts about html