Remote XML retrieval

Posted by rrrfusco on Stack Overflow See other posts from Stack Overflow or by rrrfusco
Published on 2010-12-14T20:43:20Z Indexed on 2011/01/01 8:54 UTC
Read the original article Hit count: 310

Filed under:

I'm retrieving a remote XML file with wget -O remotefile localfile with a cron tab.

Every so often the local file becomes malformed probably because of double whitespace (according to firefox) between the CDATA tags.

Sometimes the parser gives an error for ' > ' missing, but upon checking the xml file the ' > ' exists...

The remote xml file never gives malformed errors when called in the url.

EDIT
It seems CDATA is not parsed by the xml parser. (w3schools)

Is there a way to set wget to retain wellformed XML?

If not wget, What is a good way to continously retrieve a remote XML file and keep it well formed?

© Stack Overflow or respective owner

Related posts about Xml