Valid content-type for XML, HTML and XHTML documents
        Posted  
        
            by astropanic
        on Stack Overflow
        
        See other posts from Stack Overflow
        
            or by astropanic
        
        
        
        Published on 2010-06-03T11:46:05Z
        Indexed on 
            2010/06/03
            12:04 UTC
        
        
        Read the original article
        Hit count: 260
        
http
|web-standards
What are correctly content-types for this documents ?
I need to write a simple crawler, that only fetches this kind of files.
Nowadays http://somedomain.com/index.html can serve for example an JPEG file due to mod_rewrite, so I need to check the content-type from the response header and compare it with a list of allowed content-types.
From where I can get such list ?
© Stack Overflow or respective owner