Batch convert Malformed PDFs to TIF on Linux

Posted by Mike Driscoll on Stack Overflow See other posts from Stack Overflow or by Mike Driscoll
Published on 2012-12-14T16:46:08Z Indexed on 2012/12/14 17:03 UTC
Read the original article Hit count: 160

Filed under:
|
|
|

I need to convert a multipage PDF to TIF, but it appears to be a malformed PDF provided by our client. I tried using ImageMagick and GhostScript, but they do not convert the file correctly. The result is only about 85-90% correct. The only thing I've found that appears to do the job is GIMP, but I can't find an example to use it via its Batch Processing methods for PDFs.

Here are the warnings I get from ImageMagick and GhostScript:

 **** Warning: Tf refers to an unknown resource name: FORMS$.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: P06BOB.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN308E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: P06BOB.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN308E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: P06BOB.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN308E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN307A.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN104A.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN308E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN208E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN308E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN208E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: HE14BP.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN208E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN106E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN208E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN106E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN208E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: HE08BP.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: HE11BP.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: AR10NP.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN308E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: JIMP2.l Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: HS11C.l Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: FORMS$.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN104A.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN208E.f Assuming it's a font name.
 **** Warning: Tf refers to an unknown resource name: UN307E.f Assuming it's a font name.

 **** This file had errors that were repaired or ignored.
 **** Please notify the author of the software that produced this
 **** file that it does not conform to Adobe's published PDF
 **** specification.

I'm open to other suggestions too. Thanks!

© Stack Overflow or respective owner

Related posts about linux

Related posts about pdf