How to identify doc, ppt, xls files

Posted by Shelby. S on Ask Ubuntu See other posts from Ask Ubuntu or by Shelby. S
Published on 2012-07-03T14:50:03Z Indexed on 2012/07/03 15:24 UTC
Read the original article Hit count: 252

Filed under:
|
|

So I was wondering how would you differentiate ppt, xls and doc files from each other in linux regardless of extensions. I tried 'file' but from the looks of it, all of MSOffice files are categorized under the same file type. Similarly I'm having trouble with docx, xlsx and pptx files, since they're essentially all zip files containing a bunch of xml.

Thank you for your help! P.S. I also tried a python script importing the magic module, but no go.

© Ask Ubuntu or respective owner

Related posts about microsoft-office

Related posts about zip