Question about PDF manipulation

der Mouse mouse at Rodents.Montreal.QC.CA
Thu Jun 2 23:16:09 CDT 2005


>   - Ran identify again on the resulting TIFF file, and the comment's now
> changed to: "Image generated by ESP Ghostscript (device=pnmraw)"

> ... so it looks like any TIFF 'metadata' isn't getting preserved.

Worse, it is probably re-rendering the pixels, so unless it's quite
careful, it's introduced blur due to mismatches between the original's
pixel boundaries and the output's pixel boundaries.  Did you compare
the pixel contents bit-for-bit?

> Looking at the PDF file, I'm not convinced there's any TIFF data in
> there to be honest.  It looks more like the image is re-encoded from
> the input TIFF to PDFs own way of storing bitmap data - in other
> words it's not simply a wrapper for a bunch of TIFF images, but
> merely a wrapper for bitmap data in PDF's own format.

It's not *quite* that simple.  I've seen PDFs containing JPEGs which I
could pick the JPEGs out of simply by looking for the
\xff\xd8\xff\xe0..JFIF marker.

Unless of course the "PDF's own format" *is* JPEG, which would be both
surprising and disappointing.

/~\ The ASCII				der Mouse
\ / Ribbon Campaign
 X  Against HTML	       mouse at rodents.montreal.qc.ca
/ \ Email!	     7D C8 61 52 5D E7 2D 39  4E F1 31 3E E8 B3 27 4B


More information about the cctalk mailing list