Finished up the John Pory PDF. Still heavy at ~250Mb, but I’m not going to get it much smaller - the images I embed in the PDF need to be good enough to do OCR on.

Scripts I wrote to batch process the images. I don’t think they’ll be much use to anyone else though.

The whole thing was only supposed to take a couple of days, but getting the scans into a state I was happy with took a lot longer than I thought (although it probably would have gone faster if I hadn’t stopped to build my own tools).


Surprisingly, the ~250Mb PDF increased the build time not at all.


Might prod at the site a bit more. I’m not happy with how it handles PDFs, for a start.