Converting Old Publications to Text Using OCR

This document describes how to set up an OCR engine, prepare the images for recognition, and convert from pdf files to text.  Here at the Military Yearbook Project we often have the need to convert old military yearbooks in.pdf format to text, then ultimately to html.  Most of the old MOS codes used as references on this website were converted using this method.  We use a combination of software to output the text. 

Privacy Policy  |  Terms of Service  |  Sitemap 

(C) 2009-2021 The Military Yearbook Project

Contact:  webmaster-(at)-militaryyearbookproject.org