For our indexing system, we prioritize the digitization of the
Titles, Acknowledgements, Copyright, Table of Contents, Introduction,
and Index pages. Once the scans are edited, all of the introductory
pages are put together into one PDF while the Index pages are put into
another. They will serve as a preview for the book, much like Google
Books provides.
Next, we run an Optical Character
Recognition software over the PDFs which allows for searchable text.
These two PDFs are then posted onto blogs for public access.
If
we are able to acquire the rights for a heritage resource, we can then
continue to digitize the rest of the contents. Chapters are separated
into their own PDFs.
No comments:
Post a Comment