FACTSnet Heritage Index: FACTSnet Indexing System

For our indexing system, we prioritize the digitization of the Titles, Acknowledgements, Copyright, Table of Contents, Introduction, and Index pages. Once the scans are edited, all of the introductory pages are put together into one PDF while the Index pages are put into another. They will serve as a preview for the book, much like Google Books provides.

Next, we run an Optical Character Recognition software over the PDFs which allows for searchable text. These two PDFs are then posted onto blogs for public access.

If we are able to acquire the rights for a heritage resource, we can then continue to digitize the rest of the contents. Chapters are separated into their own PDFs.

FACTSnet Heritage Index

Thursday, 21 August 2014

FACTSnet Indexing System

No comments:

Post a Comment