Digital Heritage Network
FACTSnet Heritage Index
Thursday, 21 August 2014
FACTSnet Indexing System
For our indexing system, we prioritize the digitization of the
Titles, Acknowledgements, Copyright, Table of Contents, Introduction,
and Index pages. Once the scans are edited, all of the introductory
pages are put together into one PDF while the Index pages are put into
another. They will serve as a preview for the book, much like Google
Books provides.
Next, we run an Optical Character Recognition software over the PDFs which allows for searchable text. These two PDFs are then posted onto blogs for public access.
If we are able to acquire the rights for a heritage resource, we can then continue to digitize the rest of the contents. Chapters are separated into their own PDFs.
Next, we run an Optical Character Recognition software over the PDFs which allows for searchable text. These two PDFs are then posted onto blogs for public access.
If we are able to acquire the rights for a heritage resource, we can then continue to digitize the rest of the contents. Chapters are separated into their own PDFs.
Subscribe to:
Posts (Atom)