Purpose
and expected outcome of the project:
Scanning images for the creation of digital image libraries
is now a well established and documented process on campus. The ELI project has made great contributions
to bringing these digital image collections into the classroom. Scanning text
for research and pedagogical purposes is beginning to make inroads in technical
projects on campus when the texts involved are printed in Western
languages. Non-Western texts present
special challenges when planning a digital text collection. In the case of texts printed in Arabic, the
challenges multiply because of varying fonts used in the printing process,
these dependent in the main on the date and country of publication.
Methodology:
Project
Requirements:
(1) A survey of the scanning resources
on campus would need to be drawn up and distributed electronically or conducted
in person.
(2) Some cooperation among the
internal library units would be sought to cover costs normally assessed for
special scanning projects.
(3) A simple yet flexible web site
would need to be constructed to display the results of the discovery
project. Cooperation would be sought
from the YUL Systems group to determine the best location for the site. Also, attention would be paid to the current
MED DL project using Greenstone software such that the resulting files from
this SCOPA project would be compatible to that projects data structure.
(4) Software to conduct the OCR to
text processing would need to be purchased.
An academic discount will be sought.
|
Purchase of OCR software from Sakhr (does not reflect an
academic discount) |
$1400.00 |
|
Student time for scanning @ $11.00/hr |
$110.00 |
Total
|
$1510.00 |
The successful results of this grant will create a guideline for librarians and scholars at Yale to follow when undertaking projects involving the scanning and processing of non-Western text. The files produced from the manuscript can serve as a seed project for new electronic collection efforts or as additional data for existing collections in the Medical Historical Library. The scanning of the modern counterpart will provide additional information on successful OCR techniques for future digitization projects managed by the Library.
Yale University Library
SCOPA Grant Proposal: 2004
submitted: 10/18/2003