mperrie48
Image1_Letter
This image is part of a scanned book that relates the crafts and wildlife that appear along the Appalachian Trail. Running this image through OCR software adequately captured the essence of the original, yet the software returned a few errors. For instance, a couple of the capital “H”s returned “I-I” and hard “—“s turned into soft “-“s, and sometimes, numeric characters (e.g. 1, 2, 3, etc.) were inserted where letters should have been: “Hold yo1u' thumb.” The software also did not pick up oversized letters within the original text. Instead, it clipped the text as if the letter had not originally existed (“hese Blue Ridge Mountains”). Additionally, the software had the slight tendency to run together terms that are positioned close together on the page (“nakedness and can-dor, tmder high empty skies."” Finally, the read out given by the OCR software did not make stark distinctions between the beginning and end of paragraphs, which makes readability a bit difficult. I think that these errors appeared due to the type-set of the original document and the quality of the scan.
Image1_Letter
This image is part of a scanned book that relates the crafts and wildlife that appear along the Appalachian Trail. Running this image through OCR software adequately captured the essence of the original, yet the software returned a few errors. For instance, a couple of the capital “H”s returned “I-I” and hard “—“s turned into soft “-“s, and sometimes, numeric characters (e.g. 1, 2, 3, etc.) were inserted where letters should have been: “Hold yo1u' thumb.” The software also did not pick up oversized letters within the original text. Instead, it clipped the text as if the letter had not originally existed (“hese Blue Ridge Mountains”). Additionally, the software had the slight tendency to run together terms that are positioned close together on the page (“nakedness and can-dor, tmder high empty skies."” Finally, the read out given by the OCR software did not make stark distinctions between the beginning and end of paragraphs, which makes readability a bit difficult. I think that these errors appeared due to the type-set of the original document and the quality of the scan.