{"id":1028,"date":"2019-03-06T11:14:30","date_gmt":"2019-03-06T16:14:30","guid":{"rendered":"https:\/\/wordpress.ocps.net\/presenceblog\/?p=1028"},"modified":"2019-04-03T11:54:09","modified_gmt":"2019-04-03T15:54:09","slug":"oh-no-you-have-a-pdf-that-is-a-scanned-image","status":"publish","type":"post","link":"https:\/\/wordpress.ocps.net\/presenceblog\/oh-no-you-have-a-pdf-that-is-a-scanned-image\/","title":{"rendered":"Oh No!  You Have a PDF That is a Scanned Image"},"content":{"rendered":"<p>In looking through the content on the OCPS websites, I find many PDFs that appear to have been documents scanned into a PDF format.  Most likely, these documents were created by using your local printer and scanning the document to your email address rather than printing a copy of the document.  This process creates a scanned image PDF of the document and emails it to you.  A scanned image not only losses all text information, but it also loses all other ADA settings that the original document may have had.  At the very least, text on the image is not readable by most screen readers unless they have an OCR component built in.  So what can you do?\n<\/p>\n<p>The current answer that I recommend is go back to the original Microsoft Word or other source document, fix accessibility issues there first and then recreate a new PDF.  But what if you don&#8217;t have the source document anymore or cannot find it.  Then the next best option is to run these PDF&#8217;s through Adobe Acrobat DC.  I&#8217;ve talked about Adobe Acrobat DC before so I recommend you check out that discussion at:   <a href=\"https:\/\/wordpress.ocps.net\/presenceblog\/how-to-check-your-pdf-for-accessiblity\/\">https:\/\/wordpress.ocps.net\/presenceblog\/how-to-check-your-pdf-for-accessiblity\/<\/a>\n\t<\/p>\n<p>Suppose however, you already have a scanned document that was published on your portal site.  In Chrome, if you open the document on the portal and move your mouse toward the top of the screen, you will see a black bar across the top of the browser window.  On the right side of this bar you should see several buttons.\n<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/wordpress.ocps.net\/wp-content\/uploads\/2019\/03\/030619_1614_OhNoYouHav1.png\" alt=\"\"\/>\n\t<\/p>\n<p>The button with the down pointing arrow is the Download button.  In Microsoft Edge, you may have to click on the document to display to black bar at the top and it displays a slightly different set of buttons on the right side:\n<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/wordpress.ocps.net\/wp-content\/uploads\/2019\/03\/030619_1614_OhNoYouHav2.png\" alt=\"\"\/>\n\t<\/p>\n<p>The button that looks like 3.5&#8243; floppy disk on the right side is the download button.   Other browse may have other variations of this.  After downloading the PDF file, open it in Adobe Acrobat DC as discussed in the previously referenced blog post and perform a Full Accessibility Check.  For a scanned image document, you will see an error under the Document section that informs you that the document is an Image-only PDF,\n<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/wordpress.ocps.net\/wp-content\/uploads\/2019\/03\/030619_1614_OhNoYouHav3.png\" alt=\"\"\/>\n\t<\/p>\n<p>If you right-click on the error, the dropdown menu displays your options.  First, let&#8217;s look at what the Explain option tells us about this error:\n<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/wordpress.ocps.net\/wp-content\/uploads\/2019\/03\/030619_1614_OhNoYouHav4.png\" alt=\"\"\/>\n\t<\/p>\n<p>The first line tells us that a scanned image document is not accessible.  However, there &#8216;may&#8217; be a way to extract the text from the image and &#8216;fix&#8217; the document to make it accessible by selecting the &#8216;Fix&#8217; option.  For simple documents, this may fix the problem.  If not, you could also try a 3<sup>rd<\/sup> party OCR (Optical Character Reader) to convert the document.  You could also try the steps listed in the Explain text to extract the text thus making the document readable.\n<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/wordpress.ocps.net\/wp-content\/uploads\/2019\/03\/030619_1614_OhNoYouHav5.png\" alt=\"\"\/>\n\t<\/p>\n<p>The Fix option displays a dialog box with three configuration questions.  For the first pass I will leave the Output as a Searchable Image.  Another option you might choose for output is Editable Text and Images.  This provides some access to images, but neither method is as accurate as working with the source document for the PDF and then recreating the PDF.\n<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/wordpress.ocps.net\/wp-content\/uploads\/2019\/03\/030619_1614_OhNoYouHav6.png\" alt=\"\"\/>\n\t<\/p>\n<p>After clicking OK, you may still have to go through the document to fix other issues, but for simple documents, this may be all you need to create an accessible document when the document does not contain images, tables or lists.  In that case, you can simply save and upload the new PDF replacing the PDF on your portal site.\n<\/p>\n<p>However, if you have images, tables, or lists, you may need to open the document in Microsoft Word and use the Accessibility Checker to find and to identify issues such as missing alt-text, fix those issues and recreate the PDF replacing the original one loaded on your web site.\n<\/p>\n<p>Sometimes, the images, tables, or lists do not convert cleanly in either Adobe Acrobat Pro DC or in Microsoft Word.  In these cases, you may have to either replace the images with fresh versions or the images or recreate the document using just the text you were able to recover.\n<\/p>\n<p>As you can see, the process of &#8216;fixing&#8217; a PDF can go from simple to complex quickly.  If you have the original source document, starting from that source to fix accessibility issues and recreate a PDF after addressing those issues is always the best choice.  However, when you cannot find the original source document or it no longer exists, the goal then becomes to &#8216;fix&#8217; as much as possible.\n<\/p>\n<p>\n\u00a0<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In looking through the content on the OCPS websites, I find many PDFs that appear to have been documents scanned into a PDF format. Most likely, these documents were created by using your local printer and scanning the document to your email address rather than printing a copy of the document. This process creates a &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/wordpress.ocps.net\/presenceblog\/oh-no-you-have-a-pdf-that-is-a-scanned-image\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Oh No!  You Have a PDF That is a Scanned Image&#8221;<\/span><\/a><\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[28],"tags":[80],"class_list":["post-1028","post","type-post","status-publish","format-standard","hentry","category-news-related-to-ada-and-accessibility","tag-scanned-pdfs-scanned-images"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":1415,"url":"https:\/\/wordpress.ocps.net\/presenceblog\/alt-text-for-images-with-text\/","url_meta":{"origin":1028,"position":0},"title":"Alt Text for Images with Text","author":"Carlos Hernandez","date":"January 27, 2020","format":false,"excerpt":"Background If you use an image that contains text that you want site visitors to read, that text must be included elsewhere on the page so that screen readers can read the text for the visually impaired and blind. In most cases, the image can be used as long as\u2026","rel":"","context":"In &quot;ADA News&quot;","block_context":{"text":"ADA News","link":"https:\/\/wordpress.ocps.net\/presenceblog\/category\/portal-related-news\/news-related-to-ada-and-accessibility\/"},"img":{"alt_text":"Backstage Menu within Micrsoft Word","src":"https:\/\/i0.wp.com\/wordpress.ocps.net\/wp-content\/uploads\/2020\/01\/012720_1804_AltTextforI1.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":832,"url":"https:\/\/wordpress.ocps.net\/presenceblog\/creating-your-accessible-pdf-from-word\/","url_meta":{"origin":1028,"position":1},"title":"Creating Your Accessible PDF from Word","author":"Carlos Hernandez","date":"September 24, 2018","format":false,"excerpt":"Now that you have a Word Document that is accessible, how can you create a PDF that you can distribute to others outside of OCPS. Since the 2007 version of Word, you have had an option to change the file type when you save a document. In the Save As\u2026","rel":"","context":"In &quot;ADA News&quot;","block_context":{"text":"ADA News","link":"https:\/\/wordpress.ocps.net\/presenceblog\/category\/portal-related-news\/news-related-to-ada-and-accessibility\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":799,"url":"https:\/\/wordpress.ocps.net\/presenceblog\/checking-the-accessibility-of-your-documents\/","url_meta":{"origin":1028,"position":2},"title":"Checking the Accessibility of your Documents","author":"Carlos Hernandez","date":"September 13, 2018","format":false,"excerpt":"So now that you have created your Word document, how can you check if that document is accessible? There are many accessibility checkers available on the Internet. However, did you know that you have an accessibility checker built right into Microsoft Word? I'm sure most of you did not because\u2026","rel":"","context":"In &quot;ADA News&quot;","block_context":{"text":"ADA News","link":"https:\/\/wordpress.ocps.net\/presenceblog\/category\/portal-related-news\/news-related-to-ada-and-accessibility\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":767,"url":"https:\/\/wordpress.ocps.net\/presenceblog\/accessibility-of-images-in-pdfs\/","url_meta":{"origin":1028,"position":3},"title":"Accessibility of Images in PDFs","author":"Carlos Hernandez","date":"August 29, 2018","format":false,"excerpt":"Accessibility of Images in PDFs follows essentially the same rules as images on web pages. There are three basic types of images: Decoration Example of something Actual content All three types of images must have alt-text associated with them. However, the contents of that text varies based on the image\u2026","rel":"","context":"In &quot;ADA News&quot;","block_context":{"text":"ADA News","link":"https:\/\/wordpress.ocps.net\/presenceblog\/category\/portal-related-news\/news-related-to-ada-and-accessibility\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":1534,"url":"https:\/\/wordpress.ocps.net\/presenceblog\/can-alt-text-have-line-breaks\/","url_meta":{"origin":1028,"position":4},"title":"Can Alt-Text Have Line Breaks","author":"Carlos Hernandez","date":"October 8, 2020","format":false,"excerpt":"Well, that depends. It depends on where you try to create them. Let's start with a word document. If you have a word document that contains an image, you can right-click on the image and select the \"Edit Alt Text...\" from the popup menu. Then in the panel that appears\u2026","rel":"","context":"In &quot;ADA News&quot;","block_context":{"text":"ADA News","link":"https:\/\/wordpress.ocps.net\/presenceblog\/category\/portal-related-news\/news-related-to-ada-and-accessibility\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/wordpress.ocps.net\/wp-content\/uploads\/2020\/10\/Yoda2.jpg?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/wordpress.ocps.net\/wp-content\/uploads\/2020\/10\/Yoda2.jpg?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/wordpress.ocps.net\/wp-content\/uploads\/2020\/10\/Yoda2.jpg?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/wordpress.ocps.net\/wp-content\/uploads\/2020\/10\/Yoda2.jpg?resize=700%2C400&ssl=1 2x, https:\/\/i0.wp.com\/wordpress.ocps.net\/wp-content\/uploads\/2020\/10\/Yoda2.jpg?resize=1050%2C600&ssl=1 3x"},"classes":[]},{"id":751,"url":"https:\/\/wordpress.ocps.net\/presenceblog\/accessibility\/","url_meta":{"origin":1028,"position":5},"title":"Thinking About Accessibility of PDFs on your Website","author":"Carlos Hernandez","date":"August 28, 2018","format":false,"excerpt":"First, let me be clear that accessibility is for everyone, not just the visually impaired. Some aspects of accessibility are directed toward those who have reasonably good vision or hearing and are meant to make documents easier to understand. Other aspects are directed toward users who need to use screen\u2026","rel":"","context":"In &quot;ADA News&quot;","block_context":{"text":"ADA News","link":"https:\/\/wordpress.ocps.net\/presenceblog\/category\/portal-related-news\/news-related-to-ada-and-accessibility\/"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_likes_enabled":true,"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/posts\/1028","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/comments?post=1028"}],"version-history":[{"count":1,"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/posts\/1028\/revisions"}],"predecessor-version":[{"id":1029,"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/posts\/1028\/revisions\/1029"}],"wp:attachment":[{"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/media?parent=1028"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/categories?post=1028"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wordpress.ocps.net\/presenceblog\/wp-json\/wp\/v2\/tags?post=1028"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}