cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 

How to make my scans searchable PDF's

MarinaNBW
Apprentice

Hi All,

I am new to the forum and a new scanner user. We are currently scanning documents via Image Formula DR G2110 to a workstation and then to Google Drive. Unfortunately, the scans are not searchable? Is there a way to set up scanner or scans so that they produce searchable PDF's? 

6 REPLIES 6

shadowsports
Legend
Legend

Greetings,

I am not at work to check, but couldn't you use your CaptureOnTouch v4 to customize the file names?  You should be able to do this for destinations too.  Not sure as you haven't defined the "searchable" criteria?

 

GD.png

~Rick
Bay Area - CA


~R5 C (1.0.7.1) ~RF Trinity, ~RF 100 Macro, ~RF 100~400, ~RF 100~500, ~RF 200-800 +RF 1.4x TC, BG-R10, 430EX III-RT ~DxO PhotoLab Elite ~DaVinci Resolve ~ImageClass MF644Cdw/MF656Cdw ~Pixel 8 ~CarePaks Are Worth It

BurnUnit
Whiz
Whiz

I suspect that in order to make a searchable PDF you'll need to generate a true "text" file that the search software can detect. If you're scanning a text document what you end up with is a bitmap image of the text, actually a photo of the document. The search software is looking for specific sequences of individual letters but your scans consist only of a picture of all the letters in the document.

 

Essentially your choices consist of retyping the documents into a Word (or similar) document and saving it as a searchable PDF. Or you may be able to download some OCR (Optical Character Recognition) software which can decipher the scanned text images and convert them into true text documents for you. Some scanner software may also include a "lite" version of some OCR software, but you may have to find your scanner's installation disk and install it separately.

 

There is a bit of a learning curve involved with the OCR software, especially depending on the quality of your scans. It will still involve some proofreading and editing for the sake of accuracy but it's still quicker than retyping complete documents.

 

I'm not sure if Google Drive will produce searchable PDFs or not as I've never used that service. I'd think that Adobe Acrobat and some other PDF software could do the job, as well as maybe some of the freeware PDF converters. But I don't know if the "searchable" function is a standard feature or something of an upgrade.

Thank you both. I know that it must be one of the settings on my ON Capture as documentation states following:

 

OCR function

Yes (create searchable and editable PDF files, PPTX files)

 

I just am having trouble figuring out what I have done or need to do to get this to work.

I think I may need to update my or upgrade my Capture ONtouch software or add some plugins?

@MarinaNBW,

Didn't know you were trying to do OCR.

 

 

BurnUnit picked right up on it and I should have too.

 

You will need a full featured PDF application.  Acrobat Professional or similar. 

 

~Rick
Bay Area - CA


~R5 C (1.0.7.1) ~RF Trinity, ~RF 100 Macro, ~RF 100~400, ~RF 100~500, ~RF 200-800 +RF 1.4x TC, BG-R10, 430EX III-RT ~DxO PhotoLab Elite ~DaVinci Resolve ~ImageClass MF644Cdw/MF656Cdw ~Pixel 8 ~CarePaks Are Worth It


@MarinaNBW wrote:
I think I may need to update my or upgrade my Capture ONtouch software or add some plugins?

It looks like your CaptureOnTouch software may already include some OCR software. (OmniPage SE, possibly?)

 

Here's a couple links to give you an idea of what's involved. Especially recommended reading if you haven't worked with OCR software before.

 

https://support.usa.canon.com/kb/index?page=content&id=ART116634&actp=search&viewlocale=en_US&search...

 

https://support.usa.canon.com/kb/index?page=content&id=ART127046

 

...More general information here that might be helpful.

 

https://support.usa.canon.com/kb/index?page=answers&question_box_status=changed&question_box=capture...

 

How well this all works depends on the quality of the documents you're scanning and the quality of your output scans. Try converting just a couple pages initially to get a feel for how it works. You may get more accurate conversions (with less editing required) depending on the scanning resolution you choose. And some original documents may provide more "readable" output for OCR if scanned in either "Greyscale" or "Black & White" scanning mode. Adjustments to brightness and contrast settings in your scanner settings can make things better or worse as well.

 

Are your original documents text only, or do they also contain photos, illustrations or other bitmap images that have to be included in your final PDFs?

 

Avatar
Announcements