EMC QSP Scan Profile Editor - Introduction

Creating scan profiles enables users to switch quickly from scanning one type of document to another. Each profile represents one type of document and has all of the settings unique to that document type.

QSP_Profiles.png

Facts about QuickScanPro Scan Profiles

These profile scan settings once configured control the OCR (optical character recognition) settings and the export functions of the files that QuickScanPro processes for the document management system.

  • When a profile is selected, a description of the profile will appear in the box below the list. When a user creates a profile, they will have to provide a description in order for one to appear when the profile is selected.
  • Export profiles sends the file metadata to the local drive XML folder that is accompanied by the PDFoutput folder that was targeted on the general tab for file output.
  • OCR and indexing provide for the automatic capture of searchable metadata from the files as they are processed by QSP.
  • In order for OCR to work to its best capability of around 80% accurate, the documents to be OCR's must be standardized forms with no handwriting.
    • Anomalies such as documents being printed from different printers, the type of print (i.e. dot matrix) and overall print quality can effect the accuracy of the OCR.

Batch Scan Settings

QSP_ScanProfileEditorMenu.png

Click on the different menu items to access the settings.

General: Contains the profile name, description, profile type (public or private) and whether to allow profile deletion. It also shows where the document images are saved. Finally it contains a summary of the settings for the entire profile.

Scan: sets the options for how the software will scan the document. The settings include things like simplex/duplex, dpi, size if paper and whether and the scanner settings.

Image Format and Naming: configure file naming scheme, file type, colour format and compression. File names can be formatted to include information like the date and also to follow a certain formula.

Image Processing: sets up image cleanup. As the pages are scanned actions like skewing, noise removal and having blank pages detected and removed can be automatically preformed.

OCR: Unless there is a real need it is advised to not "turn on" full text OCR. OCRing slows down the scanning process and is best left for scanning written documents, magazine articles, historical records, books.

FileHold's search engine has two components:

  1. Full Text Search
  2. Metadata search

Both search indexes are searched simultaneously in FileHold. If you do enable OCR in QuickScan Pro - please note that the XML based import system will need a small adjustment in QSP settings to export out the path of the OCR'd documents.

Full Text OCR, Zonal OCR, bar codes, manual index fields, etc are all capable of being imported into FileHold using the XML system. This location is vital to the XML based system as this way the file path is known to the import system and it can locate and import the PDF or Tiff file. 

If you wish to setup full text OCR for a given QSP batch, as a best practice, we recommend you create a folder where both the XML export file and the OCR'd PDF are placed upon completion of each batch.

IMPORTANT: XML files must be in the same directory as the PDF file(s) being imported from QuickScan Pro, Kofax Capture, Kofax Express or Kodak Capture Pro!

OCR-Settings.PNG

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 Index: This tab allows for zonal OCR capture of fields captured from a reference file (TIFF format). This is great for structured forms where the information for each index field (Invoice #, Customer Name) is in the same location on the form so that you scan in batches and automate the capture of these fixed location fields

Invoices are a perfect example. Zonal OCR can be set to capture information such as the Invoice Number, the Customer Name and the date automatically. Users can create multiple index profiles for each type of document\form so that specific things happen when this batch launched in QuickScan Pro.

  • Stamp scanned image - leaves a stamp on the image to indicate scanned status.
  • Auto Index - recommended that you check "before the batch is closed". This allows the individual pages / files to be reviewed to make sure that they are being scanned correctly.
  • Profile - these profiles control the fields that will be indexed as the files are scanned.
    • Add/Edit/Delete.

Export: Set the export path for the image files once they are scanned. Users can create multiple export profiles.

  • Auto Export - just like the indexing should happen before the batch closes.
  • Profile - this profile controls where the files are exported to on the local drive* and what (if any metadata) is associated from Indexing.  *Files are exported to a set of folders created on a local drive. From there they are moved automatically into the document management system by an Import Tool that is configured in FDA. See Importing QSP processed files into the document management system for more details.
    • Add/Edit/Delete.

To export out non full text OCR'd documents - include from the <Page Values> drop down - the Page: FileName, and then include the various index/metadata fields from the Indexer profile you created for this batch profile.

REMINDER: XML files must be in the same directory as the PDF file(s) being imported from QuickScan Pro, Kofax Capture, Kofax Express or Kodak Capture Pro!

 No-fulltextOCR-export.PNG

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

To export out full text OCR'd documents - you don't include the Page: File Name like you do with non full text OCR results, instead you check the include OCR results - with an OCR count of 1.

OCRExportResults.PNG

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Batch: We recommend a basic configuration in the batch tab so that the batch function is left to save itself without prompting the user and slowing down your work.

batch.PNG

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Important:

Make sure your XML export file setting for If output file exists:

  • Generate new output file name
  • Reminder - make sure XML file is always in same folder as the PDF files that will be imported into FileHold.

XML Export.PNG