Carlow Home
   
Adobe Acrobat Capture and Index Documents
     
  - Capturing a Document - Creating an Index of Documents
  - Correcting a Captured Document - Searching an Index
  - Importing an Image File - Deleting an Index
     
     
     
     
   
  Capturing a Document
  1. Click File, Import, Image.

2. Select the file name, e.g. Ariel June98.tif from Capture folder, click Open.

3. Choose Document, Capture Pages.

4. Click Preferences, choose primary OCR language, select Normal (PDF Output style), click OK.

5. Click OK to capture current page.
NOTE: Image file must have a minimum 200 dpi (dots per inch) for capture to work.

6. After the page is captured, choose Edit, Select All.
Note: Highlighted text indicates captured text in bitmap form. Some decorative type that has no similar font type will remain as bitmap text.

7. To deselect Captured text, click in an area outside of captured text.
  Back to top  
     
   
Correcting a Captured Document  
  1. Open the captured document that needs editing.

2. Click Edit, Show Capture Suspects.

3. Click Edit, Find First Suspect.

4. Click Zoom tool for a better look at suspect word, then drag it across to highlight the text to be enlarged.
Note: If software does not find the word you entered as the first suspect, click Next button until the correct word is found. If word is not found after searching entire document, go to Step 6.

5. Click Accept to accept the correct word and discard the bitmap image.

6. Suspect word is highlighted and a bitmap image appears because the software incorrectly guessed the letters. To correct the word, click (T) touchup tool, then type in the word as it should appear.

7. Click Accept in Capture Suspect window to discard the bitmap and highlight the next suspect.
Note: Sometimes a bitmap character or symbol, that is correct as it appears, will be incorrectly diagnosed as being suspect. Click Next, Next, to keep the bitmap and discard the incorrect diagnosis.

8. To touchup character formatting in a PDF document, choose View, Actual Size.

9. Click the Touch-up tool icon T, to select text for editing.

10. Choose Edit, Text Attributes, Font tab to view/change font, character spacing, line justification, etc.

11. Caption Alignment is done by placing the mouse cursor on the left side of the caption; Hold the left mouse button while you drag the line marker until the text lines up with the edge of the picture, then release the mouse button.

12. Choose File, Save As, and choose a folder in which to save the PDF document.

13. Click Save button, then click Close.

14. Click File, Quit to exit Acrobat Exchange.

  Back to top  
     
   
Importing an Image File  
  You can capture an image file into Acrobat

1. Start Acrobat Exchange 3.0.

2. Choose File, Import, Image.

3. Select the file (.tiff, gif, etc), in capture folder.

4. Click File, Save As

5. Enter a name for the image file, e.g. helpdesk-people.pdf and choose the location of your PDF documents folder (capture destination folder).

6. Click Save button.

7. Click File, Close to close the file.

  Back to top  
     
   
Creating an Index of Documents  
  You can build full-text indexes for collections of PDF documents that users of Acrobat Exchange and Acrobat Reader can search. A full-text index makes a searchable database of all the text in the documents.

1. Start the Adobe Acrobat Catalog 3.0 application.

2. Choose Index, New, and enter a name for the Index Title, e.g. Ariel-Up and enter an Index Description, e.g. Ariel Update Newsletters.

3. Select the folder, Example N:\Infosys\ArielPDF that contains the PDF documents for the index, click Add under Include Directories. Exclude Directories is used to exclude sub-folders for the selected directory. The folder must be on the disk or network server volume where the documents to be indexed are stored.
Note: The folder .PDX file (Index file) contains the 9 sub-folders (Assists, Morgue, Parts, Pdd, Style, Temp, Topicidx, Trans, and Work) for the index data files created, by default.

4. Select PDF docs folders in Capture folder, click OK.

5. Identify ‘stopwords’ (words to exclude from the index search). Example: click Options, enter A in word text box, click Add. Repeat to add more ‘stopwords’ such as a, An, an, The, the, etc., to make searching more accurate, then click OK.

6. In Index Description field, type Stopwords include A, a, An, an, The, the etc., to let users know that these words are not included in the search.

7. Click Build, type in example, helpdesk-instructions.pdx as filename for index, click OK.
Catalog now saves the index definition and builds the index, showing progress in the messages window. Completed index and associated files are placed in the .PDF docs folder and are ready to be searched.

8. Click Quit to exit Catalog screen.

  Back to top  
     
   
Searching an Index  
  1. Start Adobe Acrobat Exchange 3.0 application.

2. Click Search button (binocular icon) in toolbar.

3. In Search Dialog box, click Indexes.

4. In Index Selection dialog box, click Add.

5. To add Online Guides inbox, select the file exchhelp.pdx in (Windows 95) Acrobat 3/Acrobat Exchange/Help folder; (Windows 3.x) Acrobat3/Exch16/Help; (Macintosh) Acrobat3.0/Help, then click Open.

6. Click Add, select the file cardindex.pdx from PDFdocs folder

7. Click Open to add the helpdesk-instructions index to the list.

8. Click OK to close the index selection box.

9. Type in search word, for example, security into the Find Results Containing Text entry area, then click Search.

10. Search results windows lists all documents containing security, ranked in terms of relevance.

11. View a document that contains the search word by selecting the file and then click View.

12. Close any open documents.

13. Click File, Quit to exit Acrobat Exchange.

  Back to top  
     
   
Deleting an Index  
 

To delete an Adobe index :

1. Delete the index definition file, example, Ariel-Up.pdx

2. Delete the log file for the index, example, Ariel-Up.log file.

3. Delete the index folder and all the nested index sub-folders.

  Back to top