Batch Conversion to PDF/A

In my two previous articles (see the archives ) on PDF for Archiving—PDF/A—I discussed the benefits of this archival format and how to create and validate PDF/A.

One challenge facing government agencies, law firms and their clients is the conversion of large numbers of legend files to PDF/A.

In this article, I’ll cover how to use the Batch Processing facility in Acrobat 8 Professional to transform common file formats found in the legal market to PDF/A-1b.

Read on to learn more…

What document types can be converted in batch to PDF/A-1b?

I’ve tested this method with the following common legal market document types.

If you need to convert a text file to PDF/A, open the file in Word and save as .doc files.

As noted in my previous articles, it is very difficult to convert PDF Normal documents without embedded fonts to PDF/A.

Converting Office Files to PDF/A-1b in Batch

Converting Office application files to PDF involves two steps:

  1. Change conversion preferences
  2. Create a Batch Sequence for the conversion

Note that you will need the right application installed to create PDF/A compatible files. For example, if PowerPoint is not installed, Acrobat cannot convert PPT files to PDF/A.

As a practical matter, you should install all authoring applications first, and then install Acrobat 8 Professional. Successful PDF/a conversion requires the installation of the one-button PDFMakers supplied by Acrobat Professional.

Changing Conversion settings to PDF/A-1b

For PDF/A-1B conversions, you can change settings easily in Acrobat Preferences.

Follow these steps to change your default conversion settings to PDF/A-1B:

  1. Edit—>Preferences
  2. In the Categories list at left, click on Convert to PDF
  3. Conversion Preferences
  4. Select the file type(s) you need to convert from the list. E.g.
    – Microsoft Word
    – Microsoft Excel
    – Microsoft PowerPoint
  5. Click the Edit Settings button
  6. Change the conversion setting to PDF/A-1b:
  7. Choosing PDF/A-1b
  8. Repeat as necessary for all document types.
  9. Close the Preferences Window

Creating the Batch Sequence

Follow these steps to create a Batch Sequence in Acrobat 8 Professional:

  1. First, move the documents you wish to convert into a folder. Nested folders (sub folders) are OK.
  2. In this example, we’ll use a folder named doc-source which contains Word, Excel , PowerPoint, TIFF and PDF image-only files.
    Folder of Files
  3. Create a destination folder on your hard drive for the converted documents. In this example, we’ll use a folder named doc-destination
  4. Choose Advanced—>Document Processing—>Batch Processing
  5. Click the New Sequence button
    Batch Sequence Window
  6. Give the sequence a descriptive name
    Name the sequence
  7. Change the following in the Edit Batch Sequence window:
    A Click the Browse button and locate the doc-source folder
    B Select the file types you wish to convert. You should deselect any file types which cannot be converted to PDF/A directly. Consult the list of eligible file types previously in this article.
    C Select your doc-destination folder
    D Click Output Options and choose PDF/A as the type
    Setting up the convesion
  8. Click OK

Running the Batch Sequence

  1. Choose Advanced—>Document Processing—>Batch Processing
  2. Select the sequence you created from the list and click the Run Sequence button.
  3. Running the sequence window
  4. Click the OK button to start.
  5. Authoring applications will open as necessary. You may see progress bars:
  6. Progress bar

Batch Conversion Issues

By default, Acrobat’s batch processor will stop if it cannot convert a file to PDF/A. An onscreen warning will appear.

You will need to click OK to each warning before Acrobat will continue on and process the next file.

The primary reason existing PDF files cannot be converted to PDF/A is that fonts are not embedded. Unfortunately, there is no way to add the fonts to the PDF after it is converted.

I’m hopeful that future releases of Acrobat will allow for post facto font embedding.