Redacting SSN By Text Search With AutoRedact® Plug-in For Adobe® Acrobat®
Introduction
The AutoRedact® is an advanced plug-in for Adobe® Acrobat® software for performing PDF redaction. It is designed for removing sensitive information from PDF documents. Redacting text patterns by search is probably the most powerful method offered by the software. The AutoRedact® can search PDF documents for occurrences of a specific text or a text pattern and automatically mark them for redacting.
The software offers a set of common text patterns:
  • Social Security Number
  • Employer Identification Number
  • Phone and fax number
  • Email address
  • Date
  • Postal Address
  • Company names (limited subset of patterns recognized)
  • Any text within square brackets
  • Any user-defined custom text pattern (via regular expressions)
What's in this Tutorial?
This tutorial shows how to automatically mark up SSN text pattern in the PDF document and redact it using AutoRedact® plug-in for Adobe® Acrobat®. The tutorial also provides instructions on how to do a custom redacting of all but last 4 digits of SSN numbers.
What is Redacting?
Redaction, by definition, means removing certain types of information from documents. In the context of United States government agency documents, redaction refers to the process of removing classified information from a document prior to its publication. For attorneys, redacting is very important procedure of protecting confidential information. Here are few examples of the redacting applied to a PDF document:
Prerequisites
You need a copy of Adobe® Acrobat® along with AutoRedact® plug-in installed on your computer in order to use this tutorial. You can download trial versions of both Adobe® Acrobat® and AutoRedact®.
Step 1 - Open a PDF file
Start Adobe® Acrobat® application and open a PDF file using “File > Open…” menu to open a PDF document that needs to be redacted.
Step 2 - Open the "Mark Up Text Patterns" Dialog
Select "Plug-ins > Redacting > Mark Up Text Pattern…" from the main Acrobat® menu to open the "Mark Up Text Patterns" dialog.
Step 3 - Select Text Patterns
Select one or more text pattern(s) to search for in the "Mark Up Text Patterns" dialog. In the tutorial "Social Security and EIN numbers" box is checked. Note, that if the "Perform direct redacting of text without creating any markup" box is checked, then the redaction will be executed without any intermediate text highlighting step.
Optionally, click “Edit Preferences…” to change style and appearance of the redacting annotations. Go to step 7 to skip changing preferences.
Step 4 - Select Processing Options (Optionally)
Select desired processing options in the "General" tab of  the "Redacting Preferences" dialog.
Step 5 - Specify Redacting Markup Settings (Optionally)
Click on "Markup" tab. Specify redacting markup settings in the "Markup" tab. It controls how document content is marked for redacting.
Markup settings
Step 6 - Select Visual Appearance of the Redacted Areas (Optionally)
Click on "Redacting" tab. The "Redacting" tab controls a visual appearance and a content of the redacted documents. Specify the redacting preferences. Select in the "Redacted Areas Appearance" section the color that will fill the redacted areas (if required). Click "OK" once done.
Redacting tab
Here are the various appearance options that can be achieved:
styling examples
Step 7 - Start the Text Search
Optionally, select a processing page range and subset. Click "OK" to start the markup process.
Select processing page range
Step 8 - Examine the Stats
The dialog with the markup results will be displayed at the end of the processing. Click "OK" to finish.
Examine the statistics
Step 9- Optional: Redacting a Custom Text Pattern (Last 4 digits of SSN)
It is a common task to redact all but last 4 digits of the SSN. Check "Find a custom text pattern" option and enter the following text pattern into "Find what:" box:
\b\d{3}\p{Pd}\d{2}\p{Pd}(?=\d{4})
In case if you want to redact only last 4 digits use another expression:
(?<=\d{3}\p{Pd}\d{2}\p{Pd})\d{4}\b
Use a custom regular expression to redact a part of the SSN
Step 10 - Examine the Markup Results
All text that matches a set of selected text pattern(s) is now marked up for redaction. In the tutorial, all SSNs/EINs are now covered by redacting annotations.
Markup results
Step 11 - Apply Redaction
Select "Plug-ins > Redacting > Redact Marked Content…" from the main Acrobat® menu.
Apply redaction
Click "OK" to start the redaction process.
Start redacting
Step 12 - Examine the Stats
The dialog with the redaction process results will be displayed. Examine it. Click "OK" to finish.
Step 13 - Examine the Results
By default, the output document with redacted content is shown on screen. The appearance of the redacted areas are controlled by the application preferences (see Step 6).
Examine redacting results
Step 14 - Save the Redacted PDF File
Save the redacted PDF file by using "File > Save As..." menu.