Back to All Posts

Automating PDF Processing with DocsParse, Make.com, Airtable, and Gmail

Cover Image for Automating PDF Processing with DocsParse, Make.com, Airtable, and Gmail
Roman Furman
Roman Furman

Table of Contents

  1. Introduction
  2. Prerequisites
  3. Overview of the Workflow
  4. Step-by-Step Guide
  5. Conclusion

Introduction

In today's digital age, efficient document processing is crucial for businesses of all sizes. This tutorial will guide you through setting up an automated document processing workflow using DocsParse, Make.com (formerly Integromat), Airtable, and Gmail. By the end, you'll have a streamlined system that automatically processes PDF documents from your email and organizes well-structured parsed data in Airtable.

Prerequisites

Before we begin, ensure you have accounts set up for the following services:

- DocsParse, ai-document processing and parsing service.
- Make, workflow automation service, you can use any other service that supports webhooks.
- Airtable, cloud-based spreadsheet-database hybrid service.
- Gmail, email service you can use any other service that supports email forwarding and filters.

You should also have basic familiarity with these platforms. If not, don't worry - we'll guide you through each step.

Overview of the Workflow

Our automated workflow will follow these steps:

  1. Emails with document attachments are forwarded to DocsParse Workflow Inbox
  2. DocsParse processes the documents and extracts relevant data
  3. Make.com retrieves the processed data from DocsParse
  4. The extracted data is then automatically added to an Airtable table

Step-by-Step Guide

Step 1: Setting up DocsParse for Document Processing

  1. Log into your DocsParse account and create your first workflow DocsParse workflow creation
  2. Upload your first document. It may take a few seconds for DocsParse to process your document.
  3. Once the document is parsed, modify the scheme by clicking "Add custom property." Adding custom property
  4. First, add a field name (which should be a unique string). We will use "email". In the next field, you can specify instructions for our AI Processor. Click "Add" to confirm. Adding email field
  5. After processing, let's add one more property with a bit more complexity. You can specify almost anything in the Instruction field. Adding complex property
  6. Click "Finish scheme" to complete the process. Don't worry, you'll be able to modify it later if needed.

Now you have a fully functional workflow. In the next step, we'll set up Airtable to store the processed data.

Step 2: Creating the Airtable Base for Processed Data

  1. Log into your Airtable account and create a new Table. If you're not sure how to do this, check out this step-by-step guide.
  2. Go to DocsParse Data tab and click "Export" to download the processed data. Exporting data from DocsParse
  3. Select "CSV" as the file format and click "Export" Selecting CSV format
  4. Go to the Airtable base you created and click "Import" to upload the CSV file. Importing data to Airtable Confirming import in Airtable

At this point, you've successfully set up the structure for your Airtable base and have imported the processed data from DocsParse. In the next step, we'll configure Make.com to connect DocsParse and Airtable.

Step 3: Configuring Make.com to Connect DocsParse and Airtable

  1. Log into your Make account and create a new scenario
  2. Click "Add a trigger", select "Webhook" and choose "Custom webhook" Adding a webhook trigger
  3. Enter webhook name and click "Save" Saving webhook
  4. Make sure that Redetermining data is active ("Stop" button with loader) and copy webhook URL Copying webhook URL
  5. Go to DocsParse Integrations tab and add new webhook Adding new webhook in DocsParse
  6. Enter webhook name, paste the webhook URL, and click "Add" Configuring webhook in DocsParse
  7. New webhook should appear in the list. Webhook added to DocsParse
  8. Go to Documents tab, select the document you want to process, and click "Re-parse" Re-parsing document
  9. Switch back to Make, you will see that determining was successfully completed. Webhook determination completed
  10. Now let's add Airtable as a destination. Click the plus button and select "Airtable". Choose Create a Record. Adding Airtable action
  11. Find your base and table in the relevant dropdowns. Selecting Airtable base and table
  12. Now you need to fill out the fields with DocsParse response data. Make sure to add an index for each column. Example: "1. result.extractedProperties[{NUMBER_OF_FIELD}]: data.value" where NUMBER_OF_FIELD is the index of the field in the response. Mapping fields in Make
  13. (Optionally) You can set up filters or other logic to process your data before adding it to Airtable. For example, you can add a filter to only process documents with an "email" property and so on. Adding filters in Make
  14. Let's test the connection. Click "Run once" and switch back to DocsParse. To re-parse the document as we did earlier, click "Re-parse". Testing the connection Re-parsing in DocsParse
  15. Now in Make, you should see processed data. And the same in Airtable. Processed data in Make Data in Airtable
  16. If everythings works well, active the scenario and you're done. Activating scenario

Step 4: Setting up Gmail Forwarding to DocsParse Workflow Inbox

  1. First, let's switch to DocsParse and copy our Workflow Inbox email address. Copying Workflow Inbox email
  2. Then, log into your Gmail account
  3. Go to Settings > Forwarding and POP/IMAP Accessing Gmail settings
  4. Click on 'Add a forwarding address' Adding forwarding address
  5. Enter your DocsParse Workflow Inbox email address Entering Workflow Inbox email
  6. At this point, you will need to verify your DocsParse Workflow Inbox email address. Verifying email address
  7. Go to DocsParse, you will see that a new email has been received. Click on it. New email in DocsParse
  8. On the details page, find a confirmation link. Copy it and paste it into your browser. Copying confirmation link
  9. Click on "Confirm" Confirming email
  10. Then go back to Gmail and select "Forwarding a copy of incoming mail to...". Make sure to click Save Changes at the bottom. Setting up forwarding in Gmail
  11. You can now create a new filter with some specific rules and filters to start the forwarding process. Creating Gmail filter
  12. Make sure to select "Forward it to ... " option and select DocsParse Workflow Inbox email address. Click "Create filter" or "Update filter". Configuring Gmail filter
  13. In the list of Filters, you will see your new filter. New filter in Gmail

We've connected DocsParse with Airtable and set up Gmail forwarding. Now we are ready to test the workflow. Let's do it!

Step 5: Testing the Workflow

  1. Send a test email to your Gmail address. (Make sure your test email will be processed by Gmail filters). Sending test email
  2. After sending the test email, it might take a few seconds/minutes for forwarding to be processed. Once it's done, you will see a new Document in DocsParse. New document in DocsParse
  3. (Optionally) As you can see in the screenshot below, DocsParse is also processing the email body. In some cases, you might want to disable this feature. Email body processing
  4. Now let's switch to Airtable and check if the data was successfully added. Data in Airtable

Conclusion

Congratulations! You've successfully set up an automated document processing workflow. This system will save you countless hours of manual data entry and reduce errors. As you become more familiar with these tools, you can further customize and expand your workflow to meet your specific needs.