Resources

Knowledge Base

PDF data extraction. Scrape PDF text

pdftocsv

 

  • To be able to run this example you need to install UiPath.PDF.Activities. See more details on how to install packages here.

Task

Extract data from PDF. This sample demonstrates UiPath's PDF data extraction capabilities. It automatically scrapes data from a PDF file and saves it as a text document.

Steps to automate

  1. Extract the PDF text.
  2. Format the text. 
  3. Write the document.

Solution

  1. Extract the PDF text document using Read PDF Text activity.
  2. Split the output into an array of strings containing each word from the table.
  3. Iterate through the data using a While activity in order to place every word under the proper column header.
  4. Write the text document using Write Text File activity.

 ex_pdf_data_extraction