Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type.
Showing results for Show only | Search instead for Did you mean:/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/td-p/11482876 Oct 04, 2020 Oct 04, 2020
Copy link to clipboard
I have about 80 local PDF files having input forms that have been filled by students. I would like to extract text data from them so that I can easily score their answers. How do you do that by the latest Acrobat Pro? I need do that on local files.
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
You didn't mention your version of Acrobat but it can be done using the Merge Data Files into Spreadsheet command, which is under Tools - Prepare Form (and then under More Form Options, in some versions).
9 Replies 9 Adobe Employee ,/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11483189#M279965 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
We are sorry for the trouble. As described, you want to extract data from the filled PDF form.
Please try the following steps and see if that helps
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11483241#M279968 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
The PDF files were collected via a web form as a file attachment, and so the individual users have not submitted the form. In this case, how do I create and initializethe response file you mentioned? Thank you very much for your help.
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Community Expert ,/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11483289#M279972 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
You didn't mention your version of Acrobat but it can be done using the Merge Data Files into Spreadsheet command, which is under Tools - Prepare Form (and then under More Form Options, in some versions).
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11483682#M280012 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
Thank you very much. It is what I was looking for and it worked, but all the Japanese characters in the form fields are broken after exporting to a CSV file.
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Community Expert ,/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11483703#M280014 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
The encoding of the file created is UTF8, which might not cover Japanese characters. In order to do that you would need to use some other tool, I'm afraid. Maybe try exporting files as TXT or FDF files, and then merge them using a different utility. Another option is to use a script to do it, instead of the built-in Merge Data Files command.
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11484043#M280037 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
Thank you agai. The text encoding looks to be UTF-8 because I could etract fields text by using PyPDF2, which is a Python module to handle PDF forms. For the moment, the use of PyPDF2 is good enough for my purpose, but your suggestion to use the native Acrobat functionality was much easier except for the Japanese character problem.
If I find a fix for my problem, I will post it in this thread for someone else.
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Community Expert ,/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11484063#M280041 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
Can you share a sample file with fields that has Japanese text in them?
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11484139#M280049 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
Here is a sample file.
"Answer1" and "Answer2" should be "日本語 Japanese 日本語" but it is convereted to " . Japanese . ".
Community guidelinesBe kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Community Expert ,/t5/acrobat-discussions/how-do-you-collect-text-data-from-filled-forms-in-local-pdf-files/m-p/11484261#M280057 Oct 05, 2020 Oct 05, 2020
Copy link to clipboard
When exporting it in UTF-8 explicitly it does seem to work correctly. I guess the default encoding is just plain ANSI, then. You can use this code I wrote to export it properly (you can run it from the JS Console, or from an Action, or something like that):
var names = []; var values = []; for (var i=0; i var doName = this.documentFileName.replace(/\.pdf$/i, "_data.txt"); this.createDataObject(doName, ""); var s = names.join("\t") + "\r\n" + values.join("\t"); this.setDataObjectContents(doName, util.streamFromString(s, "utf-8")); this.exportDataObject(doName); this.removeDataObject(doName);