[ https://issues.apache.org/jira/browse/PDFBOX-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13117890#comment-13117890 ]
Rubesh MX commented on PDFBOX-1123: ----------------------------------- Hi Andreas, Thanks very much for the quick reply, the code snippet is given below - This would just show in a message box, all field names with types etc., once this succeeds, we will be building the code to what is expected of the project. This is with the latest version for .Net, I have also tried taking the latest version that is not officially relased as mentioned by you in http://pdfbox.lehmi.de/ string fileIn = @"C:\fspl.pdf"; PDDocument pdDoc = PDDocument.load(fileIn); PDDocumentCatalog pdCat = pdDoc.getDocumentCatalog(); var obj = pdCat.getAcroForm().getFields().toArray(); foreach (var stx in obj) { PDField pdd = (PDField)stx; MessageBox.Show((pdd.getPartialName() + "|" + pdd.getFieldType() + "|" + pdd.getValue())); } pdDoc.save(fileIn); pdDoc.close(); Please let me know if there is something wrong with the way I am reading the field names. Your suggestions/comments on this will be much appreciated. Thanks again. > Not able to read field values from a PDF File if the field contains special > characters. > --------------------------------------------------------------------------------------- > > Key: PDFBOX-1123 > URL: https://issues.apache.org/jira/browse/PDFBOX-1123 > Project: PDFBox > Issue Type: Bug > Reporter: Rubesh MX > Priority: Minor > Labels: acroform > Attachments: fspl.pdf > > Original Estimate: 12h > Remaining Estimate: 12h > > Hi, I am trying to read the field names in a PDF file, it is working with > most of the files, but in some files we are not able to read the field > Id/name, the reason being we have some field names as - > topmostSubform[0].Page1[0].c1_04_0_[0] > topmostSubform[0].Page1[0].c1_09_0_ > topmostSubform[0].Page2[0].Table_Line4a[0].#subform[1].p2-t69[0] > Here all the field names starts with topmostSubform[0]. so when we try to get > the field names like PDField.getpartialname() - the field name is getting > truncated at '.' and we get only - topmostSubform[0] and since all the field > names starts with the same name the total count of fields are coming as 1. > Since there are some special characters like '.'; '_'; '#' this is causing > the issue. Could you please suggest on this? This is very critical. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira