Rahul Veeramalla created TIKA-2493: -------------------------------------- Summary: Allow Extraction of Javascript from PDFs Key: TIKA-2493 URL: https://issues.apache.org/jira/browse/TIKA-2493 Project: Tika Issue Type: Improvement Reporter: Rahul Veeramalla Priority: Blocker
I have a use case wherein I need to upload PDFs as part of a File Upload feature that I am currently building for my application. Based on Security teams recommendation, I need to scan the PDFs for any embedded Javascript, attachments and links contained in them and block such PDFs. I was able to figure out the solution to extract hyperlinks and attachments from the PDF using TIKA. However, I am unable to find anything to extract javascript from PDFs. **I need help to figure out if a PDF contains Javascript elements/code or not. -- This message was sent by Atlassian JIRA (v6.4.14#64029)