[ 
https://issues.apache.org/jira/browse/TIKA-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150324#comment-13150324
 ] 

Dave Meikle commented on TIKA-663:
----------------------------------

Thanks Nick.  Was going to add it last night but forgot my SVN password (reset 
now).

Yes, would be interested in others views as this begs the same question re 
other similar file types which could have a mix of HTML and scripting tags that 
would also be missed if picked up by the HtmlParser.

                
> JSP files data extraction failed
> --------------------------------
>
>                 Key: TIKA-663
>                 URL: https://issues.apache.org/jira/browse/TIKA-663
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.9
>         Environment: Windows, JAva 6
>            Reporter: samraj
>         Attachments: File_1.jsp, File_2.jsp, File_3.jsp
>
>
> We have worked with tika extraction. In 0.8 jsp file contents extracted 
> well.. But in 0.9 the same files are not extracted well. Pls give the solution

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to