[
https://issues.apache.org/jira/browse/LABS-118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637572#action_12637572
]
Javier Puerto commented on LABS-118:
------------------------------------
>One question:
>- Is there a reason why you used the EchoHandler and not the
>XHTMLContentHandler?
No, you could implement with the XHTMLContentHandler. I will try it later.
>It seems that you have unreleated changes in the patch:
>- core/java/regex-urlfilter.txt (the complete file)
>- in dynamics/java/org/apache/droids/droids-core-context.xml
>...
>- <property name="locations"
>value="classpath:org/apache/droids/droids-core.properties"/>
>+ <property name="locations"
>value="classpath:org/apache/droids/droids-test.properties"/>
Ops, only testing changes. The regex-urlfilter.txt to fit the web and in the
spring context the test properties file.
There's a few TODOs in the patch i want to post a more complete version soon.
> Create tied integration with Apache Tika (for parser and handler)
> -----------------------------------------------------------------
>
> Key: LABS-118
> URL: https://issues.apache.org/jira/browse/LABS-118
> Project: Labs
> Issue Type: New Feature
> Components: Droids
> Reporter: Thorsten Scherler
> Attachments: tikaparser.diff, tikaparser.diff
>
>
> http://incubator.apache.org/tika/
> Apache Tika is a toolkit for detecting and extracting metadata and structured
> text content from various documents using existing parser libraries.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]