[ 
https://issues.apache.org/jira/browse/LABS-118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637257#action_12637257
 ] 

Thorsten Scherler commented on LABS-118:
----------------------------------------

I had a look at your patch. 
Thanks for your contribution.

I like the idea that LinkExtractor is a handler very much.

One question: 
- Is there a reason why you used the EchoHandler and not the 
XHTMLContentHandler?

It seems that you have unreleated changes in the patch:
- core/java/regex-urlfilter.txt  (the complete file)
- in dynamics/java/org/apache/droids/droids-core-context.xml
...
-    <property name="locations" 
value="classpath:org/apache/droids/droids-core.properties"/>
+    <property name="locations" 
value="classpath:org/apache/droids/droids-test.properties"/>
...
the block around org.apache.droids.handle.Save

> Create tied integration with Apache Tika (for parser and handler)
> -----------------------------------------------------------------
>
>                 Key: LABS-118
>                 URL: https://issues.apache.org/jira/browse/LABS-118
>             Project: Labs
>          Issue Type: New Feature
>          Components: Droids
>            Reporter: Thorsten Scherler
>         Attachments: tikaparser.diff, tikaparser.diff
>
>
> http://incubator.apache.org/tika/
> Apache Tika is a toolkit for detecting and extracting metadata and structured 
> text content from various documents using existing parser libraries.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to