On Fri, 2008-01-11 at 15:48 +0200, Jukka Zitting wrote: > Hi, > > On Dec 16, 2007 7:46 PM, Thorsten Scherler <[EMAIL PROTECTED]> wrote: > > Otis as well wrote me the other day asking about droids and I can repeat > > what I told him: any help is welcome. > > Have you looked at Apache Tika?
I can remember that I had a look last year but not really ATM. I could not really find any docs at http://incubator.apache.org/tika/ nor did http://svn.apache.org/repos/asf/incubator/tika/trunk/README.txt told me where I can find a getting started guide. > I think it would make a nice > dependency for droids, and if you like I'd be eager to look at > integrating Tika with whatever you currently have. > Actually I reckon tika can help defining the org.apache.droids.parse.ParseData object of droids. Till now this are just outlinks but since tika can extract many more I reckon we can implement some methods there which will have a dep on tika. I need to finish migrating (still missing filter and handler factories) but then it will be usable. I will use droids at work so I had some business time to spend on it. In my usecase I will extend the DefaultCrawler and only implement 2-3 methods. I need to crawl a calendar and invoke a custom parsing with the calendar objects. I will write a separate mail when I finished the rewrite. I would be nice if you could have a look at it then. Thanks for your interest. salu2 > BR, > > Jukka Zitting > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > -- Thorsten Scherler thorsten.at.apache.org Open Source Java consulting, training and solutions --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
