Re: couple of Sandbox components

2010-12-01 Thread Tommaso Teofili
Hi Thilo, 2010/11/29 Thilo Götz > > > Hi Tommaso, > > do you know what algorithm Tika uses for language identification? > Tika uses a collection of existing language profiles, then a language profile is created from the text to analyze; after that the language profile which has the lowest distan

Re: couple of Sandbox components

2010-11-29 Thread Thilo Götz
On 11/26/2010 09:35, Tommaso Teofili wrote: > Hi all, > following Burn's proposal for multimodal analysis component skeleton I also > have a couple of components to propose for inclusion inside the sandbox: > >- Solr CAS Consumer - to consume CAS/types/features inside Solr fields. >This co

Re: couple of Sandbox components

2010-11-27 Thread Tommaso Teofili
2010/11/26 Jörn Kottmann > On 11/26/10 9:35 AM, Tommaso Teofili wrote: > >> Hi all, >> following Burn's proposal for multimodal analysis component skeleton I >> also >> have a couple of components to propose for inclusion inside the sandbox: >> >>- Solr CAS Consumer - to consume CAS/types/fea

Re: couple of Sandbox components

2010-11-26 Thread Jörn Kottmann
On 11/26/10 9:35 AM, Tommaso Teofili wrote: Hi all, following Burn's proposal for multimodal analysis component skeleton I also have a couple of components to propose for inclusion inside the sandbox: - Solr CAS Consumer - to consume CAS/types/features inside Solr fields. This could be p

couple of Sandbox components

2010-11-26 Thread Tommaso Teofili
Hi all, following Burn's proposal for multimodal analysis component skeleton I also have a couple of components to propose for inclusion inside the sandbox: - Solr CAS Consumer - to consume CAS/types/features inside Solr fields. This could be put inside Lucas or in a separate project - a