Re: Solr Extracting request handler

2014-06-23 Thread Alessandro Benedetti
...@sourcesense.com wrote: Hi During my first indexing I noticed that manifold uses Solr extracting request handler to extract the content of an xml file For performance reasons it would be better if Manifold handled the extraction

Re: Solr Extracting request handler

2014-06-19 Thread Karl Wright
...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end result is the same solr indexes

Re: Solr Extracting request handler

2014-06-18 Thread Alessandro Benedetti
15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end result is the same solr indexes text

Re: Solr Extracting request handler

2014-06-18 Thread Alessandro Benedetti
2014-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end

Re: Solr Extracting request handler

2014-06-18 Thread Matteo Grolla
m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end result is the same solr indexes text and stores text So if manifold supports text

Re: Solr Extracting request handler

2014-06-18 Thread Karl Wright
-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end

Re: Solr Extracting request handler

2014-06-18 Thread Alessandro Benedetti
15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean

Re: Solr Extracting request handler

2014-06-18 Thread Karl Wright
Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end

Re: Solr Extracting request handler

2014-06-17 Thread Karl Wright
://issues.apache.org/jira/browse/CONNECTORS-959) So can fit there. Cheers 2014-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com: Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries

Re: Solr Extracting request handler

2014-06-17 Thread Shinichiro Abe
://issues.apache.org/jira/browse/CONNECTORS-959) So can fit there. Cheers 2014-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com: Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send

Re: Solr Extracting request handler

2014-06-17 Thread Karl Wright
a pipe-line processor architecture has been thought ( https://issues.apache.org/jira/browse/CONNECTORS-959) So can fit there. Cheers 2014-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts

Re: Solr Extracting request handler

2014-06-17 Thread Shinichiro Abe
there. Cheers 2014-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end result is the same solr indexes text

Re: Solr Extracting request handler

2014-06-17 Thread Karl Wright
Matteo Grolla m.gro...@sourcesense.com : Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end result is the same solr indexes text and stores text So

Re: Solr Extracting request handler

2014-06-16 Thread Alessandro Benedetti
PM, Matteo Grolla m.gro...@sourcesense.com wrote: Hi During my first indexing I noticed that manifold uses Solr extracting request handler to extract the content of an xml file For performance reasons it would be better if Manifold handled the extraction letting Solr do the search engine

Re: Solr Extracting request handler

2014-06-16 Thread Alessandro Benedetti
there. Cheers 2014-06-16 15:59 GMT+01:00 Matteo Grolla m.gro...@sourcesense.com: Since Solr extracting request handler takes the binary and extracts text what is the point of not using Manifold extractor and send text and binaries to solr? I mean the end result is the same solr indexes text