Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Jack Krupansky
e to master, and how many reasonable levels of mastery there are. Spatial... the final frontier. -- Jack Krupansky -Original Message- From: Walter Underwood Sent: Sunday, June 16, 2013 7:27 PM To: solr-user@lucene.apache.org Subject: Re: Adding pdf/word file using JSON/XML 1. Total mastery

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Lance Norskog
No, they just learned a few features and then stopped because it was "good enough", and they had a thousand other things to code. As to REST- yes, it is worth having a coherent API. Solr is behind the curve here. Look at the HATEOS paradigm. It's ornate (and a really goofy name) but it provide

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Otis Gospodnetic
. > > One final way of putting it. If a manager or project leader wanted to staff > a dev position to be "in-house Solr expert", can they just hire any old > average Java programmer with no Solr experience and expect that he will > rapidly "master" Solr? > >

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Alexandre Rafalovitch
On Sun, Jun 16, 2013 at 7:27 PM, Walter Underwood wrote: > 2. Someone who expects partial update in a search engine, or transactions, > has a deep misunderstandings of the tradeoffs you make for what search can > do. That isn't mastery of arcane details, that is search 101. Yes, they might (hav

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Walter Underwood
>> will happen if they make slight and subtle changes or what exactly the fix >> will be for certain runtime errors? For the "average application developer" >> I'm talking about, not the elite expert Solr consultants. >> >> One final way of putting it. If a m

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Yago Riveiro
ik Seeley > Sent: Sunday, June 16, 2013 6:41 PM > To: solr-user@lucene.apache.org (mailto:solr-user@lucene.apache.org) > Subject: Re: Adding pdf/word file using JSON/XML > > On Sun, Jun 16, 2013 at 6:05 PM, Jack Krupansky (mailto:j...@basetechnology.com)> > wrote: > >

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Jack Krupansky
made it more clear what the application developer's intentions were - and maybe have completely avoided this entire thread. -- Jack Krupansky -Original Message- From: Yonik Seeley Sent: Sunday, June 16, 2013 6:41 PM To: solr-user@lucene.apache.org Subject: Re: Adding pdf/word fil

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Yonik Seeley
On Sun, Jun 16, 2013 at 6:05 PM, Jack Krupansky wrote: > Except, that Solr's divergence from a true, pure REST API is certainly one > of the elements of its "badness". Most complex systems seem to feel the need to diverge from pure REST for the sake of being practical. >From that perspective "pur

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Jack Krupansky
true, pure REST API is certainly one of the elements of its "badness". The fact that SolrCell does not support partial update as a true REST CRUD API should, is a good example of relative "badness" in Solr.] -- Jack Krupansky -Original Message----- From: Jan Høydahl

Re: Adding pdf/word file using JSON/XML

2013-06-16 Thread Jan Høydahl
r or project leader wanted to staff a > dev position to be "in-house Solr expert", can they just hire any old average > Java programmer with no Solr experience and expect that he will rapidly > "master" Solr? > > I mean, why would so many recruiters be lookin

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Jack Krupansky
ence and expect that he will rapidly "master" Solr? I mean, why would so many recruiters be looking for a "Solr expert" or engaging the services of Solr sonsultancies if mastery of Solr by "average application developers" was a reality?! [I want to hear Otis

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Grant Ingersoll
On Jun 15, 2013, at 12:54 PM, Alexandre Rafalovitch wrote: > On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll wrote: >> That being said, it truly amazes me that people were ever able to implement >> Solr, given some of the FUD in this thread. I guess those tens of thousands >> of deployment

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Alexandre Rafalovitch
On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll wrote: > That being said, it truly amazes me that people were ever able to implement > Solr, given some of the FUD in this thread. I guess those tens of thousands > of deployments out there were all done by above average devs... I would not cla

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Grant Ingersoll
On Jun 13, 2013, at 11:24 AM, Walter Underwood wrote: > That was my thought exactly. Contribute a REST request handler. --wunder > +1. The bits are already in place for a lot of it now that RESTlet is in. That being said, it truly amazes me that people were ever able to implement Solr, gi

Re: Adding pdf/word file using JSON/XML

2013-06-13 Thread Walter Underwood
That was my thought exactly. Contribute a REST request handler. --wunder On Jun 13, 2013, at 6:04 AM, Alexandre Rafalovitch wrote: > And sometimes useful projects come out from the annoying, confusing > corner situations like yours. > > See if you can get permission to open-source your implement

Re: Adding pdf/word file using JSON/XML

2013-06-13 Thread Alexandre Rafalovitch
And sometimes useful projects come out from the annoying, confusing corner situations like yours. See if you can get permission to open-source your implementation and you may find more people interested in the same thing. It could also be a good visibility for your consultancy. Worst case, there a

Re: Adding pdf/word file using JSON/XML

2013-06-13 Thread Jack Krupansky
, specific questions. -- Jack Krupansky -Original Message- From: Roland Everaert Sent: Thursday, June 13, 2013 3:32 AM To: solr-user@lucene.apache.org Subject: Re: Adding pdf/word file using JSON/XML I apologize also for my obscure questions and I thanks you and the list for your help so f

Re: Adding pdf/word file using JSON/XML

2013-06-13 Thread Roland Everaert
e. It does not have full CRUD > - no delete, and no partial update, but it does support add and full update. > > As someone else already suggested, you can do the work of SolrCell > yourself by calling Tika directly in your app layer and then sending normal > Solr CRUD requests. > >

Re: Adding pdf/word file using JSON/XML

2013-06-12 Thread Jack Krupansky
and then sending normal Solr CRUD requests. -- Jack Krupansky -Original Message- From: Roland Everaert Sent: Wednesday, June 12, 2013 5:21 AM To: solr-user@lucene.apache.org Subject: Re: Adding pdf/word file using JSON/XML 1) Being aggressive and insulting is not a way to help people

Re: Adding pdf/word file using JSON/XML

2013-06-12 Thread Gora Mohanty
On 12 June 2013 14:51, Roland Everaert wrote: [...] > Any way I still have a question regarding the /extract interface. It seems > that every time a file is updated in Solr, the lucene document is recreated > from scratch which means that any extra information we want to be > indexed/stored along

Re: Adding pdf/word file using JSON/XML

2013-06-12 Thread Roland Everaert
PI would be hidden inside the application layer. > > > -- Jack Krupansky > > -Original Message- From: Roland Everaert > Sent: Tuesday, June 11, 2013 8:48 AM > > To: solr-user@lucene.apache.org > Subject: Re: Adding pdf/word file using JSON/XML > > We are working on an

Re: Adding pdf/word file using JSON/XML

2013-06-11 Thread Jack Krupansky
.**openxmlformats-officedocument.** wordprocessingml.document and HelloWorld.docx HelloWorld.docx Or, what is it that you are really string to do? Simply tell us in plain language what problem you are trying to solve. -- Jack Krupansky -----Original Message----- From: Roland Everaert Se

Re: Adding pdf/word file using JSON/XML

2013-06-11 Thread Roland Everaert
t; >> HelloWorld.docx > >> stream_content_type > >> application/octet-stream<**/str> > >> stream_size > >> 10096 > >> stream_name > >> HelloWorld.docx > >> Content-Type > >> application/vnd.**openxmlformats-officedocum

Re: Adding pdf/word file using JSON/XML

2013-06-11 Thread Jan Høydahl
_name >> HelloWorld.docx >> Content-Type >> application/vnd.**openxmlformats-officedocument.** >> wordprocessingml.document >> >> >> and >> >> >> HelloWorld.docx >> >> >> >> HelloWorld.docx >> >>

Re: Adding pdf/word file using JSON/XML

2013-06-11 Thread Roland Everaert
language what problem you are trying to solve. > > -- Jack Krupansky > > -Original Message- From: Roland Everaert > Sent: Monday, June 10, 2013 9:23 AM > To: solr-user@lucene.apache.org > Subject: Re: Adding pdf/word file using JSON/XML > > > Sorry if it was not

Re: Adding pdf/word file using JSON/XML

2013-06-10 Thread Jack Krupansky
ne 10, 2013 9:23 AM To: solr-user@lucene.apache.org Subject: Re: Adding pdf/word file using JSON/XML Sorry if it was not clear. What I would like is to know how to construct an XML/JSON request that provide any necessary information (supposedly the full path on disk) to solr to retrieve and index a p

Re: Adding pdf/word file using JSON/XML

2013-06-10 Thread Gora Mohanty
On 10 June 2013 18:53, Roland Everaert wrote: > Sorry if it was not clear. > > What I would like is to know how to construct an XML/JSON request that > provide any necessary information (supposedly the full path on disk) to > solr to retrieve and index a pdf/ms word document. > > So, an XML reques

Re: Adding pdf/word file using JSON/XML

2013-06-10 Thread Roland Everaert
Sorry if it was not clear. What I would like is to know how to construct an XML/JSON request that provide any necessary information (supposedly the full path on disk) to solr to retrieve and index a pdf/ms word document. So, an XML request could look like this: doc10 BLAH /path/to/file.pdf

Re: Adding pdf/word file using JSON/XML

2013-06-10 Thread Gora Mohanty
On 10 June 2013 17:47, Roland Everaert wrote: > Hi, > > Based on the wiki, below is an example of how I am currently adding a pdf > file with an extra field called name: > curl " > http://localhost:8080/solr/update/extract?literal.id=doc10&literal.name=BLAH&defaultField=text"; > --data-binary @/pa

Adding pdf/word file using JSON/XML

2013-06-10 Thread Roland Everaert
Hi, Based on the wiki, below is an example of how I am currently adding a pdf file with an extra field called name: curl " http://localhost:8080/solr/update/extract?literal.id=doc10&literal.name=BLAH&defaultField=text"; --data-binary @/path/to/file.pdf -H "Content-Type: application/pdf" Is it pos