Re: Download document from solr
: Subject: Download document from solr : References: aanlkti=ajq4qpifn2r0dyz=s9hv1i=pc-nqnxp3hw...@mail.gmail.com : In-Reply-To: aanlkti=ajq4qpifn2r0dyz=s9hv1i=pc-nqnxp3hw...@mail.gmail.com http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject line of your email, other mail headers still track which thread you replied to and your question is hidden in that thread and gets less attention. It makes following discussions in the mailing list archives particularly difficult. See Also: http://en.wikipedia.org/wiki/User:DonDiego/Thread_hijacking -Hoss -- http://lucenerevolution.org/ ... October 7-8, Boston http://bit.ly/stump-hoss ... Stump The Chump!
RE: Download document from solr
You could index into your id (or another field) the download url of the document and then use that to enable download of the document. Ephraim Ofir -Original Message- From: Lance Norskog [mailto:goks...@gmail.com] Sent: Friday, September 03, 2010 1:07 AM To: solr-user@lucene.apache.org Subject: Re: Download document from solr Yes. Indexing a PDFother types with '/extract' means that Solr finds words in the document and indexes those in a field 'content'. It does not save the binary contents of the file. You could make a request handler that fetches one document and generates a redirect to the link. On Thu, Sep 2, 2010 at 7:35 AM, Matteo Moci mox...@libero.it wrote: Thank you for the suggestions, I just completed the tutorial at http://lucene.apache.org/solr/tutorial.html and i understood that in the GET parameters I can choose wt=standard (and obtain an xml structure in the results), wt=json or wt=php. All of them display the results inline, in the sense that they are embedded and entirely included in the response. If I submit pdfs (i think it is also for docs and CSVs) files to solr, I will get in the results something like this in json: [some part of response omitted] response:{numFound:1,start:0,maxScore:0.34002018,docs:[ { last_modified:2010-08-05T14:07:24Z, id:doc1, content_type:[application/pdf], score:0.34002018}] }} ( example taken from http://wiki.apache.org/solr/ExtractingRequestHandler ) that shows no content at all. The only way I have to retrieve and download the pdf file is to use the id=doc1 to access some repository (even a database table ) that can provide me the content starting from the id. Does this look like a common practice? Thank you Il 02/09/10 08:47, Lance Norskog ha scritto: Solr can return the list of results in JSON or php format, so that you UI can allow a download. You can write a UI in the Velocity toolkit- it's pretty easy. On Wed, Sep 1, 2010 at 8:24 AM, Matteo Mocimox...@libero.it wrote: Hello to All, I am a newbie with Solr, and I am trying to understand if I can use it form my purpose, and I was wondering how Solr lists the result documents: do they appear as downloadable files, just like http://solr.machine.com/path/file.doc, or do I need develop another layer to take care of downloading? Even a link to the docs might work... Thank you, Matteo -- Lance Norskog goks...@gmail.com
Re: Download document from solr
Solr can return the list of results in JSON or php format, so that you UI can allow a download. You can write a UI in the Velocity toolkit- it's pretty easy. On Wed, Sep 1, 2010 at 8:24 AM, Matteo Moci mox...@libero.it wrote: Hello to All, I am a newbie with Solr, and I am trying to understand if I can use it form my purpose, and I was wondering how Solr lists the result documents: do they appear as downloadable files, just like http://solr.machine.com/path/file.doc, or do I need develop another layer to take care of downloading? Even a link to the docs might work... Thank you, Matteo -- Lance Norskog goks...@gmail.com
Re: Download document from solr
Thank you for the suggestions, I just completed the tutorial at http://lucene.apache.org/solr/tutorial.html and i understood that in the GET parameters I can choose wt=standard (and obtain an xml structure in the results), wt=json or wt=php. All of them display the results inline, in the sense that they are embedded and entirely included in the response. If I submit pdfs (i think it is also for docs and CSVs) files to solr, I will get in the results something like this in json: [some part of response omitted] response:{numFound:1,start:0,maxScore:0.34002018,docs:[ { last_modified:2010-08-05T14:07:24Z, id:doc1, content_type:[application/pdf], score:0.34002018}] }} ( example taken from http://wiki.apache.org/solr/ExtractingRequestHandler ) that shows no content at all. The only way I have to retrieve and download the pdf file is to use the id=doc1 to access some repository (even a database table ) that can provide me the content starting from the id. Does this look like a common practice? Thank you Il 02/09/10 08:47, Lance Norskog ha scritto: Solr can return the list of results in JSON or php format, so that you UI can allow a download. You can write a UI in the Velocity toolkit- it's pretty easy. On Wed, Sep 1, 2010 at 8:24 AM, Matteo Mocimox...@libero.it wrote: Hello to All, I am a newbie with Solr, and I am trying to understand if I can use it form my purpose, and I was wondering how Solr lists the result documents: do they appear as downloadable files, just like http://solr.machine.com/path/file.doc, or do I need develop another layer to take care of downloading? Even a link to the docs might work... Thank you, Matteo
Re: Download document from solr
Yes. Indexing a PDFother types with '/extract' means that Solr finds words in the document and indexes those in a field 'content'. It does not save the binary contents of the file. You could make a request handler that fetches one document and generates a redirect to the link. On Thu, Sep 2, 2010 at 7:35 AM, Matteo Moci mox...@libero.it wrote: Thank you for the suggestions, I just completed the tutorial at http://lucene.apache.org/solr/tutorial.html and i understood that in the GET parameters I can choose wt=standard (and obtain an xml structure in the results), wt=json or wt=php. All of them display the results inline, in the sense that they are embedded and entirely included in the response. If I submit pdfs (i think it is also for docs and CSVs) files to solr, I will get in the results something like this in json: [some part of response omitted] response:{numFound:1,start:0,maxScore:0.34002018,docs:[ { last_modified:2010-08-05T14:07:24Z, id:doc1, content_type:[application/pdf], score:0.34002018}] }} ( example taken from http://wiki.apache.org/solr/ExtractingRequestHandler ) that shows no content at all. The only way I have to retrieve and download the pdf file is to use the id=doc1 to access some repository (even a database table ) that can provide me the content starting from the id. Does this look like a common practice? Thank you Il 02/09/10 08:47, Lance Norskog ha scritto: Solr can return the list of results in JSON or php format, so that you UI can allow a download. You can write a UI in the Velocity toolkit- it's pretty easy. On Wed, Sep 1, 2010 at 8:24 AM, Matteo Mocimox...@libero.it wrote: Hello to All, I am a newbie with Solr, and I am trying to understand if I can use it form my purpose, and I was wondering how Solr lists the result documents: do they appear as downloadable files, just like http://solr.machine.com/path/file.doc, or do I need develop another layer to take care of downloading? Even a link to the docs might work... Thank you, Matteo -- Lance Norskog goks...@gmail.com
Download document from solr
Hello to All, I am a newbie with Solr, and I am trying to understand if I can use it form my purpose, and I was wondering how Solr lists the result documents: do they appear as downloadable files, just like http://solr.machine.com/path/file.doc, or do I need develop another layer to take care of downloading? Even a link to the docs might work... Thank you, Matteo
Re: Download document from solr
SOLR returns an XML packet (well, you can also specify other response formats, e.b. JSON). Within that XML, there'll be some overall response characteristics (e.g. number of matches) and a list of documents. If you do the example setup (http://lucene.apache.org/solr/tutorial.html) and submit a query you'll see the XML returned (default) right in your browser. If you're using FireFox or Chrome, you might have to install an XML plugin to see it nicely formatted. HTH Erick On Wed, Sep 1, 2010 at 11:24 AM, Matteo Moci mox...@libero.it wrote: Hello to All, I am a newbie with Solr, and I am trying to understand if I can use it form my purpose, and I was wondering how Solr lists the result documents: do they appear as downloadable files, just like http://solr.machine.com/path/file.doc, or do I need develop another layer to take care of downloading? Even a link to the docs might work... Thank you, Matteo