Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread LiYu Lilly
It might not be very relevant-- but has anyone started or considering 
integrating VIVO and DSpace? 
Thanks,Lilly

> From: heli...@centrum.sk
> Date: Thu, 5 Mar 2015 21:20:45 +0100
> To: pe...@longsight.com
> CC: dspace-tech@lists.sourceforge.net
> Subject: Re: [Dspace-tech] External Searching of DSpace contents
> 
> I used to use this SRU module[1] in DSpace 1.6-1.8. It should still
> work, but it uses Lucene as the search backend. It was an annoyance to
> upgrade though, so I changed my client to talk to DSpace Solr
> directly, the client is a component on another server, so I didn't
> have to secure Solr, only add its IP to RemoteAddrValve. Back then I
> had this idea for creating a SRU interface as a thin layer on top of
> Solr, it should be quite easily doable [2], see the link there.
> 
> Anyway, I asked you what your use case was. Is it just search query
> autosuggestion using JQuery? We used to have that before 3.0, but it
> was removed in the huge patch which brought Access Rights Awareness to
> Discovery, supposedly due to being memory-inefficient. It's worth
> looking there how it was implemented.
> 
> For completeness, I'll also mention solr-security-proxy [3] here, but
> generally you might want to avoid that approach [4]. Going off on a
> tangent, Elasticsearch has the same problem, but the company behind ES
> very recently released a commercial product that addresses this highly
> requested feature [5].
> 
> 
> [1] https://code.google.com/p/oclcsrwdspacelucene/
> [2] https://jira.duraspace.org/browse/DS-1264
> [3] https://github.com/dergachev/solr-security-proxy
> [4] http://stackoverflow.com/questions/3637014/running-solr-in-read-only-mode
> [5] http://www.elasticsearch.org/overview/shield/
> 
> Regards,
> ~~helix84
> 
> Compulsory reading: DSpace Mailing List Etiquette
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
> 
> --
> Dive into the World of Parallel Programming The Go Parallel Website, sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for all
> things parallel software development, from weekly thought leadership blogs to
> news, videos, case studies, tutorials and more. Take a look and join the 
> conversation now. http://goparallel.sourceforge.net/
> ___
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette: 
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
  --
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread helix84
I used to use this SRU module[1] in DSpace 1.6-1.8. It should still
work, but it uses Lucene as the search backend. It was an annoyance to
upgrade though, so I changed my client to talk to DSpace Solr
directly, the client is a component on another server, so I didn't
have to secure Solr, only add its IP to RemoteAddrValve. Back then I
had this idea for creating a SRU interface as a thin layer on top of
Solr, it should be quite easily doable [2], see the link there.

Anyway, I asked you what your use case was. Is it just search query
autosuggestion using JQuery? We used to have that before 3.0, but it
was removed in the huge patch which brought Access Rights Awareness to
Discovery, supposedly due to being memory-inefficient. It's worth
looking there how it was implemented.

For completeness, I'll also mention solr-security-proxy [3] here, but
generally you might want to avoid that approach [4]. Going off on a
tangent, Elasticsearch has the same problem, but the company behind ES
very recently released a commercial product that addresses this highly
requested feature [5].


[1] https://code.google.com/p/oclcsrwdspacelucene/
[2] https://jira.duraspace.org/browse/DS-1264
[3] https://github.com/dergachev/solr-security-proxy
[4] http://stackoverflow.com/questions/3637014/running-solr-in-read-only-mode
[5] http://www.elasticsearch.org/overview/shield/

Regards,
~~helix84

Compulsory reading: DSpace Mailing List Etiquette
https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette


Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread Bruno Zanette
Peter,

It was good you have touched on this subject!! I'm also thinking about
it for some time since it's a requirement for the project i'm working
on.

I thought about creating a RubyGem to integrate with SOLR directly,
and use it side-by-side with rest-api gem, but i'm pretty sure this
idea has several security issues. The other ideia is to use REST API,
as you said, but i don't know how far it can go. But i think that's
the best solution for our problem, once we're already using rest-api.

I'll do some research about OpenSearch, and the other ideas you gave.
I haven't heard anything about them yet.

Right now we're fixing some bugs on our site and in next weeks we will
start working on the search feature.

I'll follow this thread closely!!!

2015-03-05 14:27 GMT-03:00 Terry Brady :
> We have a bento-box style search built into our Drupal website.
>
> http://www.library.georgetown.edu/search/gusuper/yearbook
>
> We have de-emphasized this interface since we adopted a discovery layer
> since the discovery layer also searches our DSpace instance.
>
> If you are curious, here is the code:
> https://github.com/Georgetown-University-Libraries/gusuper
>
> The code constructs a PHP SolrClient to perform the search.
>
> https://github.com/Georgetown-University-Libraries/gusuper/blob/master/gudspacesolr/gudspacesolr.module#L83
>
> Terry
>
>
> On Thu, Mar 5, 2015 at 9:15 AM, Peter Dietz  wrote:
>>
>> I was wondering if anyone has built any integrations where an external web
>> site (such as your institution website or CMS) can search and show results
>> from your DSpace? One approach that ought to be pretty straight forward that
>> I'm thinking of is the OpenSearch web service.
>>
>> https://trydspace.longsight.com/open-search/discover?query=test
>>
>> Getting that to work on a site would require some AJAX to update as the
>> user types in a search query, and then display the updated search results.
>>
>> Another route could be the DSpace 5 REST API. There is a
>> find-by-metadata-field (key, value), but isn't suited for full text
>> searching. Other possible routes that I'm thinking are SRW/U or Z39.50.
>>
>> So, I was just wondering if anyone has already accomplished something in
>> this area, and has client integration examples. I suppose a cheap route
>> would be to have some DSpace-iframe solution, where you homepage/searchpage,
>> just embeds DSpace search page inside.
>>
>> Thanks
>> 
>> Peter Dietz
>> Longsight
>> www.longsight.com
>> pe...@longsight.com
>> p: 740-599-5005 x809
>>
>>
>> --
>> Dive into the World of Parallel Programming The Go Parallel Website,
>> sponsored
>> by Intel and developed in partnership with Slashdot Media, is your hub for
>> all
>> things parallel software development, from weekly thought leadership blogs
>> to
>> news, videos, case studies, tutorials and more. Take a look and join the
>> conversation now. http://goparallel.sourceforge.net/
>> ___
>> DSpace-tech mailing list
>> DSpace-tech@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> List Etiquette:
>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
>
>
>
> --
> Terry Brady
> Applications Programmer Analyst
> Georgetown University Library Information Technology
> https://www.library.georgetown.edu/lit/code
> 425-298-5498 (Seattle, WA)
>
> --
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for
> all
> things parallel software development, from weekly thought leadership blogs
> to
> news, videos, case studies, tutorials and more. Take a look and join the
> conversation now. http://goparallel.sourceforge.net/
> ___
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette



-- 
Bruno Nocera Zanette
+55 41 9992-2508

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette


Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread Terry Brady
We have a bento-box style search built into our Drupal website.

http://www.library.georgetown.edu/search/gusuper/yearbook

We have de-emphasized this interface since we adopted a discovery layer
since the discovery layer also searches our DSpace instance.

If you are curious, here is the code:
https://github.com/Georgetown-University-Libraries/gusuper

The code constructs a PHP SolrClient to perform the search.

https://github.com/Georgetown-University-Libraries/gusuper/blob/master/gudspacesolr/gudspacesolr.module#L83

Terry


On Thu, Mar 5, 2015 at 9:15 AM, Peter Dietz  wrote:

> I was wondering if anyone has built any integrations where an external web
> site (such as your institution website or CMS) can search and show results
> from your DSpace? One approach that ought to be pretty straight forward
> that I'm thinking of is the OpenSearch web service.
>
> https://trydspace.longsight.com/open-search/discover?query=test
>
> Getting that to work on a site would require some AJAX to update as the
> user types in a search query, and then display the updated search results.
>
> Another route could be the DSpace 5 REST API. There is a
> find-by-metadata-field (key, value), but isn't suited for full text
> searching. Other possible routes that I'm thinking are SRW/U or Z39.50.
>
> So, I was just wondering if anyone has already accomplished something in
> this area, and has client integration examples. I suppose a cheap route
> would be to have some DSpace-iframe solution, where you
> homepage/searchpage, just embeds DSpace search page inside.
>
> Thanks
> 
> Peter Dietz
> Longsight
> www.longsight.com
> pe...@longsight.com
> p: 740-599-5005 x809
>
>
> --
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub for
> all
> things parallel software development, from weekly thought leadership blogs
> to
> news, videos, case studies, tutorials and more. Take a look and join the
> conversation now. http://goparallel.sourceforge.net/
> ___
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>



-- 
Terry Brady
Applications Programmer Analyst
Georgetown University Library Information Technology
https://www.library.georgetown.edu/lit/code
425-298-5498 (Seattle, WA)
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

[Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread Peter Dietz
I was wondering if anyone has built any integrations where an external web
site (such as your institution website or CMS) can search and show results
from your DSpace? One approach that ought to be pretty straight forward
that I'm thinking of is the OpenSearch web service.

https://trydspace.longsight.com/open-search/discover?query=test

Getting that to work on a site would require some AJAX to update as the
user types in a search query, and then display the updated search results.

Another route could be the DSpace 5 REST API. There is a
find-by-metadata-field (key, value), but isn't suited for full text
searching. Other possible routes that I'm thinking are SRW/U or Z39.50.

So, I was just wondering if anyone has already accomplished something in
this area, and has client integration examples. I suppose a cheap route
would be to have some DSpace-iframe solution, where you
homepage/searchpage, just embeds DSpace search page inside.

Thanks

Peter Dietz
Longsight
www.longsight.com
pe...@longsight.com
p: 740-599-5005 x809
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette