Hi Tim

I hit the first issue myself and submitted a patch to trunk for this, r8591,
the patch is attached in case you want to apply it yourself to the last
released version.

I don't think there are any particular issues with the number of
external/managed datastreams (maybe the wiki is referring to the example
indexing stylesheet), but there are issues if you want to index
managed/external XML as at the moment the helper functions only deal with
text.

There is a work-around for this (which Simon Lamb made me aware of):

1) Select the document to a variable, eg:

<xsl:variable name="externalDS"
select="document(concat('http://localhost:8080/fedora/objects/', $PID,
'/datastreams/externalDS'))" />

(if you need authentication you'll have to encode that in the URL)

2) Then you can extract the value(s) to index using xpath, eg

<field name="external.field">
  <xsl:value-of select="$externalDS/xpath/to/element/>
</field>

It is possible to index disseminators using this approach also.

It would be useful if there were some helper functions available to deal
with this.

Regards
Steve


> -----Original Message-----
> From: Tim Cornwell [mailto:[email protected]] 
> Sent: 11 August 2010 19:56
> To: [email protected]
> Subject: [fcrepo-user] Gsearch and fedora datastreams
> 
> 
> All,
> 
> I am going to attempt to implement gsearch to index multiple 
> datastreams, but have two potential issues:
> 
> 1. Configuration instructions say that Gsearch accepts only 
> "text/plain text/xml text/html application/pdf" mime types.  
> Unfortunately, my data is typed "application/xml".  Is there 
> a way to include these?   Gsearch does accept text/xml - 
> which is functionally equivalent in my case.  
> 
> 2. I see in the configuration document  
> https://wiki.duraspace.org/display/FCSVCS/Generic+Search+Service+2.2):
>  "...One managed or external datastream may be indexed per 
> FedoraObject (which one is configurable), assuming that they 
> contain the same text in different mimetypes. ..."
> I am concerned that this is some kind of firm limitation.  I 
> have multiple datastreams per object with different content in each.  
> 
> Thanks in advance,
> Tim
> 
> Timothy Cornwell, Programmer/Analyst
> National Science Digital Library
> http://nsdl.org
> 
> 
> --------------------------------------------------------------
> ----------------
> This SF.net email is sponsored by 
> 
> Make an app they can't live without
> Enter the BlackBerry Developer Challenge
> http://p.sf.net/sfu/RIM-dev2dev 
> _______________________________________________
> Fedora-commons-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
> 

Attachment: gsearch-patch-sb-2010-05-27.patch
Description: Binary data

------------------------------------------------------------------------------
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to