Hi Tim I hit the first issue myself and submitted a patch to trunk for this, r8591, the patch is attached in case you want to apply it yourself to the last released version.
I don't think there are any particular issues with the number of
external/managed datastreams (maybe the wiki is referring to the example
indexing stylesheet), but there are issues if you want to index
managed/external XML as at the moment the helper functions only deal with
text.
There is a work-around for this (which Simon Lamb made me aware of):
1) Select the document to a variable, eg:
<xsl:variable name="externalDS"
select="document(concat('http://localhost:8080/fedora/objects/', $PID,
'/datastreams/externalDS'))" />
(if you need authentication you'll have to encode that in the URL)
2) Then you can extract the value(s) to index using xpath, eg
<field name="external.field">
<xsl:value-of select="$externalDS/xpath/to/element/>
</field>
It is possible to index disseminators using this approach also.
It would be useful if there were some helper functions available to deal
with this.
Regards
Steve
> -----Original Message-----
> From: Tim Cornwell [mailto:[email protected]]
> Sent: 11 August 2010 19:56
> To: [email protected]
> Subject: [fcrepo-user] Gsearch and fedora datastreams
>
>
> All,
>
> I am going to attempt to implement gsearch to index multiple
> datastreams, but have two potential issues:
>
> 1. Configuration instructions say that Gsearch accepts only
> "text/plain text/xml text/html application/pdf" mime types.
> Unfortunately, my data is typed "application/xml". Is there
> a way to include these? Gsearch does accept text/xml -
> which is functionally equivalent in my case.
>
> 2. I see in the configuration document
> https://wiki.duraspace.org/display/FCSVCS/Generic+Search+Service+2.2):
> "...One managed or external datastream may be indexed per
> FedoraObject (which one is configurable), assuming that they
> contain the same text in different mimetypes. ..."
> I am concerned that this is some kind of firm limitation. I
> have multiple datastreams per object with different content in each.
>
> Thanks in advance,
> Tim
>
> Timothy Cornwell, Programmer/Analyst
> National Science Digital Library
> http://nsdl.org
>
>
> --------------------------------------------------------------
> ----------------
> This SF.net email is sponsored by
>
> Make an app they can't live without
> Enter the BlackBerry Developer Challenge
> http://p.sf.net/sfu/RIM-dev2dev
> _______________________________________________
> Fedora-commons-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>
gsearch-patch-sb-2010-05-27.patch
Description: Binary data
------------------------------------------------------------------------------ This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev
_______________________________________________ Fedora-commons-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
