Hello,

I've been struggling with gsearch in the last few days. I was quickly 
able to set it up, and it seemed to work fine, with the basic config 
working as designed, indexing all my objects using the default dublin 
core. However, i simply can't make it index managed text/xml datastreams.

I edited the basicFoxmlToLucene.xslt file, uncommented the call to the 
example-of-xml-not-inline template, and manually added a RIGHTS2 managed 
datastream to the demo:10 object with identical XML content as the 
RIGHTS1 inline XML datastream. That went without result, as the 
resulting index, which i inspected using Luke, doesn't contain the 
uva.access field.

I also tried adding an index field specific to my objects, trying to 
create an index from the text contained in each objects 'TEI' 
datastream, in the <TEI.2> -> <text> -> <body> subsection:

<IndexField IFname="tei.text" index="TOKENIZED" store="YES" termVector="NO">
<xsl:value-of 
select="document(concat('http://localhost:8080/fedora/get/', $PID, 
'/TEI'))/TEI.2/text/body/text()"/>
</IndexField>

But same as with the example: the tei.text index doesn't appear in the 
BasicIndex when inspected with Luke. The issue also doesn't seem to be 
access to the datastream as the 
http://localhost:8080/fedora/get/$PID/TEI url, called from the Fedora 
host, correctly returns the $PID object's TEI datastream, ie a 
well-formed XML document.

The only clue i have of something going wrong is a multitude of the same 
error message in Tomcat whenever i create an index from the foxml files 
(be it the demo objects or my own), in catalina.out:

SystemId Unknown; Line #149; Column #181; java.lang.NullPointerException

The same exact message appears hundreds of times in the log file, with 
the same line/column numbers.

Would anyone have a remote idea of what i could have done wrong? I'm 
running gsearch 2.2 on Fedora 3.1, in an external Tomcat 6, with Java 1.5.

Regards,
-- 
Bruno Voisin
Irish Centre for High-End Computing (ICHEC)
National University of Ireland, Galway

------------------------------------------------------------------------------
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to