Rich-
  I think Gert is saying that you can use those parameters to override
the name of the field, eg:

Quality & File Format = Quality___File_Format

I'm not sure where this field spec gets passed in (we don't actually
use GSearch, I'm just a busybody today).

- Ben

On Wed, Jul 18, 2012 at 4:56 PM, Rich d'Rich <[email protected]> wrote:
> Thanks, yes Ben is correct on the problem. Glad to know it's being corrected
> - we can't really use that workround as our repository is meant to index
> everything it finds in EXIFs, etc. (although I'll try it on the client
> anyway).
>
> Cheers, Rich
>
>
> On 19 July 2012 01:28, Gert Schmeltz Pedersen <[email protected]> wrote:
>>
>> Ben is right about ampersand not being modified, this will be corrected at
>> next release of GSearch, which will be as soon as possible after the
>> expected release of Fedora 3.6 in August.
>>
>> But there is no problem for you now, you can use the parameters of
>> getDatastreamFromTika to avoid it. Here is a quote from the documentation
>> page:
>>
>> "
>> selectedFields                - comma-separated list of metadata
>> fieldSpecs, if empty then all fields are included with default params
>> fieldSpec                         - metadataFieldName [ '='
>> indexFieldName] [ '/' [index] [ '/' [store] [ '/' [termVector] [ '/'
>> [boost]]]]]
>> - metadataFieldName    must be exactly as extracted by Tika from the
>> document.
>>                                            You may see the available
>> names, if you log in debug mode and look for "METADATA name="
>>                                           under "fullDsId=" in the log,
>> when "getFromTika" was called during updateIndex
>>  - indexFieldName         is used as the generated index field name.
>>                                           If not given, GSearch uses
>> metadataFieldName after replacement of the characters ' ', ':', '/', '=',
>> '(', ')' with '_'
>> "
>>
>> So use the selectedFields parameter to list the metadata fields that you
>> want to include in your index, giving a valid indexFieldName for the
>> metadataFieldName.
>>
>> -Gert
>>
>>
>> On 18/07/2012, at 14.24, Benjamin Armintor wrote:
>>
>> > I think he's saying that the field is called 'Quality & File Format',
>> > and gsearch replaces the whitespace with underscores but leaves the
>> > ampersand unmodified. Then the resulting solr xml document is
>> > malformed, because the ampersand isn't encoded.
>> >
>> > On Wed, Jul 18, 2012 at 4:19 AM, Richard Green <[email protected]>
>> > wrote:
>> >> Could you be more specific about “XML special chars”?
>> >>
>> >>
>> >>
>> >> Richard Green
>> >>
>> >>
>> >>
>> >> From: Rich d'Rich [mailto:[email protected]]
>> >> Sent: 18 July 2012 6:21 AM
>> >> To: [email protected]
>> >> Subject: [fcrepo-user] SOLR indexing fails when XML chars in EXIF
>> >> fieldsextracted in gsearch
>> >>
>> >>
>> >>
>> >> We have a large repository with images files included with EXIF data.
>> >>
>> >>
>> >>
>> >> Some of these have fields (e.g. 'Quality & File Format') that contain
>> >> XML
>> >> special chars in the field name.
>> >>
>> >>
>> >>
>> >> When the getDatastreamFromTika function in the gsearch template
>> >> extracts
>> >> these fields, the resulting document has an badly formed entity tag
>> >>
>> >> &_File_Format that causes SOLR to fail to index the document.
>> >>
>> >>
>> >>
>> >> Is this a known issue? Any workrounds?
>> >>
>> >>
>> >>
>> >>
>> >> **************************************************
>> >> To view the terms under which this email is
>> >> distributed, please go to
>> >> http://www2.hull.ac.uk/legal/disclaimer.aspx
>> >> **************************************************
>> >>
>> >> ------------------------------------------------------------------------------
>> >> Live Security Virtual Conference
>> >> Exclusive live event will cover all the ways today's security and
>> >> threat landscape has changed and how IT managers can respond.
>> >> Discussions
>> >> will include endpoint security, mobile security and the latest in
>> >> malware
>> >> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>> >> _______________________________________________
>> >> Fedora-commons-users mailing list
>> >> [email protected]
>> >> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>> >>
>> >
>> >
>> > ------------------------------------------------------------------------------
>> > Live Security Virtual Conference
>> > Exclusive live event will cover all the ways today's security and
>> > threat landscape has changed and how IT managers can respond.
>> > Discussions
>> > will include endpoint security, mobile security and the latest in
>> > malware
>> > threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>> > _______________________________________________
>> > Fedora-commons-users mailing list
>> > [email protected]
>> > https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>>
>>
>>
>> ------------------------------------------------------------------------------
>> Live Security Virtual Conference
>> Exclusive live event will cover all the ways today's security and
>> threat landscape has changed and how IT managers can respond. Discussions
>> will include endpoint security, mobile security and the latest in malware
>> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>> _______________________________________________
>> Fedora-commons-users mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>
>
>
> ------------------------------------------------------------------------------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and
> threat landscape has changed and how IT managers can respond. Discussions
> will include endpoint security, mobile security and the latest in malware
> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> Fedora-commons-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
>

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to