[Dspace-tech] How to find item with strange metadata value of "|||"

2015-07-30 Thread Alan Orth
Hi,

I've got an strange metadata value shown in one of our XMLUI Discovery
sidebar facets, and I'm not sure how to find the offending item to fix it.

The value is displaying as "|||", and I've narrowed down the collection the
item belongs to by navigating through my communities and looking at the
facet in question. I've exported the metadata as CSV and attempted to find
the item using LibreOffice and OpenRefine but nothing strange sticks out...

Does anyone have any suggestions?

Thanks,

Alan
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] How to find item with strange metadata value of "|||"

2015-07-30 Thread Kim Shepherd
Hi Alan,

Discovery filters use ||| as a separator between lowercase value and a
display/stored value, eg. something|||SomeThing
I'm sort of surprised it displays in the sidebar, because acting as the
separator, it should be pulled out, but I think the first thing I'd look
for is empty or null metadata values in your database (probably easier than
CSV because of how metadata export works), for the field in question.

What version is this happening in?

M: k...@shepherd.nz
T: @kimshepherd
P: +6421883635

0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
https://keybase.io/kshepherd

On 30 July 2015 at 20:45, Alan Orth  wrote:

> Hi,
>
> I've got an strange metadata value shown in one of our XMLUI Discovery
> sidebar facets, and I'm not sure how to find the offending item to fix it.
>
> The value is displaying as "|||", and I've narrowed down the collection
> the item belongs to by navigating through my communities and looking at the
> facet in question. I've exported the metadata as CSV and attempted to find
> the item using LibreOffice and OpenRefine but nothing strange sticks out...
>
> Does anyone have any suggestions?
>
> Thanks,
>
> Alan
>
>
> --
>
> ___
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] How to find item with strange metadata value of "|||"

2015-07-30 Thread Alan Orth
Thanks for the interesting suggesting, Kim! This is happening in DSpace
4.3. I will try to poke around in the database, as you're probably right
that this anomaly probably wouldn't make it through a CSV export...

Alan

On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd  wrote:

> Hi Alan,
>
> Discovery filters use ||| as a separator between lowercase value and a
> display/stored value, eg. something|||SomeThing
> I'm sort of surprised it displays in the sidebar, because acting as the
> separator, it should be pulled out, but I think the first thing I'd look
> for is empty or null metadata values in your database (probably easier than
> CSV because of how metadata export works), for the field in question.
>
> What version is this happening in?
>
> M: k...@shepherd.nz
> T: @kimshepherd
> P: +6421883635
>
> 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
> https://keybase.io/kshepherd
>
> On 30 July 2015 at 20:45, Alan Orth  wrote:
>
>> Hi,
>>
>> I've got an strange metadata value shown in one of our XMLUI Discovery
>> sidebar facets, and I'm not sure how to find the offending item to fix it.
>>
>> The value is displaying as "|||", and I've narrowed down the collection
>> the item belongs to by navigating through my communities and looking at the
>> facet in question. I've exported the metadata as CSV and attempted to find
>> the item using LibreOffice and OpenRefine but nothing strange sticks out...
>>
>> Does anyone have any suggestions?
>>
>> Thanks,
>>
>> Alan
>>
>>
>> --
>>
>> ___
>> DSpace-tech mailing list
>> DSpace-tech@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>> List Etiquette:
>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>>
>
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] How to find item with strange metadata value of "|||"

2015-07-31 Thread Alan Orth
So I found the metadata_field_id for this metadata item using a known valid
value:

dspace=# select * from metadatavalue where text_value='DRYLAND CEREALS';

... and then I tried to find text values containing several variations of
"|||", like '\|\|\|', '%|%', '%\|%' etc...

dspace=# select item_id from metadatavalue where metadata_field_id=86 and
text_value='|||';

But I am not getting any hits. I also figured that, since we figured CSV
export by nature wouldn't expose this anomaly, that I could simple export
and re-import, but DSpace said no changes were detected upon re-import.

Hmmm. Anyone know how I could query Solr directly for this information?

Alan



On Fri, Jul 31, 2015 at 9:35 AM Alan Orth  wrote:

> Thanks for the interesting suggesting, Kim! This is happening in DSpace
> 4.3. I will try to poke around in the database, as you're probably right
> that this anomaly probably wouldn't make it through a CSV export...
>
> Alan
>
> On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd 
> wrote:
>
>> Hi Alan,
>>
>> Discovery filters use ||| as a separator between lowercase value and a
>> display/stored value, eg. something|||SomeThing
>> I'm sort of surprised it displays in the sidebar, because acting as the
>> separator, it should be pulled out, but I think the first thing I'd look
>> for is empty or null metadata values in your database (probably easier than
>> CSV because of how metadata export works), for the field in question.
>>
>> What version is this happening in?
>>
>> M: k...@shepherd.nz
>> T: @kimshepherd
>> P: +6421883635
>>
>> 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
>> https://keybase.io/kshepherd
>>
>> On 30 July 2015 at 20:45, Alan Orth  wrote:
>>
>>> Hi,
>>>
>>> I've got an strange metadata value shown in one of our XMLUI Discovery
>>> sidebar facets, and I'm not sure how to find the offending item to fix it.
>>>
>>> The value is displaying as "|||", and I've narrowed down the collection
>>> the item belongs to by navigating through my communities and looking at the
>>> facet in question. I've exported the metadata as CSV and attempted to find
>>> the item using LibreOffice and OpenRefine but nothing strange sticks out...
>>>
>>> Does anyone have any suggestions?
>>>
>>> Thanks,
>>>
>>> Alan
>>>
>>>
>>> --
>>>
>>> ___
>>> DSpace-tech mailing list
>>> DSpace-tech@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/dspace-tech
>>> List Etiquette:
>>> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>>>
>>
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] How to find item with strange metadata value of "|||"

2015-08-01 Thread Kim Shepherd
Hi Alan, that's close, but not quite what I was suggesting: the ||| will
not be stored in the database, that's added in when indexing the values to
Solr, so I think a SQL query like:

select item_id from metadatavalue where metadata_field_id=86 and
(text_value='' OR text_value IS NULL);

Will be closer to what you're after.

If you want to try and find items that might match in solr, you could try
something like

curl http://localhost:8080/solr/search/select?query=*:*&fq=FIELDNAME_filter
:"|||"

Where FIELDNAME is the name of the field as configured in the discovery
spring configuration (eg. 'subject', 'type', 'author', and so on)

Cheers!

Kim

M: k...@shepherd.nz
T: @kimshepherd
P: +6421883635

0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
https://keybase.io/kshepherd

On 1 August 2015 at 01:28, Alan Orth  wrote:

> So I found the metadata_field_id for this metadata item using a known
> valid value:
>
> dspace=# select * from metadatavalue where text_value='DRYLAND CEREALS';
>
> ... and then I tried to find text values containing several variations of
> "|||", like '\|\|\|', '%|%', '%\|%' etc...
>
> dspace=# select item_id from metadatavalue where metadata_field_id=86 and
> text_value='|||';
>
> But I am not getting any hits. I also figured that, since we figured CSV
> export by nature wouldn't expose this anomaly, that I could simple export
> and re-import, but DSpace said no changes were detected upon re-import.
>
> Hmmm. Anyone know how I could query Solr directly for this information?
>
> Alan
>
>
>
> On Fri, Jul 31, 2015 at 9:35 AM Alan Orth  wrote:
>
>> Thanks for the interesting suggesting, Kim! This is happening in DSpace
>> 4.3. I will try to poke around in the database, as you're probably right
>> that this anomaly probably wouldn't make it through a CSV export...
>>
>> Alan
>>
>> On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd 
>> wrote:
>>
>>> Hi Alan,
>>>
>>> Discovery filters use ||| as a separator between lowercase value and a
>>> display/stored value, eg. something|||SomeThing
>>> I'm sort of surprised it displays in the sidebar, because acting as the
>>> separator, it should be pulled out, but I think the first thing I'd look
>>> for is empty or null metadata values in your database (probably easier than
>>> CSV because of how metadata export works), for the field in question.
>>>
>>> What version is this happening in?
>>>
>>> M: k...@shepherd.nz
>>> T: @kimshepherd
>>> P: +6421883635
>>>
>>> 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
>>> https://keybase.io/kshepherd
>>>
>>> On 30 July 2015 at 20:45, Alan Orth  wrote:
>>>
 Hi,

 I've got an strange metadata value shown in one of our XMLUI Discovery
 sidebar facets, and I'm not sure how to find the offending item to fix it.

 The value is displaying as "|||", and I've narrowed down the collection
 the item belongs to by navigating through my communities and looking at the
 facet in question. I've exported the metadata as CSV and attempted to find
 the item using LibreOffice and OpenRefine but nothing strange sticks out...

 Does anyone have any suggestions?

 Thanks,

 Alan


 --

 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette:
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

>>>
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] How to find item with strange metadata value of "|||"

2015-08-02 Thread Alan Orth
Bingo, I found the item, and then got its handle using the item_id:

dspace=# SELECT * FROM item, handle WHERE handle.resource_id = item.item_id
AND item.item_id = '66841';

The item did indeed have empty contents for that field. I deleted it from
the "Edit this item" function in the web interface and now my facets are
clean again. ;)

Thanks a bunch,

Alan

On Sun, Aug 2, 2015 at 5:24 AM Kim Shepherd  wrote:

> Hi Alan, that's close, but not quite what I was suggesting: the ||| will
> not be stored in the database, that's added in when indexing the values to
> Solr, so I think a SQL query like:
>
> select item_id from metadatavalue where metadata_field_id=86 and
> (text_value='' OR text_value IS NULL);
>
> Will be closer to what you're after.
>
> If you want to try and find items that might match in solr, you could try
> something like
>
> curl
> http://localhost:8080/solr/search/select?query=*:*&fq=FIELDNAME_filter
> :"|||"
>
> Where FIELDNAME is the name of the field as configured in the discovery
> spring configuration (eg. 'subject', 'type', 'author', and so on)
>
> Cheers!
>
> Kim
>
> M: k...@shepherd.nz
> T: @kimshepherd
> P: +6421883635
>
> 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
> https://keybase.io/kshepherd
>
> On 1 August 2015 at 01:28, Alan Orth  wrote:
>
>> So I found the metadata_field_id for this metadata item using a known
>> valid value:
>>
>> dspace=# select * from metadatavalue where text_value='DRYLAND CEREALS';
>>
>> ... and then I tried to find text values containing several variations of
>> "|||", like '\|\|\|', '%|%', '%\|%' etc...
>>
>> dspace=# select item_id from metadatavalue where metadata_field_id=86 and
>> text_value='|||';
>>
>> But I am not getting any hits. I also figured that, since we figured CSV
>> export by nature wouldn't expose this anomaly, that I could simple export
>> and re-import, but DSpace said no changes were detected upon re-import.
>>
>> Hmmm. Anyone know how I could query Solr directly for this information?
>>
>> Alan
>>
>>
>>
>> On Fri, Jul 31, 2015 at 9:35 AM Alan Orth  wrote:
>>
>>> Thanks for the interesting suggesting, Kim! This is happening in DSpace
>>> 4.3. I will try to poke around in the database, as you're probably right
>>> that this anomaly probably wouldn't make it through a CSV export...
>>>
>>> Alan
>>>
>>> On Thu, Jul 30, 2015 at 1:44 PM Kim Shepherd 
>>> wrote:
>>>
 Hi Alan,

 Discovery filters use ||| as a separator between lowercase value and a
 display/stored value, eg. something|||SomeThing
 I'm sort of surprised it displays in the sidebar, because acting as the
 separator, it should be pulled out, but I think the first thing I'd look
 for is empty or null metadata values in your database (probably easier than
 CSV because of how metadata export works), for the field in question.

 What version is this happening in?

 M: k...@shepherd.nz
 T: @kimshepherd
 P: +6421883635

 0CCB D957 0C35 F5C1 497E CDCF FC4B ABA3 2A1A FAEC
 https://keybase.io/kshepherd

 On 30 July 2015 at 20:45, Alan Orth  wrote:

> Hi,
>
> I've got an strange metadata value shown in one of our XMLUI Discovery
> sidebar facets, and I'm not sure how to find the offending item to fix it.
>
> The value is displaying as "|||", and I've narrowed down the
> collection the item belongs to by navigating through my communities and
> looking at the facet in question. I've exported the metadata as CSV and
> attempted to find the item using LibreOffice and OpenRefine but nothing
> strange sticks out...
>
> Does anyone have any suggestions?
>
> Thanks,
>
> Alan
>
>
> --
>
> ___
> DSpace-tech mailing list
> DSpace-tech@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dspace-tech
> List Etiquette:
> https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
>

>
--
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette