[jira] [Commented] (ANY23-131) Nested Microdata are not extracted

2016-01-06 Thread Daniel Sachse (JIRA)

[ 
https://issues.apache.org/jira/browse/ANY23-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15085632#comment-15085632
 ] 

Daniel Sachse commented on ANY23-131:
-

Hey folks,

I tried using any23 with this page 
(https://www.otto.de/p/aeg-waschmaschine-lavamat-l14as7-aplusplusplus-7-kg-1400-u-min-508571361/#variationId=504747671-M48)
 and it did for example extract the itemprop "aggregateRating" but not the 
nested "ratingValue" and "reviewCount" - neither in 1.1 nor 1.2-SNAPSHOT.
Any ideas?

Best

Daniel

> Nested Microdata are not extracted
> --
>
> Key: ANY23-131
> URL: https://issues.apache.org/jira/browse/ANY23-131
> Project: Apache Any23
>  Issue Type: Bug
>  Components: microdata
>Affects Versions: 0.7.0
>Reporter: Sebastien Richard
>Assignee: Lewis John McGibbney
> Fix For: 1.2
>
>
> Proposed patch:
> core/src/main/java/org/apache/any23/extractor/microdata/MicrodataParser.java:
> remove incorrect optim:
> L166
> - return getUnnestedNodes( topLevelItemScopes ); 
> + return topLevelItemScopes;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (ANY23-131) Nested Microdata are not extracted

2016-01-06 Thread Daniel Sachse (JIRA)

[ 
https://issues.apache.org/jira/browse/ANY23-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15085638#comment-15085638
 ] 

Daniel Sachse edited comment on ANY23-131 at 1/6/16 3:13 PM:
-

I also modified the 1.1 version locally and edited the MicrodataParser by 
commenting this code:
{code:java}
// if (attributes.getNamedItem(ITEMSCOPE_ATTRIBUTE) != null) {
//// Don't visit descendants of nodes that 
define a new scope
//return FILTER_REJECT;
//}
{code}

Afterwards the mentioned itemprops are found, but the node value is used and 
not the content attribute.


was (Author: danielsachse):
I also modified the 1.1 version locally and edited the MicrodataParser by 
commenting this code:
// if (attributes.getNamedItem(ITEMSCOPE_ATTRIBUTE) != null) {
//// Don't visit descendants of nodes that 
define a new scope
//return FILTER_REJECT;
//}

Afterwards the mentioned itemprops are found, but the node value is used and 
not the content attribute.

> Nested Microdata are not extracted
> --
>
> Key: ANY23-131
> URL: https://issues.apache.org/jira/browse/ANY23-131
> Project: Apache Any23
>  Issue Type: Bug
>  Components: microdata
>Affects Versions: 0.7.0
>Reporter: Sebastien Richard
>Assignee: Lewis John McGibbney
> Fix For: 1.2
>
>
> Proposed patch:
> core/src/main/java/org/apache/any23/extractor/microdata/MicrodataParser.java:
> remove incorrect optim:
> L166
> - return getUnnestedNodes( topLevelItemScopes ); 
> + return topLevelItemScopes;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (ANY23-131) Nested Microdata are not extracted

2016-01-06 Thread Daniel Sachse (JIRA)

[ 
https://issues.apache.org/jira/browse/ANY23-131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15085638#comment-15085638
 ] 

Daniel Sachse commented on ANY23-131:
-

I also modified the 1.1 version locally and edited the MicrodataParser by 
commenting this code:
// if (attributes.getNamedItem(ITEMSCOPE_ATTRIBUTE) != null) {
//// Don't visit descendants of nodes that 
define a new scope
//return FILTER_REJECT;
//}

Afterwards the mentioned itemprops are found, but the node value is used and 
not the content attribute.

> Nested Microdata are not extracted
> --
>
> Key: ANY23-131
> URL: https://issues.apache.org/jira/browse/ANY23-131
> Project: Apache Any23
>  Issue Type: Bug
>  Components: microdata
>Affects Versions: 0.7.0
>Reporter: Sebastien Richard
>Assignee: Lewis John McGibbney
> Fix For: 1.2
>
>
> Proposed patch:
> core/src/main/java/org/apache/any23/extractor/microdata/MicrodataParser.java:
> remove incorrect optim:
> L166
> - return getUnnestedNodes( topLevelItemScopes ); 
> + return topLevelItemScopes;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)