Can't test now but you can try:
http://nutch.apache.org/apidocs-1.3/org/apache/nutch/protocol/Content.html#getMetadata%28%29


> Hello,
> 
> I have put some metadata to url seed files like;
> 
> http://mysite.com/mypage.html \t sid=10 mid=15
> 
> doing inject/generate and then readdb dump, I actually can confirm
> that these meta properties are indeed in crawldb.
> 
> Well, after this i have to fetch -> parse -> solrindex
> 
> I can not reach any of this data during parse nor solrindex. parse is
> fed with content, and solrindex is fed with a parse.
> 
> By parse I mean HtmlParser.java and by solrindex I mean
> BasicIndexingFilter.java
> 
> How could I do this?
> 
> Best Regards,
> C.B.

Reply via email to