I'm going down the route of patching nutch so I can use this ParseMetaTags plugin: https://issues.apache.org/jira/browse/NUTCH-809
Also wondering whether I will be able to use the XMLParser to allow me to parse well formed XHTML, using xpath would be bonus: https://issues.apache.org/jira/browse/NUTCH-185 Any thoughts appreciated... -- View this message in context: http://lucene.472066.n3.nabble.com/Crawling-with-nutch-and-mapping-fields-to-solr-tp1879060p1883295.html Sent from the Solr - User mailing list archive at Nabble.com.