[ https://issues.apache.org/jira/browse/NUTCH-1486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Jelsma updated NUTCH-1486: --------------------------------- Attachment: NUTCH-1486-1.9-trunk.patch Patch for trunk! Cannot upgrade to HTTPClient 4.3.1 or 4.3.3, i ended up in some dependency hell, spewing out messages like: {code} java.lang.NoClassDefFoundError: org/apache/http/entity/mime/content/ContentBody at org.apache.nutch.indexwriter.solr.SolrUtils.getSolrServer(SolrUtils.java:58) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.open(SolrIndexWriter.java:58) at org.apache.nutch.indexer.IndexWriters.open(IndexWriters.java:78) at org.apache.nutch.indexer.IndexerOutputFormat.getRecordWriter(IndexerOutputFormat.java:39) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.<init>(ReduceTask.java:449) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:491) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) Caused by: java.lang.ClassNotFoundException: org.apache.http.entity.mime.content.ContentBody at java.net.URLClassLoader$1.run(URLClassLoader.java:366) at java.net.URLClassLoader$1.run(URLClassLoader.java:355) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:354) at java.lang.ClassLoader.loadClass(ClassLoader.java:425) at java.lang.ClassLoader.loadClass(ClassLoader.java:358) ... 8 more 2014-04-04 12:57:30,436 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:114) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:176) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:186) {code} or {code} Exception in thread "main" java.lang.NoSuchFieldError: INSTANCE at org.apache.http.entity.ContentType.parse(ContentType.java:229) at org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:442) at org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:199) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:118) at org.apache.solr.client.solrj.SolrServer.commit(SolrServer.java:168) at org.apache.solr.client.solrj.SolrServer.commit(SolrServer.java:146) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.commit(SolrIndexWriter.java:166) at org.apache.nutch.indexer.IndexWriters.commit(IndexWriters.java:128) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:118) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:176) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:186) {code} Horrible!! > Upgrade to the latest Solr 4.x > ------------------------------ > > Key: NUTCH-1486 > URL: https://issues.apache.org/jira/browse/NUTCH-1486 > Project: Nutch > Issue Type: Improvement > Affects Versions: 1.6, 2.1 > Environment: Solr 4.0, Nutch trunk 1.6-SNAPSHOT & Probably 2.2-SNAPHOT > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Fix For: 1.9 > > Attachments: NUTCH-1486-1.8.patch, NUTCH-1486-1.9-trunk.patch, > NUTCH-1486-2.x-v3.patch, NUTCH-1486-2.x.patch, NUTCH-1486-2.x.v2.patch, > NUTCH-1486-nutchgora.patch, NUTCH-1486-trunk.patch, > NUTCH-1486-trunk.v2.patch, NUTCH-1486-trunk.v3.patch > > > When attempting to configure a 4 multicore 4.0 instance with Nutch > schema-solr4.xml file, I get the following exceptions. > This has been discussed previously. As I see it we have two options > 1. Keep maintaining both schema options > 2. Ditch the more complex schema-solr4.xml in favour of vanilla schema.xml > Thoughts? > {code} > SEVERE: Unable to create core: collection4 > org.apache.solr.common.SolrException: Unable to use updateLog: _version_field > must exist in schema, using indexed="true" stored="true" and > multiValued="false" (_version_ does not exist) > at org.apache.solr.core.SolrCore.<init>(SolrCore.java:721) > at org.apache.solr.core.SolrCore.<init>(SolrCore.java:566) > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:850) > at org.apache.solr.core.CoreContainer.load(CoreContainer.java:534) > at org.apache.solr.core.CoreContainer.load(CoreContainer.java:356) > at > org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:308) > at > org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:107) > at org.eclipse.jetty.servlet.FilterHolder.doStart(FilterHolder.java:114) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.servlet.ServletHandler.initialize(ServletHandler.java:754) > at > org.eclipse.jetty.servlet.ServletContextHandler.startContext(ServletContextHandler.java:258) > at > org.eclipse.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1221) > at > org.eclipse.jetty.server.handler.ContextHandler.doStart(ContextHandler.java:699) > at > org.eclipse.jetty.webapp.WebAppContext.doStart(WebAppContext.java:454) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.deploy.bindings.StandardStarter.processBinding(StandardStarter.java:36) > at > org.eclipse.jetty.deploy.AppLifeCycle.runBindings(AppLifeCycle.java:183) > at > org.eclipse.jetty.deploy.DeploymentManager.requestAppGoal(DeploymentManager.java:491) > at > org.eclipse.jetty.deploy.DeploymentManager.addApp(DeploymentManager.java:138) > at > org.eclipse.jetty.deploy.providers.ScanningAppProvider.fileAdded(ScanningAppProvider.java:142) > at > org.eclipse.jetty.deploy.providers.ScanningAppProvider$1.fileAdded(ScanningAppProvider.java:53) > at org.eclipse.jetty.util.Scanner.reportAddition(Scanner.java:604) > at org.eclipse.jetty.util.Scanner.reportDifferences(Scanner.java:535) > at org.eclipse.jetty.util.Scanner.scan(Scanner.java:398) > at org.eclipse.jetty.util.Scanner.doStart(Scanner.java:332) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.deploy.providers.ScanningAppProvider.doStart(ScanningAppProvider.java:118) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.deploy.DeploymentManager.startAppProvider(DeploymentManager.java:552) > at > org.eclipse.jetty.deploy.DeploymentManager.doStart(DeploymentManager.java:227) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.util.component.AggregateLifeCycle.doStart(AggregateLifeCycle.java:63) > at > org.eclipse.jetty.server.handler.AbstractHandler.doStart(AbstractHandler.java:53) > at > org.eclipse.jetty.server.handler.HandlerWrapper.doStart(HandlerWrapper.java:91) > at org.eclipse.jetty.server.Server.doStart(Server.java:263) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.xml.XmlConfiguration$1.run(XmlConfiguration.java:1215) > at java.security.AccessController.doPrivileged(Native Method) > at > org.eclipse.jetty.xml.XmlConfiguration.main(XmlConfiguration.java:1138) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.eclipse.jetty.start.Main.invokeMain(Main.java:457) > at org.eclipse.jetty.start.Main.start(Main.java:602) > at org.eclipse.jetty.start.Main.main(Main.java:82) > Caused by: org.apache.solr.common.SolrException: Unable to use updateLog: > _version_field must exist in schema, using indexed="true" stored="true" and > multiValued="false" (_version_ does not exist) > at org.apache.solr.update.UpdateLog.init(UpdateLog.java:236) > at org.apache.solr.update.UpdateHandler.initLog(UpdateHandler.java:94) > at org.apache.solr.update.UpdateHandler.<init>(UpdateHandler.java:123) > at > org.apache.solr.update.DirectUpdateHandler2.<init>(DirectUpdateHandler2.java:97) > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) > at java.lang.reflect.Constructor.newInstance(Constructor.java:513) > at org.apache.solr.core.SolrCore.createInstance(SolrCore.java:476) > at org.apache.solr.core.SolrCore.createUpdateHandler(SolrCore.java:544) > at org.apache.solr.core.SolrCore.<init>(SolrCore.java:705) > ... 45 more > Caused by: org.apache.solr.common.SolrException: _version_field must exist in > schema, using indexed="true" stored="true" and multiValued="false" (_version_ > does not exist) > at > org.apache.solr.update.VersionInfo.getAndCheckVersionField(VersionInfo.java:57) > at org.apache.solr.update.VersionInfo.<init>(VersionInfo.java:83) > at org.apache.solr.update.UpdateLog.init(UpdateLog.java:233) > ... 55 more > 01-Nov-2012 16:26:15 org.apache.solr.common.SolrException log > SEVERE: null:org.apache.solr.common.SolrException: Unable to use updateLog: > _version_field must exist in schema, using indexed="true" stored="true" and > multiValued="false" (_version_ does not exist) > at org.apache.solr.core.SolrCore.<init>(SolrCore.java:721) > at org.apache.solr.core.SolrCore.<init>(SolrCore.java:566) > at org.apache.solr.core.CoreContainer.create(CoreContainer.java:850) > at org.apache.solr.core.CoreContainer.load(CoreContainer.java:534) > at org.apache.solr.core.CoreContainer.load(CoreContainer.java:356) > at > org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:308) > at > org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:107) > at org.eclipse.jetty.servlet.FilterHolder.doStart(FilterHolder.java:114) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.servlet.ServletHandler.initialize(ServletHandler.java:754) > at > org.eclipse.jetty.servlet.ServletContextHandler.startContext(ServletContextHandler.java:258) > at > org.eclipse.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1221) > at > org.eclipse.jetty.server.handler.ContextHandler.doStart(ContextHandler.java:699) > at > org.eclipse.jetty.webapp.WebAppContext.doStart(WebAppContext.java:454) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.deploy.bindings.StandardStarter.processBinding(StandardStarter.java:36) > at > org.eclipse.jetty.deploy.AppLifeCycle.runBindings(AppLifeCycle.java:183) > at > org.eclipse.jetty.deploy.DeploymentManager.requestAppGoal(DeploymentManager.java:491) > at > org.eclipse.jetty.deploy.DeploymentManager.addApp(DeploymentManager.java:138) > at > org.eclipse.jetty.deploy.providers.ScanningAppProvider.fileAdded(ScanningAppProvider.java:142) > at > org.eclipse.jetty.deploy.providers.ScanningAppProvider$1.fileAdded(ScanningAppProvider.java:53) > at org.eclipse.jetty.util.Scanner.reportAddition(Scanner.java:604) > at org.eclipse.jetty.util.Scanner.reportDifferences(Scanner.java:535) > at org.eclipse.jetty.util.Scanner.scan(Scanner.java:398) > at org.eclipse.jetty.util.Scanner.doStart(Scanner.java:332) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.deploy.providers.ScanningAppProvider.doStart(ScanningAppProvider.java:118) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.deploy.DeploymentManager.startAppProvider(DeploymentManager.java:552) > at > org.eclipse.jetty.deploy.DeploymentManager.doStart(DeploymentManager.java:227) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.util.component.AggregateLifeCycle.doStart(AggregateLifeCycle.java:63) > at > org.eclipse.jetty.server.handler.AbstractHandler.doStart(AbstractHandler.java:53) > at > org.eclipse.jetty.server.handler.HandlerWrapper.doStart(HandlerWrapper.java:91) > at org.eclipse.jetty.server.Server.doStart(Server.java:263) > at > org.eclipse.jetty.util.component.AbstractLifeCycle.start(AbstractLifeCycle.java:59) > at > org.eclipse.jetty.xml.XmlConfiguration$1.run(XmlConfiguration.java:1215) > at java.security.AccessController.doPrivileged(Native Method) > at > org.eclipse.jetty.xml.XmlConfiguration.main(XmlConfiguration.java:1138) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.eclipse.jetty.start.Main.invokeMain(Main.java:457) > at org.eclipse.jetty.start.Main.start(Main.java:602) > at org.eclipse.jetty.start.Main.main(Main.java:82) > Caused by: org.apache.solr.common.SolrException: Unable to use updateLog: > _version_field must exist in schema, using indexed="true" stored="true" and > multiValued="false" (_version_ does not exist) > at org.apache.solr.update.UpdateLog.init(UpdateLog.java:236) > at org.apache.solr.update.UpdateHandler.initLog(UpdateHandler.java:94) > at org.apache.solr.update.UpdateHandler.<init>(UpdateHandler.java:123) > at > org.apache.solr.update.DirectUpdateHandler2.<init>(DirectUpdateHandler2.java:97) > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27) > at java.lang.reflect.Constructor.newInstance(Constructor.java:513) > at org.apache.solr.core.SolrCore.createInstance(SolrCore.java:476) > at org.apache.solr.core.SolrCore.createUpdateHandler(SolrCore.java:544) > at org.apache.solr.core.SolrCore.<init>(SolrCore.java:705) > ... 45 more > Caused by: org.apache.solr.common.SolrException: _version_field must exist in > schema, using indexed="true" stored="true" and multiValued="false" (_version_ > does not exist) > at > org.apache.solr.update.VersionInfo.getAndCheckVersionField(VersionInfo.java:57) > at org.apache.solr.update.VersionInfo.<init>(VersionInfo.java:83) > at org.apache.solr.update.UpdateLog.init(UpdateLog.java:233) > ... 55 more > 01-Nov-2012 16:26:15 org.apache.solr.servlet.SolrDispatchFilter init > INFO: user.dir=/home/lewis/ASF/solr/example > 01-Nov-2012 16:26:15 org.apache.solr.servlet.SolrDispatchFilter init > INFO: SolrDispatchFilter.init() done > 2012-11-01 16:26:15.228:INFO:oejs.AbstractConnector:Started > SocketConnector@0.0.0.0:8983 > {code} -- This message was sent by Atlassian JIRA (v6.2#6252)