Is this really a stumper? This is my first experience with Solr and having spent only an hour or so with it I hit this barrier (below). I'm sure *I* am doing something completely wrong just hoping someone more familiar with the platform can help me identify & fix it.
For starters...what's "Could not initialize class ..." mean in Java exactly? Maybe that the class (ie code) itself doesn't exist? - so perhaps I haven't downloaded all the pieces of the project? Or, could it be a hint that my kit is just not configured correctly? Sorry, I'm not a Java expert...but would like to get this stabilized...if possible. If this is the wrong mailing list then just tell me and I'll go away... Thanks! On Oct 20, 2011, at 2:54 PM, MBD wrote: > Hi, I'm new to Solr and trying to get it to index PDFs. Having trouble > getting started. Following examples in ExtractingRequestHandler wiki > <http://wiki.apache.org/solr/ExtractingRequestHandler>. > > Got Solr running and it indexes html, xml & txt files just fine...but when I > try to feed it a .pdf it spits out an "Error 500 Could not initialize class > org.apache.pdfbox.pdmodel.PDPage": > > $ curl > "http://localhost:8983/solr/update/extract?literal.id=doc1&commit=true" -F > "myfile=@index.pdf" > <html> > <head> > <meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/> > <title>Error 500 Could not initialize class org.apache.pdfbox.pdmodel.PDPage > > java.lang.NoClassDefFoundError: Could not initialize class > org.apache.pdfbox.pdmodel.PDPage > ... > > I thought maybe it's because Tika isn't installed/included so I tried > downloading and installing Tika separately...but even the Tika install fails > with: > > > ------------------------------------------------------------------------------- > Test set: org.apache.tika.parser.pdf.PDFParserTest > > ------------------------------------------------------------------------------- > Tests run: 5, Failures: 0, Errors: 5, Skipped: 0, Time elapsed: 0.63 sec <<< > FAILURE! > testVarious(org.apache.tika.parser.pdf.PDFParserTest) Time elapsed: 0.165 > sec <<< ERROR! > java.lang.NoClassDefFoundError: Could not initialize class > org.apache.pdfbox.pdmodel.PDPage > > I don't know Java (but hopefully won't need to in order to get basic indexing > up and running as ultimate goal is to query this via Sunspot from a Rails > app) so go easy on me. > > Let me know if you want/need more of the error dump. > > Any help would be greatly appreciated! > -Mike