Hi Group,
I'm using the TIKA 0.6 as a bundle. It works find except by Office 2007 files.
I used the test files from the source code of TIKA but it doesn't. I also tried
to create own Office 2008 files, but also here the same problem.
Maybe a useful information, I use the line parser.parse(sourceFile, new
BodyContentHandler(), metadata, new ParseContext()); to parse the documents.
May somebody has an idea?
All regards
Stefan
!SESSION 2010-03-15 15:03:24.621 -----------------------------------------------
eclipse.buildId=unknown
java.version=1.6.0_18
java.vendor=Sun Microsystems Inc.
BootLoader constants: OS=win32, ARCH=x86, WS=win32, NL=de_DE
Framework arguments: -application
edu.emich.compsci.virtus.metadataguiextension.application
Command-line arguments: -application
edu.emich.compsci.virtus.metadataguiextension.application -data
C:\Users\User\workspace\Thesis/../runtime-edu.emich.compsci.virtus.metadataguiextension.application
-dev
file:C:/Users/User/workspace/Thesis/.metadata/.plugins/org.eclipse.pde.core/edu.emich.compsci.virtus.metadataguiextension.application/dev.properties
-os win32 -ws win32 -arch x86 -consoleLog
!ENTRY org.eclipse.ui 4 0 2010-03-15 15:03:34.544
!MESSAGE Unhandled event loop exception
!STACK 0
java.lang.NoClassDefFoundError:
org/openxmlformats/schemas/spreadsheetml/x2006/main/CTSheet
at
org.apache.poi.xssf.extractor.XSSFExcelExtractor.<init>(XSSFExcelExtractor.java:49)
at
org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:104)
at
org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:86)
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:47)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
at
edu.emich.compsci.virtus.metadata.extractor.MetaDataExtractor.getMetaData(MetaDataExtractor.java:56)
at
edu.emich.compsci.virtus.cab.CoreApplication.extractDirectory(CoreApplication.java:96)
at
edu.emich.compsci.virtus.cab.CoreApplication.startFileExtracting(CoreApplication.java:55)
at
edu.emich.compsci.virtus.metadataguiextension.commands.HandlerSelectDirectory.execute(HandlerSelectDirectory.java:37)
at
org.eclipse.ui.internal.handlers.HandlerProxy.execute(HandlerProxy.java:294)
at org.eclipse.core.commands.Command.executeWithChecks(Command.java:476)
at
org.eclipse.core.commands.ParameterizedCommand.executeWithChecks(ParameterizedCommand.java:508)
at
org.eclipse.ui.internal.handlers.HandlerService.executeCommand(HandlerService.java:169)
at
org.eclipse.ui.internal.handlers.SlaveHandlerService.executeCommand(SlaveHandlerService.java:241)
at
org.eclipse.ui.menus.CommandContributionItem.handleWidgetSelection(CommandContributionItem.java:770)
at
org.eclipse.ui.menus.CommandContributionItem.access$10(CommandContributionItem.java:756)
at
org.eclipse.ui.menus.CommandContributionItem$5.handleEvent(CommandContributionItem.java:746)
at org.eclipse.swt.widgets.EventTable.sendEvent(EventTable.java:84)
at org.eclipse.swt.widgets.Widget.sendEvent(Widget.java:1003)
at org.eclipse.swt.widgets.Display.runDeferredEvents(Display.java:3880)
at org.eclipse.swt.widgets.Display.readAndDispatch(Display.java:3473)
at org.eclipse.ui.internal.Workbench.runEventLoop(Workbench.java:2405)
at org.eclipse.ui.internal.Workbench.runUI(Workbench.java:2369)
at org.eclipse.ui.internal.Workbench.access$4(Workbench.java:2221)
at org.eclipse.ui.internal.Workbench$5.run(Workbench.java:500)
at
org.eclipse.core.databinding.observable.Realm.runWithDefault(Realm.java:332)
at
org.eclipse.ui.internal.Workbench.createAndRunWorkbench(Workbench.java:493)
at org.eclipse.ui.PlatformUI.createAndRunWorkbench(PlatformUI.java:149)
at
edu.emich.compsci.virtus.metadataguiextension.Application.start(Application.java:20)
at
org.eclipse.equinox.internal.app.EclipseAppHandle.run(EclipseAppHandle.java:194)
at
org.eclipse.core.runtime.internal.adaptor.EclipseAppLauncher.runApplication(EclipseAppLauncher.java:110)
at
org.eclipse.core.runtime.internal.adaptor.EclipseAppLauncher.start(EclipseAppLauncher.java:79)
at
org.eclipse.core.runtime.adaptor.EclipseStarter.run(EclipseStarter.java:368)
at
org.eclipse.core.runtime.adaptor.EclipseStarter.run(EclipseStarter.java:179)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
at java.lang.reflect.Method.invoke(Unknown Source)
at org.eclipse.equinox.launcher.Main.invokeFramework(Main.java:559)
at org.eclipse.equinox.launcher.Main.basicRun(Main.java:514)
at org.eclipse.equinox.launcher.Main.run(Main.java:1311)
at org.eclipse.equinox.launcher.Main.main(Main.java:1287)
Caused by: java.lang.ClassNotFoundException:
org.openxmlformats.schemas.spreadsheetml.x2006.main.CTSheet
at
org.eclipse.osgi.internal.loader.BundleLoader.findClassInternal(BundleLoader.java:494)
at
org.eclipse.osgi.internal.loader.BundleLoader.findClass(BundleLoader.java:410)
at
org.eclipse.osgi.internal.loader.BundleLoader.findClass(BundleLoader.java:398)
at
org.eclipse.osgi.internal.baseadaptor.DefaultClassLoader.loadClass(DefaultClassLoader.java:105)
at java.lang.ClassLoader.loadClass(Unknown Source)
... 43 more