woot, OK will roll 0.8.1 RC over the weekend. Watch for it!
Cheers, Chris ------------------------ Chris Mattmann chris.mattm...@gmail.com -----Original Message----- From: "Mallder, Valerie" <valerie.mall...@jhuapl.edu> Reply-To: <dev@oodt.apache.org> Date: Friday, January 23, 2015 at 1:02 PM To: "dev@oodt.apache.org" <dev@oodt.apache.org> Subject: RE: FW: Tyler - I may need your help >Hi Tyler, > >Yes, this fix did take care of my problem. Thanks so much! > >Chris, if you want to make a new OODT 0.8.1, be sure to also include the >fix for OODT-805 below in the radix installation. My system is back up >and running now. > >Thanks, > >Val > > > > >Valerie A. Mallder >New Horizons Deputy Mission System Engineer >Johns Hopkins University/Applied Physics Laboratory > >> -----Original Message----- >> From: Tyler Palsulich [mailto:tpalsul...@gmail.com] >> Sent: Thursday, January 22, 2015 10:35 PM >> To: dev >> Subject: Re: FW: Tyler - I may need your help >> >> Hi Val, >> >> Please see OODT-805 and >> >>https://github.com/apache/oodt/commit/cf1220d4ac66ccefc8e510c62fb6b38cf52 >>9f >> fb2 >> for what I believe is the fix. >> >> Can you make the MimeTypeUtils changes locally or try out trunk? >> >> Let me know! >> Tyler >> >> On Thu, Jan 22, 2015 at 5:40 PM, Tyler Palsulich <tpalsul...@gmail.com> >> wrote: >> >> > Hi Val, >> > >> > Yes, I think you've hit the nail on the head -- if Tika isn't passed >> > your updated mimetypes configuration file (with your custom types), >> > then those files will not be properly identified. I'll look into this >> > issue more tonight and hopefully find a fix. :) >> > >> > > by default tika only knows about xml files, text files, >> > application/octet-stream files. >> > I'm not sure what you mean by this? Tika knows about much more than >> > that, but is there an OODT config that overrides that? >> > >> > > I'm a newbie with Java and I can't guarantee I would be able to >> > > build a >> > JUnit test program very easily. But I will continue to investigate and >> > see what I can do. >> > No worries! :) If you have time and want to try your hand at it, the >> > best way to learn is by looking at the existing tests, like in >> > https://github.com/apache/oodt/blob/trunk/metadata/src/test/org/apache >> > /oodt/cas/metadata/util/TestMimeTypeUtils.java >> > . >> > >> > Have a good night, >> > Tyler >> > >> > On Thu, Jan 22, 2015 at 2:22 PM, Mallder, Valerie < >> > valerie.mall...@jhuapl.edu> wrote: >> > >> >> Hi Tyler, >> >> >> >> Can you tell me more about the tika-mimetypes.xml file? Is this a new >> >> 'required' file? I'm not 100% sure about this yet, but it seems to >> >> me that, since MimeTypeUtils.java instantiates Tika with the default >> >> constructor, and never explicitly tells Tika which mime-types file to >> >> use (even though the correct mime-types.xml file is passed to the >> >> MimeTypeUtils constructor from MimeExtractorRepo) there is no place >> >> where the contents of my mime-types.xml file is being read and stored >> >> in the Tika's MimeTypeRegistry, and by default tika only knows about >> >> xml files, text files, application/octet-stream files. >> >> >> >> I will keep looking at this tomorrow and verify which the file that >> >> is passed to the Tika's MimeTypesFactory class, but I have to head >>home now. >> >> >> >> Val >> >> >> >> >> >> >> >> >> >> Valerie A. Mallder >> >> New Horizons Deputy Mission System Engineer Johns Hopkins >> >> University/Applied Physics Laboratory >> >> >> >> >> >> -----Original Message----- >> >> From: Mallder, Valerie >> >> Sent: Thursday, January 22, 2015 11:42 AM >> >> To: dev >> >> Subject: RE: Tyler - I may need your help >> >> >> >> Hi Tyler, >> >> >> >> I have defined a few custom mime types in my >> >> filemgr/etc/mime-types.xml file. The contents of my file looks >> >> exactly like the contents of >> >> http://svn.apache.org/viewvc/oodt/tags/0.8/filemgr/src/main/resources >> >> /mime-types.xml with the addition of project-specific mime-types . >> >> The tika-mimetypes.xml file you pointed me to has ~2000 additional >> >> lines in it as compared to the >> >> http://svn.apache.org/viewvc/oodt/tags/0.8/filemgr/src/main/resources >> >> /mime-types.xml >> >> file and the >> >> http://svn.apache.org/viewvc/oodt/tags/0.8/mvn/archetypes/radix/src/m >> >> ain/resources/archetype-resources/filemgr/src/main/resources/etc/mime >> >> -types.xml file. So, it is definitely different than the one I've >> >> been using. But, I copied it over and added my mime types to it, and >> >> it didn't help. The mime types it is returning are 'reasonable' >> >> mime-types to return, they are just not the mime-types that I defined >> >> them as. For instance, I have *.sfdu files and *.out files that >> >> contain binary data, and tika says they are >> >> "application/octet-stream" files. I also have *.ecsv files that >> >> contain text, and tika says they are "text/plain" files. >> >> >> >> But here are the mime-types I defined for these files for my project, >> >> and these are the mime-types that have defined extractors for. None >> >> of these filename extensions "*.out, *.ecsv, and *.sfdu" are defined >> >> elsewhere in the mime-types.xml file. >> >> >> >> <mime-type type="product/fei-out"> >> >> <glob pattern="*.out"/> >> >> </mime-type> >> >> >> >> <mime-type type="product/fei-ecsv"> >> >> <glob pattern="*.ecsv"/> >> >> </mime-type> >> >> >> >> <mime-type type="product/fei-sfdu"> >> >> <glob pattern="*.sfdu"/> >> >> </mime-type> >> >> >> >> I'm a newbie with Java and I can't guarantee I would be able to build >> >> a JUnit test program very easily. But I will continue to investigate >> >> and see what I can do. >> >> >> >> Thanks! >> >> >> >> Val >> >> >> >> >> >> >> >> >> >> Valerie A. Mallder >> >> New Horizons Deputy Mission System Engineer Johns Hopkins >> >> University/Applied Physics Laboratory >> >> >> >> >> >> > -----Original Message----- >> >> > From: Tyler Palsulich [mailto:tpalsul...@gmail.com] >> >> > Sent: Wednesday, January 21, 2015 5:13 PM >> >> > To: dev >> >> > Subject: Re: Tyler - I may need your help >> >> > >> >> > Hi Val, >> >> > >> >> > Hmm... Is there a particular (wrong) mime-type that keeps getting >> >> > detected (like text/plain, or something)? I'm curious if the type >> >> > is just returning a default. Or, is it a seemingly random file >> >> > type? What >> >> are the contents of your mime-types.xml file? >> >> > If it's different than >> >> > https://raw.githubusercontent.com/apache/tika/trunk/tika- >> >> > core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml, >> >> > can you try copying it over? >> >> > >> >> > I'm not sure I'll be able to replicate your error on my computer >> >> > without a bit of difficulty. Do you think there is any way you >> >> > could create a JUnit test case with the problem? >> >> > >> >> > Tyler >> >> > >> >> > >> >> > On Wed, Jan 21, 2015 at 1:26 PM, Mallder, Valerie < >> >> > valerie.mall...@jhuapl.edu> >> >> > wrote: >> >> > >> >> > > Hi Tyler, >> >> > > >> >> > > I'm have been looking into an issue that cropped up in my OODT >> >> > > system when I upgraded to OODT 0.8. The issue is, my >> >> > > AutoDetectProductCrawler, which is launched from a >> >> > > PGETaskInstance is unable to determine the mime-type for my >> >> > > product files. I am using the same filemgr/etc/mime-types.xml >> >> > > file that I was using with OODT 0.7, and I am using the same >> >> > > oodt/extensions/policy/mime-extractor-map.xml file that I was >> >> > > using with OODT 0.7, but now, in >> >> > > MimeTypeRepo::getExtractorSpecsForFile, >> >> > > the call to >> >> > > this.mimeRepo.getMimeType(file) is returning the wrong mime-types >> >> > > for all of my files, and so the AutoDetectProductCrawler is >> >> > > telling me I have no extractor specs for my files. >> >> > > >> >> > > I noticed that you did some work on MimeTypeUtils for OODT-630 in >> >> > > OODT 0.8. At first glance, it doesn't' look like any of this work >> >> > > would be directly responsible. Can you think of anything that >> >> > > might be causing this to happen? I don't know anything about >> >> > > tika. Do I need to make any changes to my policy files to remain >> compatible. >> >> > > Just looking for clues on how to resolve this. I have verified >> >> > > by adding log messages throughout the code that, prior to >> >> > > launching the AutoDetectProductCrawler, all of the policy files >>are read >> correctly. >> >> > > The MimeExtractorConfigReader is reading the correct >> >> > > mim-extractor-map.xml file, and it is calling setMimeRepoFile >> >> > > with the correct mime-types.xml file, and it is setting the >> >> > > correct extractor config file, etc. But, once >> >> > > AutoDetectProductCrawler starts crawling it try to >> >> > > getExtractorSpecsForFile but determines the wrong mime type and >> >> > > then >> >> > can't find the extractor spec. >> >> > > >> >> > > Thanks, >> >> > > Val >> >> > > >> >> > > >> >> > > >> >> > > Valerie A. Mallder >> >> > > >> >> > > New Horizons Deputy Mission System Engineer The Johns Hopkins >> >> > > University/Applied Physics Laboratory >> >> > > 11100 Johns Hopkins Rd (MS 23-282), Laurel, MD 20723 >> >> > > 240-228-7846 (Office) 410-504-2233 (Blackberry) >> >> > > >> >> > > >> >> >> > >> > -- <https://www.211la.org/media/national-211-day/> ​Follow us & Like us! <https://www.facebook.com/211lacounty> <https://twitter.com/211LACounty> <http://www.linkedin.com/company/211-la-county> <http://instagram.com/211lacounty> <https://www.youtube.com/user/211LACounty>