Re: wiki access

2015-03-25 Thread Annie Burgess
Oops, user name : AnnieBurgess .

On Wed, Mar 25, 2015 at 12:13 PM, Annie Burgess 
wrote:

> Thanks Nick! About how long does it take for the permission to go through?
>
>
>
> On Wed, Mar 25, 2015 at 9:58 AM, Nick Burch  wrote:
>
>> On Wed, 25 Mar 2015, Annie Burgess wrote:
>>
>>> Could I please get access to edit the tika wiki page?
>>>
>>
>> Karma granted, enjoy!
>>
>> Nick
>>
>
>
>
> --
> --
> Ann Bryant Burgess, PhD
>
> Postdoctoral Fellow
> Computer Science Department
> Viterbi School of Engineering
> University of Southern California
>
> Phone:  (585) 738-7549
> --
>



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
Viterbi School of Engineering
University of Southern California

Phone:  (585) 738-7549
--


Re: wiki access

2015-03-25 Thread Annie Burgess
Thanks Nick! About how long does it take for the permission to go through?



On Wed, Mar 25, 2015 at 9:58 AM, Nick Burch  wrote:

> On Wed, 25 Mar 2015, Annie Burgess wrote:
>
>> Could I please get access to edit the tika wiki page?
>>
>
> Karma granted, enjoy!
>
> Nick
>



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
Viterbi School of Engineering
University of Southern California

Phone:  (585) 738-7549
--


wiki access

2015-03-25 Thread Annie Burgess
Hi Dev list,

Could I please get access to edit the tika wiki page?

name: AnnieBryant
alias name: abburgess

Thanks!
Annie

-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
Viterbi School of Engineering
University of Southern California

Phone:  (585) 738-7549
--


Re: [jira] [Created] (TIKA-1562) Add examples from the Tika in Action book

2015-02-26 Thread Annie Burgess
+1


On Wed, Feb 25, 2015 at 8:20 PM, Chris A. Mattmann (JIRA) 
wrote:

> Chris A. Mattmann created TIKA-1562:
> ---
>
>  Summary: Add examples from the Tika in Action book
>  Key: TIKA-1562
>  URL: https://issues.apache.org/jira/browse/TIKA-1562
>  Project: Tika
>   Issue Type: Bug
>   Components: example
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
>  Fix For: 1.8
>
>
> Manning publications has granted permission for me to contribute the Tika
> in Action code to Apache TIka. Yay! I'll put it in the examples module and
> update it if needed.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.3.4#6332)
>



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: Sonatype Docs

2014-12-09 Thread Annie Burgess
+1

This will really help us get a lot done in the new year!!

On Fri, Dec 5, 2014 at 2:50 PM, Lewis John Mcgibbney
 wrote:
> Hi Folks,
> Based on ongoing work and the effort to take Tika to the Xetabytes of
> scientific data out there, I've added a small document for all committers
> (or anyone really) to use. The document essentially describes how to
> publish third party artifacts to Sonatype OSSRH which will then be sucked
> into Maven Central.
>
> https://wiki.apache.org/tika/ThirdPartySonaType
>
> HUGE thank you to Annie Bryant Burgess and Chris Mattmann for ongoing
> correspondence with the Unidata developers. The approach has enabled us to
> work with them in getting their artifacts published to Maven Central and it
> is a HUGE win for Tika.
>
> Thanks
> Lewis
>
> --
> *Lewis*



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: [DISCUSS] Give examples of Parser, Detector, and Translator usage

2014-08-07 Thread Annie Burgess
+1

I like it Tyler!


On Thu, Aug 7, 2014 at 1:37 PM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> thanks Tyler, perfect
>
> ++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++
>
>
>
>
>
>
> -Original Message-
> From: Tyler Palsulich 
> Reply-To: "dev@tika.apache.org" 
> Date: Thursday, August 7, 2014 2:33 PM
> To: "dev@tika.apache.org" 
> Subject: Re: [DISCUSS] Give examples of Parser, Detector, and Translator
> usage
>
> >Sounds like the new module is a good idea. So, let's jump on it! I will
> >create a new 'example' JIRA tag and create issues for creating the module
> >and adding Parse, Detect, and Translate examples. Others should add
> >issues/desired examples as they see fit. How's that sound?
> >
> >Tyler
> >
> >
> >On Thu, Aug 7, 2014 at 1:08 PM, Mattmann, Chris A (3980) <
> >chris.a.mattm...@jpl.nasa.gov> wrote:
> >
> >> Great idea! This is what we did with apache OODT radix you can scope
> >>here
> >> https://cwiki.apache.org/confluence/display/OODT/RADiX+Powered+By+OODT
> >>
> >> Sent from my iPhone
> >>
> >> On Aug 7, 2014, at 12:56 PM, "Hong-Thai Nguyen"  >> > wrote:
> >>
> >> Nice idea.
> >>
> >> We could do more than samples. We can generate parser, detecter or
> >> translator maven archetype. A kind o templete so that user can have
> >>quickly
> >> project to develop new one.
> >>
> >> Regards,
> >>
> >> Hong-Thai
> >>
> >> On 07 Aug 2014, at 18:56, Tyler Palsulich   >> tpalsul...@apache.org>> wrote:
> >>
> >> Hi All,
> >>
> >> I think we should add some consolidated documentation on how to use
> >>Tika's
> >> Java API. It would be very helpful if we had short snippets of code that
> >> showed how exactly you can use Parser.parse(), for example. I think I
> >> remember a thread about testing example code a while back, but I'm not
> >> sure. We have some developer documentation on the site, but the user
> >>docs
> >> are somewhat lacking.
> >>
> >> I can think of a few options:
> >>
> >> *1) tika-example module*. This module would have example code of using
> >>each
> >> main interface of Tika. Simplicity and organization would be king, so
> >>new
> >> users can find exactly what they're looking for quickly. A big benefit
> >>of
> >> this is that unit tests would be baked in. I like this option. One
> >>downside
> >> is that reading source code in the browser is terrible (e.g. see [0]).
> >>
> >> *2)* Examples section on the *wiki*. My impression is that the wiki is
> >>not
> >> as popular as the root website. And, it's also very easy to forget about
> >> and let go out of date. But, formatting and explanations would be
> >>pretty.
> >>
> >> *3)* Examples section on the *website*. This has the benefit of pretty
> >> formatting and coloring, without the potential user having to check out
> >>the
> >> repo or view direct source in browser. Another benefit is this section
> >> would be perfect for showing how to use the tika-app jar.
> >>
> >> Right now, I think the best option is a combination of 1 and 3. We get
> >>some
> >> end to end examples running in the tika-example module and short
> >>snippets
> >> of usage on an examples page of the website.
> >>
> >> What do you guys think? What other options should we consider?
> >>
> >> Tyler
> >>
> >> [0] -
> >>
> >>
> >>
> http://svn.apache.org/repos/asf/tika/trunk/tika-core/src/main/java/org/ap
> >>ache/tika/parser/Parser.java
> >>
>
>


-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: NetCDF to Maven Central

2014-08-07 Thread Annie Burgess
Hi John and Chris,

This makes sense to me.  John, is this feasible?

Annie


On Tue, Aug 5, 2014 at 1:49 PM, Chris Mattmann  wrote:

> Thanks John and Annie, just looping dev@tika in directly.
>
>
> It would be nice to have:
>
> 1. individual modules published to the Central repo underneath
> the guise of the edu.ucar domain (thanks for pushing this Annie)
> 2. a netcdf-min module (like the one I did for 4.2, which is everything
> that you need to use the Java library for parsing HDF and NetCDF)
> 3. a netcdf-all module that includes everything.
>
> I think that would be consistent.
>
> Thoughts?
>
> Cheers,
> Chris
>
> -Original Message-
> From: Annie Burgess 
> Reply-To: 
> Date: Tuesday, August 5, 2014 12:07 PM
> To: John Caron , Chris Mattmann 
> Cc: John Caron , "support-net...@unidata.ucar.edu"
> 
> Subject: Re: NetCDF to Maven Central
>
> >Thanks for the info John,
> >I'll chat with the Tika-dev team about what the best way to proceed, re:
> >getting what we need on Maven.   Is it feasible to get netcdfAll its own
> >module if that's what it takes?
> >
> >Annie
> >
> >
> >
> >
> >On Tue, Aug 5, 2014 at 10:51 AM, John Caron  wrote:
> >
> >Hi Annie:
> >All of our artifacts are here:
> >
> >https://artifacts.unidata.ucar.edu/content/repositories/unidata-releases/
> >
> >
> >however, the netcdfAll.jar is not part of the maven repository because of
> >maven's insistence that each artifact have its own module. netcdfAll is
> >the collection of all of the component modules.( Possibly we could/should
> >fix that by giving it its own module). At  this point we build it and put
> >it on our website.
> >
> >So, at this point you would
> >
> >1) include all the component libraries you need (grib and protobuf, jdom
> >and jodatime) according to:
> >
> >
> >
> http://www.unidata.ucar.edu/software/thredds/v4.5/netcdf-java/reference/Ja
> >rDependencies.html
> >
> >
> >2) grab the netcdfAll jar from our website
> >
> >  ftp://ftp.unidata.ucar.edu/pub/netcdf-java/v4.5/netcdfAll-4.5.jar
> >
> >
> >John
> >
> >
> >
> >
> >
> >
> >On Mon, Aug 4, 2014 at 2:54 PM, Annie Burgess 
> >wrote:
> >
> >Hi John,
> >I've been able to successfully upload 4.3.22 to Maven Central.  But, it
> >looks like I need the netcdfAll-4.3.jar to successfully parse grib2
> >files.  For some reason I thought 4.3.22 and netcdfAll-4.3.jar were the
> >same, but, looks like there quite a few differences - is it possible to
> >get the netcdfAll-4.3 java artifacts so that I can also place those on
> >Maven Central?
> >
> >Any help or insight would be great.
> >
> >Best,
> >Annie
> >
> >
> >
> >
> >On Tue, May 27, 2014 at 10:31 AM, John Caron 
> >wrote:
> >
> >Hi Chris, Annie:
> >
> >we have a number of artifacts, but the java-netcdf is:
> >
> >
> https://artifacts.unidata.ucar.edu/content/repositories/unidata-releases/e
> >du/ucar/netcdf/4.3.22/
> >
> >how do you want to handle when a new release comes. are you only
> >interested in "stable release"?
> >
> >John
> >
> >On 5/22/2014 10:25 AM, Mattmann, Chris A (3980) wrote:
> >> Thanks John - Annie and I will take an action to do this. You may
> >> get a request from Sonatype OSS and I will point you to it when it
> >> comes so you can approve. Also can you point us to your latest Maven
> >> release?
> >>
> >> ++
> >> Chris Mattmann, Ph.D.
> >> Chief Architect
> >> Instrument Software and Science Data Systems Section (398)
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 168-519, Mailstop: 168-5th floor
> >> Email: chris.a.mattm...@nasa.gov
> >> WWW:  http://sunset.usc.edu/~mattmann/
> >> ++
> >> Adjunct Associate Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++
> >>
> >>
> >>
> >>
> >>
> >>
> >> -Original Message-
> >> From: John Caron 
> >> Date: Wednesday, May 21, 2014 4:26 PM
> >> To: Chris Mattmann ,
> >>"dev@tika.apache.org"
> >
> >

Re: [jira] [Commented] (TIKA-1363) .mat files not parsing

2014-07-15 Thread Annie Burgess
I pulled the new trunk and looks like Tika is now successfully parsing
Matlab .mat files at the command line and in the GUI.  Thanks all for your
help on this new parser!


On Tue, Jul 15, 2014 at 12:05 AM, Hudson (JIRA)  wrote:

>
> [
> https://issues.apache.org/jira/browse/TIKA-1363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14061817#comment-14061817
> ]
>
> Hudson commented on TIKA-1363:
> --
>
> SUCCESS: Integrated in tika-trunk-jdk1.6 #93 (See [
> https://builds.apache.org/job/tika-trunk-jdk1.6/93/])
> updated patch for TIKA-1363 from Annie Burgess: enables Mat parser in
> META-INF and fixes unit test to use AutoDetectParser to validate it.
> (mattmann: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1610600)
> *
> /tika/trunk/tika-parsers/src/main/resources/META-INF/services/org.apache.tika.parser.Parser
> *
> /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/mat/MatParserTest.java
>
>
> > .mat files not parsing
> > --
> >
> > Key: TIKA-1363
> > URL: https://issues.apache.org/jira/browse/TIKA-1363
> > Project: Tika
> >  Issue Type: Bug
> >  Components: parser
> >Affects Versions: 1.6
> >Reporter: Ann Burgess
> >Assignee: Chris A. Mattmann
> >  Labels: metadata, parser, snapshot
> > Fix For: 1.6
> >
> > Attachments: TIKA.1363.aburgess.140614.patch.txt, test_data_1.mat
> >
> >
> > We recently committed a parser for Matlab .mat files, however I've just
> downloaded the most recent Tika and am not getting any parsed --text or
> --metadata for the .mat file used in the unit test.  The steps I've used
> are below.  Am I missing something at the command line?  Can anyone else
> successfully get a text or metadata output for a .mat file?
> > Steps:
> > svn co https://svn.apache.org/repos/asf/tika/trunk tika
> > setenv MAVEN_OPTS "-Xms128m -Xmx256m"
> > cd tika
> > mvn install
> > java -jar tika-app/target/tika-app-1.6-SNAPSHOT.jar --text
> /Users/IGSWAHWSWBURGESS/Development/tika/tika-parsers/src/test/resources/test-documents/breidamerkurjokull_radar_profiles_2009.mat
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.2#6252)
>



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: [jira] [Commented] (TIKA-1327) New parser for Matlab .mat files

2014-06-12 Thread Annie Burgess
I am working on a patch to add XHTML output per Nick's original suggestion.
 Thanks for the input all.


On Wed, Jun 11, 2014 at 5:27 AM, Ken Krugler 
wrote:

> In the past we'd discussed wanting to verify that all parsers generation
> valid XHTML 1.0
>
> I forget where that particular thread died out, but Nick's comment on this
> issue points out that having some general way to validate the above would
> be good.
>
> -- Ken
>
> Begin forwarded message:
>
> > From: "Chris A. Mattmann (JIRA)" 
> > Subject: [jira] [Commented] (TIKA-1327) New parser for Matlab .mat files
> > Date: June 11, 2014 3:58:02am PDT
> > To: dev@tika.apache.org
> > Reply-To: dev@tika.apache.org
> >
> >
> >[
> https://issues.apache.org/jira/browse/TIKA-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14027624#comment-14027624
> ]
> >
> > Chris A. Mattmann commented on TIKA-1327:
> > -
> >
> > Nick agreed, I will re-open this until we get that part in there.
> >
> >> New parser for Matlab .mat files
> >> 
> >>
> >>Key: TIKA-1327
> >>URL: https://issues.apache.org/jira/browse/TIKA-1327
> >>Project: Tika
> >> Issue Type: Improvement
> >> Components: parser
> >>   Affects Versions: 1.5
> >>   Reporter: Ann Burgess
> >>   Assignee: Chris A. Mattmann
> >> Labels: parser
> >>Fix For: 1.6
> >>
> >>
> >> New parser for Matlab .mat files.
>
>
>
>
> --
> Ken Krugler
> +1 530-210-6378
> http://www.scaleunlimited.com
> custom big data solutions & training
> Hadoop, Cascading, Cassandra & Solr
>
>
>
>
>
>


-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: unit test error for new parser

2014-06-03 Thread Annie Burgess
Thanks for the input Tyler!  I've fixed the issue by adding a bit of code
to ensure a String is passed to MatFileReader when the unit test is run in
maven.  The test now successfully compiles - I'll add the code to
reviewboard soon for others to scope.

Annie



On Tue, Jun 3, 2014 at 2:16 PM, Tyler Palsulich 
wrote:

> Hi Annie,
>
> I looks like your problem is with 
> `MatFileReader(org.apache.tika.io.CloseShieldInputStream)`.
> I think the MatFileReader constructor will only take a File or String
> filename (
> http://intra.csb.ethz.ch/javadoc/metabolic/com/jmatio/io/MatFileReader.html).
> If you switch the constructor call to use the filename, instead of the
> InputStream, I wonder if that will fix the issue.
>
> Hope that helps,
> Tyler
>
>
> On Tue, Jun 3, 2014 at 3:07 PM, Annie Burgess 
> wrote:
>
>> Hi dev group,
>>
>> I've put together a new parser for Matlab (.mat) files.  I successfully
>> compile and execute the new parser at the command line as:
>>
>> javac -classpath
>> ../../../../tika-core/target/tika-core-1.6-SNAPSHOT.jar:../../../../jmatio-1.2.jar
>>  edu/usc/sunset/burgess/tika/MatParser.java
>>
>> java -classpath
>> annie-parsers.jar:tika-app/target/tika-app-1.6-SNAPSHOT.jar:jmatio-1.2.jar
>> org.apache.tika.cli.TikaCLI --metadata
>> /Users/annbryant/POLARCYBER/breidamerkurjokull_radar_profiles_2009.mat
>>
>>
>> However, I've written a unit test for the new parser and have come to a
>> roadblock.
>>
>> The new parser uses a 3rd party jar, JmatIO, file available on Maven
>> Central (
>> http://search.maven.org/#artifactdetails%7Cnet.sourceforge.jmatio%7Cjmatio%7C1.0%7Cjar
>> ), which I have included as a dependency in the tika-parsers/pom.xml with
>> the following:
>>
>>  
>>   net.sourceforge.jmatio
>>   jmatio
>>   1.0
>> 
>>
>>
>> When I run the unit test in maven, maven seems to be able to find the
>> appropriate class files from the 3rd party jar, i.e. I don't get any
>> "missing package" or "can't find symbol' errors from the 'import'
>> commands:  import com.jmatio.io.MatFileHeader;  However, the test fails
>> when the first instance of one of the JmatIO classes is called within the
>> parser.
>>
>> Are there some steps I'm missing when integrating a third party jar in a
>> unit test?
>>
>> I have attached the new parser and the parser test.
>>
>> Please let me know if any other information would be useful.
>>
>> Annie
>>
>>
>> *
>>
>> abryant:tika-parsers abryant$ mvn -Dtest=MatParserTest compile
>>
>> [INFO]
>> 
>> [INFO] Building Apache Tika parsers 1.6-SNAPSHOT
>> [INFO]
>> 
>> [INFO]
>> [INFO] --- maven-remote-resources-plugin:1.2.1:process (default) @
>> tika-parsers ---
>> [INFO]
>> [INFO] --- maven-resources-plugin:2.5:resources (default-resources) @
>> tika-parsers ---
>> [debug] execute contextualize
>> [INFO] Using 'UTF-8' encoding to copy filtered resources.
>> [INFO] Copying 5 resources
>> [INFO] Copying 3 resources
>> [INFO]
>> [INFO] --- maven-compiler-plugin:2.3.2:compile (default-compile) @
>> tika-parsers ---
>> [INFO] Compiling 1 source file to
>> /Users/annbryant/TIKA/tika/tika-parsers/target/classes
>> [INFO] -
>> [ERROR] COMPILATION ERROR :
>> [INFO] -
>> [ERROR]
>> /Users/annbryant/TIKA/tika/tika-parsers/src/main/java/org/apache/tika/parser/mat/MatParser.java:[69,23]
>> cannot find symbol
>> symbol  : constructor
>> MatFileReader(org.apache.tika.io.CloseShieldInputStream)
>> location: class com.jmatio.io.MatFileReader
>> [INFO] 1 error
>> [INFO] -
>> [INFO]
>> 
>> [INFO] BUILD FAILURE
>> [INFO]
>> 
>> [INFO] Total time: 5.213s
>> [INFO] Finished at: Tue Jun 03 12:54:15 AKDT 2014
>> [INFO] Final Memory: 16M/81M
>> [INFO]
>> 
>> [ERROR] Re-run Ma

Fwd: unit test error for new parser

2014-06-03 Thread Annie Burgess
Hi dev group,

I've put together a new parser for Matlab (.mat) files.  I successfully
compile and execute the new parser at the command line as:

javac -classpath
../../../../tika-core/target/tika-core-1.6-SNAPSHOT.jar:../../../../jmatio-1.2.jar
 edu/usc/sunset/burgess/tika/MatParser.java

java -classpath
annie-parsers.jar:tika-app/target/tika-app-1.6-SNAPSHOT.jar:jmatio-1.2.jar
org.apache.tika.cli.TikaCLI --metadata
/Users/annbryant/POLARCYBER/breidamerkurjokull_radar_profiles_2009.mat


However, I've written a unit test for the new parser and have come to a
roadblock.

The new parser uses a 3rd party jar, JmatIO, file available on Maven
Central (
http://search.maven.org/#artifactdetails%7Cnet.sourceforge.jmatio%7Cjmatio%7C1.0%7Cjar
), which I have included as a dependency in the tika-parsers/pom.xml with
the following:

 
  net.sourceforge.jmatio
  jmatio
  1.0



When I run the unit test in maven, maven seems to be able to find the
appropriate class files from the 3rd party jar, i.e. I don't get any
"missing package" or "can't find symbol' errors from the 'import'
commands:  import com.jmatio.io.MatFileHeader;  However, the test fails
when the first instance of one of the JmatIO classes is called within the
parser.

Are there some steps I'm missing when integrating a third party jar in a
unit test?

I have attached the new parser and the parser test.

Please let me know if any other information would be useful.

Annie

*

abryant:tika-parsers abryant$ mvn -Dtest=MatParserTest compile

[INFO]

[INFO] Building Apache Tika parsers 1.6-SNAPSHOT
[INFO]

[INFO]
[INFO] --- maven-remote-resources-plugin:1.2.1:process (default) @
tika-parsers ---
[INFO]
[INFO] --- maven-resources-plugin:2.5:resources (default-resources) @
tika-parsers ---
[debug] execute contextualize
[INFO] Using 'UTF-8' encoding to copy filtered resources.
[INFO] Copying 5 resources
[INFO] Copying 3 resources
[INFO]
[INFO] --- maven-compiler-plugin:2.3.2:compile (default-compile) @
tika-parsers ---
[INFO] Compiling 1 source file to
/Users/annbryant/TIKA/tika/tika-parsers/target/classes
[INFO] -
[ERROR] COMPILATION ERROR :
[INFO] -
[ERROR]
/Users/annbryant/TIKA/tika/tika-parsers/src/main/java/org/apache/tika/parser/mat/MatParser.java:[69,23]
cannot find symbol
symbol  : constructor
MatFileReader(org.apache.tika.io.CloseShieldInputStream)
location: class com.jmatio.io.MatFileReader
[INFO] 1 error
[INFO] -
[INFO]

[INFO] BUILD FAILURE
[INFO]

[INFO] Total time: 5.213s
[INFO] Finished at: Tue Jun 03 12:54:15 AKDT 2014
[INFO] Final Memory: 16M/81M
[INFO]

[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions,
please read the following articles:
[ERROR] [Help 1]
http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException


-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: tika install fail on os x 10.9.2

2014-05-21 Thread Annie Burgess
Following Lewis's advice, I installed:

  Mac OS X x64179.56 MB
jdk-7u55-macosx-x64.dmg<http://download.oracle.com/otn-pub/java/jdk/7u55-b13/jdk-7u55-macosx-x64.dmg>
I subsequently updated my bash profile to:

#export JAVA_HOME=/Library/Java/JavaVirtualMachines/jdk1.8.0_
05.jdk/Contents/Home
export JAVA_HOME=/Library/Java/JavaVirtualMachines/jdk1.7.0_
55.jdk/Contents/Home

After these adjustments, Tika installed without a problem.

Thanks all.





On Thu, May 15, 2014 at 8:04 AM, Ramirez, Paul M (398J) <
paul.m.rami...@jpl.nasa.gov> wrote:

> Annie,
>
> I haven't built tika in a while but if it's a typical maven build the
> details of the test output will be captured in one of the files in the
> target directory. If you find those details and post them here that would
> help troubleshoot what is going on.
>
> Thanks,
> Paul Ramirez
>
> On May 8, 2014, at 10:02 AM, Annie Burgess 
>  wrote:
>
> > Hi all,
> >
> > I have a new computer running  OS X 10.9.2 (13C64).  I am attempting to
> get
> > Tika up and running, but am getting errors in the Maven install phase.
>  My
> > steps are as follows:
> >
> >
> > [annies-mbp:~/tika/] % svn co
> https://svn.apache.org/repos/asf/tika/trunktmp
> > [annies-mbp:~/tika/tmp]% setenv MAVEN_OPTS "-Xms128m -Xmx256m"
> > [annies-mbp:~/tika/tmp]% mvn install
> >
> > Results :
> >
> > Tests in error:
> >
> >  testiBooksParser(org.apache.tika.parser.ibooks.iBooksParserTest):
> > Premature end of file.
> >
> > Tests run: 506, Failures: 0, Errors: 1, Skipped: 1
> >
> > [INFO]
> > 
> > [INFO] Reactor Summary:
> > [INFO] Apache Tika parent  SUCCESS [
>  0.626
> > s]
> > [INFO] Apache Tika core .. SUCCESS [
>  6.631
> > s] [INFO] Apache Tika parsers ... FAILURE [
> > 23.323 s]
> >
> > .
> > .
> > .
> >
> > [INFO]
> > 
> > [INFO] BUILD FAILURE
> > [INFO]
> > 
> >
> > My Maven version is:
> >
> > [annies-mbp:~/Development/tika/tmp]% mvn --version
> > Apache Maven 3.2.1 (ea8b2b07643dbb1b84b6d16e1f08391b666bc1e9;
> > 2014-02-14T08:37:52-09:00)
> > Maven home: /usr/local/Cellar/maven/3.2.1/libexec
> > Java version: 1.8.0_05, vendor: Oracle Corporation
> > Java home:
> > /Library/Java/JavaVirtualMachines/jdk1.8.0_05.jdk/Contents/Home/jre
> > Default locale: en_US, platform encoding: UTF-8
> > OS name: "mac os x", version: "10.9.2", arch: "x86_64", family: "mac"--
> >
> >
> > Does anyone have any insight as to why this is failing at
> > 'iBooksParserTest'?
> > Thanks!
> > Annie
> >
> >
> --
> > Ann Bryant Burgess, PhD
> >
> > Postdoctoral Fellow
> > Computer Science Department
> > University of Southern California
> > Viterbi School of Engineering
> > Los Angeles, CA
> >
> > Alaska Science Center/USGS
> > Anchorage, AK
> >
> > Cell:  (585) 738-7549
> > Office:  (907) 786-7059
> > Fax:  (907) 786-7150
> > E-mail: anniebryant.burg...@gmail.com
> > Office Address: 4210 University Dr., Anchorage, AK 99508-4626
> >
> ---
>
>


-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Fwd: tika install fail on os x 10.9.2

2014-05-15 Thread Annie Burgess
Hi all,

I have a new computer running  OS X 10.9.2 (13C64).  I am attempting to get
Tika up and running, but am getting errors in the Maven install phase.  My
steps are as follows:


[annies-mbp:~/tika/] % svn co https://svn.apache.org/repos/asf/tika/trunktmp
[annies-mbp:~/tika/tmp]% setenv MAVEN_OPTS "-Xms128m -Xmx256m"
[annies-mbp:~/tika/tmp]% mvn install

Results :

Tests in error:

  testiBooksParser(org.apache.tika.parser.ibooks.iBooksParserTest):
Premature end of file.

Tests run: 506, Failures: 0, Errors: 1, Skipped: 1

[INFO]

[INFO] Reactor Summary:
[INFO] Apache Tika parent  SUCCESS [  0.626
s]
[INFO] Apache Tika core .. SUCCESS [  6.631
s] [INFO] Apache Tika parsers ... FAILURE [
23.323 s]

.
.
.

[INFO]

[INFO] BUILD FAILURE
[INFO]


My Maven version is:

[annies-mbp:~/Development/tika/tmp]% mvn --version
Apache Maven 3.2.1 (ea8b2b07643dbb1b84b6d16e1f08391b666bc1e9;
2014-02-14T08:37:52-09:00)
Maven home: /usr/local/Cellar/maven/3.2.1/libexec
Java version: 1.8.0_05, vendor: Oracle Corporation
Java home:
/Library/Java/JavaVirtualMachines/jdk1.8.0_05.jdk/Contents/Home/jre
Default locale: en_US, platform encoding: UTF-8
OS name: "mac os x", version: "10.9.2", arch: "x86_64", family: "mac"--


Does anyone have any insight as to why this is failing at
'iBooksParserTest'?
Thanks!
Annie

--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: NetCDF to Maven Central

2014-05-12 Thread Annie Burgess
Thank you for your response John.  I'm cc'ing the Tika dev list so we can
work towards an understanding of how best to move forward.

Best,
Annie


On Wed, May 7, 2014 at 4:31 PM, John Caron  wrote:

> Hi Annie:
>
> We find it difficult to keep maven central updated, and are maintaining
> our our maven server here:
>
> https://artifacts.unidata.ucar.edu/content/repositories/
> unidata-releases/edu/ucar/
>
> is that sufficient for your project?
>
> John
>
>
>
> On 5/5/2014 12:41 PM, Annie Burgess wrote:
>
>> Hi John,
>>
>> My name is Annie Burgess, I work with Chris Mattmann at JPL and USC.
>>   I'm working on a project that requires that latest version (4.3) of
>> NetCDF to be available on Maven Central. I've submitted a support
>> request for this issue on the Unidata site, but thought I'd also contact
>> you.
>>
>> Do you know if its possible to get 4.3 on Maven anytime soon?
>>
>> Any information you can give is greatly appreciated.
>>
>> Best,
>> Annie
>>
>>
>> --
>> 
>> --
>> Ann Bryant Burgess, PhD
>>
>> Postdoctoral Fellow
>> Computer Science Department
>> University of Southern California
>> Viterbi School of Engineering
>> Los Angeles, CA
>>
>> Alaska Science Center/USGS
>> Anchorage, AK
>>
>> Cell: (585) 738-7549
>> Office: (907) 786-7059
>> Fax: **(907) 786-7150
>> E-mail: anniebryant.burg...@gmail.com <mailto:anniebryant.burgess@
>> gmail.com>
>>
>> Office Address: 4210 University Dr., Anchorage, AK 99508-4626
>> 
>> ---
>>
>


-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: [jira] [Commented] (TIKA-1287) Update NetCDF .jar file on Maven Central

2014-05-02 Thread Annie Burgess
I submitted a request to Unidata yesterday for them to update on Maven,
though I did not submit it specifically to John Caron.  I will send an
email to John and make sure he is aware of the request.


On Fri, May 2, 2014 at 10:09 AM, Chris A. Mattmann (JIRA)
wrote:

>
> [
> https://issues.apache.org/jira/browse/TIKA-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13988034#comment-13988034]
>
> Chris A. Mattmann commented on TIKA-1287:
> -
>
> Yeah we should contact John Caron or someone from Unidata, they should be
> happy to update it. Also do we need to upgrade past NetCDF 4.2-min? Is
> there some functionality we need from them?
>
> > Update NetCDF .jar file on Maven Central
> > 
> >
> > Key: TIKA-1287
> > URL: https://issues.apache.org/jira/browse/TIKA-1287
> > Project: Tika
> >  Issue Type: Bug
> >Affects Versions: 1.5
> >Reporter: Ann Burgess
> >  Labels: jar, maven, netcdf, tika, unit-test, update
> >
> > I am working to update the NetCDFParser file.  When using the
> most-recent .jar file available from http://www.unidata.ucar.edu/ at the
> command line I receive a note about a depreciated API:
> > javac -classpath
> ../../../../tika-core/target/tika-core-1.6-SNAPSHOT.jar:../../../../toolsUI-4.3.jar
> org/apache/tika/parser/netcdf/NetCDFParser.java
> > Note: org/apache/tika/parser/netcdf/NetCDFParser.java uses or overrides
> a deprecated API.
> > Note: Recompile with -Xlint:deprecation for details.
> > After updating the NetCDFParser file with non-deprecated methods (e.x.
> changing "dimension.getName()" to "dimension.getFullName()") however, I get
> failed unit tests in maven, which I assume is because the Maven Central
> Repo has the lapsed version of the .jar file needed for NetCDF files (
> >
> http://search.maven.org/#search%7Cgav%7C1%7Cg%3A%22edu.ucar%22%20AND%20a%3A%22netcdf%22)
> .
> > Can anyone provide insight into how I get the updated .jar file into the
> Maven Central Repository? Is there an alternative method to update Tika so
> I can run my unit tests in Maven?
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.2#6252)
>



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Fwd: [jira] [Created] (TIKA-1287) Update NetCDF .jar file on Maven Central

2014-05-01 Thread Annie Burgess
Ann Burgess created TIKA-1287:
-

 Summary: Update NetCDF .jar file on Maven Central
 Key: TIKA-1287
 URL: https://issues.apache.org/jira/browse/TIKA-1287
 Project: Tika
  Issue Type: Bug
Affects Versions: 1.5
Reporter: Ann Burgess


I am working to update the NetCDFParser file.  When using the most-recent
.jar file available from http://www.unidata.ucar.edu/ at the command line I
receive a note about a depreciated API:

javac -classpath
../../../../tika-core/target/tika-core-1.6-SNAPSHOT.jar:../../../../toolsUI-4.3.jar
org/apache/tika/parser/netcdf/NetCDFParser.java

Note: org/apache/tika/parser/netcdf/NetCDFParser.java uses or overrides a
deprecated API.
Note: Recompile with -Xlint:deprecation for details.

After updating the NetCDFParser file with non-deprecated methods (e.x.
changing "dimension.getName()" to "dimension.getFullName()") however, I get
failed unit tests in maven, which I assume is because the Maven Central
Repo has the lapsed version of the .jar file needed for NetCDF files (
http://search.maven.org/#search%7Cgav%7C1%7Cg%3A%22edu.ucar%22%20AND%20a%3A%22netcdf%22)
.

Can anyone provide insight into how I get the updated .jar file into the
Maven Central Repository? Is there an alternative method to update Tika so
I can run my unit tests in Maven?





--
This message was sent by Atlassian JIRA
(v6.2#6252)



-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


unit tests and classpaths

2014-04-24 Thread Annie Burgess
Hi dev group,

I'm working on a very simple starter unit test for a new parser and am
coming up with some roadblocks.  I suspect it may be classpath related, but
have tried many iterations and am coming up short.

My unit test:

package edu.usc.sunset.burgess.tika;

//JDK imports
import static org.junit.Assert.assertEquals;
import static org.junit.Assert.assertTrue;
import junit.framework.TestCase;

import java.io.InputStream;

//TIKA imports
import org.apache.tika.metadata.Metadata;
import org.apache.tika.metadata.TikaCoreProperties;
import org.apache.tika.parser.ParseContext;
import org.apache.tika.parser.Parser;
import org.apache.tika.sax.BodyContentHandler;
import org.junit.Test;
import org.xml.sax.ContentHandler;
import java.io.IOException;
/*
 * Test cases to exercise the {@link EnviHeaderParser}.
 *
 */
public class EnviHeaderParserTest extends TestCase
{
 public static final String TEST_STRING = "{GEO-TIFF File Imported into
ENVI [Fri May 25 14:06:23 2012]}";

@Test
public void testParser() throws Exception {

Parser parser = new EnviHeaderParser();
ContentHandler handler = new BodyContentHandler();
Metadata metadata = new Metadata();

InputStream stream = EnviHeaderParser.class

.getResourceAsStream("/test-documents/envi_test_header.hdr");
try {
parser.parse(stream, handler, metadata, new ParseContext());
} finally {
stream.close();
}

// Check text
String content = handler.toString();
assertTrue(content.contains(TEST_STRING));
}
}
---
Files are located as follows:

 
tika/tika-parsers/src/test/java/org/apache/tika/parser/envi/EnviHeaderParserTest.java

/tika/tika-parsers/src/test/resources/test-documents/envi_test_header.hdr

/tika/anniedev/src/main/java/edu/usc/sunset/burgess/tika/EnviHeaderParser.java


To compile and test code I do:

cd /tika/tika/tika-parsers
mvn -Dtest=EnviHeaderParserTest compile
mvn -Dtest=EnviHeaderParserTest test

-
I get the following output:

Running edu.usc.sunset.burgess.tika.EnviHeaderParserTest
Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.172 sec
<<< FAILURE!

Results :

Failed tests:   testParser(edu.usc.sunset.burgess.tika.EnviHeaderParserTest)

Tests run: 1, Failures: 1, Errors: 0, Skipped:0
-

Please let me know if any additional information would be helpful.
Any insights are appreciated.

Annie

-- 
--
Ann Bryant Burgess, PhD

Postdoctoral Fellow
Computer Science Department
University of Southern California
Viterbi School of Engineering
Los Angeles, CA

Alaska Science Center/USGS
Anchorage, AK

Cell:  (585) 738-7549
Office:  (907) 786-7059
Fax:  (907) 786-7150
E-mail: anniebryant.burg...@gmail.com
Office Address: 4210 University Dr., Anchorage, AK 99508-4626
---


Re: [VOTE] Apache Tika 1.5 RC2

2014-02-14 Thread Annie Burgess
Hi dev crew,

I also live in a sort-of removed location - Anchorage, AK.  If anyone knows
of any developers up north, I'd love to try to connect with the AK Apache
community.

Cheers,
Annie


On Fri, Feb 14, 2014 at 1:24 PM, Nick Burch  wrote:

> On Fri, 14 Feb 2014, David Meikle wrote:
>
>> Had a check on this and there isn't anyone local I can find for a quick
>> meet up.  I am based in Scotland but travel a bit, so will look out for an
>> opportunity to meet up with someone soon but doubt it will be in the coming
>> weeks.
>>
>
> Which bit of Scotland? I might know someone...
>
> Failing that, shout if you head down to the land of the Sassenachs, there
> are loads of us here who can help!
>
> Nick




-- 

*Annie Bryant Burgess, PhD*

Phone: 585.738.7549
E-mail: anniebryant.burg...@gmail.com