Re: [VOTE] Apache OpenNLP 2.3.1 Release Candidate

2023-11-24 Thread Suneel Marthi
+1

On Fri, Nov 24, 2023 at 7:25 AM Jeff Zemerick  wrote:

> +1
>
> Thanks,
> Jeff
>
>
> On Thu, Nov 23, 2023 at 9:03 AM Tommaso Teofili  >
> wrote:
>
> > +1
> >
> > Tommaso
> >
> > On Thu, 23 Nov 2023 at 12:05, Richard Zowalla  wrote:
> >
> > > +1 (binding)
> > >
> > >
> > > (We should create an issue for the year in the NOTICE file though)
> > >
> > > Am Mittwoch, dem 22.11.2023 um 15:12 +0100 schrieb Martin Wiesner:
> > > >
> > > > Hi folks,
> > > >
> > > > I have posted a 1st release candidate for the Apache OpenNLP 2.3.1
> > > > release and it is ready for testing.
> > > >
> > > > It is a maintenance release which provides some enhancements.
> > > > Some of these are related to sentences models and the use of
> > > > abbreviations, see OPENNLP-570 & OPENNLP-793.
> > > > Moreover, it switches the ONNX runtime for the 'opennlp-dl' component
> > > > from the GPU to the CPU-based variant, see OPENNLP-1515.
> > > > Several other (cleanup) tasks have also been completed.
> > > >
> > > > Thank you to everyone who contributed to this release, including all
> > > > of our users and the people who submitted bug reports, contributed
> > > > code or documentation enhancements.
> > > >
> > > > The release was made using the OpenNLP release process, documented on
> > > > the website:
> > > > https://opennlp.apache.org/release.html
> > > >
> > > > Maven Repo:
> > > >
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1035
> > > >
> > > > 
> > > > 
> > > > opennlp-2.3.1-rc1
> > > > Testing OpenNLP 2.3.1 release candidate
> > > > 
> > > >
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1035
> > > > 
> > > > 
> > > > 
> > > >
> > > > Binaries & Source:
> > > >
> > > > https://dist.apache.org/repos/dist/dev/opennlp/opennlp-2.3.1
> > > >
> > > > Tag:
> > > >
> > > > https://github.com/apache/opennlp/releases/tag/opennlp-2.3.1
> > > >
> > > > Release notes:
> > > >
> > > >
> > >
> >
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12311215=12353478
> > > >
> > > > The results of the eval tests for the aforementioned tag can be found
> > > > here:
> > > > https://ci-builds.apache.org/job/OpenNLP/job/eval-tests-releases/9/
> > > >
> > > > Reminder: The up-2-date KEYS file for signature verification can be
> > > > found here: https://dist.apache.org/repos/dist/release/opennlp/KEYS
> > > >
> > > > Please vote on releasing these packages as Apache OpenNLP 2.3.1. The
> > > > vote is open for at least the next 72 hours.
> > > >
> > > > Only votes from OpenNLP PMC are binding, but everyone is welcome to
> > > > check the release candidate and vote.
> > > > The vote passes if at least three binding +1 votes are cast.
> > > >
> > > > Please VOTE
> > > >
> > > > [+1] go ship it
> > > > [+0] meh, don't care
> > > > [-1] stop, there is a ${showstopper}
> > > >
> > > > Thanks!
> > > > mawiesne
> > >
> > >
> >
>


Re: Potential 2.3.1 release?

2023-10-31 Thread Suneel Marthi
+1

  I can volunteer to be the Release Manager after a 6 yrs hiatus since I
did this the last time.

On Tue, Oct 31, 2023 at 9:29 AM Eric Pugh 
wrote:

> +1
>
> > On Oct 31, 2023, at 9:20 AM, Richard Zowalla  wrote:
> >
> > +1
> >
> > Am Dienstag, dem 31.10.2023 um 09:15 -0400 schrieb Jeff Zemerick:
> >> Hi all,
> >>
> >> It looks like it might be a good time for a 2.3.1 release? We have
> >> had a
> >> few pull requests. Thoughts?
> >>
> >> Thanks,
> >> Jeff
> >
>
> ___
> Eric Pugh | Founder & CEO | OpenSource Connections, LLC | 434.466.1467 |
> http://www.opensourceconnections.com <
> http://www.opensourceconnections.com/> | My Free/Busy <
> http://tinyurl.com/eric-cal>
> Co-Author: Apache Solr Enterprise Search Server, 3rd Ed <
> https://www.packtpub.com/big-data-and-business-intelligence/apache-solr-enterprise-search-server-third-edition-raw>
>
> This e-mail and all contents, including attachments, is considered to be
> Company Confidential unless explicitly stated otherwise, regardless of
> whether attachments are marked as such.
>
>


Re: [VOTE] Apache OpenNLP 2.1.0 Release Candidate

2022-11-22 Thread Suneel Marthi
+1 binding 

On 2022/11/16 15:00:33 "Bruno P. Kinoshita" wrote:
> +1
> 
> tested on 
> 
> Apache Maven 3.8.6 (84538c9988a25aec085021c365c560670ad80f63)
> Maven home: /opt/apache-maven-3.8.6
> Java version: 19.0.1, vendor: Oracle Corporation, runtime: /usr/lib/jvm/jdk-19
> Default locale: en_US, platform encoding: UTF-8
> OS name: "linux", version: "5.14.0-1054-oem", arch: "amd64", family: "unix"
> 
> Thanks!
> 
> On 2022/11/07 13:39:32 Jeff Zemerick wrote:
> > Hi folks,
> > 
> > I have posted a release candidate for the Apache OpenNLP 2.1.0 release and
> > it is ready for testing.
> > 
> > Changes in this version:
> > https://issues.apache.org/jira/browse/OPENNLP-1370?jql=project%20%3D%20OPENNLP%20AND%20status%20in%20(Resolved%2C%20Closed)%20AND%20fixVersion%20in%20(2.1.0)%20ORDER%20BY%20priority%20DESC%2C%20updated%20DESC
> > 
> > The distributables can be downloaded from:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1030/org/apache/opennlp/opennlp-distr/2.1.0
> > 
> > The release was made from the Apache OpenNLP 2.1.0 tag at:
> > https://github.com/apache/opennlp/tree/opennlp-2.1.0
> > 
> > To use it in a maven build set the version for opennlp-tools or
> > opennlp-uima to 2.1.0 and add the following URL to your settings.xml file:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1030
> > 
> > The release was made using the OpenNLP release process, documented on the
> > website:
> > https://opennlp.apache.org/release.html
> > 
> > Please vote on releasing these packages as Apache OpenNLP 2.1.0. The vote
> > is open for at least the next 72 hours.
> > 
> > Only votes from OpenNLP PMC are binding, but everyone is welcome to check
> > the release candidate and vote.
> > The vote passes if at least three binding +1 votes are cast.
> > 
> > [ ] +1 Release the packages as Apache OpenNLP 2.1.0
> > [ ] -1 Do not release the packages because...
> > 
> > Thanks!
> > Jeff
> > 
> 


Re: "Reuters" data for the CONLL 2003 task

2022-07-24 Thread Suneel Marthi
Did we want to discontinue Reuters and go with a dataset like IMDB reviews etc… 
?

Sent from my iPhone

> On Jul 24, 2022, at 6:43 PM, Jeff Zemerick  wrote:
> 
> HI Bertrand,
> 
> This probably shouldn't be considered factual advice, but I think as an
> individual you can "accept" it yourself. The folks at NIST (
> reuters-requ...@nist.gov) can likely give a definitive answer to that.
> 
> Thanks,
> Jeff
> 
> 
>> On Sun, Jul 24, 2022 at 3:28 PM Bertrand Rigaldies 
>> wrote:
>> 
>> Hi folks, I’m working on
>> https://issues.apache.org/jira/browse/OPENNLP-1373, and I need to get the
>> “Reuters” data. For that, it appears that Reuters require this form to be
>> filled out to request authorization:
>> https://trec.nist.gov/data/reuters/ind_appl_reuters_v4.html
>> 
>> The form requires an “organization” information and signature:
>> 
>> Organization 
>> Corporation/Partnership/Legal Entity 
>> Official mail address __
>> _
>> _
>> Telephone _
>> Facsimile _
>> Electronic mail 
>> 
>> 
>> Accepted by the organization:
>> 
>> Signature_
>> 
>> Date 
>> 
>> Name (please print) __
>> 
>> Title ___
>> 
>> Institution/Agency
>> __
>> 
>> What do I fill out for OpenNLP-associated work? Also, how do I get an
>> “organization’s signature”?
>> 
>> Thanks.
>> Bertrand Rigaldies


Re: [VOTE] Apache OpenNLP 2.0.0 Release Candidate

2022-06-01 Thread Suneel Marthi
+1 binding

On Wed, Jun 1, 2022 at 3:12 PM Jeff Zemerick  wrote:

> Just pinging folks on the thread about the active vote. The project has a
> board report due in a week - it would be awesome to get this release in
> that report.
>
> Thanks,
> Jeff
>
> On Thu, May 26, 2022 at 9:39 AM Jeff Zemerick 
> wrote:
>
> > I created a JIRA task to update the NOTICE file.
> >
> > I re-ran build tests and eval tests and am +1 to release as 2.0.0
> >
> > Thanks,
> > Jeff
> >
> >
> > On Tue, May 10, 2022 at 8:37 AM Jeff Zemerick 
> > wrote:
> >
> >> Bruno,
> >>
> >> Good catch. Does updating the date require a new RC?
> >>
> >> Thanks for the reminder about the evaluation tests. Here's the output
> log
> >> from my run:
> >>
> https://gist.githubusercontent.com/jzonthemtn/02195c55a479c0c84102af0456331758/raw/a74aade3d605510f15c24098d13ebb9aa201c672/gistfile1.txt
> >> (This was run on 1.9.5-SNAPSHOT before I did the release steps and the
> >> version changed to 2.0.0.) I will also share how to run these tests.
> >>
> >> I am +1 for the release unless the NOTICE file is a blocker.
> >>
> >> Thanks,
> >> Jeff
> >>
> >> On Mon, May 9, 2022 at 7:23 AM Bruno P. Kinoshita
> >>  wrote:
> >>
> >>>  Hi Jeff,
> >>> I think the NOTICE file needs to be adjusted to 2022?
> >>>
> >>>
> >>>
> https://github.com/apache/opennlp/blob/804ad5579b829f3a9b7b2bf3af819c53d6bb4290/NOTICE#L2https://github.com/apache/opennlp/blob/2.0.0/NOTICE#L2
> >>>
> >>> I downloaded a ZIP from Maven (opennlp-distr-2.0.0-bin.zip) and its
> >>> NOTICE had 2017. At least in Apache Commons and Jena we try to keep the
> >>> NOTICE file up to date (I think it's an ASF policy?)
> >>>
> >>> Building OK on
> >>>
> >>> Apache Maven 3.8.2 (ea98e05a04480131370aa0c110b8c54cf726c06f)
> >>> Maven home: /opt/apache-maven-3.8.2
> >>> Java version: 11.0.15, vendor: Private Build, runtime:
> >>> /usr/lib/jvm/java-11-openjdk-amd64
> >>> Default locale: en_US, platform encoding: UTF-8
> >>> OS name: "linux", version: "5.4.0-109-generic", arch: "amd64", family:
> >>> "unix"
> >>>
> >>> I don't know how to run the more complete test/models that others used
> >>> to run for other releases. In case you know how to run that, it'd be
> good
> >>> if you could post in your vote saying whether everything worked fine.
> >>> Otherwise check with another PMC/committer about it. Since it's a 2.0
> >>> release I expect a few users curious about what's new trying out the
> new
> >>> code :)
> >>>
> >>> Thanks!
> >>> Bruno
> >>>
> >>>
> >>> On Monday, 9 May 2022, 12:26:50 am NZST, Jeff Zemerick <
> >>> jzemer...@apache.org> wrote:
> >>>
> >>>  Hi folks,
> >>>
> >>> I have posted a first release candidate for the Apache OpenNLP 2.0.0
> >>> release and it is ready for testing.
> >>>
> >>> The distributables can be downloaded from:
> >>>
> >>>
> https://repository.apache.org/content/repositories/orgapacheopennlp-1029/org/apache/opennlp/opennlp-distr/2.0.0/
> >>>
> >>> The release was made from the Apache OpenNLP 2.0.0 tag at:
> >>> https://github.com/apache/opennlp/tree/2.0.0
> >>>
> >>> To use it in a maven build set the version for opennlp-tools or
> >>> opennlp-uima to 2.0.0 and add the following URL to your settings.xml
> >>> file:
> >>>
> https://repository.apache.org/content/repositories/orgapacheopennlp-1029
> >>>
> >>> The release was made using the OpenNLP release process, documented on
> the
> >>> website:
> >>> https://opennlp.apache.org/release.html
> >>>
> >>> Please vote on releasing these packages as Apache OpenNLP 2.0.0. The
> vote
> >>> is open for at least the next 72 hours.
> >>>
> >>> Only votes from OpenNLP PMC are binding, but everyone is welcome to
> check
> >>> the release candidate and vote. The vote passes if at least three
> binding
> >>> +1 votes are cast.
> >>>
> >>> [ ] +1 Release the packages as Apache OpenNLP [VERSION]
> >>> [ ] -1 Do not release the packages because...
> >>>
> >>> Thanks!
> >>> Jeff
> >>>
> >>
> >>
>


Re: [VOTE] Apache OpenNLP 1.9.3 Release Candidate

2020-07-29 Thread Suneel Marthi
+1 binding

On Wed, Jul 29, 2020 at 6:27 PM Joern Kottmann  wrote:

> +1 Release the packages as Apache OpenNLP 1.9.3
>
> Jörn
>
> On Wed, Jul 29, 2020 at 1:08 PM Tommaso Teofili
>  wrote:
> >
> > +1 from me, build, sigs, tag look good.
> >
> > Regards,
> > Tommaso
> >
> > On Tue, 28 Jul 2020 at 10:48, Bruno P. Kinoshita 
> wrote:
> >
> > > It worked after I imported keys from
> > > https://dist.apache.org/repos/dist/release/opennlp/KEYS
> > >
> > > [x] +1 Release the packages as Apache OpenNLP 1.9.3
> > >
> > >
> > > Thanks!
> > > Bruno
> > >
> > >
> > > On Monday, 27 July 2020, 12:00:29 am NZST, Jeff Zemerick <
> > > jzemer...@apache.org> wrote:
> > >
> > >
> > >
> > >
> > >
> > > Looks like I'm in there as jzemerick. See if I'm doing this correctly:
> > >
> > > wget https://people.apache.org/keys/group/opennlp.asc
> > > gpg --import https://people.apache.org/keys/group/opennlp.asc
> > >
> > > wget
> > >
> > >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1027/org/apache/opennlp/opennlp-distr/1.9.3/opennlp-distr-1.9.3-bin.tar.gz
> > > wget
> > >
> > >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1027/org/apache/opennlp/opennlp-distr/1.9.3/opennlp-distr-1.9.3-bin.tar.gz.asc
> > >
> > > gpg --verify opennlp-distr-1.9.3-bin.tar.gz.asc
> > > gpg: assuming signed data in 'opennlp-distr-1.9.3-bin.tar.gz'
> > > gpg: Signature made Fri Jul 24 15:21:24 2020 UTC
> > > gpg:using RSA key
> 6786BCFFBD2AE66E737FE97760E63AD841EF12D8
> > > gpg: Good signature from "Jeff Zemerick (CODE SIGNING KEY) <
> > > jzemer...@apache.org>" [unknown]
> > > gpg: WARNING: This key is not certified with a trusted signature!
> > > gpg:  There is no indication that the signature belongs to the
> > > owner.
> > > Primary key fingerprint: 6786 BCFF BD2A E66E 737F  E977 60E6 3AD8 41EF
> 12D8
> > >
> > > Jeff
> > >
> > >
> > > On Sun, Jul 26, 2020 at 5:25 AM Bruno P. Kinoshita 
> > > wrote:
> > >
> > > > Hi,
> > > >
> > > >
> > > > Built successfully from tag with Java 8 on Ubuntu LTS. Had a look at
> one
> > > > file from the dist area, and the contents looked OK (license, notice,
> > > jars
> > > > were using the right version 1.9.3 too).
> > > >
> > > >
> > > > Also checked the signatures using some shell script I normally use,
> but
> > > it
> > > > failed to validate. I think it failed to find your key in
> > > > https://people.apache.org/keys/group/opennlp.asc. Have you added
> your
> > > key
> > > > there? I search for Jeff and jzonthemtn, but couldn't find it.
> > > >
> > > >
> > > > Cheers
> > > >
> > > > Bruno
> > > >
> > > >
> > > >
> > > > On Saturday, 25 July 2020, 11:08:12 pm NZST, Jeff Zemerick <
> > > > jzemer...@apache.org> wrote:
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > Hi folks,
> > > >
> > > > I have posted a 1st release candidate for the Apache OpenNLP 1.9.3
> > > release
> > > > and it is ready for testing.
> > > >
> > > > The distributables can be downloaded from:
> > > >
> > > >
> > >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1027/org/apache/opennlp/opennlp-distr/1.9.3/
> > > >
> > > > The release was made from the Apache OpenNLP 1.9.3 tag at:
> > > > https://github.com/apache/opennlp/tree/opennlp-1.9.3
> > > >
> > > > To use it in a maven build set the version for opennlp-tools or
> > > > opennlp-uima to 1.9.3 and add the following URL to your settings.xml
> > > file:
> > > >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1027
> > > >
> > > > The release was made using the OpenNLP release process, documented
> on the
> > > > website:
> > > > https://opennlp.apache.org/release.html
> > > >
> > > > Please vote on releasing these packages as Apache OpenNLP 1.9.3. The
> vote
> > > > is open for at least the next 72 hours.
> > > >
> > > > Only votes from OpenNLP PMC are binding, but everyone is welcome to
> check
> > > > the release candidate and vote.
> > > > The vote passes if at least three binding +1 votes are cast.
> > > >
> > > > [ ] +1 Release the packages as Apache OpenNLP 1.9.3
> > > > [ ] -1 Do not release the packages because...
> > > >
> > > > Thanks!
> > > >
> > > > Jeff
> > > >
> > >
>


Re: [VOTE] Apache OpenNLP 1.9.2 Release Candidate

2019-12-23 Thread Suneel Marthi
+1 binding


On Mon, Dec 23, 2019 at 9:28 AM Tommaso Teofili 
wrote:

> +1 (binding)
>
> tag build succeeds (jdk 8), signatures ok.
>
> Regards,
> Tommaso
>
> On Mon, 23 Dec 2019 at 13:32, Jeff Zemerick  wrote:
>
> > +1 binding
> >
> > verified signatures
> > built and tested from opennlp-1.9.2 tag using openjdk 8
> >
> > On Fri, Dec 20, 2019 at 11:07 AM Jeff Zemerick 
> > wrote:
> >
> > > Hi folks,
> > >
> > > I have posted a 1st release candidate for the Apache OpenNLP 1.9.2
> > release
> > > and it is ready for testing.
> > >
> > > The distributables can be downloaded from:
> > >
> > >
> >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1026/org/apache/opennlp/opennlp-distr/1.9.2/
> > >
> > > The release was made from the Apache OpenNLP 1.9.2 tag at:
> > > https://github.com/apache/opennlp/tree/opennlp-1.9.2
> > >
> > > To use it in a maven build set the version for opennlp-tools or
> > > opennlp-uima to 1.9.2 and add the following URL to your settings.xml
> > file:
> > >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1026
> > >
> > > The release was made using the OpenNLP release process, documented on
> the
> > > website:
> > > https://opennlp.apache.org/release.html
> > >
> > > Please vote on releasing these packages as Apache OpenNLP 1.9.2. The
> vote
> > > is open for at least the next 72 hours.
> > >
> > > Only votes from OpenNLP PMC are binding, but everyone is welcome to
> check
> > > the release candidate and vote.
> > > The vote passes if at least three binding +1 votes are cast.
> > >
> > > [ ] +1 Release the packages as Apache OpenNLP 1.9.2
> > > [ ] -1 Do not release the packages because...
> > >
> > > Thanks!
> > >
> > > Jeff
> > >
> >
>


Re: [VOTE] Apache OpenNLP 1.9.1 Release Candidate 2

2018-12-27 Thread Suneel Marthi
+1 binding

On Thu, Dec 27, 2018 at 10:52 PM  wrote:

> Hi Folks,
>Just made a whole bunch of new models using universal-dependencies and
> openNLP 1.9.1.  So far so good.
>
> +1 from me.
>
> Daniel
>
>
>
> > On Dec 27, 2018, at 2:39 PM, Jeff Zemerick  wrote:
> >
> > Hi folks,
> >
> > I have posted a 2nd release candidate for the Apache OpenNLP 1.9.1
> release
> > and it is ready for testing.
> >
> > The distributables can be downloaded from:
> >
> https://repository.apache.org/content/repositories/orgapacheopennlp-1025/org/apache/opennlp/opennlp-distr/1.9.1/
> >
> > The release was made from the Apache OpenNLP 1.9.1 tag at:
> > https://github.com/apache/opennlp/tree/opennlp-1.9.1-rc2
> >
> > To use it in a maven build set the version for opennlp-tools or
> > opennlp-uima to 1.9.1 and add the following URL to your settings.xml
> file:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1025
> >
> > The release was made using the OpenNLP release process, documented on the
> > website:
> > https://opennlp.apache.org/release.html
> >
> > Please vote on releasing these packages as Apache OpenNLP 1.9.1. The vote
> > is open for at least the next 72 hours.
> >
> > Only votes from OpenNLP PMC are binding, but everyone is welcome to check
> > the release candidate and vote.
> > The vote passes if at least three binding +1 votes are cast.
> >
> > [ ] +1 Release the packages as Apache OpenNLP 1.9.1
> > [ ] -1 Do not release the packages because...
> >
> > Thanks!
> >
> > Jeff
>
>


Re: [VOTE] Apache OpenNLP 1.9.0 Release Candidate 2

2018-06-29 Thread Suneel Marthi
+1 binding

On Fri, Jun 29, 2018 at 9:02 AM, Joern Kottmann  wrote:

> +1
>
> Jörn
>
> On Fri, Jun 29, 2018 at 1:45 PM, Jeff Zemerick 
> wrote:
> > Hi folks,
> >
> > I have posted a 2nd release candidate for the Apache OpenNLP 1.9.0
> release
> > and it is ready for testing.
> >
> > The distributables can be downloaded from:
> > https://repository.apache.org/content/repositories/
> orgapacheopennlp-1022/org/apache/opennlp/opennlp-distr/1.9.0/
> >
> > The release was made from the Apache OpenNLP 1.9.0 RC2 tag at:
> > https://github.com/apache/opennlp/tree/opennlp-1.9.0-rc2
> >
> > To use it in a maven build set the version for opennlp-tools or
> > opennlp-uima to 1.9.0 and add the following URL to your settings.xml
> file:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1022
> >
> > The release was made using the OpenNLP release process, documented on the
> > website:
> > https://opennlp.apache.org/release.html
> >
> > Please vote on releasing these packages as Apache OpenNLP 1.9.0. The vote
> > is open for at least the next 72 hours.
> >
> > Only votes from OpenNLP PMC are binding, but everyone is welcome to check
> > the release candidate and vote.
> > The vote passes if at least three binding +1 votes are cast.
> >
> > [ ] +1 Release the packages as Apache OpenNLP 1.9.0
> > [ ] -1 Do not release the packages because...
> >
> > Thanks!
> > Jeff
>


Re: [VOTE] Apache OpenNLP 1.9.0 Release Candidate

2018-06-22 Thread Suneel Marthi
+1 binding

1. Clean build from src and all unit tests pass
2. Verified sigs and hashes


On Fri, Jun 22, 2018 at 12:16 PM, Jeff Zemerick 
wrote:

> Hi folks,
>
> I have posted a first release candidate for the Apache OpenNLP 1.9.0
> release and it is ready for testing.
>
> The distributables can be downloaded from:
> https://repository.apache.org/content/repositories/
> orgapacheopennlp-1021/org/apache/opennlp/opennlp-distr/1.9.0/
>
> The release was made from the Apache OpenNLP 1.9.0 tag at:
> https://github.com/apache/opennlp/tree/opennlp-1.9.0
>
> To use it in a maven build set the version for opennlp-tools or
> opennlp-uima to 1.9.0 and add the following URL to your settings.xml file:
> https://repository.apache.org/content/repositories/orgapacheopennlp-1021
>
> The release was made using the OpenNLP release process, documented on the
> website:
> https://opennlp.apache.org/release.html
>
> Please vote on releasing these packages as Apache OpenNLP 1.9.0. The vote
> is open for at least the next 72 hours.
>
> Only votes from OpenNLP PMC are binding, but everyone is welcome to check
> the release candidate and vote.
> The vote passes if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache OpenNLP 1.9.0
> [ ] -1 Do not release the packages because...
>
> Thanks!
> Jeff
>


Re: [DISCUSS] - (ONIP-1) Better language model support

2018-01-27 Thread Suneel Marthi
Thanks Tommaso.

Could u share a google doc with the design, we can post the same onto the
Wiki after the Google doc's been finalized.

Its easier to comment on and make changes to a Google doc.

On Sat, Jan 27, 2018 at 9:50 AM, Tommaso Teofili 
wrote:

> Hi all,
>
> recently I've created
> https://cwiki.apache.org/confluence/display/OPENNLP/
> ONIP-1+Better+language+model+support
> as
> a description of possible useful improvements to our ngram language model
> implementation.
> Feedback welcome.
>
> Regards,
> Tommaso
>
> p.s.:
> we created a wiki page containing possible such improvements at
> https://cwiki.apache.org/confluence/display/OPENNLP/
> OpenNLP+Improvement+Proposals,
> feel free to create other proposals
>


Re: [VOTE] Apache OpenNLP 1.8.4 Release Candidate

2017-12-21 Thread Suneel Marthi
+1 binding

On Thu, Dec 21, 2017 at 11:44 AM, Tommaso Teofili  wrote:

> +1 build ok, tag ok, sigs ok
>
> Tommaso
>
> Il giorno gio 21 dic 2017 alle ore 17:35 Dan Russ  ha
> scritto:
>
> > [ X] +1 Release the packages as Apache OpenNLP 1.8.4
> >
> > > On Dec 21, 2017, at 9:44 AM, Jeff Zemerick 
> wrote:
> > >
> > > Hi Folks,
> > >
> > > I have posted a first release candidate for the Apache OpenNLP 1.8.4
> > > release and it is ready for testing.
> > >
> > > The RC1 distributables can be downloaded from here:
> > >
> > https://repository.apache.org/content/repositories/
> orgapacheopennlp-1020/org/apache/opennlp/opennlp-distr/1.8.4
> > >
> > > The release was made from the Apache OpenNLP 1.8.4 tag at
> > > https://github.com/apache/opennlp/tree/opennlp-1.8.4
> > >
> > > To use it in a maven build set the version for opennlp-tools or
> > > opennlp-uima to 1.8.4 and add the following URL to your settings.xml
> > file:
> > > https://repository.apache.org/content/repositories/
> orgapacheopennlp-1020
> > >
> > > The release was made using the OpenNLP release process, documented on
> the
> > > Wiki here:
> > > https://cwiki.apache.org/confluence/display/OPENNLP/Release+Process
> > >
> > > The release contains quite some changes, please refer to the contained
> > > issue list for details.
> > >
> > > Please vote on releasing these packages as Apache OpenNLP 1.8.4. The
> > vote is
> > > open for at least the next 72 hours.
> > >
> > > Only votes from OpenNLP PMC are binding, but folks are welcome to check
> > the
> > > release candidate and voice their approval or disapproval. The vote
> > passes
> > > if at least three binding +1 votes are cast.
> > >
> > > [ ] +1 Release the packages as Apache OpenNLP 
> > > [ ] -1 Do not release the packages because...
> > >
> > > Thanks!
> > > Jeff Zemerick
> >
> >
>


Re: [VOTE] Language Detector model for Apache OpenNLP 1.8.3 Release Candidate 3

2017-10-31 Thread Suneel Marthi
+1 binding

Sent from my iPhone

> On Oct 31, 2017, at 3:04 PM, Rodrigo Agerri  wrote:
> 
> +1
> 
> Rodrigo
> 
> On Tue, Oct 31, 2017 at 2:37 AM, Koji Sekiguchi
>  wrote:
>> +1
>> 
>> - checked text files in the zipped model file
>> - verified signatures
>> - executed LanguageDetector using the model file
>> 
>> Koji
>> 
>> 
>>> On 2017/10/30 22:30, William Colen wrote:
>>> 
>>> The Apache OpenNLP PMC would like to call for a Vote on the Language
>>> Detector model for Apache OpenNLP 1.8.3 Release Candidate 3.
>>> 
>>> The Release artifacts can be downloaded from:
>>> 
>>> http://people.apache.org/~colen/models/langdetect-183/rc3/
>>> 
>>> The model was built with Apache OpenNLP 1.8.3 release, trained with a
>>> portion of the Leipzig corpus, which can be found under this  tag:
>>> 
>>> https://svn.apache.org/repos/bigdata/opennlp/tags/langdetect-183_RC3
>>> 
>>> The model binary includes the NOTICE, LICENSE and also a README with
>>> details of supported languages, how the Leipzig corpus was created and the
>>> model was trained. For your convenience the README is available here:
>>> 
>>> 
>>> https://svn.apache.org/repos/bigdata/opennlp/tags/langdetect-183_RC3/leipzig/resources/README.txt
>>> 
>>> A detailed evaluation report is available here:
>>> 
>>> 
>>> http://people.apache.org/~colen/models/langdetect-183/rc3/langdetect-183.bin.report.txt
>>> 
>>> To use Language Detector, please follow the documentation here:
>>> 
>>> http://opennlp.apache.org/docs/1.8.3/manual/opennlp.html#tools.langdetect
>>> 
>>> It is important to note that this model is trained for and works well with
>>> longer texts that have at least 2 sentences or more from the same
>>> language.
>>> 
>>> The artifacts have been signed with the Key - 524A9649
>>>  found at
>>> 
>>> http://people.apache.org/keys/group/opennlp.asc
>>> 
>>> Please vote on releasing the model as Apache OpenNLP Language Detector
>>> Model 1.8.3. The vote is open for either the next 72 hours or a minimum of
>>> 3 +1 PMC binding votes
>>> whichever happens earlier.
>>> 
>>> Only votes from OpenNLP PMC are binding, but folks are welcome to check
>>> the
>>> release candidate and voice their approval or disapproval. The vote passes
>>> if at least three binding +1 votes are cast.
>>> 
>>> [ ] +1 Release the packages as Apache OpenNLP Language Detector Model
>>> 1.8.3
>>> 
>>> [ ] -1 Do not release the packages because...
>>> 
>>> Thanks again to all the committers and contributors for their work over
>>> the
>>> past few weeks.
>>> 
>> 


[ANNOUNCE] Apache OpenNLP 1.8.3 Release

2017-10-27 Thread Suneel Marthi
The Apache OpenNLP team is pleased to announce the release of Apache
OpenNLP 1.8.3.


The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

It supports the most common NLP tasks, such as tokenization, sentence
segmentation, part-of-speech tagging, named entity extraction, chunking,
parsing, and coreference resolution.

Apache OpenNLP 1.8.3 binary and source distributions are available for
download from our download page:
http://opennlp.apache.org/cgi-bin/download.cgi

The OpenNLP library is distributed by Maven Central as well. See the Maven
Dependency page for more details:
http://opennlp.apache.org/maven-dependency.html

This release introduces new features, improvements and bug fixes. Java 1.8
and Maven 3.3.9 are required.

Additionally the release contains the following noteworthy changes:

- New experimental API for Word Vectors and support for Glove vector files

- Code cleanups and addition of test cases

- Java 9 module name is now set to org.apache.opennlp.tools

- All Sample objects now implement Serializable to better work with
distributed frameworks like Apache Flink

A detailed list of the issues related to this release can be found in the
release notes.

For a complete list of fixed bugs and improvements please see the README.html
file included in the distribution.

--The Apache OpenNLP Team


Re: [VOTE] Apache OpenNLP 1.8.3 Release Candidate

2017-10-27 Thread Suneel Marthi
Thanks for the Votes - we are past the 72 hrs and this vote is now closed.

Results:

10 +1 binding
1 +1 non-binding

This vote now passes, will send notice out once the release is finalized.


On Thu, Oct 26, 2017 at 1:54 PM, Koji Sekiguchi <koji.sekigu...@rondhuit.com
> wrote:

> +1 binding.
>
> Koji
>
>
> On 2017/10/24 18:29, Suneel Marthi wrote:
>
>> The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
>> 1.8.3 Release Candidate.
>>
>> The Release artifacts can be downloaded from:
>>
>> https://repository.apache.org/content/repositories/orgapache
>> opennlp-1010/org/apache/opennlp/opennlp-distr/1.7.2/
>>
>> The release was made from the Apache OpenNLP 1.8.3 tag at
>>
>> https://github.com/apache/opennlp/tree/opennlp-1.8.3
>>
>> To use it in a maven build set the version for opennlp-tools or
>> opennlp-uima
>> to 1.8.3
>>
>> and add the following URL to your settings.xml file:
>>
>> https://repository.apache.org/content/repositories/orgapache
>> opennlp-1019/org/apache/opennlp/opennlp-distr/1.8.3/
>>
>> The artifacts have been signed with the Key - D3541808 found at
>>
>> http://people.apache.org/keys/group/opennlp.asc
>>
>> Please vote on releasing these packages as Apache OpenNLP 1.8.3. The vote
>> is
>>
>> open for either the next 72 hours or a minimum of 3 +1 PMC binding votes
>> whichever happens earlier.
>>
>> Only votes from OpenNLP PMC are binding, but folks are welcome to check
>> the
>>
>> release candidate and voice their approval or disapproval. The vote passes
>>
>> if at least three binding +1 votes are cast.
>>
>> [ ] +1 Release the packages as Apache OpenNLP 1.8.3
>>
>> [ ] -1 Do not release the packages because...
>>
>> Thanks again to all the committers and contributors for their work
>> over the past
>> few weeks.
>>
>>
>
> --
> 最新ブログ記事~LUCENE-6819: Good bye index-time boost
> http://lucene.jugem.jp/?eid=485
> ==
> 株式会社 ロンウイット
> 関口宏司
> 105-0003 東京都港区西新橋1-18-6
> クロスオフィス内幸町 11階
> TEL 03-5288-5927
> FAX 03-5288-5928
> http://www.rondhuit.com/
> ブログ http://lucene.jugem.jp/
>


Re: [VOTE] Apache OpenNLP 1.8.3 Release Candidate

2017-10-25 Thread Suneel Marthi
+1 binding

1. Verified Sigs and hashes
2. Ran a clean build from {src} * {zip, tar}
3. All unit tests pass

On Wed, Oct 25, 2017 at 3:08 PM, Bruno P. Kinoshita <
brunodepau...@yahoo.com.br.invalid> wrote:

> [ X ] +1 Release the packages as Apache OpenNLP 1.8.3
>
> `mvn clean test install` working fine, checked artefacts signatures,
> matching with what was in the vote e-mail.
>
> Currently on tag 1.8.3, commit b317159cb9857dc509c08a31a98dc61209f39bff
>
> Thanks for preparing this release.
>
> Cheers
> Bruno
>
>
>
> ____
> From: Suneel Marthi <smar...@apache.org>
> To: dev@opennlp.apache.org; us...@opennlp.apache.org
> Sent: Tuesday, 24 October 2017 10:29 PM
> Subject: [VOTE] Apache OpenNLP 1.8.3 Release Candidate
>
>
>
> The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
>
> 1.8.3 Release Candidate.
>
>
> The Release artifacts can be downloaded from:
>
>
> https://repository.apache.org/content/repositories/orgapache
>
> opennlp-1010/org/apache/opennlp/opennlp-distr/1.7.2/
>
>
> The release was made from the Apache OpenNLP 1.8.3 tag at
>
>
> https://github.com/apache/opennlp/tree/opennlp-1.8.3
>
>
> To use it in a maven build set the version for opennlp-tools or
> opennlp-uima
>
> to 1.8.3
>
>
> and add the following URL to your settings.xml file:
>
>
> https://repository.apache.org/content/repositories/
> orgapacheopennlp-1019/org/apache/opennlp/opennlp-distr/1.8.3/
>
>
> The artifacts have been signed with the Key - D3541808 found at
>
>
> http://people.apache.org/keys/group/opennlp.asc
>
>
> Please vote on releasing these packages as Apache OpenNLP 1.8.3. The vote
> is
>
>
> open for either the next 72 hours or a minimum of 3 +1 PMC binding votes
>
> whichever happens earlier.
>
>
> Only votes from OpenNLP PMC are binding, but folks are welcome to check the
>
>
> release candidate and voice their approval or disapproval. The vote passes
>
>
> if at least three binding +1 votes are cast.
>
>
> [ ] +1 Release the packages as Apache OpenNLP 1.8.3
>
>
> [ ] -1 Do not release the packages because...
>
>
> Thanks again to all the committers and contributors for their work
>
> over the past
>
> few weeks.
>


[VOTE] Apache OpenNLP 1.8.3 Release Candidate

2017-10-24 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
1.8.3 Release Candidate.

The Release artifacts can be downloaded from:

https://repository.apache.org/content/repositories/orgapache
opennlp-1010/org/apache/opennlp/opennlp-distr/1.7.2/

The release was made from the Apache OpenNLP 1.8.3 tag at

https://github.com/apache/opennlp/tree/opennlp-1.8.3

To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.8.3

and add the following URL to your settings.xml file:

https://repository.apache.org/content/repositories/orgapacheopennlp-1019/org/apache/opennlp/opennlp-distr/1.8.3/

The artifacts have been signed with the Key - D3541808 found at

http://people.apache.org/keys/group/opennlp.asc

Please vote on releasing these packages as Apache OpenNLP 1.8.3. The vote is

open for either the next 72 hours or a minimum of 3 +1 PMC binding votes
whichever happens earlier.

Only votes from OpenNLP PMC are binding, but folks are welcome to check the

release candidate and voice their approval or disapproval. The vote passes

if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache OpenNLP 1.8.3

[ ] -1 Do not release the packages because...

Thanks again to all the committers and contributors for their work
over the past
few weeks.


[ANNOUNCE] Apache OpenNLP 1.8.2 Release

2017-09-16 Thread Suneel Marthi
The Apache OpenNLP team is pleased to announce the release of version
1.8.2 of Apache OpenNLP.

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

It supports the most common NLP tasks, such as tokenization, sentence
segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution.

The OpenNLP 1.8.2 binary and source distributions are available for
download from our download page:
http://opennlp.apache.org/cgi-bin/download.cgi

The OpenNLP library is distributed by Maven Central as well. See the
Maven Dependency page for more details:
http://opennlp.apache.org/maven-dependency.html
Requirements 


Java 8 is required to run OpenNLP, Maven 3.3.9 is required for building it
Building from the Source Distribution


To build everything execute the following command in the root folder:
mvn clean install

The results of the build will be placed in:
opennlp-distr/target/apache-opennlp-1.8.2-bin.tar-gz (or .zip)
What's new in Apache OpenNLP 1.8.2

This release introduces some minor improvements and bug fixes.

The release contains the following noteworthy changes:

- The Leipzig format support was improved to extract data for
langdetect model training
- Maxents loglikelihood treshold can be configured by the user
- Added data verification for the eval data
- Fixed handling of xml parsers used through out the package

A detailed list of the issues related to this release can be found in
the release notes.

Thanks again to all contributors and committers for their help.

--The Apache OpenNLP Team


Re: [VOTE] Apache OpenNLP 1.8.2 Release Candidate 2

2017-09-12 Thread Suneel Marthi
+1 binding

On Tue, Sep 12, 2017 at 8:10 AM, Tommaso Teofili 
wrote:

> +1
>
> Tommaso
>
> Il giorno lun 11 set 2017 alle ore 09:12 Joern Kottmann <
> kottm...@gmail.com>
> ha scritto:
>
> > Hi Folks,
> >
> >
> > I have posted a second release candidate for the Apache OpenNLP 1.8.2
> > release and it is ready for testing.
> >
> >
> > The RC 2 distributables can be downloaded from here:
> >
> > https://repository.apache.org/content/repositories/
> orgapacheopennlp-1018/org/apache/opennlp/opennlp-distr/1.8.2/
> >
> >
> > The release was made from the Apache OpenNLP 1.8.2 tag at
> > https://github.com/apache/opennlp/tree/opennlp-1.8.2
> >
> >
> > To use it in a maven build set the version for opennlp-tools or
> > opennlp-uima to 1.8.2 and add the following URL to your settings.xml
> > file:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1018
> >
> > The release was made using the OpenNLP release process, documented on
> > the Wiki here:
> > https://cwiki.apache.org/confluence/display/OPENNLP/Release+Process
> >
> > The release contains quite some changes, please refer to the contained
> > issue list for details.
> >
> >
> > Please vote on releasing these packages as Apache OpenNLP 1.8.2. The vote
> > is
> > open for at least the next 72 hours.
> >
> >
> > Only votes from OpenNLP PMC are binding, but folks are welcome to check
> the
> > release candidate and voice their approval or disapproval. The vote
> passes
> > if at least three binding +1 votes are cast.
> >
> >
> > [ ] +1 Release the packages as Apache OpenNLP 1.8.2
> > [ ] -1 Do not release the packages because...
> >
> >
> > Thanks!
> >
> > Jörn
> >
> > P.S. Here is my +1.
> >
>


Re: [VOTE] Apache OpenNLP 1.8.2 Release Candidate

2017-09-04 Thread Suneel Marthi
+1 binding

On Mon, Sep 4, 2017 at 5:41 PM, Joern Kottmann  wrote:

> Hi Folks,
>
>
> I have posted a first release candidate for the Apache OpenNLP 1.8.2
> release and it is ready for testing.
>
>
> The RC 1 distributables can be downloaded from here:
> https://repository.apache.org/content/repositories/
> orgapacheopennlp-1017/org/apache/opennlp/opennlp-distr/1.8.2/
>
>
> The release was made from the Apache OpenNLP 1.8.2 tag at
> https://github.com/apache/opennlp/tree/opennlp-1.8.2
>
>
> To use it in a maven build set the version for opennlp-tools or
> opennlp-uima to 1.8.2 and add the following URL to your settings.xml
> file:
> https://repository.apache.org/content/repositories/orgapacheopennlp-1017
>
> The release was made using the OpenNLP release process, documented on
> the Wiki here:
> https://cwiki.apache.org/confluence/display/OPENNLP/Release+Process
>
> The release contains quite some changes, please refer to the contained
> issue list for details.
>
>
> Please vote on releasing these packages as Apache OpenNLP 1.8.2. The vote
> is
> open for at least the next 72 hours.
>
>
> Only votes from OpenNLP PMC are binding, but folks are welcome to check the
> release candidate and voice their approval or disapproval. The vote passes
> if at least three binding +1 votes are cast.
>
>
> [ ] +1 Release the packages as Apache OpenNLP 1.8.2
> [ ] -1 Do not release the packages because...
>
>
> Thanks!
>
> Jörn
>
> P.S. Here is my +1.
>


Re: Releasing a Language Detection Model

2017-07-11 Thread Suneel Marthi
...one last point before wrapping up this discussion.  Is it possible to
that u could have more than one lang detect model but trained with
different algorithms - like say 'MaxEnt', 'Naive Bayes', ' Perceptron'

Questions:

1.   Do we just publish one model trained on a specific algorithm, if so
the metadata would have the algorithm information ?

2.  Do we publish multiple models for the same task, each trained on
different algorithms ?



On Tue, Jul 11, 2017 at 9:30 AM, Joern Kottmann  wrote:

> Hello,
>
> right, very good point, I also think that it is very important to load
> a model in one from the classpath.
>
> I propose we have the following setup:
> - One jar contains one or multiple model packages (thats the zip container)
> - A model name itself should be kind of unique  e.g. eng-ud-token.bin
> - A user loads the model via: new
> SentenceModel(getClass().getResource("eng-ud-sent.bin")) <- the stream
> gets then closed properly
>
>
> Lets take away three things from this discussion:
> 1) Store the data in a place where the community can access it
> 2) Offer models on our download page similar as it is done today on
> the SourceForge page
> 3) Release models packed inside a jar file via maven central
>
> Jörn
>
>
>
>
>
>
>
> On Tue, Jul 11, 2017 at 3:00 PM, Aliaksandr Autayeu
>  wrote:
> > To clarify on models and jars.
> >
> > Putting model inside jar might not be a good idea. I mean here things
> like
> > bla-bla.jar/en-sent.bin. Our models are already zipped, so they are
> "jars"
> > already in a sense. We're good. However, current packaging and metadata
> > might not be very classpath friendly.
> >
> > The use case I have in mind is being able to add needed models as
> > dependencies and load them by writing a line of code. For this case
> having
> > all models in a root with the same name might not be very convenient.
> Same
> > goes for manifest. The name "manifest.properties" is quite generic and
> it's
> > not too far-fetched to see some clashes because some other lib also
> > manifests something. It might be better to allow for some flexibility and
> > to adhere to classpath conventions. For example, having manifests in
> > something like org/apache/opennlp/models/manifest.properties. Or
> > opennlp/tools/manifest.properties. And perhaps even allowing to
> reference a
> > model in the manifest, so the model can be put elsewhere. Just in case
> > there are several custom models of the same kind for different pipelines
> in
> > the same app. For example, processing queries with one pipeline - one set
> > of models - and processing documents with another pipeline - another set
> of
> > models. In this case allowing for different classpaths is needed.
> >
> > Perhaps to illustrate my thinking, something like this (which still
> keeps a
> > lot of possibilities open):
> > en-sent.bin/opennlp/tools/sentdetect/manifest.properties (perhaps
> contains
> > a line with something like model =
> > /opennlp/tools/sentdetect/model/sent.model)
> > en-sent.bin/opennlp/tools/sentdetect/model/sent.model
> >
> > This allows including en-sent.bin as dependency. And then doing something
> > like
> > SentenceModel sdm = SentenceModel.getDefaultResourceModel(); // if we
> want
> > default models in this way. Seems verbose enough to allow for some safety
> > through explicitness. That's if we want any defaults at all.
> > Or something like:
> > SentenceModel sdm =
> > SentenceModel.getResourceModel("/opennlp/tools/sentdetect/manifest.
> properties");
> > Or
> > SentenceModel sdm =
> > SentenceModel.getResourceModel("/opennlp/tools/sentdetect/model/sent.
> model");
> > Or more in-line with a current style:
> > SentenceModel sdm = new
> > SentenceModel("/opennlp/tools/sentdetect/model/sent.model"); // though
> here
> > we commit to interpreting String as classpath reference. That's why I'd
> > prefer more explicit method names.
> > Or leave dealing with resources to the users, leave current code intact
> and
> > provide only packaging and distribution:
> > SentenceModel sdm = new
> > SentenceModel(this.getClass().getResourceAsStream("/.../.../manifest or
> > model"));
> >
> >
> > And to add to model metadata also F1\accuracy (at least CV-based, for
> > example 10-fold) for quick reference or quick understanding of what that
> > model is capable of. Could be helpful for those with a bunch of models
> > around. And for others as well to have a better insight about the model
> in
> > question.
> >
> >
> >
> > On 11 July 2017 at 06:37, Chris Mattmann  wrote:
> >
> >> Hi,
> >>
> >> FWIW, I’ve seen CLI tools – lots in my day – that can load from the CLI
> to
> >> override an
> >> internal classpath dependency. This is for people in environments who
> want
> >> a sensible
> >> / delivered internal classpath default and the ability for run-time, non
> >> zipped up/messing
> >> with JAR file override. Think about people who are using OpenNLP in both
> >> 

[Announce] Apache OpenNLP 1.8.1 Release

2017-07-08 Thread Suneel Marthi
The Apache OpenNLP team is pleased to announce the release of version
1.8.1 of Apache OpenNLP.

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

It supports the most common NLP tasks, such as tokenization, sentence
segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution.

The OpenNLP 1.8.1 binary and source distributions are available for
download from http://opennlp.apache.org/download.html.

The OpenNLP library is distributed by Maven Central as well. See
http://opennlp.apache.org/maven-dependency.html for more details.

Java 1.8 is required to run OpenNLP Maven 3.3.9 is required for
building it building from the Source Distribution.

# What's new in Apache OpenNLP 1.8.1

This release introduces many new features, improvements and bug fixes.
The API has been improved for a better consistency and many deprecated
methods were removed. Java 1.8 is required.

Additionally the release contains the following noteworthy changes:

- A new Language Detection Component
- Support for Irish Sentence Bank formats
- Support to train the sentence detector and tokenizer on the UD corpus
- Evaluation tests now support ISO-639-3 language codes
- Convenience methods to load models from a path
- Refactored the Data Indexer Code
- Optimized NGram creation loop to better leverage CPU cache
- Refactored BratNameSampleStream
- Remove deprecated code from util package
- Redesigned web site - https://opennlp.apache.org
- New logo for the project

A detailed list of the issues related to this release can be found in
the release notes.

Thanks again to all contributors and committers for their help.

--The Apache OpenNLP Team


Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate 3

2017-07-08 Thread Suneel Marthi
Thanks for all the votes, this vote has passed and is now closed, will send
a release announcement soon once the release is finalized.

We have 8 +1 binding votes and 3 +1 non-binding votes, with no -1s or 0s.

+1 binding

Rodrigo Agerri
Dan Russ
Tommaso Teofili
Jorn Kottmann
William Colen
Chris Mattmann
Suneel Marthi
Bruno Kinoshita

+1 non-binding

Jeff Zemerick
Richard Eckart de Castilho
Madhawa Gunasekara



On Sat, Jul 8, 2017 at 4:00 AM, Madhawa Kasun Gunasekara <
madhaw...@gmail.com> wrote:

> +1 non-binding.
>
>
> Thanks,
> Madhawa
>
> Madhawa
>
> On Sat, Jul 8, 2017 at 10:33 AM, Chris Mattmann <mattm...@apache.org>
> wrote:
>
> > +1 from me, SIGS check out, checksums look good!
> >
> > Cheers,
> > Chris
> >
> > LMC-053601:apache-opennlp-1.8.1-rc3 mattmann$ for class in \-bin \-src;
> do
> > > $HOME/bin/stage_apache_rc opennlp-distr 1.8.1$class
> > https://repository.apache.org/content/repositories/
> > orgapacheopennlp-1016/org/apache/opennlp/opennlp-distr/1.8.1/; done
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100 10.4M  100 10.4M0 0   154k  0  0:01:09  0:01:09 --:--:--
> > 233k
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100   842  100   8420 0   1103  0 --:--:-- --:--:-- --:--:--
> > 1103
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 10032  100320 0 20  0  0:00:01  0:00:01 --:--:--
> >   20
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 10040  100400 0 40  0  0:00:01 --:--:--  0:00:01
> >   40
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100 12.8M  100 12.8M0 0   431k  0  0:00:30  0:00:30 --:--:--
> > 650k
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100   842  100   8420 0   1059  0 --:--:-- --:--:-- --:--:--
> > 1060
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 10032  100320 0 32  0  0:00:01 --:--:--  0:00:01
> >   32
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 10040  100400 0 43  0 --:--:-- --:--:-- --:--:--
> >   43
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100 2290k  100 2290k0 0   258k  0  0:00:08  0:00:08 --:--:--
> > 435k
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100   842  100   8420 0303  0  0:00:02  0:00:02 --:--:--
> >  303
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 10032  100320 0 32  0  0:00:01 --:--:--  0:00:01
> >   32
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 10040  100400 0  8  0  0:00:05  0:00:04  0:00:01
> >   11
> >   % Total% Received % Xferd  Average Speed   TimeTime Time
> > Current
> >  Dload  Upload   Total   SpentLeft
> > Speed
> > 100 3322k  100 3322k0 0   304k  0  0:00:10  0:00:10 --:--:--
> > 405k
> >   % Total% Received % Xferd  Average Speed   TimeTime 

Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate 3

2017-07-05 Thread Suneel Marthi
+1 binding

1. Ran the complete suite of Eval tests - all passed
2. Built from {source} * {tar, zip} - all unit tests pass
3. verified sigs and hashes



On Wed, Jul 5, 2017 at 9:21 AM, Suneel Marthi <smar...@apache.org> wrote:

> The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.1
> Release Candidate 3.
>
> The Release artifacts can be downloaded from:
>
> https://repository.apache.org/content/repositories/
> orgapacheopennlp-1016/org/apache/opennlp/opennlp-distr/1.8.1/
>
> The release was made from the Apache OpenNLP 1.8.1 tag at
>
> https://github.com/apache/opennlp/tree/opennlp-1.8.1
>
> To use it in a maven build set the version for opennlp-tools or opennlp-uima
> to 1.8.1
>
> and add the following URL to your settings.xml file:
>
> https://repository.apache.org/content/repositories/orgapacheopennlp-1016/
>
> The artifacts have been signed with the Key - D3541808 found at
>
> http://people.apache.org/keys/group/opennlp.asc
>
> Please vote on releasing these packages as Apache OpenNLP 1.8.1. The vote
>  is
>
> open for the next 72 hours *ending on Saturday, July 8AM EST *.
>
> Only votes from OpenNLP PMC are binding, but folks are welcome to check
> the
>
> release candidate and voice their approval or disapproval. The vote passes
>
> if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache OpenNLP 1.8.1
>
> [ ] -1 Do not release the packages because...
>
> Thanks again to all the committers and contributors for their work over
> the past few weeks.
>


[VOTE] Apache OpenNLP 1.8.1 Release Candidate 3

2017-07-05 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.1
Release Candidate 3.

The Release artifacts can be downloaded from:

https://repository.apache.org/content/repositories/orgapacheopennlp-1016/org/apache/opennlp/opennlp-distr/1.8.1/

The release was made from the Apache OpenNLP 1.8.1 tag at

https://github.com/apache/opennlp/tree/opennlp-1.8.1

To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.8.1

and add the following URL to your settings.xml file:

https://repository.apache.org/content/repositories/orgapacheopennlp-1016/

The artifacts have been signed with the Key - D3541808 found at

http://people.apache.org/keys/group/opennlp.asc

Please vote on releasing these packages as Apache OpenNLP 1.8.1. The vote is

open for the next 72 hours *ending on Saturday, July 8AM EST *.

Only votes from OpenNLP PMC are binding, but folks are welcome to check the

release candidate and voice their approval or disapproval. The vote passes

if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache OpenNLP 1.8.1

[ ] -1 Do not release the packages because...

Thanks again to all the committers and contributors for their work
over the past
few weeks.


Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate

2017-07-01 Thread Suneel Marthi
Here's my +1 binding

1. Verified the sigs and hashsums
2. Ran a clean build of {src} * {zip, tar} and all unit tests pass
3. Verified RAT check

On Sat, Jul 1, 2017 at 11:20 AM, Suneel Marthi <smar...@apache.org> wrote:

> The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.1
> Release Candidate.
>
> The Release artifacts can be downloaded from:
>
> https://repository.apache.org/content/repositories/
> orgapacheopennlp-1014/org/apache/opennlp/opennlp-distr/1.8.1/
>
> The release was made from the Apache OpenNLP 1.8.1 tag at
>
> https://github.com/apache/opennlp/tree/opennlp-1.8.1
>
> To use it in a maven build set the version for opennlp-tools or opennlp-uima
> to 1.8.1
>
> and add the following URL to your settings.xml file:
>
> https://repository.apache.org/content/repositories/orgapacheopennlp-1014
>
> The artifacts have been signed with the Key - D3541808 found at
>
> http://people.apache.org/keys/group/opennlp.asc
>
> Please vote on releasing these packages as Apache OpenNLP 1.8.1. The vote
>  is
>
> open for the next 72 hours *ending on Monday, July 3 11AM EST *.
>
> Only votes from OpenNLP PMC are binding, but folks are welcome to check
> the
>
> release candidate and voice their approval or disapproval. The vote passes
>
> if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache OpenNLP 1.8.1
>
> [ ] -1 Do not release the packages because...
>
> Thanks again to all the committers and contributors for their work over
> the past few weeks.
>


[VOTE] Apache OpenNLP 1.8.1 Release Candidate

2017-07-01 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.8.1
Release Candidate.

The Release artifacts can be downloaded from:

https://repository.apache.org/content/repositories/orgapacheopennlp-1014/org/apache/opennlp/opennlp-distr/1.8.1/

The release was made from the Apache OpenNLP 1.8.1 tag at

https://github.com/apache/opennlp/tree/opennlp-1.8.1

To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.8.1

and add the following URL to your settings.xml file:

https://repository.apache.org/content/repositories/orgapacheopennlp-1014

The artifacts have been signed with the Key - D3541808 found at

http://people.apache.org/keys/group/opennlp.asc

Please vote on releasing these packages as Apache OpenNLP 1.8.1. The vote is

open for the next 72 hours *ending on Monday, July 3 11AM EST *.

Only votes from OpenNLP PMC are binding, but folks are welcome to check the

release candidate and voice their approval or disapproval. The vote passes

if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache OpenNLP 1.8.1

[ ] -1 Do not release the packages because...

Thanks again to all the committers and contributors for their work
over the past
few weeks.


Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Suneel Marthi
http://opennlp.apache.org/docs/1.8.0/manual/opennlp.html#tools.doccat

On Thu, Jun 29, 2017 at 1:51 PM, Chris Mattmann  wrote:

> Thanks I will investigate the below thanks Joern. Can someone send me some
> pointers
> to the Doc Cat API that I can find? Thanks.
>
>
>
>
> On 6/29/17, 10:18 AM, "Joern Kottmann"  wrote:
>
> For 2. I would like to suggest that we implement doccat format support
> to train on that data.
>
> 3. it would be best so think about how we want to test the doccat
> component, today we don't have any tests which use lots of data to
> evaluate it.
> Probably the sentitment data could solve this for us and a train and
> evaluate test could be included in the eval tests.
>
> +1 to revert and then do these steps after the 1.8.1 release.
>
> I can apply my PR myself if nobody objects.
>
> Jörn
>
> On Thu, Jun 29, 2017 at 7:10 PM, Chris Mattmann 
> wrote:
> > Hi Rodrigo,
> >
> > This is very useful feedback that I wish we would have had a long
> time ago.
> >
> > I will look into it and see if I can reproduce the CLI error. I did
> a full build and mvn
> > install (which I though would run tests?) before commiting and as I
> posted in JIRA
> > the tests passed for me? So I will have to look into that.
> >
> > That said, given your feedback that SentimentME and the Sentiment
> Component
> > doesn’t offer much over Document Classifier I agree with you, but
> wasn’t super
> > familiar with the Document Classifier API. That said, if we can get
> the same functionality
> > by just using Document Classifier why don’t we:
> >
> > 1. Remove the SentimentME and associated code (except for the unit
> tests)
> > 2. Use the sample datasets from NetFlix & Stanford Treebank
> sentiment and
> > build models using Document Classifier API.
> > 3. Rename and keep the unit tests that test against Netflix and
> Stanford tree bank.
> >
> > That way we get basic sentiment analysis (that is working for us
> internally at JPL decently),
> > for Apache OpenNLP, and then if we want to build something better
> than a Document
> > Classification approach to sentiment we can do so.
> >
> > Thoughts?
> >
> > Thanks for your useful feedback. If everyone agrees this is a plan I
> can back out the code
> > using Joern’s revert, and then try and execute 1-3 above in a branch
> first. Thanks.
> >
> > Cheers,
> > Chris
> >
> >
> >
> > On 6/29/17, 10:03 AM, "Rodrigo Agerri"  wrote:
> >
> > Hi Chris,
> >
> > I have been interested in the new sentiment component for a
> while,
> > although truth to be told, I did not follow that closely. I have
> today
> > looked at it and test it with some of the corpora you have
> mentioned.
> > In order to do that, I checkout master to work with from this
> commit
> > onwards
> >
> > https://github.com/apache/opennlp/commit/
> 56321aab51a470cd2004b76fb1f5330881b943c1
> >
> > 1. I tried to run it from the CLI. The Sentiment component did
> not
> > appear to be available.
> > 2. I added the SentimentTrainer and Evaluator to the cmdline.CLI
> (no
> > SentimentTool is implemented to tag with a trained model).
> > 3. After that, the CLI tests did not pass. So, the CLI is
> currently
> > non functional, unless I did something wrong, always possible, of
> > course. See if you can reproduce that error.
> >
> > I therefore did the tests via API. I implemented a little test
> for
> > training, evaluating and tagging here:
> >
> > https://github.com/ixa-ehu/ixa-pipe-doc/tree/test
> >
> > I run the training on the large movies review from Stanford for
> binary
> > polarity classification
> >
> > http://ai.stanford.edu/~amaas/data/sentiment/
> >
> > and on the two little samples multiclass files added in
> resources and
> > mentioned in the previous email, using the first one for
> training and
> > the second one for testing (maxent 100 iterations, cutoff 5).
> >
> > 2. Stanford results: 0.84264
> > 3. sample multiclass: 0.73
> >
> > Given that this is a standard document classification task, I
> decided
> > to train the doccat component from the CLI:
> >
> > 1. Stanford results: 0.84264 (BOW features by default).
> > 2. sample multiclass: 0.73
> >
> > I then looked at the code of the sentiment component and saw
> that it
> > is basically a document classifier working with bag of words
> features.
> > No added functionality. So, my conclusions are:
> >
> > 1. The CLI needs to be fixed.
> > 2. The Sentiment component, as it is, 

Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Suneel Marthi
On Thu, Jun 29, 2017 at 1:10 PM, Chris Mattmann  wrote:

> Hi Rodrigo,
>
> This is very useful feedback that I wish we would have had a long time ago.
>
> I will look into it and see if I can reproduce the CLI error. I did a full
> build and mvn
> install (which I though would run tests?) before commiting and as I posted
> in JIRA
> the tests passed for me? So I will have to look into that.
>
> That said, given your feedback that SentimentME and the Sentiment Component
> doesn’t offer much over Document Classifier I agree with you, but wasn’t
> super
> familiar with the Document Classifier API. That said, if we can get the
> same functionality
> by just using Document Classifier why don’t we:
>
> 1. Remove the SentimentME and associated code (except for the unit tests)
> 2. Use the sample datasets from NetFlix & Stanford Treebank sentiment and
> build models using Document Classifier API.
> 3. Rename and keep the unit tests that test against Netflix and Stanford
> tree bank.
>

+1 for the above 3 steps. Let's go ahead with Step 1 today - that way we
can start planning on cutting a 1.8.1 release candidate this weekend.



>
> That way we get basic sentiment analysis (that is working for us
> internally at JPL decently),
> for Apache OpenNLP, and then if we want to build something better than a
> Document
> Classification approach to sentiment we can do so.
>
> Thoughts?
>
> Thanks for your useful feedback. If everyone agrees this is a plan I can
> back out the code
> using Joern’s revert, and then try and execute 1-3 above in a branch
> first. Thanks.
>
> Cheers,
> Chris
>
>
>
> On 6/29/17, 10:03 AM, "Rodrigo Agerri"  wrote:
>
> Hi Chris,
>
> I have been interested in the new sentiment component for a while,
> although truth to be told, I did not follow that closely. I have today
> looked at it and test it with some of the corpora you have mentioned.
> In order to do that, I checkout master to work with from this commit
> onwards
>
> https://github.com/apache/opennlp/commit/
> 56321aab51a470cd2004b76fb1f5330881b943c1
>
> 1. I tried to run it from the CLI. The Sentiment component did not
> appear to be available.
> 2. I added the SentimentTrainer and Evaluator to the cmdline.CLI (no
> SentimentTool is implemented to tag with a trained model).
> 3. After that, the CLI tests did not pass. So, the CLI is currently
> non functional, unless I did something wrong, always possible, of
> course. See if you can reproduce that error.
>
> I therefore did the tests via API. I implemented a little test for
> training, evaluating and tagging here:
>
> https://github.com/ixa-ehu/ixa-pipe-doc/tree/test
>
> I run the training on the large movies review from Stanford for binary
> polarity classification
>
> http://ai.stanford.edu/~amaas/data/sentiment/
>
> and on the two little samples multiclass files added in resources and
> mentioned in the previous email, using the first one for training and
> the second one for testing (maxent 100 iterations, cutoff 5).
>
> 2. Stanford results: 0.84264
> 3. sample multiclass: 0.73
>
> Given that this is a standard document classification task, I decided
> to train the doccat component from the CLI:
>
> 1. Stanford results: 0.84264 (BOW features by default).
> 2. sample multiclass: 0.73
>
> I then looked at the code of the sentiment component and saw that it
> is basically a document classifier working with bag of words features.
> No added functionality. So, my conclusions are:
>
> 1. The CLI needs to be fixed.
> 2. The Sentiment component, as it is, provides the same functionality
> as the document classifier.
>
> I would therefore reconsider this commit until those two issues are
> addressed. Just my opinion.
>
> Best regards,
>
> Rodrigo
>
> On Thu, Jun 29, 2017 at 5:30 PM, Chris Mattmann 
> wrote:
> >
> > Hey Joern,
> >
> > Sure, you can find the model data links here, along with our
> evaluation of them.
> >
> > http://irds.usc.edu/SentimentAnalysisParser/datasets.html
> >
> > There are other evaluations here:
> >
> > http://irds.usc.edu/SentimentAnalysisParser/models.html
> >
> > The HT provider review I cannot contribute at this time and I
> question its broad
> > applicability since it’s related to human trafficking. In addition
> we are still working
> > on publishing our analysis & evaluation of it which is why I removed
> it from the
> > commit.
> >
> > Cheers,
> > Chris
> >
> >
> >
> >
> >
> > On 6/29/17, 7:36 AM, "Joern Kottmann"  wrote:
> >
> > Which data sets did you use to evaluate this?
> > I was looking for a bit more than a sample file to train it.
> >
> > I noticed that you checked in stanford and 

Re: Public datasets for Semantic Relationship Extraction

2017-06-28 Thread Suneel Marthi
Forced me to join that group first - so will patiently wait for the group
moderator to consider/rule out my application to join that group and then
maybe I get to read that post. 



On Wed, Jun 28, 2017 at 2:44 PM, Chris Mattmann  wrote:

> Hi Team,
>
> Anything here that we can use in OpenNLP?
>
> https://www.linkedin.com/groups/131222/131222-
> 6284423593917063169?midToken=AQGRDKND99GRHQ=eml-b2_
> anet_digest_of_digests-hero-11-discussion~subject&
> trkEmail=eml-b2_anet_digest_of_digests-hero-11-discussion~
> subject-null-uh2g~j4hb54j7~h2-null-communities~group~
> discussion=urn%3Ali%3Apage%3Aemail_b2_anet_digest_of_digests%
> 3BnYRsTix4QoG8YsVuU%2FryIg%3D%3D
>
> CC’ing dev@tika too.
>
> Cheers,
> Chris
>
>
>
>
>


Re: [VOTE] Migrate our main repositories to GitHub

2017-06-28 Thread Suneel Marthi
All of the active PMC and committers have already voted +1, so its ok to
close the vote and not waste 72 hrs.





On Wed, Jun 28, 2017 at 10:03 AM, Chris Mattmann <mattm...@apache.org>
wrote:

> Hi Joern,
>
> VOTEs need to stay open for at least 72 hours…for everyone and time zones,
> etc.
>
> Is there some rush here?
>
> Cheers,
> Chris
>
>
>
>
> On 6/28/17, 3:57 AM, "Joern Kottmann" <kottm...@gmail.com> wrote:
>
> The vote passes, only +1 votes have been received:
> +1 Mark G
> +1 Rodrigo Agerri
> +1 Jeff Zemerick
> +1 Suneel Marthi
> +1 Jörn Kottmann
> +1 William Colen
> +1 Dan Russ
> +1 Anthony Beylerian
> +1 Chris Mattmann
> +1 Oleg Tikhonov
> +1 Tommaso Teofili
>
> Jörn
>
> On Wed, Jun 28, 2017 at 10:27 AM, Tommaso Teofili
> <tommaso.teof...@gmail.com> wrote:
> > +1 to migrate to gitbox [1]
> >
> > Regards,
> > Tommaso
> >
> > [1] : https://gitbox.apache.org/
> >
> > Il giorno mar 27 giu 2017 alle ore 21:54 Oleg Tikhonov <
> o...@apache.org> ha
> > scritto:
> >
> >> [x] +1 Migrate all repositories to GitHub
> >>
> >>
> >>
> >> On Tue, Jun 27, 2017 at 10:48 PM, Chris Mattmann <
> mattm...@apache.org>
> >> wrote:
> >>
> >> > If you are talking about using Apache Gitbox, then yes I am +1
> for this.
> >> >
> >> > Thanks,
> >> > Chris
> >> >
> >> >
> >> >
> >> >
> >> > On 6/27/17, 3:30 AM, "Joern Kottmann" <kottm...@gmail.com> wrote:
> >> >
> >> > Hello all,
> >> >
> >> > lets decide here if we want to move our main repository,
> currently
> >> > hosted at Apache to GitHub instead. This will make our
> process a bit
> >> > easier because we can eliminate one remote from our workflow.
> >> >
> >> >  [ ] +1 Migrate all repositories to GitHub
> >> >  [ ] -1 Do not migrate,  because...
> >> >
> >> > Thanks,
> >> > Jörn
> >> >
> >> >
> >> >
> >> >
> >>
>
>
>
>


Re: [VOTE] Migrate our main repositories to GitHub

2017-06-27 Thread Suneel Marthi
+1

मेरे iPhone से प्रेषित

२७/०६/२०१७ को पू ८:२२ पर Jeff Zemerick  ने लिखा :

> +1
> 
> On Tue, Jun 27, 2017 at 6:53 AM, Rodrigo Agerri 
> wrote:
> 
>> +1
>> 
>> R
>> 
>>> On Tue, Jun 27, 2017 at 12:46 PM, Mark G  wrote:
>>> 
>>> +1
>>> 
>>> Sent from my iPhone
>>> 
 On Jun 27, 2017, at 6:30 AM, Joern Kottmann 
>> wrote:
 
 +1
 
 Jörn
 
> On Tue, Jun 27, 2017 at 12:30 PM, Joern Kottmann 
>>> wrote:
> Hello all,
> 
> lets decide here if we want to move our main repository, currently
> hosted at Apache to GitHub instead. This will make our process a bit
> easier because we can eliminate one remote from our workflow.
> 
>[ ] +1 Migrate all repositories to GitHub
>[ ] -1 Do not migrate,  because...
> 
> Thanks,
> Jörn
>>> 
>> 


Re: [GitHub] opennlp-site pull request #20: OPENNLP-1093: bump up version of jbake-maven-...

2017-06-13 Thread Suneel Marthi
+1 lgtm

Sent from my iPhone

> On Jun 13, 2017, at 12:45 PM, kinow  wrote:
> 
> GitHub user kinow opened a pull request:
> 
>https://github.com/apache/opennlp-site/pull/20
> 
>OPENNLP-1093: bump up version of jbake-maven-plugin, and update groupId
> 
>Tested locally both running the site, and generating the files during the 
> package version.
> 
> You can merge this pull request into a Git repository by running:
> 
>$ git pull https://github.com/kinow/opennlp-site OPENNLP-1093
> 
> Alternatively you can review and apply these changes as the patch at:
> 
>https://github.com/apache/opennlp-site/pull/20.patch
> 
> To close this pull request, make a commit to your master/trunk branch
> with (at least) the following in the commit message:
> 
>This closes #20
> 
> 
> commit c04795c9032481a245afb5b4d269a1ef10831bec
> Author: Bruno P. Kinoshita 
> Date:   2017-06-13T10:44:45Z
> 
>OPENNLP-1093: bump up version of jbake-maven-plugin, and update groupId
> 
> 
> 
> 
> ---
> If your project is set up for it, you can reply to this email and have your
> reply appear on GitHub as well. If your project does not have this feature
> enabled and wishes so, or if the feature is enabled but not working, please
> contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
> with INFRA.
> ---


Berlin Buzzwords 2017, Berlin Germany

2017-06-05 Thread Suneel Marthi
A good number of Apache OpenNLP committers will be attending and speaking
at Berlin Buzzwords next week in Berlin, Germany [1].

Look out for Jorn Kottmann, Tommaso Teofili, Grant Ingersoll, Isabel
Drost-Fromm, Suneel Marthi, Uwe Schindler who will be speaking.

Scheduled Events:
. BM25 is so yesterday: Modern techniques for better search relevance -
Grant Ingersoll
  Time: Monday, 6/12, 2pm

. Embracing Diversity: Searching over multiple languages
  Time: Monday, 6/12, 12:20pm

Everybody is welcome, hope to meet with OpenNLP users there!

[1] http://berlinbuzzwords.de


Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 3

2017-05-17 Thread Suneel Marthi
+1 binding



On Wed, May 17, 2017 at 5:48 PM, Joern Kottmann  wrote:

> The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
> 1.8.0 Release Candidate 3.
>
> The RC 3 distributables can be downloaded from here:
> https://repository.apache.org/content/repositories/orgapacheopennlp-101
> 3/org/apache/opennlp/opennlp-distr/1.8.0/
>
> The release was made from the Apache OpenNLP 1.8.0 tag at
> https://github.com/apache/opennlp/tree/opennlp-1.8.0
>
> To use it in a maven build set the version for opennlp-tools or
> opennlp-uima to 1.8.0 and add the following URL to your settings.xml
> file:
> https://repository.apache.org/content/repositories/orgapacheopennlp-101
> 3
>
> The release was made using the OpenNLP release process, documented on
> the Wiki here:
> https://cwiki.apache.org/confluence/display/OPENNLP/Release+Process
>
> The release contains quite some changes, please refer to the contained
> issue list for details.
>
> Please vote on releasing these packages as Apache OpenNLP 1.8.0. The
> vote is open for at least the next 72 hours.
>
> Only votes from OpenNLP PMC are binding, but folks are welcome to check
> the release candidate and voice their approval or disapproval. The vote
> passes if at least three binding +1 votes are cast.
>
> [ ] +1 Release the packages as Apache OpenNLP 1.8.0
> [ ] -1 Do not release the packages because...
>
>
> Thanks!
>
> Jörn
>
> P.S. Here is my +1.
>


Re: Update web site layout

2017-03-03 Thread Suneel Marthi
There are few that r doing that presently, check out the Flink project as
also Pirk.



http://flink.apache.org
http://pirk.incubator.apache.org



On Fri, Mar 3, 2017 at 4:38 PM, Bruno P. Kinoshita <
brunodepau...@yahoo.com.br.invalid> wrote:

> Hi William,
> Would it be an option? I haven't seen any ASF project hosted primarily on
> GitHub (site or src).
>
> I'd be fine with that as well, or even the current ASF cms.
> Bruno
> Sent from Yahoo Mail on Android
>
>   On Sat, Mar 4, 2017 at 9:57, William Colen
> wrote:   Hi, Bruno,
>
> What do you think if we instead of using maven site we do it using Jekyll +
> github?
> That way we don't need to separate the site and documentation deploy.
>
> Thank you
> William
>
> 2017-03-03 10:03 GMT-03:00 Bruno P. Kinoshita <
> brunodepau...@yahoo.com.br.invalid>:
>
> > Hi all,
> >
> > Didn't find an issue for that, so thought about asking here before
> > creating a ticket in JIRA.
> >
> >
> > Normally I find what I need using the current site (normally models,
> > version, and manual) but thought maybe an update in the web site layout
> > could be a good idea?
> >
> > I thought combined with OPENNLP-6 and OPENNLP-504 (and maybe later
> > OPENNLP-48 as well) this could attract more users / developers.
> >
> > Here's an example, using the Maven Site plug-in, with the Fluid skin:
> > https://kinow.github.io/opennlp/
> >
> > Cheers
> > Bruno
> >
>
>


Re: Help Required in Code

2017-02-08 Thread Suneel Marthi
In Java both are valid syntaxes to represent an array of chars, the
preferred syntax should have been char[] eosCharacters.

The getter method actually returns a char[]

public char[] getEndOfSentenceCharacters() {
  return eosCharacters;
}


On Wed, Feb 8, 2017 at 10:01 AM, ABHISHEK MAITI 
wrote:

> Hi!
> I was going through the codebase which is present on Github. I found a line
> in this
>  tools/src/main/java/opennlp/tools/sentdetect/DefaultEndOfSentenceScanner.
> java>
> file which I couldn't understand (line no. 31).Is it supposed to be char
> eosCharacters[]? I was expecting it to be char[] eosCharacters.
>
> Thanks!
>
> *Abhishek Maiti*
> *B.Tech 2016 | Computer Science and Engineering*
> Indraprastha Institute Of Information Technology
> New Delhi
> +918447549121
>


[ANNOUNCE] Apache OpenNLP 1.7.2 Release

2017-02-04 Thread Suneel Marthi
The Apache OpenNLP team is pleased to announce the release of version 1.7.2
of Apache OpenNLP.

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

It supports the most common NLP tasks, such as tokenization, sentence
segmentation, part-of-speech tagging, named entity extraction, chunking,
parsing, and coreference resolution.

The OpenNLP 1.7.2 binary and source distributions are available for
download from our download page:
http://opennlp.apache.org/cgi-bin/download.cgi

The OpenNLP library is distributed by Maven Central as well. See the Maven
Dependency page for more details:
http://opennlp.apache.org/maven-dependency.html
Requirements


Java 1.8 is required to run OpenNLP Maven 3.3.9 is required for building it
Building from the Source Distribution


To build everything execute the following command in the root folder: mvn
clean install

The results of the build will be placed in:
opennlp-distr/target/apache-opennlp-1.7.2-bin.tar-gz (or .zip)
What's new in Apache OpenNLP 1.7.2


This release introduces many new features, improvements and bug fixes. The
API has been improved for a better consistency and 1.4 deprecated methods
were removed. Now Java 1.8 is required.

Additionally the release contains the following noteworthy changes:

   - Name Finder evaluation can now show a confusion matrix
   - The default evaluation output contains more details
   - Added a Language Model CLI tool
   - Add Moses format support
   - More refactoring and cleanup, specially in Machine Learning package
   and Dictionary
   - Removed deprecated trainers from UIMA integration
   - Fixed potential localization issues and added maven plugin to prevent
   it (ForbiddenAPI)
   - Fixed issues with the BRAT corpus reader
   - Deprecated GIS class, will be removed in a future 1.8.x release

A detailed list of the issues related to this release can be found in the
release notes.

Thanks again to all contributors and committers for their help.


Re: [VOTE] Apache OpenNLP 1.7.2 Release Candidate

2017-02-04 Thread Suneel Marthi
Its been past 72 hrs and below are the Vote Results:

4 +1 binding - Joern, Rodrigo, William, Suneel
2 +1 non-binding - Daniel Russ, Jeffrey Zemenick

The VOTE has passed and is now officially closed, thanks again to all
committers and contributors.

On Fri, Feb 3, 2017 at 10:43 AM, William Colen <co...@apache.org> wrote:

> +1 binding
>
> I did run the eval tests and they all run through, including the one that
> needs more memory.
>
> William
>
> 2017-02-03 13:35 GMT-02:00 Suneel Marthi <smar...@apache.org>:
>
> > +1 binding
> >
> > Verified {src, bin} * {zip, tar} and all tests pass.
> >
> > On Fri, Feb 3, 2017 at 10:08 AM, Russ, Daniel (NIH/CIT) [E] <
> > dr...@mail.nih.gov> wrote:
> >
> > > +1 (non-binding)  Have not run across problems with external code that
> > > uses OpenNLP
> > >
> > > On 2/3/17, 9:57 AM, "Rodrigo Agerri" <rodrigo.age...@ehu.eus> wrote:
> > >
> > > +1 also pass tests
> > >
> > > On Fri, Feb 3, 2017 at 3:34 PM, Jeffrey Zemerick <
> > jzemer...@apache.org
> > > >
> > > wrote:
> > >
> > > > +1 (non-binding) Build and tests pass with no issues.
> > > >
> > > >
> > > >
> > > > On Fri, Feb 3, 2017 at 4:15 AM, Joern Kottmann <
> kottm...@gmail.com
> > >
> > > wrote:
> > > >
> > > > > +1
> > > > >
> > >     > > I did run the eval tests and they all run through except one
> test
> > > which
> > > > > needed more memory, that test case has to be adapted to run
> fast
> > > and with
> > > > > much less memory, we should do that for the 1.7.3 release.
> > > > >
> > > > > Jörn
> > > > >
> > > > > On Wed, Feb 1, 2017 at 5:52 PM, Suneel Marthi <
> > smar...@apache.org>
> > > > wrote:
> > > > >
> > > > > > The Apache OpenNLP PMC would like to call for a Vote on
> Apache
> > > OpenNLP
> > > > > > 1.7.2
> > > > > > Release Candidate.
> > > > > >
> > > > > > The Release artifacts can be downloaded from:
> > > > > >
> > > > > > https://repository.apache.org/content/repositories/
> > > > > > orgapacheopennlp-1010/org/apache/opennlp/opennlp-distr/
> 1.7.2/
> > > > > >
> > > > > > The release was made from the Apache OpenNLP 1.7.2 tag at
> > > > > >
> > > > > > https://github.com/apache/opennlp/tree/opennlp-1.7.2
> > > > > >
> > > > > > To use it in a maven build set the version for opennlp-tools
> or
> > > > > > opennlp-uima
> > > > > > to 1.7.2
> > > > > >
> > > > > > and add the following URL to your settings.xml file:
> > > > > >
> > > > > > https://repository.apache.org/content/repositories/
> > > > orgapacheopennlp-1010
> > > > > >
> > > > > > The artifacts have been signed with the Key - D3541808 found
> at
> > > > > >
> > > > > > http://people.apache.org/keys/group/opennlp.asc
> > > > > >
> > > > > > Please vote on releasing these packages as Apache OpenNLP
> > 1.7.2.
> > > The
> > > > vote
> > > > > > is
> > > > > >
> > > > > > open for either the next 72 hours or a minimum of 3 +1 PMC
> > > binding
> > > > votes
> > > > > > whichever happens earlier.
> > > > > >
> > > > > > Only votes from OpenNLP PMC are binding, but folks are
> welcome
> > > to check
> > > > > the
> > > > > >
> > > > > > release candidate and voice their approval or disapproval.
> The
> > > vote
> > > > > passes
> > > > > >
> > > > > > if at least three binding +1 votes are cast.
> > > > > >
> > > > > > [ ] +1 Release the packages as Apache OpenNLP 1.7.2
> > > > > >
> > > > > > [ ] -1 Do not release the packages because...
> > > > > >
> > > > > > Thanks again to all the committers and contributors for their
> > > work
> > > > > > over the past
> > > > > > few weeks.
> > > > > >
> > > > >
> > > >
> > >
> > >
> > >
> >
>


Re: [VOTE] Apache OpenNLP 1.7.2 Release Candidate

2017-02-03 Thread Suneel Marthi
+1 binding

Verified {src, bin} * {zip, tar} and all tests pass.

On Fri, Feb 3, 2017 at 10:08 AM, Russ, Daniel (NIH/CIT) [E] <
dr...@mail.nih.gov> wrote:

> +1 (non-binding)  Have not run across problems with external code that
> uses OpenNLP
>
> On 2/3/17, 9:57 AM, "Rodrigo Agerri" <rodrigo.age...@ehu.eus> wrote:
>
> +1 also pass tests
>
> On Fri, Feb 3, 2017 at 3:34 PM, Jeffrey Zemerick <jzemer...@apache.org
> >
> wrote:
>
> > +1 (non-binding) Build and tests pass with no issues.
> >
> >
> >
> > On Fri, Feb 3, 2017 at 4:15 AM, Joern Kottmann <kottm...@gmail.com>
> wrote:
> >
> > > +1
> > >
> > > I did run the eval tests and they all run through except one test
> which
> > > needed more memory, that test case has to be adapted to run fast
> and with
> > > much less memory, we should do that for the 1.7.3 release.
> > >
> > > Jörn
> > >
> > > On Wed, Feb 1, 2017 at 5:52 PM, Suneel Marthi <smar...@apache.org>
> > wrote:
> > >
> > > > The Apache OpenNLP PMC would like to call for a Vote on Apache
> OpenNLP
> > > > 1.7.2
> > > > Release Candidate.
> > > >
> > > > The Release artifacts can be downloaded from:
> > > >
> > > > https://repository.apache.org/content/repositories/
> > > > orgapacheopennlp-1010/org/apache/opennlp/opennlp-distr/1.7.2/
> > > >
> > > > The release was made from the Apache OpenNLP 1.7.2 tag at
> > > >
> > > > https://github.com/apache/opennlp/tree/opennlp-1.7.2
> > > >
> > > > To use it in a maven build set the version for opennlp-tools or
> > > > opennlp-uima
> > > > to 1.7.2
> > > >
> > > > and add the following URL to your settings.xml file:
> > > >
> > > > https://repository.apache.org/content/repositories/
> > orgapacheopennlp-1010
> > > >
> > > > The artifacts have been signed with the Key - D3541808 found at
> > > >
> > > > http://people.apache.org/keys/group/opennlp.asc
> > > >
> > > > Please vote on releasing these packages as Apache OpenNLP 1.7.2.
> The
> > vote
> > > > is
> > > >
> > > > open for either the next 72 hours or a minimum of 3 +1 PMC
> binding
> > votes
> > > > whichever happens earlier.
> > > >
> > > > Only votes from OpenNLP PMC are binding, but folks are welcome
> to check
> > > the
> > > >
> > > > release candidate and voice their approval or disapproval. The
> vote
> > > passes
> > > >
> > > > if at least three binding +1 votes are cast.
> > > >
> > > > [ ] +1 Release the packages as Apache OpenNLP 1.7.2
> > > >
> > > > [ ] -1 Do not release the packages because...
> > > >
> > > > Thanks again to all the committers and contributors for their
> work
> > > > over the past
> > > > few weeks.
> > > >
> > >
> >
>
>
>


[VOTE] Apache OpenNLP 1.7.2 Release Candidate

2017-02-01 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.7.2
Release Candidate.

The Release artifacts can be downloaded from:

https://repository.apache.org/content/repositories/
orgapacheopennlp-1010/org/apache/opennlp/opennlp-distr/1.7.2/

The release was made from the Apache OpenNLP 1.7.2 tag at

https://github.com/apache/opennlp/tree/opennlp-1.7.2

To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.7.2

and add the following URL to your settings.xml file:

https://repository.apache.org/content/repositories/orgapacheopennlp-1010

The artifacts have been signed with the Key - D3541808 found at

http://people.apache.org/keys/group/opennlp.asc

Please vote on releasing these packages as Apache OpenNLP 1.7.2. The vote is

open for either the next 72 hours or a minimum of 3 +1 PMC binding votes
whichever happens earlier.

Only votes from OpenNLP PMC are binding, but folks are welcome to check the

release candidate and voice their approval or disapproval. The vote passes

if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache OpenNLP 1.7.2

[ ] -1 Do not release the packages because...

Thanks again to all the committers and contributors for their work
over the past
few weeks.


[VOTE] Apache OpenNLP 1.7.2 Release Candidate

2017-01-31 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP 1.7.2
Release Candidate.

The Release artifacts can be downloaded from:

https://repository.apache.org/content/repositories/orgapacheopennlp-1009/org/apache/opennlp/opennlp-distr/1.7.2/


The release was made from the Apache OpenNLP 1.7.2 tag at

https://github.com/apache/opennlp/tree/opennlp-1.7.2


To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.7.2

and add the following URL to your settings.xml file:

https://repository.apache.org/content/repositories/orgapacheopennlp-1009


The artifacts have been signed with the Key - D3541808 found at

http://people.apache.org/keys/group/opennlp.asc

Please vote on releasing these packages as Apache OpenNLP 1.7.2. The vote is

open for either the next 72 hours or a minimum of 3 +1 PMC binding votes.


Only votes from OpenNLP PMC are binding, but folks are welcome to check the

release candidate and voice their approval or disapproval. The vote passes

if at least three binding +1 votes are cast.


[ ] +1 Release the packages as Apache OpenNLP 1.7.2

[ ] -1 Do not release the packages because...

Thanks again to all the committers and contributors for their work
over the past
few weeks.

Suneel Marthi


Re: [VOTE] Apache OpenNLP 1.7.1 Release Candidate 1

2017-01-23 Thread Suneel Marthi
Thanks all for voting, its past 72 hrs and below are the vote results:

5 +1 binding - Joern, Rodrigo, William, Tommaso, Suneel
2 +1 non-binding - Richard, Jeffrey

This VOTE is now closed and the OpenNLP 1.7.1 release passes.

On Mon, Jan 23, 2017 at 10:02 AM, Rodrigo Agerri <rage...@apache.org> wrote:

> +1 to release
>
> nice
>
> R
>
> On Mon, Jan 23, 2017 at 9:33 AM, Joern Kottmann <kottm...@gmail.com>
> wrote:
>
> > +1 binding
> >
> > Jörn
> >
> > On Jan 21, 2017 12:18 AM, "Suneel Marthi" <smar...@apache.org> wrote:
> >
> > The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
> > 1.7.1 Release Candidate.
> >
> > The Release artifacts can be downloaded from:
> > https://repository.apache.org/content/repositories/
> > orgapacheopennlp-1008/org/apache/opennlp/opennlp-distr/1.7.1/
> >
> > The release was made from the Apache OpenNLP 1.7.1 tag at
> > https://github.com/apache/opennlp/tree/opennlp-1.7.1
> >
> > To use it in a maven build set the version for opennlp-tools or
> > opennlp-uima to 1.7.1
> > and add the following URL to your settings.xml file:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1008
> >
> > The artifacts have been signed with the Key - D3541808 found at
> > http://people.apache.org/keys/group/opennlp.asc
> >
> > Please vote on releasing these packages as Apache OpenNLP 1.7.1. The vote
> > is
> > open for either the next 72 hours or a minimum of 3 +1 PMC binding votes.
> >
> > Only votes from OpenNLP PMC are binding, but folks are welcome to check
> the
> > release candidate and voice their approval or disapproval. The vote
> passes
> > if at least three binding +1 votes are cast.
> >
> > [ ] +1 Release the packages as Apache OpenNLP 1.7.1
> > [ ] -1 Do not release the packages because...
> > [ ]  0 I Care Less/I Don't Care
> >
> > Thanks again to all the committers and contributors for their work over
> the
> > past few weeks.
> >
> > Suneel Marthi
> >
>


Re: [VOTE] Apache OpenNLP 1.7.1 Release Candidate 1

2017-01-22 Thread Suneel Marthi
+1 binding

Verified Sigs and Checksums
Verified Rat check
Verified and Built {src} * {zip, tar}


On Sun, Jan 22, 2017 at 3:45 PM, William Colen 
wrote:

> +1 binding
>
> - signature
> - complete test suit OK (except for Maxent QN config minor issue)
> - package OK (except for the README minor issue)
> - CLI tools OK
>
> 2017-01-22 18:41 GMT-02:00 Tommaso Teofili :
>
> > +1
> >
> > - checked sigs
> > - build ok
> > - license ok
> >
> > Regards,
> > Tommaso
> >
> >
> > Il giorno dom 22 gen 2017 alle ore 18:18 Joern Kottmann <
> > kottm...@gmail.com>
> > ha scritto:
> >
> > > On Sat, 2017-01-21 at 21:09 -0500, Jeffrey Zemerick wrote:
> > > > I went to the opennlp-distr/README for a summary of changes in 1.7.1
> > > > but I
> > > > think it is the same as it was for 1.7.0. Is that file typically
> > > > updated
> > > > for revision releases? The link at the bottom of the RELEASE_NOTES to
> > > > the
> > > > fixed JIRA issues is issuesFixed/jira-report.html. Minor stuff but
> > > > thought
> > > > I'd ask.
> > > >
> > > >
> > >
> > > Yes, this file should be updated. And usually we do this, just this
> > > time we didn't, I think we should release anyway and if we have to do
> > > RC 2 we can update it.
> > >
> > > There is also another minor thing with a test for maxent qn which is
> > > not configured correctly.
> > >
> > > Anyway, beside that, which will be perfect in 1.7.2 I didn't know of
> > > anything which would keep us from taking RC 1 for the 1.7.1 release, I
> > > will have a more detailed look at it now.
> > >
> > > Jörn
> > >
> >
>


[VOTE] Apache OpenNLP 1.7.1 Release Candidate 1

2017-01-20 Thread Suneel Marthi
The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
1.7.1 Release Candidate.

The Release artifacts can be downloaded from:
https://repository.apache.org/content/repositories/orgapacheopennlp-1008/org/apache/opennlp/opennlp-distr/1.7.1/

The release was made from the Apache OpenNLP 1.7.1 tag at
https://github.com/apache/opennlp/tree/opennlp-1.7.1

To use it in a maven build set the version for opennlp-tools or
opennlp-uima to 1.7.1
and add the following URL to your settings.xml file:
https://repository.apache.org/content/repositories/orgapacheopennlp-1008

The artifacts have been signed with the Key - D3541808 found at
http://people.apache.org/keys/group/opennlp.asc

Please vote on releasing these packages as Apache OpenNLP 1.7.1. The vote is
open for either the next 72 hours or a minimum of 3 +1 PMC binding votes.

Only votes from OpenNLP PMC are binding, but folks are welcome to check the
release candidate and voice their approval or disapproval. The vote passes
if at least three binding +1 votes are cast.

[ ] +1 Release the packages as Apache OpenNLP 1.7.1
[ ] -1 Do not release the packages because...
[ ]  0 I Care Less/I Don't Care

Thanks again to all the committers and contributors for their work over the
past few weeks.

Suneel Marthi


Fwd: text classification in portuguese

2017-01-19 Thread Suneel Marthi
Fyi folks

Attn: @Wcolen


-- Forwarded message --
From: Gustavo Frederico 
Date: Thu, Jan 19, 2017 at 9:59 AM
Subject: Re: text classification in portuguese
To: u...@predictionio.incubator.apache.org


Marcus, at first sight this looks like a correct Json encoding. Json itself
encodes the UTF-8 characters.

Abraço
Gustavo

On Thu, Jan 19, 2017 at 8:54 AM, Marcus Vinicius 
wrote:

> Hello guys,
>
> I`m again. I`m trying to classify a portuguese text following the demo
> tutorial (http://predictionio.incubator.apache.org/demo/textclassific
> ation/).
>
> Someone already perform this with predictionIo? How could be the better
> way to i lead with stemming and stop portuguese words?
>
> Allow me to take this opportunity to do another question. Someone has
> problem with encoding? My csv load file is in ISO-8859 and in python script
> i`m transforming my text to utf-8.
>
> text_utf8 = text.decode('iso-8859-1').encode('utf-8')
> client.create_event(
>   event="documents",
>   entity_type="source",
>   entity_id=str(count), # use the count num as user ID
>   properties= {
> "text" : text_utf8,
> "category" : attr[2],
> "label" : int(attr[3])
>   }
> )
>
> When i retrive event from http://localhost:7070/events.json i got  a
> encoded word. Is it right?
>
> {"eventId":"x","event":"documents","entityType":"source","entityId":"73","properties":{"category":"A","text":"Gest\u008bo
>  de 
> Caixa","label":2},"eventTime":"2017-01-19T12:31:27.863Z","creationTime":"2017-01-19T12:31:27.867Z"}
>
>
> I really appreciate your attention.
>
>
> --
>
> Marcus Vinicius A. Silva
>
> *P*  *ANTES DE IMPRIMIR pense em sua responsabilidade e compromisso
> com o MEIO AMBIENTE.*
>


Re: Commit message style

2017-01-09 Thread Suneel Marthi
On Mon, Jan 9, 2017 at 5:02 PM, Jeffrey Zemerick <jzemer...@apache.org>
wrote:

> I'm personally a fan of the issue number being the first thing on the
> subject line, like "OPENNLP-xxx: commit message." For me it gives a
> consistent place to look for the issue without having to read the full
> message. (That way you can also see the issue number in GitHub's commit
> list without having to expand the commit.)
>

+1


>
> On Mon, Jan 9, 2017 at 1:48 PM, Joern Kottmann <kottm...@gmail.com> wrote:
>
> > It doesn't matter where the jira# is placed, as long as it is there.
> >
> > Can be in the first line or occur somewhere later in the message,
> > for example see OPENNLP-914. There it was placed in the body.
> >
> > Jörn
> >
> > On Mon, 2017-01-09 at 13:20 -0500, Suneel Marthi wrote:
> > > I guess the reason to include the jira# at the beginning of the
> > > message is
> > > because the same would be reflected in the corresponding jira (i
> > > could be
> > > wrong here).
> > >
> > > I am not sure if omitting the issue# in the git subject line would
> > > still
> > > reflect the git convo in jira or not.
> > >
> > >
> > >
> > > On Mon, Jan 9, 2017 at 8:26 AM, Joern Kottmann <kottm...@gmail.com>
> > > wrote:
> > >
> > > > Hello all,
> > > >
> > > > we are using different styles for commit messages. It would be good
> > > > to have
> > > > a short discussion on how we think they should be and agree all on
> > > > how to
> > > > write the subject line.
> > > >
> > > > Here are few points from me:
> > > > - Good commit messages are important to understand what happened in
> > > > the
> > > > project and motivate to produce well thought through commits
> > > > - In git we have a subject line, first line in the commit message,
> > > > should
> > > > be around 50 chars, GH cuts after 72 chars and knows this
> > > > convention
> > > > - Subject line is usually written in imperative (git convention)
> > > > - Capitalize the first word (like in a new sentence)
> > > > - Commit message should contain the issue symbol
> > > >
> > > > Open questions:
> > > > - Should the issue symbol be in the subject line? Or in the body?
> > > > - Everyone fine with writing subject line in imperative?
> > > >
> > > > Here is an interesting article about it:
> > > > http://chris.beams.io/posts/git-commit/
> > > >
> > > > Jörn
> > > >
> >
>


Re: Trunk vs. Master

2017-01-09 Thread Suneel Marthi
ITs the 'master' going forward, we'll be filing an infra request to delete
the 'trunk' branch.

On Mon, Jan 9, 2017 at 1:18 PM, Russ, Daniel (NIH/CIT) [E] <
dr...@mail.nih.gov> wrote:

> Hello,
>I am a little confused by the fact we have both a trunk and a master
> branch.  Which branch should be the baseline?  Can we remove the other?
> Daniel
>
> Daniel Russ, Ph.D.
> Staff Scientist, Office of Intramural Research
> Center for Information Technology
> National Institutes of Health
> U.S. Department of Health and Human Services
> 12 South Drive
> Bethesda, MD 20892-5624
>
>


Re: OpenNLP 1.7.0 RC 2 is ready for testing

2016-12-31 Thread Suneel Marthi
The release has been finalized - please find the 1.7.0 release artifacts at
http://www.apache.org/dist/opennlp/opennlp-1.7.0/


On Sat, Dec 31, 2016 at 8:38 PM, Richard Eckart de Castilho <r...@apache.org>
wrote:

> Was the RC2 cancelled? The staging repo doesn't seem to exist (anymore)?
>
> Best,
>
> -- Richard
>
> > On 31.12.2016, at 22:16, William Colen <william.co...@gmail.com> wrote:
> >
> > +1
> >
> >
> > 2016-12-31 19:01 GMT-02:00 Suneel Marthi <smar...@apache.org>:
> >
> >> +1 non-binding
> >>
> >> 1. Verified Sigs and Hashes
> >> 2. Ran clean build from Source and all tests pass
> >> 3. Verified RAT check
> >>
> >> On Sat, Dec 31, 2016 at 3:58 PM, Joern Kottmann <kottm...@gmail.com>
> >> wrote:
> >>
> >>> +1, looks good
> >>>
> >>> Jörn
> >>>
> >>> On Dec 31, 2016 8:54 PM, "William Colen" <co...@apache.org> wrote:
> >>>
> >>>> Hi all,
> >>>>
> >>>> Apache OpenNLP 1.7.0 RC 2 is ready for testing. The RC 1 failed due to
> >>>> missing files and it failed to run 1.6.0 models. There is no new
> >> features
> >>>> since RC 1.
> >>>>
> >>>> The RC 2 can be downloaded from here:
> >>>> http://people.apache.org/~colen/releases/opennlp-1.7.0/rc2/
> >>>>
> >>>> To use it in a maven build set the version for opennlp-tools or
> >>>> opennlp-uima to 1.7.0 and add the following URL to your settings.xml
> >>> file:
> >>>> https://repository.apache.org/content/repositories/
> >> orgapacheopennlp-1007
> >>>>
> >>>> The current test plan can be found here:
> >>>> https://cwiki.apache.org/confluence/display/OPENNLP/TestPlan1.7.0
> >>>>
> >>>> The release artifacts were signed by KEY - 524A9649.
> >>>>
> >>>> Please sign up for tasks in the test plan.
> >>>>
> >>>> The release plan can be found here:
> >>>> https://cwiki.apache.org/confluence/display/OPENNLP/
> >>>> ReleasePlanAndTasks1.7.0
> >>>>
> >>>> The release contains quite some changes, please refer to the contained
> >>>> issue list for details.
> >>>>
> >>>> For your convenience, a copy of the issue list, as well as the release
> >>>> notes and the readme, can be found in the following link:
> >>>>
> >>>> http://people.apache.org/~colen/releases/opennlp-1.7.0/
> >>>> rc2/RELEASE_NOTES.html
> >>>>
> >>>>
> >>>> Thank you,
> >>>> William
>
>


Re: OpenNLP 1.7.0 RC 2 is ready for testing

2016-12-31 Thread Suneel Marthi
+1 non-binding

1. Verified Sigs and Hashes
2. Ran clean build from Source and all tests pass
3. Verified RAT check

On Sat, Dec 31, 2016 at 3:58 PM, Joern Kottmann  wrote:

> +1, looks good
>
> Jörn
>
> On Dec 31, 2016 8:54 PM, "William Colen"  wrote:
>
> > Hi all,
> >
> > Apache OpenNLP 1.7.0 RC 2 is ready for testing. The RC 1 failed due to
> > missing files and it failed to run 1.6.0 models. There is no new features
> > since RC 1.
> >
> > The RC 2 can be downloaded from here:
> > http://people.apache.org/~colen/releases/opennlp-1.7.0/rc2/
> >
> > To use it in a maven build set the version for opennlp-tools or
> > opennlp-uima to 1.7.0 and add the following URL to your settings.xml
> file:
> > https://repository.apache.org/content/repositories/orgapacheopennlp-1007
> >
> > The current test plan can be found here:
> > https://cwiki.apache.org/confluence/display/OPENNLP/TestPlan1.7.0
> >
> > The release artifacts were signed by KEY - 524A9649.
> >
> > Please sign up for tasks in the test plan.
> >
> > The release plan can be found here:
> > https://cwiki.apache.org/confluence/display/OPENNLP/
> > ReleasePlanAndTasks1.7.0
> >
> > The release contains quite some changes, please refer to the contained
> > issue list for details.
> >
> > For your convenience, a copy of the issue list, as well as the release
> > notes and the readme, can be found in the following link:
> >
> > http://people.apache.org/~colen/releases/opennlp-1.7.0/
> > rc2/RELEASE_NOTES.html
> >
> >
> > Thank you,
> > William
> >
>


Re: Pull request

2016-12-21 Thread Suneel Marthi
Not seeing it, these are the present PRs out there -
https://github.com/apache/opennlp/pulls



On Wed, Dec 21, 2016 at 3:31 PM, Russ, Daniel (NIH/CIT) [E] <
dr...@mail.nih.gov> wrote:

> Ok I created a repository on github and attempted a pull request.  Did
> anyone get it?
> The repository is :
> https://github.com/danielruss/openNLP
>
> Thanks
> Daniel
>
>


Re: Update to Java 8

2016-12-19 Thread Suneel Marthi
+1 to move to Java 8


  From: Joern Kottmann 
 To: "dev@opennlp.apache.org"  
 Sent: Monday, December 19, 2016 8:45 AM
 Subject: Update to Java 8
   
Hello all,

Java 7 is already EOL.

Should we update OpenNLP to Java 8 for the 1.7.0 release, any opinions?

Jörn

   

Re: DeepLearning4J as a ML for OpenNLP

2016-06-28 Thread Suneel Marthi
Are u looking at using ND4J (from Deeplearning4j project) as the Math backend 
for ML work? If so, yes.


  From: William Colen 
 To: "dev@opennlp.apache.org"  
 Sent: Tuesday, June 28, 2016 5:23 PM
 Subject: DeepLearning4J as a ML for OpenNLP
   
Hi,

Do you think it would be possible to implement a ML based on DL4J?

http://deeplearning4j.org/

Thank you
William