Re: [VOTE] Release OpenNLP Models trained on UD

2021-03-12 Thread Rodrigo Agerri
+1 Cheers, R On Fri, 12 Mar 2021 at 14:39, Jeff Zemerick wrote: > > All, > > I am calling a vote to release the Apache OpenNLP models trained on the > Universal Dependencies corpus. The models were described in a previous > thread you can see at >

Re: Crucial parts about parser training in Apache OpenNLP Manual are missing

2020-06-08 Thread Rodrigo Agerri
Hello, You can find examples of header files in the repo: https://github.com/apache/opennlp/tree/master/opennlp-tools/lang/en/parser https://github.com/apache/opennlp/tree/master/opennlp-tools/lang/es/parser The format of the header rules comes from Collins thesis, Appendix A:

Re: [VOTE] Apache OpenNLP 1.9.0 Release Candidate 2

2018-07-02 Thread Rodrigo Agerri
+1 Rodrigo On Sun, Jul 1, 2018 at 12:42 AM, Koji Sekiguchi wrote: > I tested mvn install and some Eval tests (OntoNotes4NameFinderEval, > Conll02NameFinderEval, OntoNotes4PosTaggerEval) which use > FeatureGeneratorUtil. > > +1 > > Koji > > > > On 2018/06/29 20:45, Jeff Zemerick wrote: >> >> Hi

Re: [VOTE] Apache OpenNLP 1.9.0 Release Candidate

2018-06-25 Thread Rodrigo Agerri
+1 binding Cheers, Rodrigo On Mon, Jun 25, 2018 at 12:25 AM, Bruno P. Kinoshita wrote: > +1 > > Building from tag passing OK, alas not enough time to look at all changes and > check signatures. > > > Apache Maven 3.5.2 (138edd61fd100ec658bfa2d307c43b76940a5d7d; > 2017-10-18T20:58:13+13:00) >

Re: Word sense disambiguation

2018-02-27 Thread Rodrigo Agerri
Hello, Babelfy is not open source software. DBpedia Spotlight performs Named Entity Disambiguation (APL 2.0), UKB (GPL) does WSD and obtains very good results, and the IMS system is available for download. There will be others, I am sure, but just talking off the top of my head. HTH R On Tue,

Re: [VOTE] Apache OpenNLP 1.8.4 Release Candidate

2017-12-22 Thread Rodrigo Agerri
+1 binding R On Fri, Dec 22, 2017 at 2:02 AM, Koji Sekiguchi wrote: > +1 > > I checked files included in the -src package, built successfully, etc. > > Koji > > > On 2017/12/21 23:44, Jeff Zemerick wrote: >> >> Hi Folks, >> >> I have posted a first release candidate

Re: [VOTE] Language Detector model for Apache OpenNLP 1.8.3 Release Candidate 3

2017-10-31 Thread Rodrigo Agerri
+1 Rodrigo On Tue, Oct 31, 2017 at 2:37 AM, Koji Sekiguchi wrote: > +1 > > - checked text files in the zipped model file > - verified signatures > - executed LanguageDetector using the model file > > Koji > > > On 2017/10/30 22:30, William Colen wrote: >> >> The

Re: [VOTE] Apache OpenNLP 1.8.3 Release Candidate

2017-10-26 Thread Rodrigo Agerri
+1 (binding) -eval and unit tests OK On Wed, Oct 25, 2017 at 7:01 PM, William Colen wrote: > +1 binding > > - eval tests ok > - unit test ok > - build from tag ok > - distribution execution ok > - distribution ok > > > > 2017-10-25 14:46 GMT-02:00 Tommaso Teofili

Re: [VOTE] Apache OpenNLP 1.8.1 Release Candidate 3

2017-07-07 Thread Rodrigo Agerri
+1 builds on a clean machine, all tests passed. thanks Rodrigo On Fri, Jul 7, 2017 at 3:56 PM, Tommaso Teofili wrote: > +1 sigs and build ok, langdetect ok. > > Regards, > Tommaso > > Il giorno ven 7 lug 2017 alle ore 15:55 William Colen ha >

Re: Document Categorizer based on Glove + LSTM (powered by DL4J)

2017-07-05 Thread Rodrigo Agerri
Hello again, @Thamme, out of curiosity, do you have evaluation numbers on the Stanford Large Movie Review dataset? Best, Rodrigo On Wed, Jul 5, 2017 at 9:25 AM, Rodrigo Agerri <rage...@apache.org> wrote: > +1 to Tommaso's comment. This would be very nice to have in the proje

Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Rodrigo Agerri
Hi Chris, On Thu, Jun 29, 2017 at 7:10 PM, Chris Mattmann wrote: > Hi Rodrigo, > > This is very useful feedback that I wish we would have had a long time ago. > > I will look into it and see if I can reproduce the CLI error. I did a full > build and mvn > install (which I

Re: [GitHub] opennlp pull request #238: Revert merging of sentiment work, no consent to m...

2017-06-29 Thread Rodrigo Agerri
Hi Chris, I have been interested in the new sentiment component for a while, although truth to be told, I did not follow that closely. I have today looked at it and test it with some of the corpora you have mentioned. In order to do that, I checkout master to work with from this commit onwards

Re: [VOTE] Migrate our main repositories to GitHub

2017-06-27 Thread Rodrigo Agerri
+1 R On Tue, Jun 27, 2017 at 12:46 PM, Mark G wrote: > +1 > > Sent from my iPhone > > > On Jun 27, 2017, at 6:30 AM, Joern Kottmann wrote: > > > > +1 > > > > Jörn > > > >> On Tue, Jun 27, 2017 at 12:30 PM, Joern Kottmann >

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 3

2017-05-18 Thread Rodrigo Agerri
+1 binding R On Wed, May 17, 2017 at 11:50 PM, Suneel Marthi wrote: > +1 binding > > > > On Wed, May 17, 2017 at 5:48 PM, Joern Kottmann > wrote: > > > The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP > > 1.8.0 Release Candidate 3.

Re: [VOTE] Apache OpenNLP 1.8.0 Release Candidate 2

2017-05-15 Thread Rodrigo Agerri
Hello Richard, I have tried with various corpora, including GUM, but I cannot reproduce that error. https://github.com/apache/opennlp/commit/8a3b3b537a30b14c4ffb5eb32ffa41 d5027bddad Please note that commit O-904 changed (broke) the lemmatizer API substantially to make it uniform between

Re: Update web site layout

2017-03-03 Thread Rodrigo Agerri
I think the new design looks great, and I would definitely like to find a way to incorporate the new design.. Thanks Bruno, Rodrigo On Fri, Mar 3, 2017 at 10:43 PM, Suneel Marthi wrote: > There are few that r doing that presently, check out the Flink project as > also

Re: Tokenizer for NER training

2017-03-02 Thread Rodrigo Agerri
f the generator: > https://github.com/apache/opennlp/blob/master/opennlp- > tools/src/main/java/opennlp/tools/util/featuregen/ > AdditionalContextFeatureGenerator.java#L38 > ) > > In this way i do not care if a date has or not a whitespace (or other > separators) i simply use BIO encoding.

Re: Tokenizer for NER training

2017-03-02 Thread Rodrigo Agerri
Hi Damiano, Maybe I am not understanding your question, but if you just give the NameFinder tokenized annotated data that should be fine: word O 2017 B-DATE 03 I-DATE 02 I-DATE word O Then at testing time, if you tokenize the dates like that, the NameFinder should still try to find the

Re: Name Finder trainer default settings

2017-02-07 Thread Rodrigo Agerri
Hi Daniel, Previous publications suggest features are more important than learning methods. Before last year, the trend seemed to go towards CRFs, nowadays, it goes towards deep learning (LSTM, CNN, RNN, etc. and so on). However, if we do a very quick review of English results for CoNLL 2003

Re: Name Finder trainer default settings

2017-02-06 Thread Rodrigo Agerri
+1 R On Mon, Feb 6, 2017 at 7:17 PM, William Colen wrote: > +1 > The times I tried, it was always better with Perceptron cutoff 0. > > 2017-02-06 15:40 GMT-02:00 Joern Kottmann : > > > Hello all, > > > > I would like to propose to switch the default

Re: [VOTE] Apache OpenNLP 1.7.2 Release Candidate

2017-02-03 Thread Rodrigo Agerri
+1 also pass tests On Fri, Feb 3, 2017 at 3:34 PM, Jeffrey Zemerick wrote: > +1 (non-binding) Build and tests pass with no issues. > > > > On Fri, Feb 3, 2017 at 4:15 AM, Joern Kottmann wrote: > > > +1 > > > > I did run the eval tests and they all run

Re: word2vec

2017-01-25 Thread Rodrigo Agerri
It might, I forgot that :) R On Wed, Jan 25, 2017 at 9:43 PM, Damiano Porta <damianopo...@gmail.com> wrote: > Could It help ? https://github.com/ragerri/cluster-preprocessing :) > > Il 25/Gen/2017 21:30, "Rodrigo Agerri" <rage...@apache.org> ha scritto: >

Re: word2vec

2017-01-25 Thread Rodrigo Agerri
Hello, Yes, you need to induce clusters using word2vec (see word2vec documentation for that) or any other clustering algorithm and then pass it as explained in the manual: http://opennlp.apache.org/documentation/1.7.1/manual/opennlp.html#tools.namefind.training HTH, R On Wed, Jan 25, 2017 at

Re: [VOTE] Apache OpenNLP 1.7.1 Release Candidate 1

2017-01-23 Thread Rodrigo Agerri
+1 to release nice R On Mon, Jan 23, 2017 at 9:33 AM, Joern Kottmann wrote: > +1 binding > > Jörn > > On Jan 21, 2017 12:18 AM, "Suneel Marthi" wrote: > > The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP > 1.7.1 Release Candidate.

Re: Sample data counters / stats

2017-01-16 Thread Rodrigo Agerri
+1 R On Mon, Jan 16, 2017 at 4:37 PM, William Colen wrote: > +1 > > It would be nice, specially because it allows understanding the data we are > using for training. > > > > 2017-01-14 10:18 GMT-02:00 Joern Kottmann : > > > Hello all, > > > > I

Re: Commit message style

2017-01-10 Thread Rodrigo Agerri
+1 for the OPENNLP-xxx: commit message. On Tue, Jan 10, 2017 at 12:51 AM, William Colen wrote: > +1 for the OPENNLP-xxx: commit message. > Fast to find a commit. > > > 2017-01-09 21:24 GMT-02:00 Joern Kottmann : > > > On Mon, 2017-01-09 at 17:02

Re: Release Process on wiki

2017-01-09 Thread Rodrigo Agerri
great! R On Thu, Jan 5, 2017 at 8:12 PM, Chris Mattmann wrote: > Hi, > > Thanks to the team, we now have a release process on the wiki. > > https://cwiki.apache.org/confluence/display/OPENNLP/Release+Process > > @Community > > Please check it out and you can follow on for

Re: OpenNLP 1.7.0 RC 2 is ready for testing

2017-01-02 Thread Rodrigo Agerri
Sorry, These days I am away... but after looking at it +1 on the release Cheers, Rodrigo On Sun, Jan 1, 2017 at 7:18 PM, Joern Kottmann wrote: > Sorry, this was closed too early and should have been open longer, > you are right. We will do better for the releases to

Re: Lemmatizer BUG

2016-12-05 Thread Rodrigo Agerri
nd the > use of the permuations? Why can we not have the lemma directly? > > Thanks for the clarification > Damiano > > > 2016-12-05 12:12 GMT+01:00 Rodrigo Agerri <rage...@apache.org>: > > > Hello, > > > > The String[] lemmatize(String[] toks, String[]

Re: Lemmatizer BUG

2016-12-05 Thread Rodrigo Agerri
Hello, The String[] lemmatize(String[] toks, String[] tags) method will give you predicted "lemma class" which consists of the number of permutations required to go from the word form to the lemma. If the output is O that means that no permutation is required, namely, the lemma and the word form

Re: Next release

2016-11-23 Thread Rodrigo Agerri
which >> > could >> > > be removed and is really old. >> > > It is a bit of a boring task, if anyone has some spare cycles help >> would >> > be >> > > welcome. >> > > >> > > Jörn >> > > >> > > On Tue, Nov 8, 20

Re: Moving brat annotator to opennlp.git

2016-10-19 Thread Rodrigo Agerri
+1 On Tue, Oct 18, 2016 at 6:46 PM, Joern Kottmann wrote: > Hello all, > > what do you think about including the brat ner annotator in the 1.6.1 > release? > > I believe it is important that we include it to allow our users to easier > run custom annotation projects, as part

Re: OpenNLP Github Mirror out of sync

2016-10-04 Thread Rodrigo Agerri
Hello, Actually the commits were pushed to https://git-wip-us.apache.org/repos/asf/opennlp.git but they do not appear in the read-only git mirror git://git.apache.org/opennlp.git or in the svn or github mirrored repo. Best regards, Rodrigo On Mon, Oct 3, 2016 at 9:30 AM, Daniel Gruno

Re: Morfologik Addon

2016-07-15 Thread Rodrigo Agerri
Great stuff, William. I have been using Morfologik stemming for a long time and when we included it we put it as an addon. I assume that the reason was its license, but reading Morfologik license it is not clear to me why is is not Apache compatible. If it is, it would be nice to include it

Re: Migrate to Git?

2016-07-04 Thread Rodrigo Agerri
+1 r On Mon, Jul 4, 2016 at 7:20 PM, Joern Kottmann wrote: > Thanks for your advice, if there are no concerns I will follow Chris > suggestion. > > The first step is to get us setup on git-wp. I will fill an issue with > infra to do this for us. > > Jörn > > On Mon,

Re: Question about deprecated NameFinderME constructors

2016-03-07 Thread Rodrigo Agerri
Hi, You can do all those tasks by using the create method in the TokenNameFinderFactory: http://svn.apache.org/viewvc/opennlp/trunk/opennlp-tools/src/main/java/opennlp/tools/namefind/TokenNameFinderFactory.java?revision=1712553=markup#l100 For that you need to: 1. Provide the name of the

Re: mallet addon

2015-10-20 Thread Rodrigo Agerri
; > I will push the changes and then you can experiment with it too. > > Jörn > > > On Mon, Oct 5, 2015 at 4:45 PM, Rodrigo Agerri <rage...@apache.org> wrote: > >> Hi, >> >> On Tue, Sep 29, 2015 at 3:41 PM, Joern Kottmann <kottm...@gmail.com> >&g

Re: mallet addon

2015-10-05 Thread Rodrigo Agerri
Hi, On Tue, Sep 29, 2015 at 3:41 PM, Joern Kottmann wrote: > We can also move > it to the sandbox, releasing it at Apache might be more difficult since > mallet pulls in incompatible licensed dependencies. But maybe that changed, > we can check. Mallet is released under

Re: mallet addon

2015-09-30 Thread Rodrigo Agerri
t; mallet pulls in incompatible licensed dependencies. But maybe that changed, > we can check. > > What do you think? > > Jörn > > > > On Tue, Sep 29, 2015 at 2:34 PM, Rodrigo Agerri <rage...@apache.org> > wrote: > > > Hello, > > > > I hav

mallet addon

2015-09-29 Thread Rodrigo Agerri
Hello, I have seen that there is a mallet addon here https://github.com/kottmann/opennlp-mallet-addon is this currently being used or integrated in opennlp? I have not seen with the rest of the addons. Cheers, Rodrigo

Re: WSD-Supervised

2015-08-11 Thread Rodrigo Agerri
sincerely, Mondher On Monday, August 10, 2015, Rodrigo Agerri agerri.rodr...@gmail.com wrote: Hello Mondher, How is the all words IMS disambiguation progressing? We really need to focus on this to have a good candidate for integration in opennlp tools. The evaluator, the CLI and the all words

Re: WSD-Supervised

2015-08-10 Thread Rodrigo Agerri
Hello Mondher, How is the all words IMS disambiguation progressing? We really need to focus on this to have a good candidate for integration in opennlp tools. The evaluator, the CLI and the all words supervised disambiguation should be the focus. Cheers, Rodrigo On Sat, Jul 18, 2015 at 5:40

Re: WSD - Supervised techniques

2015-07-13 Thread Rodrigo Agerri
Hello, It has been few public activity these last days. We believe that it is very important to step up in two directions wrt what is already commited in svn: 1. Finishing the WSDEvaluator 2. Provide the classes required to run the WSD tools from the CLI as any other component. 3. Formats: it

Re: GSoC 2015 - WSD Module

2015-06-19 Thread Rodrigo Agerri
Thanks for the update and the updated patch. With respect to the licensing of BabelNet, I do not think we can redistribute CC BY-NC-SA resources here, but others in this project and Apache in general will probably know better than me. Best, Rodrigo On Sun, Jun 14, 2015 at 2:47 PM, Anthony

Re: GSoC 2015 - WSD Module

2015-06-08 Thread Rodrigo Agerri
Hello, +1 for using extJWNL instead of JWNL, I use it in some other projects too and it is very nice IMHO. R On Sat, Jun 6, 2015 at 12:55 PM, Aliaksandr Autayeu aliaksa...@autayeu.com wrote: Thinking of impartiality... Anyway, I'm the author of extJWNL in case you have questions. Aliaksandr

Re: OpenNLP 1.6.0 RC 4 ready for testing

2015-06-01 Thread Rodrigo Agerri
Hello, I tested the pos tagger, namefinder and constituent parser in my projects and no problems reported. I am not aware of any issues left for next RC. Cheers, R On Fri, May 29, 2015 at 5:49 PM, Joern Kottmann kottm...@gmail.com wrote: I accidentally didn't include the dev list in my

Re: GSoC 2015 - WSD Module

2015-05-22 Thread Rodrigo Agerri
Hello Mondher (my response is about supervised WSD), Thanks for the info, it is quite interesting. Apart from the comment by Jörn, which I think is very important if we want to achieve something given the time constrains of the GSOC, I have a couple of recommendations/comments from my part: 1.

Re: W2VClassesDictionary class

2015-05-22 Thread Rodrigo Agerri
Hello, You are right I kept it there while I was doing the test with the WordClusterFeatureGenerator. I will remove it. Best, R On Fri, May 22, 2015 at 1:51 PM, Joern Kottmann kottm...@gmail.com wrote: Hello, looks like this class was renamed into WordClusterDictionary. Can the class

Re: Is it ok to post a job offer here?

2015-04-28 Thread Rodrigo Agerri
In my opinion it is quite good :) R On Mon, Apr 27, 2015 at 9:31 PM, Joern Kottmann kottm...@gmail.com wrote: Should be fine. Any objections? Jörn On Thu, 2015-04-23 at 17:26 +0200, Thilo Goetz wrote: Is it acceptable to post a job offer (NLP related) to this list? Thanks. --Thilo

Re: the sentence splitter at openNLP

2015-03-12 Thread Rodrigo Agerri
Hi, You can check the Ratnapharki's thesis. http://user.phil-fak.uni-duesseldorf.de/~rumpf/WS2005/Klassifikation/Rat98.pdf HTH, R On Thu, Mar 12, 2015 at 8:22 AM, phdapple applec...@qq.com wrote: Hi, I am a student from Shanghai, China. I am interesting in the sentence splitter ‍at

Re: [GSoC2015] OPENNLP-758

2015-03-05 Thread Rodrigo Agerri
Hello Jörn, I am sorry, I seem to have missed all the (great) news about the GSOC. If new ideas are required for other students, I have wanted to add a probabilistic lemmatizer in OpenNLP for some time. As you know, the current lemmatizer is only dictionary based. There is an issue about adding

Re: Word Sense Disambiguation

2015-01-22 Thread Rodrigo Agerri
+1 On Mon, Jan 19, 2015 at 10:30 PM, Mark G ma...@apache.org wrote: +1 On Mon, Jan 19, 2015 at 1:49 PM, Tommaso Teofili tommaso.teof...@gmail.com wrote: +1 Tommaso 2015-01-19 19:10 GMT+01:00 Joern Kottmann kottm...@gmail.com: Hello, +1 from me to just go ahead and implement the

Re: Text Summarization module?

2015-01-22 Thread Rodrigo Agerri
+1 On Thu, Jan 22, 2015 at 9:06 AM, Tommaso Teofili tommaso.teof...@gmail.com wrote: Hi Ram, since your proposal got positive feedback, maybe you could create an issue in Jira and attach the code / patch for discussion / review. Regards, Tommaso

Re: Need to speed up the model creation process of OpenNLP

2014-11-26 Thread Rodrigo Agerri
, String type, ObjectStreamNameSample samples,AdaptiveFeatureGenerator generator, MapString,Object resources, int iterations, int cutoff) Am I missing something, Could you please tell me how can I do so? Thanks Nikhil From: Rodrigo Agerri rage...@apache.org

Re: Next release

2014-11-24 Thread Rodrigo Agerri
+1 for William as RM R On Mon, Nov 24, 2014 at 3:18 PM, Jörn Kottmann kottm...@gmail.com wrote: On 11/21/2014 01:26 PM, William Colen wrote: +1 to start the release process I candidate myself as release manager for the 1.6.0. +1 for William as RM Jörn

Re: Build changed opennlp/pom.xml moved to root directory

2014-11-21 Thread Rodrigo Agerri
I agree. Very good. R On 20 Nov 2014 21:23, Tommaso Teofili tommaso.teof...@gmail.com wrote: IMHO it was about time, thanks Jörn :-) Regards, Tommaso 2014-11-20 21:11 GMT+01:00 Joern Kottmann kottm...@gmail.com: Hello everybody, we changed the structure of the project slightly. The

Re: 1.6.0 maven repo

2014-11-19 Thread Rodrigo Agerri
at 21:20 +0100, Rodrigo Agerri wrote: Hi Any chance to release snapshot repos to maven central? Or to an apache snapshots repo? It would make the use of current trunk via API much easier. Cheers Rodrigo

Re: 1.6.0 maven repo

2014-11-19 Thread Rodrigo Agerri
of 1.6.0, I think most issues are already solved and remaining bugs we will uncover during the manual testing phase. Jörn On Wed, 2014-11-19 at 21:20 +0100, Rodrigo Agerri wrote: Hi Any chance to release snapshot repos to maven central? Or to an apache snapshots repo? It would make

NameFinder Perceptron Sequence Trainer

2014-11-12 Thread Rodrigo Agerri
Hi, Has anyone managed to train NER models with the PerceptronSequenceTrainer? Whenever I try bin/opennlp TokenNameFinderTrainer -featuregen lang/en/namefinder/en-namefinder.xml -params lang/ml/PerceptronSequenceTrainerParams.txt -lang en -data

Re: What should we do with the SF models?

2014-10-29 Thread Rodrigo Agerri
In my opinion the models should be documented. In some cases it is said the training corpus used but in others it's not. We should also said which features were used and the results obtained on which dataset. If default features are used we should also said so. If we cannot provide such info we

Re: Build failed in Jenkins: OpenNLP #476

2014-10-27 Thread Rodrigo Agerri
Hi, This is not caused by my latest commit, is it not? R On Mon, Oct 27, 2014 at 6:40 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: See https://builds.apache.org/job/OpenNLP/476/changes Changes: [ragerri] OPENNLP-725 now the serializer is chosen from dict attribute and

Re: Parsing with PCFGs

2014-10-16 Thread Rodrigo Agerri
/documentation/1.5.3/manual/opennlp.html#tools.parser.parsing 2014-10-15 9:38 GMT+02:00 Rodrigo Agerri rage...@apache.org: Hi, The main algorithm (called chunking in the trunk) is based on Ratnapharki's work. It is best to directly read the paper. http://link.springer.com/article/10.1023

Re: [opennlp-dev] TokenNameFinderFactory new features and extension

2014-10-08 Thread Rodrigo Agerri
Hello, On Wed, Oct 8, 2014 at 8:17 AM, Jörn Kottmann kottm...@gmail.com wrote: +1 for the first option. Great, I have commit and close the issue. Thanks! Rodrigo

Re: How to sanitize and parse noisy text

2014-07-15 Thread Rodrigo Agerri
Hi Carlos, In my opinion, you would need to properly segment that sentence. It is virtually impossible the parser will get anything right if you pass it such sentences. Perhaps you can use the newlines in your cleaned text to create shorter more grammatical sentences. Also, if I had to deal with

Re: interest in new parser?

2014-06-23 Thread Rodrigo Agerri
Hello, Ratnapharki's (1999) is a shift-reduced parser. Others like Stanford NLP are now releasing shift-reduced parsers. There are differences between them, though. For example, Zhang and Clark (2009)'s parser (cited by Stanford's new parser) is similar except that they use a global

Re: MarkableFileInputStreamFactory

2014-05-20 Thread Rodrigo Agerri
+1 to move it to the util package. Rodrigo On Tue, May 20, 2014 at 10:11 AM, Jörn Kottmann kottm...@gmail.com wrote: On 05/19/2014 11:24 PM, Mark G wrote: OK, thanks Carlos, I think I will commit the change, seems like it wouldn't hurt. Anybody else? We can't do it like that, the cmdline

Re: OPENNLP-683

2014-05-15 Thread Rodrigo Agerri
+1 rule-based complements well dictionary lookup lemmatizer. Rodrigo On 2014/05/06 at 20:50, Joern Kottmann wrote: Hello, we got a question in https://issues.apache.org/jira/browse/OPENNLP-683 if it would be interesting to implement a rule based lemmatizer as explained in the issue.

Re: TokenNameFinder and Span probs

2014-05-14 Thread Rodrigo Agerri
+1 to the second solution too, and to use this solution everywhere where a Span object is returned. Rodrigo On 2014/05/07 at 09:22, Joern Kottmann wrote: Hello Mark, +1 for your second solution. I believe that is much more intuitive than calling a method afterwards to retrieve the prob for

Re: svn buid fails

2014-04-09 Thread Rodrigo Agerri
is being pulled in by mvn in the local repo. Thanks, James On 4/7/2014 10:28 AM, Rodrigo Agerri wrote: Hi all, After co the current svn trunk repo, I try to build the project and it fails at the opennlp dir. As the first errors are related with the tests, I first tried to compile

Re: svn buid fails

2014-04-09 Thread Rodrigo Agerri
On 4/7/2014 10:28 AM, Rodrigo Agerri wrote: Hi all, After co the current svn trunk repo, I try to build the project and it fails at the opennlp dir. As the first errors are related with the tests, I first tried to compile ignoring tests: mvn clean install -Dmaven.test.skip=true

Re: Next release

2013-02-18 Thread Rodrigo Agerri
Hello, If it is of any help, I have been using the coreference module with jwnl 1.4_rc3 and it works. Cheers, Rodrigo On Mon, Feb 18, 2013 at 1:56 PM, William Colen william.co...@gmail.com wrote: With jwnl 1.4_rc3 the code at least compiles. Also, it would be nice if someone familiar with