Just wanted to pipe up on this topic. I completed the Java port as a GSoC project back in 2010, and yes the design of the modules is to just pass around large strings/buffers. This makes it easier to use as a library, yes. Though a bit of a heads-up, I believe there may still be some places where the code uses System.exit() instead of just throwing an error, which can be an issue when using it as a library.
-- Stephen (p.s. Yes, I'm still around, and still need to get my Apertium dev environment re-built, thanks for the reminder. ^_^) On Wed, Feb 8, 2012 at 11:21 AM, stevens35 <steven...@llnl.gov> wrote: > Hi All, > > Thanks for the quick and really awesome responses. The java version of > Lttoolbox is /*exactly*/ what I was hoping to find (and not have to > write myself)! Looking through the source code for the processing main, > it doesn't look too hard to use the internal classes as a library and > feed it strings. > > On a related note, if i just want to do morphological analysis of > English, which language pair should I start with? Or is there an all > encompassing English morphological dictionary that someone maintains? > If not, how troublesome would it be to merge the existing different > English dictionary files? > > Cheers and Thanks, > --Keith > > Kevin Brubeck Unhammer wrote: >> stevens35 <steven...@llnl.gov> writes: >> >> >>> Hello, >>> >>> I've been searching around for a good morphological analyzer for a while >>> and came across Lttoolbox. The analyzer step does exactly what I want >>> for words in a language, it splits the word into it's lexical base and >>> then adds in morphological tags based on how the word was formed. Up >>> until now, I've just been using the Porter Stemmer to get the root word, >>> but it's always been displeasing because it throws away the rest of the >>> surface form. >>> >>> However, most of the text processing code I work with is in Java, and if >>> possible, I'd like to keep everything within Java. Had anyone had any >>> experience linking to Lttoolbox from Java? Or does anyone know of any >>> java versions of Lttoolbox that utilize the existing dictionaries, or a >>> similar tool for java? >>> >> >> lttoolbox-java works fine with all the existing dictionaries, and should >> be feature-complete with the C++ version. lttoolbox and lttoolbox-java >> are completely independent of each other, so you don't need the C++ >> version to use the Java version and vice versa, so keeping everything >> within Java should work fine. >> >> >> > > > ------------------------------------------------------------------------------ > Keep Your Developer Skills Current with LearnDevNow! > The most comprehensive online learning library for Microsoft developers > is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, > Metro Style Apps, more. Free future releases when you subscribe now! > http://p.sf.net/sfu/learndevnow-d2d > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff ------------------------------------------------------------------------------ Virtualization & Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff