Branch: refs/heads/master
Home: https://github.com/languagetool-org/languagetool
Commit: 1a7419d1b9c1a914ab91910edc704b7273bc2bcd
https://github.com/languagetool-org/languagetool/commit/1a7419d1b9c1a914ab91910edc704b7273bc2bcd
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M languagetool-dev/pom.xml
A
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/MachineLearning.java
A
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/WikipediaTrainingDataGenerator.java
A
languagetool-dev/src/test/java/org/languagetool/dev/errorcorpus/MachineLearningTest.java
M
languagetool-wikipedia/src/main/java/org/languagetool/dev/dumpcheck/WikipediaSentenceSource.java
Log Message:
-----------
machine learning tools, copied from branch confusion-rule
Commit: 9346c05807c1b1f349918a0db40784342574647b
https://github.com/languagetool-org/languagetool/commit/9346c05807c1b1f349918a0db40784342574647b
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/WikipediaTrainingDataGenerator.java
A
languagetool-wikipedia/src/main/java/org/languagetool/dev/dumpcheck/PlainTextSentenceSource.java
M
languagetool-wikipedia/src/main/java/org/languagetool/dev/dumpcheck/SentenceSource.java
M
languagetool-wikipedia/src/main/java/org/languagetool/dev/dumpcheck/TatoebaSentenceSource.java
Log Message:
-----------
training class can now work on pre-extracted sentences (much faster than
parsing the Wikipedia XML on each run)
Commit: f32f3829a63c5df9f8b4d607a596de24ba5b6b37
https://github.com/languagetool-org/languagetool/commit/f32f3829a63c5df9f8b4d607a596de24ba5b6b37
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
A
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/TrainingDataGenerator.java
R
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/WikipediaTrainingDataGenerator.java
Log Message:
-----------
fix name - it's not Wikipedia-specific anymore
Commit: a0d7e694650fd02bd0ee785b84da03f3e777b9c1
https://github.com/languagetool-org/languagetool/commit/a0d7e694650fd02bd0ee785b84da03f3e777b9c1
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/TrainingDataGenerator.java
Log Message:
-----------
improve output formatting
Commit: 5b1ed29fbfafe578f3dda4bb80a007b9112094b2
https://github.com/languagetool-org/languagetool/commit/5b1ed29fbfafe578f3dda4bb80a007b9112094b2
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M
languagetool-language-modules/en/src/main/java/org/languagetool/tokenizers/en/EnglishWordTokenizer.java
Log Message:
-----------
[en] override getTokenizingCharacters() instead of using our own member
Commit: 9262e5f8c0f32b70dbb25615bfcf2562a993962a
https://github.com/languagetool-org/languagetool/commit/9262e5f8c0f32b70dbb25615bfcf2562a993962a
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/TrainingDataGenerator.java
Log Message:
-----------
output and tokenizing fixes
Commit: 55c4ce6be6519aef968c701b78bc85505bd30754
https://github.com/languagetool-org/languagetool/commit/55c4ce6be6519aef968c701b78bc85505bd30754
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/TrainingDataGenerator.java
A
languagetool-dev/src/test/java/org/languagetool/dev/errorcorpus/TrainingDataGeneratorTest.java
Log Message:
-----------
fix ngram logic
Commit: 04db83a703ea81bbc3c5b3d0d23e72c16f19d636
https://github.com/languagetool-org/languagetool/commit/04db83a703ea81bbc3c5b3d0d23e72c16f19d636
Author: Daniel Naber <na...@danielnaber.de>
Date: 2015-04-28 (Tue, 28 Apr 2015)
Changed paths:
M
languagetool-dev/src/main/java/org/languagetool/dev/errorcorpus/TrainingDataGenerator.java
Log Message:
-----------
small cleanup / introduce training constant
Compare:
https://github.com/languagetool-org/languagetool/compare/e0919c7916d4...04db83a703ea
------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________
Languagetool-commits mailing list
Languagetool-commits@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-commits