svn commit: r940252 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/VectorBenchmarks.java

2010-05-02 Thread robinanil
Author: robinanil Date: Sun May 2 16:05:16 2010 New Revision: 940252 URL: http://svn.apache.org/viewvc?rev=940252&view=rev Log: Serialize/Deserialize Benchmarks Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/VectorBenchmarks.java Modified: lucene/ma

svn commit: r930564 - in /lucene/mahout/site: publish/index.html publish/index.pdf src/documentation/content/xdocs/index.xml

2010-04-03 Thread robinanil
Author: robinanil Date: Sat Apr 3 19:10:41 2010 New Revision: 930564 URL: http://svn.apache.org/viewvc?rev=930564&view=rev Log: Seems we are not very user friendly for new students. So I went ahead and linked the wiki page as a news item. Modified: lucene/mahout/site/publish/index.

svn commit: r923322 - /lucene/mahout/site/src/documentation/content/xdocs/index.xml

2010-03-15 Thread robinanil
Author: robinanil Date: Mon Mar 15 16:14:01 2010 New Revision: 923322 URL: http://svn.apache.org/viewvc?rev=923322&view=rev Log: Setting release announcement date as 16th March. Changes in release notes Modified: lucene/mahout/site/src/documentation/content/xdocs/index.xml Modified: lu

svn commit: r921777 - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/fpm/pfpgrowth/ParallelFPGrowthReducer.java main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/Pattern.java test/java

2010-03-11 Thread robinanil
Author: robinanil Date: Thu Mar 11 10:22:34 2010 New Revision: 921777 URL: http://svn.apache.org/viewvc?rev=921777&view=rev Log: Bug created in the last commit. the id wasn't incrementing in the ParallelFPGrowthReducer Modified: lucene/mahout/trunk/core/src/main/java/org/apache/m

svn commit: r918863 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/clustering/lda/LDAPrintTopics.java

2010-03-03 Thread robinanil
Author: robinanil Date: Thu Mar 4 06:09:56 2010 New Revision: 918863 URL: http://svn.apache.org/viewvc?rev=918863&view=rev Log: MAHOUT-320 LDAPrintTopics missed out from the previous commit Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/clustering

svn commit: r918860 - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/clustering/lda/ main/java/org/apache/mahout/common/ test/java/org/apache/mahout/clustering/lda/ test/java/org/apache

2010-03-03 Thread robinanil
Author: robinanil Date: Thu Mar 4 05:40:03 2010 New Revision: 918860 URL: http://svn.apache.org/viewvc?rev=918860&view=rev Log: MAHOUT-320 Improvements in LDA Added: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/IntPairWritable.java - copied, changed from r91

svn commit: r918493 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/DenseVector.java

2010-03-03 Thread robinanil
Author: robinanil Date: Wed Mar 3 14:43:30 2010 New Revision: 918493 URL: http://svn.apache.org/viewvc?rev=918493&view=rev Log: Introduced a bug, Missed out resetting length squared on fast densevector assign Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/

svn commit: r918468 - in /lucene/mahout/trunk/math/src: main/java/org/apache/mahout/math/ main/java/org/apache/mahout/math/function/ test/java/org/apache/mahout/math/

2010-03-03 Thread robinanil
Author: robinanil Date: Wed Mar 3 13:36:02 2010 New Revision: 918468 URL: http://svn.apache.org/viewvc?rev=918468&view=rev Log: Some improvements in DenseVector and DenseMatrix and cleanup Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/DenseMatrix.java lu

svn commit: r917939 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDAInference.java

2010-03-02 Thread robinanil
Author: robinanil Date: Tue Mar 2 10:00:36 2010 New Revision: 917939 URL: http://svn.apache.org/viewvc?rev=917939&view=rev Log: Performance fix: Matrix in LDA was getting re-allocated for everydocument, added a reset code to reset/create the matrix values. Modified: lucene/mahout/t

svn commit: r917742 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/lda: LDADriver.java LDAInference.java

2010-03-01 Thread robinanil
Author: robinanil Date: Mon Mar 1 21:50:25 2010 New Revision: 917742 URL: http://svn.apache.org/viewvc?rev=917742&view=rev Log: Cleanup for 0.3 release. LDA was using HashMap, changed it to OpenIntIntHashmap for better performance. plus function was taking more time due to excessive c

svn commit: r917589 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/clustering/lda/LDAPrintTopics.java

2010-03-01 Thread robinanil
Author: robinanil Date: Mon Mar 1 16:51:37 2010 New Revision: 917589 URL: http://svn.apache.org/viewvc?rev=917589&view=rev Log: Fix for 0.3: LDAPrintTopics will match seqdump, clusterdump behaviour by printing output to std out when output file is not specified Modified: lucene/ma

svn commit: r917577 - in /lucene/mahout/trunk: bin/mahout utils/src/main/java/org/apache/mahout/clustering/lda/LDAPrintTopics.java

2010-03-01 Thread robinanil
Author: robinanil Date: Mon Mar 1 16:37:26 2010 New Revision: 917577 URL: http://svn.apache.org/viewvc?rev=917577&view=rev Log: Adding LDA print topics to the shell script and some cleanup of style Modified: lucene/mahout/trunk/bin/mahout lucene/mahout/trunk/utils/src/main/java

svn commit: r916108 - in /lucene/mahout/trunk/math/src: main/java/org/apache/mahout/math/AbstractVector.java test/java/org/apache/mahout/math/TestDenseVector.java test/java/org/apache/mahout/math/Test

2010-02-24 Thread robinanil
Author: robinanil Date: Thu Feb 25 03:41:07 2010 New Revision: 916108 URL: http://svn.apache.org/viewvc?rev=916108&view=rev Log: Bug is minus was causing inverse result. Added tests Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/AbstractVector.java lu

svn commit: r915068 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/distance/CosineDistanceMeasure.java

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 21:14:06 2010 New Revision: 915068 URL: http://svn.apache.org/viewvc?rev=915068&view=rev Log: Cosine Distance measure should use smaller vector to iterate Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/dist

svn commit: r915032 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/RandomAccessSparseVector.java

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 19:38:39 2010 New Revision: 915032 URL: http://svn.apache.org/viewvc?rev=915032&view=rev Log: MAHOUT-300 RandomAccessSparseVector will start at INITIAL_SIZE 11 or cardinality whichever is min Modified: lucene/mahout/trunk/math/src/main/java/org/ap

svn commit: r915026 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/RandomAccessSparseVector.java

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 19:28:32 2010 New Revision: 915026 URL: http://svn.apache.org/viewvc?rev=915026&view=rev Log: MAHOUT-300 RandomAccessSparseVector will start at INITIAL_SIZE 11 Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/

svn commit: r915021 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/DenseVector.java

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 19:03:43 2010 New Revision: 915021 URL: http://svn.apache.org/viewvc?rev=915021&view=rev Log: MAHOUT-300 DenseVector Tweaks Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/DenseVector.java Modified: lucene/mahout/trunk/math

svn commit: r915012 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/RandomAccessSparseVector.java

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 18:42:55 2010 New Revision: 915012 URL: http://svn.apache.org/viewvc?rev=915012&view=rev Log: MAHOUT-300 removing redundant check might speed things a bit Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/RandomAccessSparseVector.

svn commit: r915007 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering: canopy/Canopy.java kmeans/Cluster.java

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 18:26:17 2010 New Revision: 915007 URL: http://svn.apache.org/viewvc?rev=915007&view=rev Log: MAHOUT-297 First cut changes for kmeans and canopy for 0.3, rest for 0.4 Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/ca

svn commit: r912655 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/common/ math/src/main/java/org/apache/mahout/math/ math/src/test/java/org/apache/mahout/math/ utils/src/main/java/or

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 17:01:56 2010 New Revision: 912655 URL: http://svn.apache.org/viewvc?rev=912655&view=rev Log: MAHOUT-300 First wave of perf improvements Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/TimingStatistics.java lucene/ma

svn commit: r912585 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/distance: CosineDistanceMeasure.java ManhattanDistanceMeasure.java SquaredEuclideanDistanceMeasure.java Tanimo

2010-02-22 Thread robinanil
Author: robinanil Date: Mon Feb 22 14:38:48 2010 New Revision: 912585 URL: http://svn.apache.org/viewvc?rev=912585&view=rev Log: Distance Measure improvements Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/distance/CosineDistanceMeasure.java lucene/ma

svn commit: r912190 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/text/term/TermCountReducer.java

2010-02-20 Thread robinanil
Author: robinanil Date: Sat Feb 20 18:54:33 2010 New Revision: 912190 URL: http://svn.apache.org/viewvc?rev=912190&view=rev Log: FindBugs: Prevent edit of static var in configure Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/text/

svn commit: r911434 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classifier/bayes/TestClassifier.java

2010-02-18 Thread robinanil
Author: robinanil Date: Thu Feb 18 15:14:44 2010 New Revision: 911434 URL: http://svn.apache.org/viewvc?rev=911434&view=rev Log: MAHOUT-296 Style issues with TestClassifier Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classifier/bayes/TestClassifier.java Modi

svn commit: r911432 - in /lucene/mahout/trunk: bin/mahout core/src/main/java/org/apache/mahout/classifier/bayes/TestClassifier.java

2010-02-18 Thread robinanil
Author: robinanil Date: Thu Feb 18 15:12:14 2010 New Revision: 911432 URL: http://svn.apache.org/viewvc?rev=911432&view=rev Log: MAHOUT-296 testclassifier and trainclassifier added to shell script. Test classifier now uses correct label from the key Modified: lucene/mahout/trunk/bin/ma

svn commit: r910926 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/VectorBenchmarks.java

2010-02-17 Thread robinanil
Author: robinanil Date: Wed Feb 17 11:38:17 2010 New Revision: 910926 URL: http://svn.apache.org/viewvc?rev=910926&view=rev Log: Set the default loop as 200 Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/VectorBenchmarks.java Modified: lucene/mahout/t

svn commit: r910925 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/VectorBenchmarks.java

2010-02-17 Thread robinanil
Author: robinanil Date: Wed Feb 17 11:37:34 2010 New Revision: 910925 URL: http://svn.apache.org/viewvc?rev=910925&view=rev Log: Better looking Benchmark output with comparison across impls Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/VectorBenchmarks.

svn commit: r910876 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/common/TimingStatistics.java utils/src/main/java/org/apache/mahout/benchmark/ utils/src/main/java/org/apache/mahout/

2010-02-17 Thread robinanil
Author: robinanil Date: Wed Feb 17 09:19:01 2010 New Revision: 910876 URL: http://svn.apache.org/viewvc?rev=910876&view=rev Log: Basic Benchmarking Code Added: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/benchmark/ lucene/mahout/trunk/utils/src/main/java/org/apache/ma

svn commit: r910285 - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/classifier/bayes/mapreduce/bayes/ test/java/org/apache/mahout/classifier/ test/java/org/apache/mahout/classifier/bay

2010-02-15 Thread robinanil
Author: robinanil Date: Mon Feb 15 18:27:31 2010 New Revision: 910285 URL: http://svn.apache.org/viewvc?rev=910285&view=rev Log: MAHOUT-293 Classifier TestData and BayesClassifier Self Test Added: lucene/mahout/trunk/core/src/test/java/org/apache/mahout/classifier/ClassifierData.

svn commit: r910282 [6/6] - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/cf/taste/hadoop/ core/src/main/java/org/apache/mahout/cf/taste/hadoop/cooccurence/ core/src/main/java/org/apac

2010-02-15 Thread robinanil
Modified: lucene/mahout/trunk/utils/src/test/java/org/apache/mahout/utils/nlp/collocations/llr/CollocReducerTest.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/utils/src/test/java/org/apache/mahout/utils/nlp/collocations/llr/CollocReducerTest.java?rev=910282&r1=910281&r2=910282&view=d

svn commit: r910067 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/clustering/ClusterDumper.java

2010-02-14 Thread robinanil
Author: robinanil Date: Sun Feb 14 20:23:42 2010 New Revision: 910067 URL: http://svn.apache.org/viewvc?rev=910067&view=rev Log: Adding configuration parameter in cluster dumper to set the number of top words returned and provide total score of the term Modified: lucene/mahout/trunk/u

svn commit: r909938 - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/clustering/canopy/ test/java/org/apache/mahout/clustering/canopy/

2010-02-13 Thread robinanil
Author: robinanil Date: Sun Feb 14 00:19:31 2010 New Revision: 909938 URL: http://svn.apache.org/viewvc?rev=909938&view=rev Log: CanopyClusterer now reports status information Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/canopy/CanopyClusterer.

svn commit: r909920 - /lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/canopy/TestCanopyCreation.java

2010-02-13 Thread robinanil
Author: robinanil Date: Sat Feb 13 21:45:41 2010 New Revision: 909920 URL: http://svn.apache.org/viewvc?rev=909920&view=rev Log: TestCanopyClustering to use DummyReporter instead of null Modified: lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/ca

svn commit: r909916 - /lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/canopy/TestCanopyCreation.java

2010-02-13 Thread robinanil
Author: robinanil Date: Sat Feb 13 21:22:29 2010 New Revision: 909916 URL: http://svn.apache.org/viewvc?rev=909916&view=rev Log: MAHOUT-291 Missed out changed test for CanopyClustering Modified: lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/ca

svn commit: r909914 [5/5] - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/clustering/ main/java/org/apache/mahout/clustering/canopy/ main/java/org/apache/mahout/clustering/dirichlet/ m

2010-02-13 Thread robinanil
Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/meanshift/MeanShiftCanopyClusterer.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/meanshift/MeanShiftCanopyClusterer.java?rev=909914&r1=909913&r2=909914&view=d

svn commit: r909912 [10/10] - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste: common/ eval/ hadoop/ hadoop/cooccurence/ hadoop/item/ hadoop/pseudo/ hadoop/slopeone/ impl/common/

2010-02-13 Thread robinanil
Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/model/Preference.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/model/Preference.java?rev=909912&r1=909911&r2=909912&view=diff

svn commit: r909910 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/ga/watchmaker: EvalMapper.java MahoutEvaluator.java MahoutFitnessEvaluator.java OutputUtils.java STEvolutionEngine.ja

2010-02-13 Thread robinanil
Author: robinanil Date: Sat Feb 13 20:33:13 2010 New Revision: 909910 URL: http://svn.apache.org/viewvc?rev=909910&view=rev Log: MAHOUT-291 mahout-ga code style changes Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/ga/watchmaker/EvalMapper.java lucene/mahout/t

svn commit: r909900 [4/4] - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/df: ./ builder/ callback/ data/ data/conditions/ mapred/ mapred/inmem/ mapred/partial/ mapreduce/ mapreduce/inm

2010-02-13 Thread robinanil
Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/df/tools/FrequenciesJob.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/df/tools/FrequenciesJob.java?rev=909900&r1=909899&r2=909900&view=diff

svn commit: r909873 - in /lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/cf/taste/ejb: RecommenderEJB.java RecommenderEJBBean.java RecommenderEJBHome.java RecommenderEJBLocal.java Recomm

2010-02-13 Thread robinanil
Author: robinanil Date: Sat Feb 13 19:12:14 2010 New Revision: 909873 URL: http://svn.apache.org/viewvc?rev=909873&view=rev Log: MAHOUT-291 Few more changes in examples Modified: lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/cf/taste/ejb/RecommenderEJB.java lu

svn commit: r909871 [7/7] - in /lucene/mahout/trunk/examples/src/main/java/org/apache/mahout: analysis/ cf/taste/ejb/ cf/taste/example/ cf/taste/example/bookcrossing/ cf/taste/example/grouplens/ cf/ta

2010-02-13 Thread robinanil
Modified: lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/text/WikipediaToSequenceFile.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/text/WikipediaToSequenceFile.java?rev=909871&r1=909870&r2=909871&view=diff ==

svn commit: r909861 [4/4] - in /lucene/mahout/trunk/utils/src: main/java/org/apache/mahout/clustering/lda/ main/java/org/apache/mahout/text/ main/java/org/apache/mahout/utils/ main/java/org/apache/mah

2010-02-13 Thread robinanil
Modified: lucene/mahout/trunk/utils/src/test/java/org/apache/mahout/utils/nlp/collocations/llr/GramTest.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/utils/src/test/java/org/apache/mahout/utils/nlp/collocations/llr/GramTest.java?rev=909861&r1=909860&r2=909861&view=diff ==

svn commit: r909781 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/SequenceFileDumper.java

2010-02-13 Thread robinanil
Author: robinanil Date: Sat Feb 13 09:24:06 2010 New Revision: 909781 URL: http://svn.apache.org/viewvc?rev=909781&view=rev Log: LDAPrintTopics helpOpt fix Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/SequenceFileDumper.java Modified: lucene/mahout/t

svn commit: r909696 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/clustering/lda/ utils/src/main/java/org/apache/mahout/clustering/ utils/src/main/java/org/apache/mahout/clustering/l

2010-02-12 Thread robinanil
Author: robinanil Date: Sat Feb 13 02:17:01 2010 New Revision: 909696 URL: http://svn.apache.org/viewvc?rev=909696&view=rev Log: Moved LDAPrintTopics to utils added functionality to read DictionaryVectorizer dictionary.file-* Added: lucene/mahout/trunk/utils/src/main/java/org/apache/ma

svn commit: r909611 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFiles.java

2010-02-12 Thread robinanil
Author: robinanil Date: Fri Feb 12 21:24:08 2010 New Revision: 909611 URL: http://svn.apache.org/viewvc?rev=909611&view=rev Log: adding sequentialaccess option in main filesrc/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFiles.java Modified: lucene/mahout/trunk/utils/src/

svn commit: r909598 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/clustering/lda/ core/src/test/java/org/apache/mahout/clustering/lda/ utils/src/main/java/org/apache/mahout/utils/clu

2010-02-12 Thread robinanil
Author: robinanil Date: Fri Feb 12 20:23:55 2010 New Revision: 909598 URL: http://svn.apache.org/viewvc?rev=909598&view=rev Log: MAHOUT-289 LDAMapper was using Vector instead of VectorWritable in Mapper defn Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering

svn commit: r909434 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/FPGrowth.java

2010-02-12 Thread robinanil
Author: robinanil Date: Fri Feb 12 14:44:19 2010 New Revision: 909434 URL: http://svn.apache.org/viewvc?rev=909434&view=rev Log: pruneFPTree to use OpenIntIntHashMap Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/FPGrowth.java Modified: lu

svn commit: r909383 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/FPGrowthDriver.java

2010-02-12 Thread robinanil
Author: robinanil Date: Fri Feb 12 12:42:42 2010 New Revision: 909383 URL: http://svn.apache.org/viewvc?rev=909383&view=rev Log: FPGrowth driver edits Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/FPGrowthDriver.java Modified: lucene/mahout/trunk/

svn commit: r909063 - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/classifier/bayes/algorithm/ main/java/org/apache/mahout/classifier/bayes/datastore/ main/java/org/apache/mahout/clas

2010-02-11 Thread robinanil
Author: robinanil Date: Thu Feb 11 16:32:44 2010 New Revision: 909063 URL: http://svn.apache.org/viewvc?rev=909063&view=rev Log: Bayes Classifier some classes modified to use math collections Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classifier/bayes/algor

svn commit: r909008 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/SparseMatrix.java

2010-02-11 Thread robinanil
Author: robinanil Date: Thu Feb 11 14:58:59 2010 New Revision: 909008 URL: http://svn.apache.org/viewvc?rev=909008&view=rev Log: SparseMatrix uses OpenIntObjectHashMap Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/SparseMatrix.java Modified: lucene/mahout/t

svn commit: r908859 - in /lucene/mahout/trunk/utils/src: main/java/org/apache/mahout/text/ main/java/org/apache/mahout/utils/nlp/collocations/llr/ test/java/org/apache/mahout/utils/nlp/collocations/ll

2010-02-10 Thread robinanil
Author: robinanil Date: Thu Feb 11 07:12:48 2010 New Revision: 908859 URL: http://svn.apache.org/viewvc?rev=908859&view=rev Log: MAHOUT-285 CollocMapper optimisations (Now reduces number of subgrams in output) Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/

svn commit: r908851 - in /lucene/mahout/trunk/utils/src: main/java/org/apache/mahout/text/ main/java/org/apache/mahout/utils/nlp/collocations/llr/ main/java/org/apache/mahout/utils/vectors/text/ main/

2010-02-10 Thread robinanil
Author: robinanil Date: Thu Feb 11 06:01:19 2010 New Revision: 908851 URL: http://svn.apache.org/viewvc?rev=908851&view=rev Log: Checking in Drew's fixes. Functional Vectorizer Added: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/nlp/collocations/llr/NGramColle

svn commit: r908841 - in /lucene/mahout/trunk: examples/src/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFiles.java utils/src/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFil

2010-02-10 Thread robinanil
Author: robinanil Date: Thu Feb 11 04:35:24 2010 New Revision: 908841 URL: http://svn.apache.org/viewvc?rev=908841&view=rev Log: MAHOUT-285 Moving the main class to utils Added: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFiles.java - co

svn commit: r908839 - /lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFiles.java

2010-02-10 Thread robinanil
Author: robinanil Date: Thu Feb 11 04:34:04 2010 New Revision: 908839 URL: http://svn.apache.org/viewvc?rev=908839&view=rev Log: MAHOUT-285 Missed out the main class Modified: lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/text/SparseVectorsFromSequenceFiles.java Modi

svn commit: r908463 - in /lucene/mahout/trunk/math: pom.xml src/main/java/org/apache/mahout/math/matrix/linalg/Algebra.java src/main/java/org/apache/mahout/math/matrix/linalg/Smp.java src/main/java/or

2010-02-10 Thread robinanil
Author: robinanil Date: Wed Feb 10 12:08:44 2010 New Revision: 908463 URL: http://svn.apache.org/viewvc?rev=908463&view=rev Log: Kicking out concurrent library. And with it the SMP implementation of Blas Removed: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/matrix/li

svn commit: r908367 - /lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/clustering/dirichlet/DisplayDirichlet.java

2010-02-09 Thread robinanil
Author: robinanil Date: Wed Feb 10 06:54:08 2010 New Revision: 908367 URL: http://svn.apache.org/viewvc?rev=908367&view=rev Log: Trunk wasnt compiling due to this bug. Temporary fix for now Modified: lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/clustering/diric

svn commit: r908040 - in /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils: nlp/collocations/llr/ vectors/text/term/

2010-02-09 Thread robinanil
Author: robinanil Date: Tue Feb 9 14:11:50 2010 New Revision: 908040 URL: http://svn.apache.org/viewvc?rev=908040&view=rev Log: Adding minSupport and minLLRValue parameter for pruning low frequency ngrams in Collocations Map/Reduce Job Modified: lucene/mahout/trunk/utils/src/main/java

svn commit: r908024 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/fpm/pfpgrowth/ examples/src/main/java/org/apache/mahout/fpm/pfpgrowth/ examples/src/main/java/org/apache/mahout/fpm/

2010-02-09 Thread robinanil
Author: robinanil Date: Tue Feb 9 13:45:48 2010 New Revision: 908024 URL: http://svn.apache.org/viewvc?rev=908024&view=rev Log: Moving FpGrowthJob to core and renamed as FPGrowthDriver and some other refactor Added: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgr

svn commit: r908014 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/classifier/bayes/ examples/src/main/java/org/apache/mahout/classifier/bayes/ examples/src/main/java/org/apache/mahou

2010-02-09 Thread robinanil
Author: robinanil Date: Tue Feb 9 12:44:33 2010 New Revision: 908014 URL: http://svn.apache.org/viewvc?rev=908014&view=rev Log: Moving Test and Train Classifier to core Added: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classifier/bayes/TestClassifier.java - co

svn commit: r907938 - in /lucene/mahout/trunk/utils: ./ src/main/java/org/apache/mahout/utils/nlp/ src/main/java/org/apache/mahout/utils/nlp/collocations/ src/main/java/org/apache/mahout/utils/nlp/col

2010-02-08 Thread robinanil
Author: robinanil Date: Tue Feb 9 05:49:18 2010 New Revision: 907938 URL: http://svn.apache.org/viewvc?rev=907938&view=rev Log: MAHOUT-242 NGram Collocation using LLR (Drew Farris) Added: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/nlp/ lucene/mahout/trunk/utils

svn commit: r907925 - in /lucene/mahout/trunk: core/pom.xml maven/pom.xml

2010-02-08 Thread robinanil
Author: robinanil Date: Tue Feb 9 04:08:55 2010 New Revision: 907925 URL: http://svn.apache.org/viewvc?rev=907925&view=rev Log: MAHOUT-282 Committing for Drew Modified: lucene/mahout/trunk/core/pom.xml lucene/mahout/trunk/maven/pom.xml Modified: lucene/mahout/trunk/core/pom.xml

svn commit: r907675 - in /lucene/mahout/trunk: examples/pom.xml utils/src/main/java/org/apache/mahout/utils/vectors/tfidf/TFIDFPartialVectorReducer.java

2010-02-08 Thread robinanil
Author: robinanil Date: Mon Feb 8 15:06:09 2010 New Revision: 907675 URL: http://svn.apache.org/viewvc?rev=907675&view=rev Log: reverting changes to jets3 Modified: lucene/mahout/trunk/examples/pom.xml lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/t

svn commit: r907642 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/fpm/pfpgrowth/ core/src/main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/ examples/src/main/java/org/apache/mahout

2010-02-08 Thread robinanil
Author: robinanil Date: Mon Feb 8 12:51:51 2010 New Revision: 907642 URL: http://svn.apache.org/viewvc?rev=907642&view=rev Log: Transforming code to use Mahout-math collections instead of HashMap. Only the easier ones. No Changes made in public functions Modified: lucene/mahout/trunk/

svn commit: r907637 - in /lucene/mahout/trunk: collections-codegen-plugin/pom.xml core/pom.xml examples/pom.xml taste-web/pom.xml

2010-02-08 Thread robinanil
Author: robinanil Date: Mon Feb 8 12:40:00 2010 New Revision: 907637 URL: http://svn.apache.org/viewvc?rev=907637&view=rev Log: Removing dependency to unused jars Modified: lucene/mahout/trunk/collections-codegen-plugin/pom.xml lucene/mahout/trunk/core/pom.xml lucene/mahout/t

svn commit: r907625 - /lucene/mahout/trunk/maven/build.xml

2010-02-08 Thread robinanil
Author: robinanil Date: Mon Feb 8 11:59:39 2010 New Revision: 907625 URL: http://svn.apache.org/viewvc?rev=907625&view=rev Log: Job jars were having duplicate classes/jar bundled Modified: lucene/mahout/trunk/maven/build.xml Modified: lucene/mahout/trunk/maven/build.xml URL:

svn commit: r907466 - in /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils: clustering/ClusterDumper.java vectors/VectorDumper.java vectors/VectorHelper.java

2010-02-07 Thread robinanil
Author: robinanil Date: Sun Feb 7 19:49:16 2010 New Revision: 907466 URL: http://svn.apache.org/viewvc?rev=907466&view=rev Log: MAHOUT-278 Cluster dumper reads DictionaryVectorizer dictionary chunks Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/cluste

svn commit: r907465 - in /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/text: DictionaryVectorizer.java term/TFPartialVectorReducer.java

2010-02-07 Thread robinanil
Author: robinanil Date: Sun Feb 7 18:55:29 2010 New Revision: 907465 URL: http://svn.apache.org/viewvc?rev=907465&view=rev Log: MAHOUT-277 Increase number of entries in memory per chunk of dictionary Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/

svn commit: r907442 - /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/RandomAccessSparseVector.java

2010-02-07 Thread robinanil
Author: robinanil Date: Sun Feb 7 16:43:38 2010 New Revision: 907442 URL: http://svn.apache.org/viewvc?rev=907442&view=rev Log: RandomAccessSparseVector addTo made a inner static class Modified: lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/RandomAccessSparseVector.

svn commit: r907417 - in /lucene/mahout/trunk/core/src: main/java/org/apache/mahout/clustering/meanshift/ test/java/org/apache/mahout/clustering/fuzzykmeans/ test/java/org/apache/mahout/clustering/kme

2010-02-07 Thread robinanil
Author: robinanil Date: Sun Feb 7 12:08:26 2010 New Revision: 907417 URL: http://svn.apache.org/viewvc?rev=907417&view=rev Log: Dummy Reporter to aid Map/Reduce unit testing Added: lucene/mahout/trunk/core/src/test/java/org/apache/mahout/common/DummyReporter.java Modified: lu

svn commit: r907217 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering: fuzzykmeans/FuzzyKMeansReducer.java kmeans/KMeansReducer.java

2010-02-06 Thread robinanil
Author: robinanil Date: Sat Feb 6 13:56:31 2010 New Revision: 907217 URL: http://svn.apache.org/viewvc?rev=907217&view=rev Log: Report number of converged clusters for kmeans and fuzzy kmeans, as a feedback to the user Modified: lucene/mahout/trunk/core/src/main/java/org/apache/ma

svn commit: r907203 - in /lucene/mahout/trunk/math/src/main/java-templates/org/apache/mahout/math: map/OpenKeyTypeObjectHashMap.java.t map/OpenKeyTypeValueTypeHashMap.java.t map/OpenObjectValueTypeHas

2010-02-06 Thread robinanil
Author: robinanil Date: Sat Feb 6 11:50:37 2010 New Revision: 907203 URL: http://svn.apache.org/viewvc?rev=907203&view=rev Log: Small changes: Redundant code removed and some tweaks Modified: lucene/mahout/trunk/math/src/main/java-templates/org/apache/mahout/math

svn commit: r907020 - in /lucene/mahout/trunk/math/src/main/java-templates/org/apache/mahout/math/list: AbstractValueTypeList.java.t ValueTypeArrayList.java.t

2010-02-05 Thread robinanil
Author: robinanil Date: Fri Feb 5 17:59:14 2010 New Revision: 907020 URL: http://svn.apache.org/viewvc?rev=907020&view=rev Log: null check bug in List Modified: lucene/mahout/trunk/math/src/main/java-templates/org/apache/mahout/math/list/AbstractValueTypeList.java.t lucene/ma

svn commit: r906255 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/clustering/kmeans/RandomSeedGenerator.java

2010-02-03 Thread robinanil
Author: robinanil Date: Wed Feb 3 21:36:51 2010 New Revision: 906255 URL: http://svn.apache.org/viewvc?rev=906255&view=rev Log: MAHOUT-273 RandomSeedGenerator doesnt estimate cluster centers when input path is a directory. Now iterates over the all the files in the input directory to gene

svn commit: r905387 - /lucene/mahout/trunk/maven/build.xml

2010-02-01 Thread robinanil
Author: robinanil Date: Mon Feb 1 19:34:31 2010 New Revision: 905387 URL: http://svn.apache.org/viewvc?rev=905387&view=rev Log: Examples Job bundles utils classes Modified: lucene/mahout/trunk/maven/build.xml Modified: lucene/mahout/trunk/maven/build.xml URL: http://svn.apache.org/vi

svn commit: r905232 - /lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/kmeans/TestKmeansClustering.java

2010-02-01 Thread robinanil
Author: robinanil Date: Mon Feb 1 10:08:29 2010 New Revision: 905232 URL: http://svn.apache.org/viewvc?rev=905232&view=rev Log: TestKMeansWithCanopyClusterInput updated to use the lastest code. It was never updated since the first checkin Modified: lucene/mahout/trunk/core/src/test/

svn commit: r905089 - /lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/kmeans/TestKmeansClustering.java

2010-01-31 Thread robinanil
Author: robinanil Date: Sun Jan 31 18:10:33 2010 New Revision: 905089 URL: http://svn.apache.org/viewvc?rev=905089&view=rev Log: Spell Error caused test not to run Modified: lucene/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/kmeans/TestKmeansClustering.java Modi

svn commit: r900034 - in /lucene/mahout/trunk: examples/src/main/java/org/apache/mahout/text/ utils/src/main/java/org/apache/mahout/utils/vectors/text/ utils/src/test/java/org/apache/mahout/utils/vect

2010-01-16 Thread robinanil
Author: robinanil Date: Sat Jan 16 23:10:31 2010 New Revision: 900034 URL: http://svn.apache.org/viewvc?rev=900034&view=rev Log: MAHOUT-237 Dictionary Vectorizer: running version which was tested over Wikipedia article dumps Added: lucene/mahout/trunk/utils/src/main/java/org/apache/ma

svn commit: r898653 - /lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/text/PartialVectorGenerator.java

2010-01-12 Thread robinanil
Author: robinanil Date: Wed Jan 13 05:40:24 2010 New Revision: 898653 URL: http://svn.apache.org/viewvc?rev=898653&view=rev Log: MAHOUT-237. Remove assertion(was used for debug) Modified: lucene/mahout/trunk/utils/src/main/java/org/apache/mahout/utils/vectors/

svn commit: r897994 - in /lucene/mahout/trunk/math/src/main/java/org/apache/mahout/math/map: OpenDoubleIntHashMap.java OpenIntDoubleHashMap.java OpenIntIntHashMap.java

2010-01-11 Thread robinanil
Author: robinanil Date: Mon Jan 11 18:32:02 2010 New Revision: 897994 URL: http://svn.apache.org/viewvc?rev=897994&view=rev Log: Deleting OpenDoubleIntHashMap OpenIntDoubleHashMap OpenIntIntHashMap from source and use generated classes instead Removed: lucene/mahout/trunk/math/src/

svn commit: r897134 - in /lucene/mahout/trunk: core/src/test/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/ examples/src/main/java/org/apache/mahout/fpm/pfpgrowth/example/dataset/

2010-01-08 Thread robinanil
Author: robinanil Date: Fri Jan 8 08:23:22 2010 New Revision: 897134 URL: http://svn.apache.org/viewvc?rev=897134&view=rev Log: MAHOUT-221 Missed out two files while checking in FP-Bonsai Added: lucene/mahout/trunk/core/src/test/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/ lu

svn commit: r896922 [3/3] - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/common/ core/src/main/java/org/apache/mahout/fpm/pfpgrowth/ core/src/main/java/org/apache/mahout/fpm/pfpgrowth

2010-01-07 Thread robinanil
Added: lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/fpm/pfpgrowth/example/dataset/KeyBasedStringTupleGrouper.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/fpm/pfpgrowth/example/dataset/KeyBasedStringTupleGrouper.java?rev=896922

svn commit: r896311 [4/4] - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/classifier/ core/src/main/java/org/apache/mahout/classifier/bayes/algorithm/ core/src/main/java/org/apache/mah

2010-01-05 Thread robinanil
Modified: lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/bayes/WikipediaDatasetCreatorDriver.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/bayes/WikipediaDatasetCreatorDriver.java?rev=896311&r1=896310&r2=896

svn commit: r894033 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/distance/WeightedDistanceMeasure.java

2009-12-26 Thread robinanil
Author: robinanil Date: Sat Dec 26 22:56:22 2009 New Revision: 894033 URL: http://svn.apache.org/viewvc?rev=894033&view=rev Log: Found a NULLPointer Exception bug: the check was happening after the use Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/dist

svn commit: r828932 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/FPGrowth.java

2009-10-22 Thread robinanil
Author: robinanil Date: Fri Oct 23 04:41:23 2009 New Revision: 828932 URL: http://svn.apache.org/viewvc?rev=828932&view=rev Log: loop index wasn't getting updated(due to previous checkin for-while loop) FPGrowth Modified: lucene/mahout/trunk/core/src/main/java/org/apache/m

svn commit: r827438 - /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/FPGrowth.java

2009-10-20 Thread robinanil
Author: robinanil Date: Tue Oct 20 13:44:21 2009 New Revision: 827438 URL: http://svn.apache.org/viewvc?rev=827438&view=rev Log: Changing MAHOUT FPGrowth:growth for-loop to while-loop Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/fpgrowth/FPGrowth.

svn commit: r826841 - in /lucene/mahout/trunk: core/src/main/java/org/apache/mahout/classifier/bayes/algorithm/ core/src/main/java/org/apache/mahout/classifier/bayes/datastore/ core/src/main/java/org/

2009-10-19 Thread robinanil
Author: robinanil Date: Mon Oct 19 22:26:27 2009 New Revision: 826841 URL: http://svn.apache.org/viewvc?rev=826841&view=rev Log: MAHOUT-188 Cleanup of Bayes/CBayes Classifier Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classifier/bayes/algorithm/BayesAlgorithm.

svn commit: r826562 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth: ParallelFPGrowthReducer.java fpgrowth/FPTreeDepthCache.java fpgrowth/Pattern.java

2009-10-18 Thread robinanil
Author: robinanil Date: Mon Oct 19 00:05:02 2009 New Revision: 826562 URL: http://svn.apache.org/viewvc?rev=826562&view=rev Log: removed some findbugs warnings from PFPGrowth Modified: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/ParallelFPGrowthReducer.

svn commit: r826561 - in /lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classifier/bayes: algorithm/BayesAlgorithm.java algorithm/CBayesAlgorithm.java common/ByScoreLabelResultComparator.ja

2009-10-18 Thread robinanil
Author: robinanil Date: Mon Oct 19 00:03:55 2009 New Revision: 826561 URL: http://svn.apache.org/viewvc?rev=826561&view=rev Log: MAHOUT-186 Removed ClassifierProrityQueue custom class replaced by PriorityQueue Added: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/classi

svn commit: r826546 - in /lucene/mahout/trunk: core/pom.xml examples/pom.xml taste-web/pom.xml utils/pom.xml

2009-10-18 Thread robinanil
Author: robinanil Date: Sun Oct 18 22:28:59 2009 New Revision: 826546 URL: http://svn.apache.org/viewvc?rev=826546&view=rev Log: MAHOUT-170 adding optimize=true flag during java compiles stage Modified: lucene/mahout/trunk/core/pom.xml lucene/mahout/trunk/examples/pom.xml lu

svn commit: r808808 [3/3] - in /lucene/mahout/trunk: core/ core/lib/ core/src/main/java/org/apache/mahout/classifier/ core/src/main/java/org/apache/mahout/classifier/bayes/ core/src/main/java/org/apac

2009-08-28 Thread robinanil
Added: lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/Parameters.java URL: http://svn.apache.org/viewvc/lucene/mahout/trunk/core/src/main/java/org/apache/mahout/common/Parameters.java?rev=808808&view=auto ===