Hi Jing, I tried to run the same command on my distribution, and got the following results - one difference you might notice is that I used a --token option, which is normally required. So, I wonder if you could try using the --token option? Also, are you able to run the tests, as in:
cd Testing csh all-tests.sh Does that work? Below are the results when I run on the sample data for begin... lincoln:~/.cpan/build/Text-SenseClusters-1.03-2dUNe9 # discriminate.pl samples/Data/begin.v-test.xml --token samples/Regexs/token.regex ================================================================= Output when #clusters = 2 (Set manually) ================================================================= ******************************************************************************** vcluster (CLUTO 2.1.2) Copyright 2001-06, Regents of the University of Minnesota Matrix Information ----------------------------------------------------------- Name: expr1383616949.vectors, #Rows: 255, #Columns: 4242, #NonZeros: 417931 Options ---------------------------------------------------------------------- CLMethod=RB, CRfun=I2, SimFun=Cosine, #Clusters: 2 RowModel=None, ColModel=None, GrModel=SY-DIR, NNbrs=40 Colprune=1.00, EdgePrune=-1.00, VtxPrune=-1.00, MinComponent=5 CSType=Best, AggloFrom=0, AggloCRFun=I2, NTrials=10, NIter=10 Solution --------------------------------------------------------------------- ------------------------------------------------------------------------ 2-way clustering: [I2=2.47e+02] [255 of 255] ------------------------------------------------------------------------ cid Size ISim ISdev ESim ESdev | ------------------------------------------------------------------------ 0 162 +0.948 +0.027 +0.893 +0.047 | 1 93 +0.930 +0.046 +0.893 +0.088 | ------------------------------------------------------------------------ ------------------------------------------------------------------------------ Hierarchical Tree that optimizes the I2 criterion function... ------------------------------------------------------------------------------ ----------------- 2 |---0 |---1 ----------------- ------------------------------------------------------------------------------ Timing Information ----------------------------------------------------------- I/O: 0.153 sec Clustering: 0.126 sec Reporting: 0.033 sec Memory Usage Information ----------------------------------------------------- Maximum memory used: 11472896 bytes Current memory used: 3541224 bytes ******************************************************************************** Clusters of given contexts can be found in file: expr1383616949.clusters Good luck, and please let us know what happens! Thanks, Ted On Mon, Nov 4, 2013 at 5:41 PM, Jing Wang <[email protected]> wrote: > Hello, > > I am a new user to the senseclusters, and I try to run the discriminate.pl > on the sample file under the samples directory, however, I cannot make it > work. > > I simply type the command: perl discriminate.pl > ./samples/Data/begin.v-test.xml > > And the error is: ERROR(discriminate.pl): > Only 2 FEATURES found in the <expr1383608414.bigrams> file. > At least 10 FEATURES required to proceed with context > representation. > > This might be a silly mistake, but I cannot figure it out. Can someone help > me on this issue? Thank you very much! > > > Best, > Safari > > ------------------------------------------------------------------------------ > November Webinars for C, C++, Fortran Developers > Accelerate application performance with scalable programming models. Explore > techniques for threading, error checking, porting, and tuning. Get the most > from the latest Intel processors and coprocessors. See abstracts and > register > http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk > _______________________________________________ > senseclusters-users mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/senseclusters-users > -- Ted Pedersen http://www.d.umn.edu/~tpederse ------------------------------------------------------------------------------ November Webinars for C, C++, Fortran Developers Accelerate application performance with scalable programming models. Explore techniques for threading, error checking, porting, and tuning. Get the most from the latest Intel processors and coprocessors. See abstracts and register http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk _______________________________________________ senseclusters-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/senseclusters-users
