Hi Jing,

I tried to run the same command on my distribution, and got the
following results - one difference you might notice is that I used a
--token option, which is normally required. So, I wonder if you could
try using the --token option? Also, are you able to run the tests, as
in:

cd Testing
csh all-tests.sh

Does that work?

Below are the results when I run on the sample data for begin...

lincoln:~/.cpan/build/Text-SenseClusters-1.03-2dUNe9 # discriminate.pl
samples/Data/begin.v-test.xml --token samples/Regexs/token.regex

=================================================================
Output when #clusters = 2 (Set manually)
=================================================================
********************************************************************************
vcluster (CLUTO 2.1.2) Copyright 2001-06, Regents of the University of Minnesota

Matrix Information -----------------------------------------------------------
  Name: expr1383616949.vectors, #Rows: 255, #Columns: 4242, #NonZeros: 417931

Options ----------------------------------------------------------------------
  CLMethod=RB, CRfun=I2, SimFun=Cosine, #Clusters: 2
  RowModel=None, ColModel=None, GrModel=SY-DIR, NNbrs=40
  Colprune=1.00, EdgePrune=-1.00, VtxPrune=-1.00, MinComponent=5
  CSType=Best, AggloFrom=0, AggloCRFun=I2, NTrials=10, NIter=10

Solution ---------------------------------------------------------------------

------------------------------------------------------------------------
2-way clustering: [I2=2.47e+02] [255 of 255]
------------------------------------------------------------------------
cid  Size  ISim  ISdev   ESim  ESdev  |
------------------------------------------------------------------------
  0   162 +0.948 +0.027 +0.893 +0.047 |
  1    93 +0.930 +0.046 +0.893 +0.088 |
------------------------------------------------------------------------

------------------------------------------------------------------------------
Hierarchical Tree that optimizes the I2 criterion function...
------------------------------------------------------------------------------

-----------------
2
|---0
|---1
-----------------
------------------------------------------------------------------------------

Timing Information -----------------------------------------------------------
   I/O:                                   0.153 sec
   Clustering:                            0.126 sec
   Reporting:                             0.033 sec
Memory Usage Information -----------------------------------------------------
   Maximum memory used:                11472896 bytes
   Current memory used:                 3541224 bytes
********************************************************************************

Clusters of given contexts can be found in file: expr1383616949.clusters

Good luck, and please let us know what happens!

Thanks,
Ted

On Mon, Nov 4, 2013 at 5:41 PM, Jing Wang <[email protected]> wrote:
> Hello,
>
> I am a new user to the senseclusters, and I try to run the discriminate.pl
> on the sample file under the samples directory, however, I cannot make it
> work.
>
> I simply type the command: perl discriminate.pl
> ./samples/Data/begin.v-test.xml
>
> And the error is: ERROR(discriminate.pl):
> Only 2 FEATURES found in the <expr1383608414.bigrams> file.
> At least 10 FEATURES required to proceed with context
> representation.
>
> This might be a silly mistake, but I cannot figure it out. Can someone help
> me on this issue? Thank you very much!
>
>
> Best,
> Safari
>
> ------------------------------------------------------------------------------
> November Webinars for C, C++, Fortran Developers
> Accelerate application performance with scalable programming models. Explore
> techniques for threading, error checking, porting, and tuning. Get the most
> from the latest Intel processors and coprocessors. See abstracts and
> register
> http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk
> _______________________________________________
> senseclusters-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/senseclusters-users
>



-- 
Ted Pedersen
http://www.d.umn.edu/~tpederse

------------------------------------------------------------------------------
November Webinars for C, C++, Fortran Developers
Accelerate application performance with scalable programming models. Explore
techniques for threading, error checking, porting, and tuning. Get the most 
from the latest Intel processors and coprocessors. See abstracts and register
http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk
_______________________________________________
senseclusters-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/senseclusters-users

Reply via email to