>
>
> To get started, I am looking for some tutorial/documentation on setting up
> Mahout and configuring it on a Windows machine.
>
>
Hi Buddy,
Install Cygwin and Configure Mahout inside Cygwin
Best regards
--
******
JAGANADH G
http://jaganadhg
and Chapter 14 in Mahout in Action talks about the
> code. You can get the code from http://github.com/tdunning/MiA
>
>
Hi Ted
Thanks for the pointer.
It works.
Sorry to shoot another question.
Is there any way get lable for classifier result as of 0.7 API
Best regards
--
- Doc 3 is ZZZ similar*
> *
> *
> Can you please help?
>
>
Hi
Try the tutorial given at
http://sujitpal.blogspot.com/search?updated-max=2011-11-04T11:21:00-07:00&max-results=1
--
**
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
and even your own data set can be used there .
Some times u may need to do some how work that is all .
Hopes this helps you
--
******
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
va coding skill relates things.
Simple tip : Convert your classifier code to an executable jar file with
necessary libraries. Then run it like java -jar classifier.jar
That is all
--
**
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
preprocessing before the text passed to classifier .
--
******
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
n pass it to the
classifier . Make code change accordingly
--
******
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
..
--
**
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
lled listDir which returns all the .txt files in the
directory. Itreate the list and open each files and pass to classifier .
that is all . There is no need to got to mahout-core etc.. Still if you
feel it hard please show your code
--
******
JAGANADH G
http://jaganadhg
method. return class with file name .
Quite simple
--
**
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
automatically
--
**
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in
ou will get a clear insight on how to use Mahout (in java in your case
)
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
*ILUGCBE*
http://ilugcbe.psgkriya.org
Hi All
I just found that Mahout is available in the clouders repo.
If cloudera hadoop repositories are enabled you can isnatll it using your
distros package manager.
I just installed it with "yum install mahout" in my Fedora machine :-)
--
**
JAGAN
to classify network packets using Mahout Classification algorithm. Can
> anyone make some comments. Thanks in advance
>
>
Hi
I think you need to look some un-supervised algos for prediction .
--
******
JAGANADH G
http://jaganadhg.freeflux.ne
On Tue, Sep 20, 2011 at 3:25 PM, Paritosh Ranjan wrote:
> I think you will have a hard time applying clustering to it. I will suggest
> that Classification will be a better choice for this use case.
>
>
@Paritosh
Thanks for the reply
--
******
JAG
Classification or clustering ?
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
*ILUGCBE*
http://ilugcbe.techstud.org
Hi All
Is there any way to train and use a binary classifier in Mahout.
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
*ILUGCBE*
http://ilugcbe.techstud.org
tion" published by Manning
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
*ILUGCBE*
http://ilugcbe.techstud.org
many tutorial and talk slides and related materials
Also you can checkout Mahout in Action from Manning Publishers
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
*ILUGCBE*
http://ilugcbe.techstud.org
his mailing list will find it useful, but at
> least you can check it and potentially direct there the newbies asking
> questions.
>
>
Some samples of Recommendation and Document Classification is available at
my repo
https://bitbucket.org/jaganadhg/blog/src/tip/bck9/java/src/org/bc/kl/
-
be used
> in the project. I got to know that there is a CF engine called taste in
> Mahout. Can anyone tell me how to get the information about Taste in my
> project.
>
>
Checkout the book Mahout in Action
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
>
> hdfs://localhost:9000/user/root/examples/bin/work/20news-bydate/bayes-train-input"
>
> I digged into the code, it seems that trainclassifier only accepts HDFS or
> HBASE, is there a way to read file directly from a directory?
>
>
>
I did my experiment in Fedora
nd generate a model. If I try this on a new set of data, will it classify
> something that is not bad? If yes, what would be its label??
>
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Tue, Nov 23, 2010 at 11:09 AM, Divya wrote:
> I am following same steps
> But no success...
>
>
Are you using cygiwin or GNU/Linux
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
h I am looking for Mahout's API for classification.
>
A sample program is given below
http://bitbucket.org/jaganadhg/blog/src/995fa52d4fbc/bck9/java/src/org/bc/kl/ClassifierDemo.java
For working it in real-time system you have to some more work . Find it :-)
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
he output.
>
> How can we make use of this output?
>
Are you looking for Mahout API usgae for classification ?
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Tue, Nov 16, 2010 at 9:05 AM, Lance Norskog wrote:
> Great!
>
> (The link for the code is 404 at bitbucket.)
>
>
Try this link
http://bitbucket.org/jaganadhg/blog/src/tip/bck9/java/src/org/bc/kl/
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Fri, Nov 12, 2010 at 3:36 PM, JAGANADH G wrote:
>
>
> On Fri, Nov 12, 2010 at 3:15 PM, tog wrote:
>
>> Sorry if this is a naïve question but where exactly the talk is taking
>> place ?
>>
>
> Oh I forgot to include the event details
> It is hap
tAnalyzer
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
o present something in Java :-))
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
Dear All
I will be giving a talk on Machine Learning in the BarCap Kerala9 . I have
included Mahout in the talk too.
I will give demo of recommendation and Classification with Mahout.
Some Pythonic and non-mahout stuff is also there in my talk
--
**
JAGANADH G
r after performing
preprocessing (As like prepare20newsgroup does) . Then only I can attain
reasonable result out the trained model.
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
sing a Python basesd NaiveBayes
clsiiifer prepared by me. But due to many reasons it was too slow :-) .
Mahout makes life brighter
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Thu, Oct 21, 2010 at 4:20 AM, Ted Dunning wrote:
> If this is testing on held-out data, then this is a pretty respectable
> result for an untuned system.
>
This result is obtained from the training set .
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
ns accuracy of ngram?)
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
result . That is what he told. Once again I
will recheck and compare the results .
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
;> > 995 5 | 1000 a = pos
>> > 0 1000 | 1000 b = neg
>> > Default Category: unknown: 2
>> >
>> >
>> > With some pruning, you will have a decent enough classifier for
>> sentiments
Wow this is an amazing result :-)
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Mon, Oct 18, 2010 at 9:11 PM, JAGANADH G wrote:
>
>
> On Mon, Oct 18, 2010 at 9:03 PM, Robin Anil wrote:
>
>> bin/mahout prepare20newsgroups -p
>> /Users/robinanil/Downloads/movie_reviews/
>> -o movie -c UTF-8 -a org.apache.mahout.vectorizer.DefaultAnalyzer
&g
ry to
classify the document with my java code .
I will update one it is done I will update u regarding the status
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
uot;\t" text like .
After that the pos.txt and neg.txt placed in to a dir called "training"
This directory was given to the trainer as input
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
/movie_reviews.zipdata.
When i run the prediction module it says that all the reviews are positive .
Any thoughts !!!
I think I am posting the same question 3rd time here :-)
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
views.zip
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
rect format to be used with the job.
>
Thanks to all for the help.
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
Dear All
Can somebody tell how to prepare the movie lenses data for the Mahout
recommendation job
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Tue, Oct 12, 2010 at 4:45 PM, Mat Kelcey wrote:
> On 12 October 2010 21:57, JAGANADH G wrote:
> > Dear all
> > Can somebody tell how to train a cbayes classifier in Mhout.
> > Is it required to have the input in a hadoop file system ?
>
> https://cwiki.apache.or
Dear all
Can somebody tell how to train a cbayes classifier in Mhout.
Is it required to have the input in a hadoop file system ?
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
ted is available in the 'good' category only.
All the bad document it is telling that 'bad'. But all the 'good' it is
telling that 'bad' :-(
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
outputDir, params);
System.out.println("Finished Training");
// Training the system
}
When I tried the model against the development 1300 'Good' it is ok. But the
'Bad' 1300 document also the model classfied as 'Good document'. What may be
the reason
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
On Tue, Sep 28, 2010 at 8:23 PM, Grant Ingersoll wrote:
>
> On Sep 28, 2010, at 7:25 AM, JAGANADH G wrote:
>
> > On Tue, Sep 28, 2010 at 4:35 PM, Grant Ingersoll >wrote:
> >
> > I tried the classifier with two class documents - "good" and "bad".
re info from my side.
>
I tried the classifier with two class documents - "good" and "bad". But the
system identified all Good documents as well as bad documents as "Good
Documents"
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
- one pass only or multiple passes
>
> I got the same idea from some research papers.
Some where I saw that LSI will be also useful for the same. But I dont know
the details
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
rant
I am trying to make a strict match.
I will try Solar and Nutch.
Thanks and Regards
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
Dear All
Is there any way or algo available to compare tow documents.
Eg. Check if doc "A" is a copy (palagirised version) of document "B".
With regards
--
******
JAGANADH G
http://jaganadhg.freeflux.net/blog
8 I should put up a
> new
> patch some time today.
>
> Dear Ted Dunning
Thanks for the info .
I tried implementing a classifier based on Mahout BayesClassifier .
Now I have to do incremental training.
I will wait till the gsd relases to make the experiment
--
***
Dear All
Is it possible to train the Mahout classifier in an incremental way.
For exampel
Train 100 Document in day 1
Add 10 more documents in day 2
Also is it possible to store the training data to some database ?
--
**
JAGANADH G
http://jaganadhg.freeflux.net
added this code fragment
BayesParameters params = new BayesParameters(2);
params.set("alpha_i", "1.0");
params.set("dataSource", "hdfs");
I got this idea from the test code of TrainClassifier.
Thanks to all
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
am a novice in Java I am not able to follow all the things. I will try.
Up to that I will play with my Python classifier :-)
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
an you give some clue or tutorial or documentation regarding the same.
> TrainClassifier main function and set everything it does by default
>
>
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
Dear All
I was trying the Mahout classifier .
The program which I used for classifcation is given here
http://pastebin.com/6rALqCbN
The error message is http://pastebin.com/NwL93Kjx
Can anybody help me to to solve this
--
**
JAGANADH G
http
java class
--
**
JAGANADH G
http://jaganadhg.freeflux.net/blog
60 matches
Mail list logo