+1
On Mon, Apr 12, 2010 at 4:50 AM, Ted Dunning ted.dunn...@gmail.com wrote:
+1 (on trust, really)
On Sun, Apr 11, 2010 at 6:49 PM, Benson Margulies
bimargul...@gmail.comwrote:
https://repository.apache.org/content/repositories/orgapachemahout-015/
contains (this time for sure) all the
+1
On Thu, Apr 8, 2010 at 2:57 AM, Drew Farris drew.far...@gmail.com wrote:
+1
On Tue, Apr 6, 2010 at 9:08 PM, Benson Margulies bimargul...@gmail.com
wrote:
In order to decouple the mahout-collections library from the rest of
Mahout, to allow more frequent releases and other good things,
:
• Deneche Abdelhakim adene...@...
• Isabel Drost (isa...@...)
• Ted Dunning (tdunn...@...)
• Jeff Eastman (jeast...@...)
• Drew Farris (d...@...)
• Grant Ingersoll (gsing...@...)
• Benson Margulies (bimargul...@...)
• Sean Owen (sro
close, actually:
عبد الحكيم
=D
On Thu, Mar 18, 2010 at 6:41 PM, Benson Margulies bimargul...@gmail.com wrote:
Or perhaps:
عبدل حكيم
?
On Thu, Mar 18, 2010 at 1:34 PM, deneche abdelhakim adene...@gmail.com
wrote:
should be Abdelhakim Deneche ... cause my first name is 'Abdelhakim
just to get it right: not being in the PMC doesn't mean I'm no more a
committer, right ?
On Mon, Mar 15, 2010 at 6:08 PM, Jake Mannix jake.man...@gmail.com wrote:
+1 and I'm in (my email @apache is just jmannix btw, for some reason its not
listed on those resolutions)
On Mar 15, 2010 9:07 AM,
oops, will attach it as soon as possible. I really wonder why submit a
patch and attach a patch are two different operations in JIRA ?
On Sat, Mar 6, 2010 at 10:08 PM, Robin Anil (JIRA) j...@apache.org wrote:
[
yes, I'm planning to make DF look more like a Mahout classifier. I
will take a look at bayes.
On Sun, Mar 7, 2010 at 7:09 PM, Robin Anil (JIRA) j...@apache.org wrote:
[
Welcome Drew
=D
On Fri, Feb 19, 2010 at 5:02 AM, Grant Ingersoll gsing...@apache.org wrote:
On Feb 18, 2010, at 8:32 PM, Drew Farris wrote:
There's lots more stuff I'd like to get in there,
now I only need to figure how to squeeze 48 hours of consciousness
into a day.
I believe there is
One important question in my mind here is how does this effect 0.20 based
jobs and pre 0.20 based jobs. I had written pfpgrowth in pure 0.20 api. and
deneche is also maintaining two version it seems. I will check the
AbstractJob and see
although I maintain two versions of Decision Forests,
The only example that actually uses watchmaker-swing is Travelling
Salesman, mainly because it was a direct port of an existing
watchmaker example. And if I remember well, it does not actually use
JFreeChart...so I think it's safe to exclude it.
On Sat, Jan 30, 2010 at 5:19 AM, Drew Farris
Yeah, its probably due to the way I used to generate random data...the
problem is that I never get this error =P so it's very difficult to
fix...I'll try my best as soon as I have some time. In the mean time,
rerunning 'mvn clean install' again generally does the trick.
On Sat, Jan 16, 2010 at
I'm getting similar slowdowns with my VirtualBox Ubuntu 9.04
I'm suspecting that the problem is not -only- caused by RandomUtils because:
1. I'm familiar with MerseneTwisterRNG slowdowns (I use it a lot) but
the test time used to be reported accurately by maven. Now maven
reports that a test
(Sat, 09 Jan 2010) | 1 line
Code style adjustments; enabled/fixed TestSamplingIterator
On Sun, Jan 17, 2010 at 5:47 AM, deneche abdelhakim adene...@gmail.com wrote:
I'm getting similar slowdowns with my VirtualBox Ubuntu 9.04
I'm suspecting that the problem is not -only- caused by RandomUtils
Welcome =D
On Wed, Jan 13, 2010 at 10:36 PM, Drew Farris drew.far...@gmail.com wrote:
Congratulations Benson. It is wonderful to see your great work in the
mahout-math (and the future mahout-collections?) come together quickly.
On Wed, Jan 13, 2010 at 3:28 PM, Grant Ingersoll
the build is successful, thanks =D
On Fri, Jan 8, 2010 at 9:23 AM, Robin Anil robin.a...@gmail.com wrote:
Try Now
yep :p
On Sun, Jan 3, 2010 at 4:41 PM, Sean Owen (JIRA) j...@apache.org wrote:
[
https://issues.apache.org/jira/browse/MAHOUT-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved MAHOUT-71.
-
Resolution: Later
last time I tried, running Hadoop 0.20 on Windows was impossible for
me...should we still try to support Windows ? I found that installing
Ubuntu on Windows using Virtual Box is the easiest way to use Hadoop
inside Windows
On Mon, Dec 28, 2009 at 8:47 PM, Benson Margulies bimargul...@gmail.com
I'm not planing to make new changes to 'mapred', my new code should go
to 'mapreduce'
On Thu, Dec 3, 2009 at 3:34 PM, Isabel Drost isa...@apache.org wrote:
On Thu Sean Owen sro...@gmail.com wrote:
I suggest our current stance be that we use 0.20.x, with the old APIs.
When 0.21 comes out and
df/mapred works with the old hadoop API
df/mapreduce works with hadoop 0.20 API
On Saturday, November 28, 2009, Sean Owen sro...@gmail.com wrote:
I'm all for generating and publishing this.
The CPD results highlight a question I had: what's up with the amount
of duplication between
please use Decision Forests instead of Random Forests
On Thu, Nov 12, 2009 at 9:01 AM, Robin Anil robin.a...@gmail.com wrote:
Please edit/add stuff.
Robin
==
Apache Mahout 0.2 has been released and is now available for public
download. Apache Mahout
Sure.
On Fri, Oct 2, 2009 at 8:59 AM, Isabel Drost (JIRA) j...@apache.org wrote:
[
https://issues.apache.org/jira/browse/MAHOUT-184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12761501#action_12761501
]
Isabel Drost commented on MAHOUT-184:
I'm trying to commit [MAHOUT-122 |
https://issues.apache.org/jira/browse/MAHOUT-122], but I'm getting the
following error:
svn: Commit failed (details follow):
svn: Server sent unexpected return value (403 Forbidden) in response
to MKACTIVITY request for
On Sep 27, 2009, at 6:47 AM, Simon Willnauer wrote:
Are you commiting into a http or https path. you must check out via
https in order to commit this has been an issue for many new
commiters.
Simon
On Sun, Sep 27, 2009 at 8:49 AM, deneche abdelhakim adene...@apache.org
wrote:
I'm trying
yes its meant to be run twice, one time selecting the training samples
and the next time the testing samples. It assumes that RNG will return
the exact same numbers twice.
On Mon, Sep 21, 2009 at 1:54 PM, Sean Owen sro...@gmail.com wrote:
I rolled it back. So the reader depends on the seed and
The change in
examples/src/main/java/org/apache/mahout/ga/watchmaker/cd/hadoop/DatasetSplit.java
could lead to a bug. The problem is in the following modification:
- rng = new MersenneTwisterRNG(split.getSeed());
+ rng = RandomUtils.getRandom();
rng is supposed to use the seed given
proper
permissions to write on the forrest
install. I believe Forrest downloads stuff to its
directories. I
recall seeing similar things. Very annoying.
On Sep 15, 2009, at 7:12 AM, deneche abdelhakim wrote:
I'm already using Java 1.5 !
--- En date de : Mar 15.9.09, Grant Ingersoll
Ingersoll gsing...@apache.org
Objet: Re: Updating the Web site
À: mahout-dev@lucene.apache.org
Date: Mercredi 16 Septembre 2009, 15h35
What's the full log say?
On Sep 16, 2009, at 7:15 AM, deneche abdelhakim wrote:
forest is installed in my home directory :(
--- En date de : Mar 15.9.09
, Grant Ingersoll wrote:
Now when I did a forrest clean I get the same error.
On Sep 16, 2009, at 9:44 AM, deneche abdelhakim
wrote:
'forrest site' gives me:
**
Apache Forrest. Run 'forrest -projecthelp'
to list
that the
Lucene PMC has voted to add Deneche Abdelhakim, Robin Anil
and David Hall as Mahout committers. Deneche, Robin
and David have all made significant contributions to Mahout
in regards to classification, clustering, evolutionary
programming and general usage and utilities.
Furthermore
I followed the instructions available here:
http://cwiki.apache.org/MAHOUT/howtoupdatethewebsite.html
in order to add my name to the committer list =P
when running 'forrest run' but I'm getting broken links:
X [0] skin/images/current.gif
BROKEN:
, Isabel Drost isa...@apache.org a écrit :
De: Isabel Drost isa...@apache.org
Objet: Re: Re : Welcome the newest Mahouts!
À: mahout-dev@lucene.apache.org
Date: Mardi 15 Septembre 2009, 12h29
On Tue, 15 Sep 2009 10:11:56 +
(GMT)
deneche abdelhakim a_dene...@yahoo.fr
wrote:
Got my Apache
to 1.5 for it and it should
work.
On Sep 15, 2009, at 6:24 AM, deneche abdelhakim wrote:
I followed the instructions available here:
http://cwiki.apache.org/MAHOUT/howtoupdatethewebsite.html
in order to add my name to the committer list =P
when running 'forrest run' but I'm
Thanks!
--- En date de : Mar 15.9.09, Isabel Drost isa...@apache.org a écrit :
De: Isabel Drost isa...@apache.org
Objet: Re: JIRA permission ?
À: mahout-dev@lucene.apache.org
Date: Mardi 15 Septembre 2009, 17h23
On Tue, 15 Sep 2009 14:52:28 +
(GMT)
deneche abdelhakim a_dene
Thanks Robin =D
--- En date de : Lun 14.9.09, Robin Anil robin.a...@gmail.com a écrit :
De: Robin Anil robin.a...@gmail.com
Objet: Comprehensive study on Java Memory Optimization
À: mahout-dev mahout-dev@lucene.apache.org
Date: Lundi 14 Septembre 2009, 9h08
Hope it would be useful.
Link:
done.
--- En date de : Mar 8.9.09, Grant Ingersoll gsing...@apache.org a écrit :
De: Grant Ingersoll gsing...@apache.org
Objet: [GSOC] Code Submissions
À: Mahout Dev List mahout-dev@lucene.apache.org
Date: Mardi 8 Septembre 2009, 13h09
Hi Robin, David and Deneche,
You will need to submit
just got the same error, nuking .m2 AND installing maven 2.2.1 solved the
problem
--- En date de : Mar 25.8.09, Ted Dunning ted.dunn...@gmail.com a écrit :
De: Ted Dunning ted.dunn...@gmail.com
Objet: Re: build failure
À: mahout-dev@lucene.apache.org, isa...@apache..org
Date: Mardi 25 Août
I moved recently some of the Decision Forest examples from the core project
to the examples project. While in core they worked perfectly in hadoop 0..19.1
(pseudo-distributed), but now they don't !!!
For example, running my org.apache.mahout.df.BuildForest gives the following
exception:
I'm getting it too when building from the base directory
- Message d'origine
De : Robin Anil robin.a...@gmail.com
À : mahout-dev mahout-dev@lucene.apache.org
Envoyé le : Mercredi, 22 Juillet 2009, 19h15mn 38s
Objet : Error building Mahout
I am getting this error on building mahout.
maven 2.1.0
deleting the local repository solves the problems, just hopes I wont have to do
it often
- Message d'origine
De : Grant Ingersoll gsing...@apache.org
À : mahout-dev@lucene.apache.org
Envoyé le : Mercredi, 22 Juillet 2009, 19h42mn 04s
Objet : Re: Error building Mahout
Actually, I'm not used any reducer at all, the output of the mappers is
collected and handled by the main program after the end of the job.
Running the job with 10 map tasks in a 10 instances (c1.medium) cluster takes
0h 11m 39s 209, speculative execution is on so 12 map tasks have been
:
De: Grant Ingersoll gsing...@apache.org
Objet: Re: problems downloading lucene-analyzers
À: mahout-dev@lucene.apache.org
Date: Mardi 30 Juin 2009, 15h20
FWIW, it works for me.
On Jun 30, 2009, at 6:54 AM, deneche abdelhakim wrote:
I'm having problems with lucene-analyzers
(2.9
I'm having problems with lucene-analyzers (2.9-SNAPSHOT) dependency, because
its a snapshot mvn install downloads a new version every day, and most of the
time I got checksum failures !!! Is any body else having the same problem ?
mvn -version :
Maven version: 2.0.9
Java version: 1.6.0_0
OS
Hi,
=D
I've been accepted. And I'll be working on Random Forests
=P
Given it's my second participation, I have one advise : don't be shy to ask
about anything related to your project on this list (starting from now), its
the fastest way to learn about Mahout.
Who else has been accepted ?
...@cs.stanford.edu a écrit :
De: David Hall d...@cs.stanford.edu
Objet: Re: [GSOC] Accepted Students
À: mahout-dev@lucene.apache.org
Date: Mardi 21 Avril 2009, 8h30
On Mon, Apr 20, 2009 at 11:18 PM,
deneche abdelhakim a_dene...@yahoo.fr
wrote:
Hi,
=D
I've been accepted. And I'll
Here is a draft of my proposal
**
Title/Summary: [Apache Mahout] Implement parallel Random/Regression Forests
Student: AbdelHakim Deneche
Student e-mail: ...
Student Major: Phd in Computer Science
Student Degree: Master in Computer Science
, then your approach (3) will work no matter
the data size.
If scaling shows an evil memory size effect, then your
approach (2) would be
required for large data sets.
On Sat, Mar 28, 2009 at 8:14 AM, deneche abdelhakim a_dene...@yahoo.frwrote:
My question is : when Mahout.RF will be used
you should read in . 2a
. This implementation is, relatively, easy given...
--- En date de : Sam 28.3.09, deneche abdelhakim a_dene...@yahoo.fr a écrit :
De: deneche abdelhakim a_dene...@yahoo.fr
Objet: Re: [gsoc] random forests
À: mahout-dev@lucene.apache.org
Date: Samedi 28 Mars 2009
talking about Random Forests, I think there are two possible ways to actually
implement them:
The first implementation is useful when the dataset is not that big (= 2Go
perhaps) and thus can be distributed via Hadoop's DistributedCache. In this
case each mapper has access to all the dataset
for a long time is
whether the choice of
variables to use for splits is chosen once per tree or
again at each split.
I think that the latter interpretation is actually the
correct one. You
should check my thought.
On Sun, Mar 15, 2009 at 1:53 AM, deneche abdelhakim a_dene...@yahoo.frwrote
I added a page to the wiki that describes how to build a random forest and how
to use it to classify new cases.
http://cwiki.apache.org/confluence/display/MAHOUT/Random+Forests
The following classes uses the Deque interface, which is not available in Java
1.5:
. org.apache.mahout.classifier.bayes.BayesClassifier
. org.apache.mahout.classifier.cbayes.CBayesClassifier
--- En date de : Lun 9.3.09, Sean Owen sro...@gmail.com a écrit :
De: Sean Owen sro...@gmail.com
Im seriously considering Random Forests (RF) as my GSoC project, they seem
interesting, and judging by how often they have been suggested, they are very
useful to Mahout. I found the following discussion:
http://markmail.org/message/dancn3n76ken6thb
that gives many useful informations about
Hi,
Im planning to participate, again, at GSoC and I want to do it, again, with
Mahout.
This year, lets make Mahout run over Amazon EC2. This means building the proper
AMIs, run some Mahout projects (the GA examples) over EC2, give feedback and
write simple, clear How-Tos about running a
focus to me should be on demoing/documenting
Mahout's capabilities, versus showing how to run Mahout
on any particular platform.
On Feb 26, 2009, at 9:58 AM, deneche abdelhakim wrote:
Hi,
Im planning to participate, again, at GSoC and I want
to do it, again, with Mahout
About MAHOUT-102 (https://issues.apache.org/jira/browse/MAHOUT-102), the patch
is already available, is someone could just commit it.
Also, I'm not able to make my patchs delete files (or directories) when
applied, is it because I'm not a commiter or because I'm using TortoiseSVN ?
--- En date
: @Override annotations
À: mahout-dev@lucene.apache.org
Date: Jeudi 22 Janvier 2009, 10h05
I think mahout should compile with both 1.5 and 1.6.
On Wed, Jan 21, 2009 at 11:23 PM, deneche abdelhakim
a_dene...@yahoo.frwrote:
Last time I tried to compile the Mahout trunk, I got a
similar problem
Last time I tried to compile the Mahout trunk, I got a similar problem. In my
case, I'm using Eclipse and the errors were caused by the JDK Compliance Level
(in the project properties). In short, I was using JVM 1.6 JRE but with 5.0
compliance level (forgot to change it !).
I found the answer
5. BruteForceTravellingSalesman says copyright Daniel
Dwyer -- can
this be replaced by the standard copyright header?
Oups, I tought I changed them all ! Yes you can replace it.
__
Do You Yahoo!?
En finir avec le spam? Yahoo! Mail vous offre la
--- En date de : Dim 19.10.08, Grant Ingersoll [EMAIL PROTECTED] a écrit :
De: Grant Ingersoll [EMAIL PROTECTED]
Objet: Re: More proposed changes across code
À: mahout-dev@lucene.apache.org
Date: Dimanche 19 Octobre 2008, 18h30
On Oct 19, 2008, at 11:16 AM, Sean Owen wrote:
On Sun,
Swing client code
from the core logic.
On 9/20/08, deneche abdelhakim
[EMAIL PROTECTED] wrote:
Sounds cool :)
I'll do the TSP part, but it may take some
time because I'm a bit busy
(PhD's administrative stuff).
There are many available large TSP benchmarks,
and it seems
path, yes. If it
is easier to adapt
to Maven's location, OK.
On 9/22/08, deneche abdelhakim
[EMAIL PROTECTED] wrote:
Dumb question: why does example code
depend on test code?
Can this be solved by severing that
dependency?
It's not from the example code but from
the example's
Sounds cool :)
I'll do the TSP part, but it may take some time because I'm a bit busy (PhD's
administrative stuff).
There are many available large TSP benchmarks, and it seems that there is a
common file format for them TSPLIB
I came across the following competition
http://www.netflixprize.com/index
It's about recommender systems, so I think it's a Taste stuff. The training
dataset consists of more than 100M ratings.
- Message d'origine
De : Josh Myer [EMAIL PROTECTED]
À : mahout-dev@lucene.apache.org
Go on, I will do my part, I just hope GA likes Java 6 :P
- Message d'origine
De : Sean Owen [EMAIL PROTECTED]
À : mahout-dev@lucene.apache.org
Envoyé le : Samedi, 30 Août 2008, 21h26mn 45s
Objet : Re: Going to move us to Hadoop 0.18.0, Java 6
So I should hold off on committing changes
You should run the job task in the examples directory (ant job), it will
generate a file (in examples/build) called
apache-mahout-examples-0.1-dev.job, this is the jar (even if it ends with
.job) that contains both the examples and the core.
- Message d'origine
De : Robin Anil [EMAIL
Now that the Class Discovery (CD) example is up and running, it's time to think
about what to do next. I already have some ideas, but I want to check with the
community first.
I see two possible ways ahead of me:
A.Enhance the (CD) example
a1. handle categorical attributes
a2. generate
256m to get the tests to run without heap problems.
Jeff
deneche abdelhakim wrote:
I've been using Eclipse for all my testing and all just works fine.
But now I want to build and test the examples using ant. I managed to modify
the build.xml to generate the examples job. But when I run one
I just did a fresh checkout and all the tests are successfull !!!
--- En date de : Sam 21.6.08, Allen Day [EMAIL PROTECTED] a écrit :
De: Allen Day [EMAIL PROTECTED]
Objet: getting started with mahout, failing tests
À: mahout-dev@lucene.apache.org
Date: Samedi 21 Juin 2008, 8h00
Hi,
I
can go do
some background
reading.
I will try to get to MAHOUT-56 this week, but others can
jump in and
review as well.
-Grant
On May 27, 2008, at 4:52 AM, deneche abdelhakim wrote:
In a GA there are many things that can be distributed,
and one
should always start
I checked the last version of Mahout (rev. 662372) and got the following
exception with many tests (the list of this tests is at the end of this post):
java.io.IOException: Job failed!
the following message is printed in System.err :
java.lang.OutOfMemoryError: Java heap space
I think its
might want to coordinate with Deneche
Abdelhakim who is working in
GA for GSoC - as I understand, Gene Expression
Programming is related to GA?
Isabel
--
#if _FP_W_TYPE_SIZE 32#error Here's a
nickel kid. Go buy yourself a real
computer.#endif--
linux/arch
Ted Dunning [EMAIL PROTECTED] wrote:
Conceptually, at least, it would be good to have the option for fitness
functions to be expressed as map-reduce programs. Unfortunately, having
mappers spawn MR programs runs the real risk of dead-lock, especially on
less than grandiose clusters.
To
UCI : http://archive.ics.uci.edu/ml/
--- En date de : Mer 21.5.08, Jeff Eastman [EMAIL PROTECTED] a écrit :
De: Jeff Eastman [EMAIL PROTECTED]
Objet: Re: Thoughts on timeline for first release?
À: mahout-dev@lucene.apache.org
Date: Mercredi 21 Mai 2008, 17h10
Does anybody have some links
as part of my gsoc project I started adapting one of Watchmaker examples (TSP)
to use with Mahout. I believe that the next step is to start a Jira issue and
post an svn patch, isn't it ?
I also did a fresh checkout of Mahout and run ant test in the core
directory and got a wonderful Tests
Hi Robin,
I am very happy that I've been accepted, thanks to the Mahout Community that
kindly commented on my draft.
So we are four students, that's cool. I wish us good work and great fun in this
summer.
Hakim
Robin Anil [EMAIL PROTECTED] a écrit : Hi Everyone,
This is
its own set of
operators.
Abdel Hakim
Ted Dunning wrote :
I think it is a very bad idea to tie the algorithm to the number of
processors being used in this way. A program should produce identical
results on any machine, subject only to PRNG seeding issues.
On 4/11/08 8:52 PM, deneche
I don't know the exact term, but may be I should have said computing process,
so each processor (or computing node) can run many computing processes...
Ted Dunning [EMAIL PROTECTED] a écrit :
How is computing node not a processor?
On 4/12/08 9:26 PM, deneche abdelhakim wrote:
The number
Hi Grant,
You wrote the following comment on my GSoC proposal:
Could someone w/ a little more GA knowledge comment on the use of
WatchMaker? What I wonder is if it is possible to distribute some of the
watchmaker functionality?
Do you want to know if there are more other ways to
I've written my proposal, and because I could no more change it after I submit
it to GSoc, I first post it here
if someone have some suggestions you are welcome.
I will wait until saturday morning to post it to the GSoC
Hi
Im a PhD student on AI and adaptive systems, I have been working on
evolutionary algorithms for the last 4 years. I implemented my own Aritifial
Immune System with Matlab and as a Java extension to Yale, I also worked with a
C++ framework for multi-objective optimization.
My project is to
80 matches
Mail list logo