spark

2015-08-17 Thread Publius
Hello
what is the difference between Hadoop and Spark?
How is Spark better? 


Re: Any working VM of Apache Hadoop ?

2015-01-19 Thread Publius
Cloudera works on VM 





KMG 365

Putin for President :O

The most wretched device devised by man ..is a Computer :)

Fight back SPAM: http://www.spamcop.net
  From: Artem Ervits artemerv...@gmail.com
 To: user@hadoop.apache.org 
 Sent: Monday, January 19, 2015 5:37 AM
 Subject: Re: Any working VM of Apache Hadoop ?
   
+1 for BigtopArtem ErvitsOn Jan 18, 2015 4:10 PM, Jay Vyas 
jayunit100.apa...@gmail.com wrote:

Also BigTop has a very flexible vagrant infrastructure:
https://github.com/apache/bigtop/tree/master/bigtop-deploy/vm/vagrant-puppet
On Jan 18, 2015, at 3:37 PM, Andre Kelpe ake...@concurrentinc.com wrote:


Try our vagrant setup: 
https://github.com/Cascading/vagrant-cascading-hadoop-cluster

- André

On Sat, Jan 17, 2015 at 10:07 PM, Krish Donald gotomyp...@gmail.com wrote:

Hi,
I am looking for working VM of Apache Hadoop.Not looking for cloudera or Horton 
works VMs.If anybody has it and if they can share that would be great .
ThanksKrish



-- 
André Kelpe
an...@concurrentinc.com
http://concurrentinc.com



  

future of labor with machine learning

2015-01-04 Thread Publius
 Hello
is there any hope for human jobs wit the future of computerized bot and machine 
learning taking over huge swaths of the labor pool?
will Hadoop jobs come and go in the blink of an eye?




Re: Self Study Project in Hadoop -----

2015-01-03 Thread Publius
What is POC, please? 





KMG 365

Putin for President :O

The most wretched device devised by man ..is a Computer :)

Fight back SPAM: http://www.spamcop.net
  From: Abhishek Singh 23singhabhis...@gmail.com
 To: user@hadoop.apache.org 
 Sent: Saturday, January 3, 2015 12:06 PM
 Subject: Re: Self Study Project in Hadoop -
   
Hi Krish,

You could use Hortonworks tutorials available online 
http://hortonworks.com/tutorials/ and perform POC on your own. Also, one of the 
widely practiced project by beginner is Twitter Sentiment Analysis .I would 
suggest you to perform POC for any general issues which has enough 
stackoverflow contents and later choose your own problem domain(Healthcare, 
Demographics, Geography, Music, web logs, etc)

This is how I am trying to make myself learn.
Hope it gives you a rough image.

Regards,
Abhishek Singh
On Fri, Jan 2, 2015 at 11:21 PM, Gotomy PC gotomyp...@gmail.com wrote:

Hi,
I am new to hadoop and I would like to do some small projects which I can put 
in my resume for the job interview.
I have fair understanding of Hadoop ecosystem.
Can somebody please guide me?
ThanksKrish Donald




  

Re: Jr. Hadoop positions or internships

2014-07-01 Thread Publius
you poor soul
 
there are internships out there, 
hadoop internship, san francisco - Google Search

 
 hadoop internship, san francisco - Google Search
Search Options Any time Past hour Past 24 hours Past week Past month Past year 
All results Verbatim About 162,000 results 10 Myths About Hadoop - TDWI Best 
Practices Report - SAS.com   
View on www.google.com Preview by Yahoo  

but If you know hadoop I would apply for entry level jobs; 
even making low wages like $25/hr

I am also trying to learn hadoop







 From: Adam Pritchard apritchard...@gmail.com
To: user@hadoop.apache.org 
Sent: Tuesday, July 1, 2014 11:01 AM
Subject: Jr. Hadoop positions or internships
 


Hi everyone,

Does anyone know of any Jr. Hadoop roles or big data related internships in the 
Bay Area?


I am very motivated to learn Hadoop and big data related technologies and have 
quit my full time job as a web developer and been teaching myself for the last 
5 months.


I have been studying Hadoop, Map Reduce and Hbase.


I am willing to work for free for up to six weeks.  I just need experience and 
to be in an environment where I can learn.



Thanks for any information you can provide.



Adam Pritchard

apritchard...@gmail.com
650-847-8518

Re: Fwd: Trying to build Hadoop on Windows 8

2014-06-13 Thread Publius
maybe just download hortonworks for windows

Download Hadoop

 
   Download Hadoop
Download Apache Hadoop for the enterprise with Hortonworks Data Platform. Data 
access, storage, governance, security and operations across Linux and...  
View on hortonworks.com Preview by Yahoo  
 
 






KMG 365 



 From: Néstor Boscán nesto...@gmail.com
To: user@hadoop.apache.org 
Sent: Friday, June 13, 2014 7:31 AM
Subject: Fwd: Trying to build Hadoop on Windows 8
 


Hi

I'm trying to build Hadoop on WIndows 8. I have:

Java 1.6.0_45 (JAVA_HOME set using old DOS name)
Maven 3.2 (M2_HOME and PATH set)
Protoc 2.5.0 (The 32-bit build that I found, PATH set)
Cygwin 64-bit (PATH set)
Windows SDK

When I try to run the build I get an error that it cannot build using the 
Windows tools. Lookig throught the Internet I found that with Visual Studio 
2010 there is a batch file that sets the Windows SDK enviroment variables for 
this but I don't know how to do this if I only use Windows SDK.

Regards,

Néstor

Re: Running and Logging multinode clusters on Windows with Hadoop 2.2

2014-06-04 Thread Publius


 How did you do this?  I've gotten Hadoop built from source using maven and 
working on windows

virtual box?




KMG 365 



 From: kashif.o...@instinet.com kashif.o...@instinet.com
To: user@hadoop.apache.org 
Sent: Wednesday, June 4, 2014 1:28 PM
Subject: Running and Logging multinode clusters on Windows with Hadoop 2.2
 


Hello,

I've gotten Hadoop built from source using maven and working on windows in 
single node form.  I'm currently attempting to connect with a server that also 
has the same built version of Hadoop.  I've set up my host file and slave file 
accordingly.  Attempting to run the cluster from the master node get doesn't 
give me any sort of response from my slave node, nor can I find any sort of 
sign that my master node even tried to connect.  Are there any logs I can look 
at so that I may begin working through this or is what I'm attempting to do not 
implemented at all?

-Kashif


=
 
 Disclaimer  
This message is intended solely for use by the named addressee(s). If you 
receive this transmission in error, please immediately notify the sender and 
destroy this message in its entirety, whether in electronic or hard copy 
format. Any unauthorized use (and reliance thereon), copying, disclosure, 
retention, or distribution of this transmission or the material in this 
transmission is forbidden.  We reserve the right to monitor and archive 
electronic communications. This material does not constitute an offer or 
solicitation with respect to the purchase or sale of any security. It should 
not be construed to contain any recommendation regarding any security or 
strategy. Any views expressed are those of the individual sender, except where 
the message states otherwise and the sender is authorized to state them to be 
the views of any such entity. This communication is provided on an “as is” 
basis. It contains material that is owned by Instinet Incorporated,
 its subsidiaries or its or their licensors, and may not, in whole or in part, 
be (i) copied, photocopied or duplicated in any form, by any means, or (ii) 
redistributed, posted, published, excerpted, or quoted without Instinet 
Incorporated's prior written consent. Please access the following link for 
important information and instructions: 
http://instinet.com/includes/index.jsp?thePage=/html/le_index.txt 
Securities products and services are provided by locally registered brokerage 
subsidiaries of Instinet Incorporated: Instinet Australia Pty Limited (ACN: 131 
253 686 AFSL No: 327834), regulated by the Australian Securities  Investments 
Commission; Instinet Canada Limited, member IIROC/CIPF; Instinet Pacific 
Limited, authorized and regulated by the Securities and Futures Commission of 
Hong Kong; Instinet Singapore Services Private Limited, regulated by the 
Monetary Authority of Singapore, trading member of The Singapore Exchange 
Securities Trading Private Limited and clearing member of The Central 
Depository (Pte) Limited; and Instinet, LLC, member SIPC. 


=
  

Re: How do I get started with hadoop on windows system

2014-04-30 Thread Publius
oracle virtual box, and hortonworks sandbox for virtual box also work
 
watch you tube videos on the steps





 From: unmesha sreeveni unmeshab...@gmail.com
To: User Hadoop user@hadoop.apache.org 
Sent: Wednesday, April 30, 2014 6:51 AM
Subject: Re: How do I get started with hadoop on windows system
 


In order to get started with hadoop you need to install cgywin (Provides an 
exact look and feel as linux)
Or else u can run ubundu in a vmPlayer
Once you done this
You can download hadoop directly from Apache or from other vendors
And Follow thiese steps: 
http://www.unmeshasreeveni.blogspot.in/2014/04/hadoop-installation-for-beginners.html







On Fri, Apr 25, 2014 at 11:47 AM, 破千 997626...@qq.com wrote:

Hi everyone,
I have subscribed hadoop mail list this morning. How do I get started with 
hadoop on my windows 7 PC.
Thanks!




-- 

Thanks  Regards 


Unmesha Sreeveni U.B

Hadoop, Bigdata Developer
Center for Cyber Security | Amrita Vishwa Vidyapeetham

http://www.unmeshasreeveni.blogspot.in/







Question on Book

2014-03-26 Thread Publius
Hello

is there a good book on the particulars of learning mapreduce java programming 
for dummies :O
 



Questions from a newbie to Hadoop

2014-02-21 Thread Publius


 Hello

I am new to hadoop and trying to learn it

I want to set up Psuedo distributed mode on my windows vista (32 bit) machine 
to experiment with?

I am having great difficulty locating the correct software(s) to do this?

JDK1.6 or JDK1.7, exclipse?
oracle VM box, VMplayer, etc
CH4, hortonworks, or just apache hadoop?


I have tried to load and run VMplayer and hortonworks Hadoop so far, 
but the horton works version is set up for 64 bit and my poor vista machine is 
32 bit :(


can someone offer advice on how to do this?

Please help IF able 

If somone wanst to email offline that if fine also


Troy



KMG 365 

Re: Questions from a newbie to Hadoop

2014-02-21 Thread Publius
I wish to run pseudo machine on a virtual box

I have it almost running on oracle virtual box with Hortonworks sandbox 2.0, 
but the virtual appliance is wanting a 64 bit CPU, and mine is only 32 bit

still looking for a 32 bit version of Hortonworks sand box 2.0

Hortonworks seems to be very stable and good so far, very easy to set up

what does this mean;  set up P-D mode on it?

:O

thanks for the reply

From: Devin Suiter RDX dsui...@rdx.com
To: user@hadoop.apache.org 
Sent: Friday, February 21, 2014 11:30 AM
Subject: Re: Questions from a newbie to Hadoop
 


You should also clarify for the group:


Do you want to make a virtual machine to run a pseudo-distributed Hadoop 
cluster on?


Or


Do you want to install Hadoop directly onto the Vista machine and run it there?


If the former, you should be able to set up a VM just fine with a Linux 
version of your choice - we use CentOS for that sort of thing here - and set 
up P-D mode on it, just by using VirtualBox.


If the latter...


You make want to invest in some extra RAM - no, I'm kidding. :-) HortonWorks 
has the stable Windows builds as Arpit has already directed you to.


You could also consider the Amazon and Microsoft Azure cloud versions of 
Hadoop, if you can afford to pay for a few cycles every month. They're pretty 
affordable, and those Amazon $100 gift cards for EC2 are popular giveaway 
items at trade shows and whatnot...


Devin Suiter
Jr. Data Solutions Software Engineer
100 Sandusky Street | 2nd Floor | Pittsburgh, PA 15212
Google Voice: 412-256-8556 | www.rdx.com


On Fri, Feb 21, 2014 at 2:11 PM, Arpit Agarwal aagar...@hortonworks.com 
wrote:

You can try building Apache Hadoop with these instructions:

https://wiki.apache.org/hadoop/Hadoop2OnWindows


32-bit Windows has not been tested.

CONFIDENTIALITY NOTICE
NOTICE: This message is intended for the use of the individual or entity to 
which it is addressed and may contain information that is confidential, 
privileged and exempt from disclosure under applicable law. If the reader of 
this message is not the intended recipient, you are hereby notified that any 
printing, copying, dissemination, distribution, disclosure or forwarding of 
this communication is strictly prohibited. If you have received this 
communication in error, please contact the sender immediately and delete it 
from your system. Thank You.