Hi,

 

I would like to know if hadoop will be of help to me? Let me explain you
guys my scenario:

 

I have a windows server based single machine server having 16 Cores and 48
GB of Physical Memory. In addition, I have 120 GB of virtual memory.

 

I am running a query with statistical calculation on large data of over 1
billion rows, on SAS. In this case, SAS is acting like a database on which
both source and target tables are residing. For storage, I can keep the
source and target data on Teradata as well but the query containing a patent
can only be run on SAS interface.

 

The problem is that SAS is taking many days (25 days) to run it (a single
query with statistical function) and not all cores all the time were used
and rather merely 5% CPU was utilized on average. However memory utilization
was high, very high, and that's why large virtual memory was used. 

 

Can I have a hadoop interface in place to do it all so that I may end up
running the query in lesser time that is in 1 or 2 days. Anything squeezing
my run time will be very helpful. 

 

Thanks

 

Ali Jooan Rizvi

Reply via email to