[Blog] Doubts On CCD-410 Sample Dumps on Ecosystem Projects

2014-10-06 Thread unmesha sreeveni
http://www.unmeshasreeveni.blogspot.in/2014/09/what-do-you-think-of-these-three.html -- *Thanks Regards * *Unmesha Sreeveni U.B* *Hadoop, Bigdata Developer* *Center for Cyber Security | Amrita Vishwa Vidyapeetham* http://www.unmeshasreeveni.blogspot.in/

Re: [Blog] Doubts On CCD-410 Sample Dumps on Ecosystem Projects

2014-10-06 Thread unmesha sreeveni
Hi 5 th question can it be SQOOP? On Mon, Oct 6, 2014 at 1:24 PM, unmesha sreeveni unmeshab...@gmail.com wrote: Yes On Mon, Oct 6, 2014 at 1:22 PM, Santosh Kumar skumar.bigd...@hotmail.com wrote: Are you preparing g for Cloudera certification exam? Thanks and Regards, Santosh

Re: [Blog] Doubts On CCD-410 Sample Dumps on Ecosystem Projects

2014-10-06 Thread unmesha sreeveni
what about the last one? The answer is correct. Pig. Is nt it? On Mon, Oct 6, 2014 at 4:29 PM, adarsh deshratnam adarsh.deshrat...@gmail.com wrote: For question 3 answer should be B and for question 4 answer should be D. Thanks, Adarsh D Consultant - BigData and Cloud [image: View my

Optimizing Pig script

2014-10-06 Thread Ankur Kasliwal
Hi, I have written a ‘Pig Script’ which is processing Sequence files given as input. It is working fine but there is one problem mentioned below. I have repetitive statements in my pig script, as shown below: - Filtered_Data _1= FILTER BagName BY ($0 matches 'RegEx-1'); -

Re: Optimizing Pig script

2014-10-06 Thread Russell Jurney
Load the regex patterns from a file (one pattern per line), CROSS their relation with BagName, and then use SelectFieldByName UDF to summon the regex pattern from the regex relation. https://issues.apache.org/jira/plugins/servlet/mobile#issue/DATAFU-69 I believe you can use a field name against

Re: Optimizing Pig script

2014-10-06 Thread Prashant Kommireddi
Are these regex static? If yes, this is easily achieved with embedding your script in Java or any other language that Pig supports http://pig.apache.org/docs/r0.13.0/cont.html You could also possibly write a UDF that loops through all the regex and returns result. On Mon, Oct 6, 2014 at 12:44

Re: Optimizing Pig script

2014-10-06 Thread Pradeep Gollakota
Hi Ankur, Is the list of regular expressions static or dynamic? If it's a static list, you can collapse all the filter operators into a single operator and use the AND keyword to combine them. E.g. Filtered_Data = FILTER BagName BY ($0 matches 'RegEx-1') AND ($0 matches 'RegEx-2') AND ($0

Re: Optimizing Pig script

2014-10-06 Thread Russell Jurney
Actually, I don't think you need SelectFieldByValue. Just use the name of the field directly. On Monday, October 6, 2014, Prashant Kommireddi prash1...@gmail.com wrote: Are these regex static? If yes, this is easily achieved with embedding your script in Java or any other language that Pig

Re: Optimizing Pig script

2014-10-06 Thread Pradeep Gollakota
In case you haven't seen this already, take a look at http://pig.apache.org/docs/r0.13.0/perf.html for some basic strategies on optimizing your pig scripts. On Mon, Oct 6, 2014 at 1:08 PM, Russell Jurney russell.jur...@gmail.com wrote: Actually, I don't think you need SelectFieldByValue. Just

Re: Optimizing Pig script

2014-10-06 Thread Ankur Kasliwal
Thanks for replying everyone. Few comments to everyone's suggestion. 1 I am processing sequence file which consist of many CSV files. I need to extract only few among all CSV'S. So that is the reason I am doing 'SelectFieldByValue' which is file name in my case not by field directly. 2 All

Re: Optimizing Pig script

2014-10-06 Thread Pradeep Gollakota
It looks like the best option at this point is to write a custom UDF that takes loads a set of regular expressions from file and runs the data through all of them. On Mon, Oct 6, 2014 at 1:44 PM, Ankur Kasliwal ankur.kasliwal...@gmail.com wrote: Thanks for replying everyone. Few comments to

Re: Optimizing Pig script

2014-10-06 Thread Russell Jurney
If you can describe the layout of your input files more thoroughly, it would help. On Monday, October 6, 2014, Pradeep Gollakota pradeep...@gmail.com wrote: It looks like the best option at this point is to write a custom UDF that takes loads a set of regular expressions from file and runs the

Re: ERROR 2017: Internal error creating job configuration

2014-10-06 Thread Sunil S Nandihalli
The input file-directory tarred and gzipped is here https://transfer.sh/Nmnkk/rawlogs.tgz . The Jar file which contains all the udfs is here https://transfer.sh/JpSKg/pigpen.jar On Tue, Oct 7, 2014 at 9:07 AM, Sunil S Nandihalli sunil.nandiha...@gmail.com wrote: Hi Everybody, The pig script

Re: [Blog] Doubts On CCD-410 Sample Dumps on Ecosystem Projects

2014-10-06 Thread unmesha sreeveni
Hi Pradeep You are right. Updated the right answers in the blog. This may help anyone thinking about investing in that particular test package. On Tue, Oct 7, 2014 at 9:25 AM, Pradeep Gollakota pradeep...@gmail.com wrote: That's not exactly what the question is asking for... It's saying that