Re: Optionally Enclosed By in PIG

2011-10-17 Thread Gheorghe Muresan
If some columns may contain the separator, you can escape their content before writing them into the table, and unescape them after you split the row, before you use the content. You can use URL escape characters (e.g. http://www.werockyourweb.com/url-escape-characters) or something more reader-fri

Optionally Enclosed By in PIG

2011-10-17 Thread kiranprasad
Hi How can I ignore the seperator character in middle of a column value. eg : Seperator char is ‘|’. The Record values are | seperated xyz|1234|98798|”xyz|abc”| Regards Kiran.G

python modules

2011-10-17 Thread Stan Rosenberg
Hi, What's a proper way to deploy python udfs? I've dropped the latest version of jython.jar in $PIG_HOME/lib. Things work in "local" mode, but when I run on a cluster, built-in python modules cannot be found. E.g., urlparse cannot be located: ImportError: No module named urlparse at org

Re: How to escape if Seperator charector in a record

2011-10-17 Thread Dmitriy Ryaboy
Write a custom loader. You can check out CSV loader in the piggybank, it does something like this. D On Mon, Oct 17, 2011 at 5:03 AM, kiranprasad wrote: > Hi > > How can I ignore the character in a record if it contains the seperator > char in the record. > eg : Seperator char is ‘|’. > > The R

Re: calling python udfs with varargs

2011-10-17 Thread Julien Le Dem
https://issues.apache.org/jira/browse/PIG-2322 On Mon, Oct 17, 2011 at 12:38 PM, Stan Rosenberg < srosenb...@proclivitysystems.com> wrote: > Hi Julien, > > Thanks for a quick reply. I patched my local version of > JythonFunction to pass the input parameters when 'varargs' is true. > > stan > > O

Re: calling python udfs with varargs

2011-10-17 Thread Stan Rosenberg
Hi Julien, Thanks for a quick reply. I patched my local version of JythonFunction to pass the input parameters when 'varargs' is true. stan On Mon, Oct 17, 2011 at 2:26 PM, Julien Le Dem wrote: > Hi, > I'm looking into it. Internally varargs advertise themselves as having 0 > args so I need to

Re: calling python udfs with varargs

2011-10-17 Thread Julien Le Dem
Hi, I'm looking into it. Internally varargs advertise themselves as having 0 args so I need to add a special case in the JythonFunction to handle varargs. I'll create a JIRA for this. For now you can not use varargs as they will always be called with no parameters. Julien On Mon, Oct 17, 2011 at 9

calling python udfs with varargs

2011-10-17 Thread Stan Rosenberg
Hi, I have a simple python udf which takes a variable number of (string) arguments and returns the first non-empty one. I can see that the udf is invoked from pig but no arguments are being passed. Here is the script: = #!/usr/bin/python f

Re: pig Latin the usage of limit

2011-10-17 Thread Dmitriy Ryaboy
Number of reducers for a limit should be 1. If you are seeing 2, there's a bug. What version of Pig are you using? I can't duplicate this in my setup On Sun, Oct 16, 2011 at 8:04 PM, China Alice wrote: > Recently, I use pig Latin language ,when it comes to limit clause,for > example > > t = load

pig Latin the usage of limit

2011-10-17 Thread China Alice
Recently, I use pig Latin language ,when it comes to limit clause,for example  t = load 'input.txt' using PigStorage(','); t2 = order t1 by $1; t3 = limit t2 5; in the process ,the number of reduce is 2,so the amount of data is ten rather than five,how can I to let  the amount of data is five.