If some columns may contain the separator, you can escape their
content before writing them into the table, and unescape them after
you split the row, before you use the content.
You can use URL escape characters (e.g.
http://www.werockyourweb.com/url-escape-characters) or something more
reader-fri
Hi
How can I ignore the seperator character in middle of a column value.
eg : Seperator char is ‘|’.
The Record values are | seperated
xyz|1234|98798|”xyz|abc”|
Regards
Kiran.G
Hi,
What's a proper way to deploy python udfs? I've dropped the latest
version of jython.jar in $PIG_HOME/lib.
Things work in "local" mode, but when I run on a cluster, built-in
python modules cannot be found. E.g., urlparse cannot be located:
ImportError: No module named urlparse
at org
Write a custom loader.
You can check out CSV loader in the piggybank, it does something like this.
D
On Mon, Oct 17, 2011 at 5:03 AM, kiranprasad wrote:
> Hi
>
> How can I ignore the character in a record if it contains the seperator
> char in the record.
> eg : Seperator char is ‘|’.
>
> The R
https://issues.apache.org/jira/browse/PIG-2322
On Mon, Oct 17, 2011 at 12:38 PM, Stan Rosenberg <
srosenb...@proclivitysystems.com> wrote:
> Hi Julien,
>
> Thanks for a quick reply. I patched my local version of
> JythonFunction to pass the input parameters when 'varargs' is true.
>
> stan
>
> O
Hi Julien,
Thanks for a quick reply. I patched my local version of
JythonFunction to pass the input parameters when 'varargs' is true.
stan
On Mon, Oct 17, 2011 at 2:26 PM, Julien Le Dem wrote:
> Hi,
> I'm looking into it. Internally varargs advertise themselves as having 0
> args so I need to
Hi,
I'm looking into it. Internally varargs advertise themselves as having 0
args so I need to add a special case in the JythonFunction to handle
varargs. I'll create a JIRA for this.
For now you can not use varargs as they will always be called with no
parameters.
Julien
On Mon, Oct 17, 2011 at 9
Hi,
I have a simple python udf which takes a variable number of (string)
arguments and returns the first non-empty one.
I can see that the udf is invoked from pig but no arguments are being passed.
Here is the script:
=
#!/usr/bin/python
f
Number of reducers for a limit should be 1. If you are seeing 2, there's a
bug. What version of Pig are you using? I can't duplicate this in my setup
On Sun, Oct 16, 2011 at 8:04 PM, China Alice wrote:
> Recently, I use pig Latin language ,when it comes to limit clause,for
> example
>
> t = load
Recently, I use pig Latin language ,when it comes to limit clause,for example
t = load 'input.txt' using PigStorage(',');
t2 = order t1 by $1;
t3 = limit t2 5;
in the process ,the number of reduce is 2,so the amount of data is ten rather
than five,how can I to let the amount of data is five.
10 matches
Mail list logo