Hi All,
I have the following question:
Snippet of my sample.txt. First column is id, however each row can have 
variable number of columns. 

id1 100 200 300 400 500
id2 10 20 30id1 800 900 600
id3 10 20 30 40 50 60 70 80 90 100
id1 1 2 3 4 5 6 7 8 9id2 40 50 60 70 80 90
id3 200
sample = LOAD 'sample.txt' [how should I specify schema here]sample_grpd = 
GROUP sample by $0;sample_result = FOREACH sample_grpd generate group, 
FLATTEN(TOBAG([what should go here]))
group by id so that the result is:
id1 100 200 300 400 500 800 900 600 1 2 3 4 5 6 7 8 9
id2 10 20 30 40 50 60 70 80 90
id3 10 20 30 40 50 60 70 80 90 100 200


Any help with this, will be greatly appreciated!

                                          

Reply via email to