> One last thing, hbase shell (relational) operators running mapreduce jobs 
> begins to impinge on PIG territory. We should be careful and avoid 
> overlap/duplicating work. Would it make sense doing further operators as PIG 
> user defined functions? (I suppose we'll be able to tell better after PIG-6, 
> the hbase load/store, is done. Smile).

I don't think so.
It's just a relational algebra basics.
No these works, no SQL-like HQL, RDF, Matrix works.
What is an overlapping character?
I need the exact explain.

Also, If Pig has time to discuss matrix operations or linear systems, 
Can i said "No, it's a hbase shell territory." ??

I feel stuffy becuase i'm just going round in circles discussing the plans of 
shell operator on wiki. 
(almost for the past two months) 

I bet my name on shell works and Other members,too.
At least, I have rights to make it.

Any advice are welcome.

Edward J. Yoon,
Thanks.
------------------------------
B. Regards,
Edward yoon @ NHN, corp.
Home : http://www.udanax.org


> Date: Mon, 5 Nov 2007 14:04:51 -0800
> From: [EMAIL PROTECTED]
> To: hadoop-dev@lucene.apache.org
> Subject: [jira] Commented: (HADOOP-1608) [Hbase Shell] Relational Algrebra 
> Operators
>
>
> [ 
> https://issues.apache.org/jira/browse/HADOOP-1608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12540287
>  ]
>
> stack commented on HADOOP-1608:
> -------------------------------
>
> Edward:
>
> I do not understand what you mean by the following in your 02/Nov/07 06:09 PM 
> comment: "I'll implement relational algebra operators to the tentative 
> language and HQL by Sub-Tasks List sequence."
>
> In org.apache.hadoop.hbase.shell.algebra, test for output table presence and 
> creation if missing is duplicated code in Selection, Projection, 
> DuplicateTable, etc. I tried to move this duplicated code back up into the 
> RelationalOperation class as a utility but noticed then if table already 
> exists, we don't call initJob because we return early (See end of the getConf 
> in DuplicateTable for example). Is running one of these operators a second 
> time, after table has been created, a problem? Have you tried it?
>
> I did not have a mapreduce cluster running and so the last line hung for ever 
> (You should add to help need for a mapreduce cluster).
> {code}
> Hbase> aaa = table('y');
> Hbase> aaa;
> Syntax error : Type 'help;' for usage.Message : Encountered ";" at line 1, 
> column 5.
> Hbase> show aaa;
> Missing parameters. Please check 'Show' syntax
> Hbase> bbb = group aaa by ('x'); Hbase> save bbb into table ('a');
> {code}
>
> Note, it would be nice if you entered a variable name only if you got a 
> description of the variable content or perhaps doing 'show VARIABLE_NAME' 
> output some kind of description.
>
> I then started up a cluster and did the simplest of operations:
> {code}
> Hbase> aaa = table('x');
> Hbase> save aaa into table(aaaaaa);
> 07/11/05 21:26:57 WARN mapred.JobClient: No job jar file set. User classes 
> may not be found. See JobConf(Class) or JobConf#setJar(String).Job 
> job_200711052125_0001 is still running........Job failed.
> {code}
>
> Job failed because of 'Caused by: java.lang.ClassNotFoundException: 
> org.apache.hadoop.hbase.mapred.TableSplit'. How do you run on a cluster 
> Edward? Do you copy the hbase jar into the hadoop lib dir all over the 
> cluster? Does HADOOP-1622 help here?
>
> On job failure, should the table be removed? Currently, I have a new table 
> 'aaaaaa' with nothing in it.
>
> The output of Group, Selection, etc. is saved to an 'output' table?
>
> It would be good if we could get into the help listing somewhere the 
> operators and types of conditions allowed on selection (>, 
> One last thing, hbase shell (relational) operators running mapreduce jobs 
> begins to impinge on PIG territory. We should be careful and avoid 
> overlap/duplicating work. Would it make sense doing further operators as PIG 
> user defined functions? (I suppose we'll be able to tell better after PIG-6, 
> the hbase load/store, is done. Smile).
>
> Otherwise, patch looks good Edward.
>
>
>
>
>
>> [Hbase Shell] Relational Algrebra Operators
>> -------------------------------------------
>>
>> Key: HADOOP-1608
>> URL: https://issues.apache.org/jira/browse/HADOOP-1608
>> Project: Hadoop
>> Issue Type: Improvement
>> Components: contrib/hbase
>> Affects Versions: 0.14.1
>> Environment: All environments
>> Reporter: Edward Yoon
>> Priority: Minor
>> Fix For: 0.16.0
>>
>> Attachments: patch_v01.txt, patch_v02.txt, patch_v03.txt, patch_v04.txt, 
>> patch_v05.txt, patch_v06.txt, patch_v07.txt, patch_v08.txt, patch_v09.txt, 
>> patch_v10.txt, patch_v11.txt, patch_v12.txt, patch_v13.txt, patch_v14.txt, 
>> patch_v15.txt
>>
>>
>> Development of relational algebra operators has begun.
>> * Projection
>> ** selects a subset of the columnfamilies of a relation
>> ** Result = π ~column_list~ (Relation)
>> {code}
>> Hbase> Result = Relation.Projection('year','legnth');
>> Hbase> save Result into table('result');
>> {code}
>> * Selection
>> ** See : HADOOP-2003 issue's description
>> {code}
>> Hbase> Result = Relation.Selection(length> 100 and studioName = 'Fox');
>> Hbase> save Result into table('result');
>> {code}
>> * Group
>> ** more details about 'GROUP' operation will be handled in HADOOP-1658 issue.
>> * θ Join
>> ** The join of two relations R1(A ~1~,A ~2~,...,A ~n~) and R2(B ~1~,B 
>> ~2~,...,B ~m~) is a relation with degree k=n+m and attributes (A ~1~,A 
>> ~2~,...,A ~n~, B ~1~,B ~2~,...,B ~m~) that satisfy the join condition
>> {code}
>> Hbase> R1 = table('movieLog_table');
>> Hbase> R2 = table('personInfo_table');
>> Hbase> Result = R1.join(R1.producer: = R2.ROW) and R2;
>> or Result = R1.join(R1.actor:hero = R2.Row) and R2;
>> or Result = R1.join(R1.actor:hero = R2.Row and R1.studioName = 'Fox' and 
>> R2.occupation = 'singer') and R2;
>> {code}
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>

_________________________________________________________________
Boo! Scare away worms, viruses and so much more! Try Windows Live OneCare!
http://onecare.live.com/standard/en-us/purchase/trial.aspx?s_cid=wl_hotmailnews

Reply via email to