[
https://issues.apache.org/jira/browse/GORA-109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16462909#comment-16462909
]
Alfonso Nishikawa edited comment on GORA-109 at 5/3/18 6:43 PM:
----------------------------------------------------------------
Hi, Pals.
It has been a lot of time since this issue. I finally got a _quite_-successful
implementation of a GoraStorage for Apache Pig. At this moment it is in my own
branch since I impemented it for my degree project.
[https://github.com/alfonsonishikawa/gora/tree/goraexplorer/gora-pig]
It allows to load data from -any- HBase (I can only claim this) through Gora in
Pig scripts like:
{code:java}
set job.name 'GoraPig LOAD test';
register gora/*.jar;
webpage = LOAD '.' USING org.apache.gora.pig.GoraStorage('{
"persistentClass": "admin.WebPage",
"fields": "baseUrl,contentType"
}') ;
DUMP webpage;
{code}
I will update my page at
[http://people.apache.org/~alfonsonishikawa/gora-109.html] with details in the
incoming days (the speed depends on my spare time), but please, ask me in
advance the first questions you have so I can write a FAQ too. If I have to
write by my own ideas, I tend to write EVERYTHING and I end NEVER :)
If anyone is a Pig user and wants to test it ask too. Questions priotize what I
explain :)
I am thinking about some Nutch jobs in Pig :P
Sorry for the inmense delay :(. Just know that this _"complete"_ implementation
(actually nothing ever is complete) took me more than 300 hours :(. It has much
details everywhere.
was (Author: alfonso.nishikawa):
Hi, Pals.
It has been a lot of time since this issue. I finally got a _quite_-successful
implementation of a GoraStorage for Apache Pig. At this moment it is in my own
branch since I impemented it for my degree proyect.
[https://github.com/alfonsonishikawa/gora/tree/goraexplorer/gora-pig]
It allows to load data from -any- HBase (I can only claim this) through Gora in
Pig scripts like:
{code:java}
set job.name 'GoraPig LOAD test';
register gora/*.jar;
webpage = LOAD '.' USING org.apache.gora.pig.GoraStorage('{
"persistentClass": "admin.WebPage",
"fields": "baseUrl,contentType"
}') ;
DUMP webpage;
{code}
I will update my page at
[http://people.apache.org/~alfonsonishikawa/gora-109.html] with details in the
incoming days (the speed depends on my spare time), but please, ask me in
advance the first questions you have so I can write a FAQ too. If I have to
write by my own ideas, I tend to write EVERYTHING and I end NEVER :)
If anyone is a Pig user and wants to test it ask too. Questions priotize what I
explain :)
I am thinking about some Nutch jobs in Pig :P
Sorry for the inmense delay :(. Just know that this _"complete"_ implementation
(actually nothing ever is complete) took me more than 300 hours :(. It has much
details everywhere.
> Pig Adapter for Gora
> --------------------
>
> Key: GORA-109
> URL: https://issues.apache.org/jira/browse/GORA-109
> Project: Apache Gora
> Issue Type: New Feature
> Components: gora-pig
> Affects Versions: 0.2
> Reporter: Lewis John McGibbney
> Assignee: Alfonso Nishikawa
> Priority: Minor
> Fix For: 0.9
>
>
> This has been on the agenda from the start and I suppose will be purely use
> case driven. Persoanlly I haven't touched Pig as I don't have a use-case
> driven requirement to do so, however there is no harm in this being logged if
> in the future it attracts interest.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)