[
https://issues.apache.org/jira/browse/PIG-3225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13604325#comment-13604325
]
Dishara Wijewardana commented on PIG-3225:
------------------------------------------
Hi Gianmarco
I am Dishara who took part in previous GSoC 2012 in Apache Velocity project and
successfully completed the JSR 223 implementation. I would like to contribute
to the PIG project since it seems pretty interesting. As far as I understand
this project idea is basically to implement a tolerable Stratified sampling
algorithm on top of PIG. Correct me If I am wrong. Can you provide a bit more
details of what aspects I need to look in and get in to this. (like what
exactly expected eventually, so that may be I can provide potential algorithm
as a patch to simulate this probably before the proposal)
> Stratified sampling
> -------------------
>
> Key: PIG-3225
> URL: https://issues.apache.org/jira/browse/PIG-3225
> Project: Pig
> Issue Type: New Feature
> Reporter: Gianmarco De Francisci Morales
> Labels: gsoc2013
>
> Implement a stratified sampling option (
> http://en.wikipedia.org/wiki/Stratified_sampling ) in Pig's SAMPLE operator.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira