[ 
https://issues.apache.org/jira/browse/KYLIN-1641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15283870#comment-15283870
 ] 

hongbin ma commented on KYLIN-1641:
-----------------------------------

it seems posted in wrong project

> Spark - pagination
> ------------------
>
>                 Key: KYLIN-1641
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1641
>             Project: Kylin
>          Issue Type: Improvement
>            Reporter: Dileep
>
> Issue: we have inserted around 10 million records in hive and show the 
> results in web interface through spark dataframe. We cannot get all those 10 
> million and do the pagination in the front end. So we did the pagination in 
> the spark dataframe using following approach 
>   df1 =df.limit(rowsperPage * pagenumer)
>         df2 = df1.limit(rowsperPage * (pagenumer  -1))
> df1.subtract(df2)).collect().
> This working fine but when we go up the pagenumber (last page ) it is slowing 
> down and not get the results back to front end. 
> Just want to check what we are doing right or any other solution for this 
> problem
> Thanks



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to