[ 
https://issues.apache.org/jira/browse/DATAFU-164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17805538#comment-17805538
 ] 

Eyal Allweil commented on DATAFU-164:
-------------------------------------

Looks like [~arpitbhardwaj] isn't on this, so it's available if anyone wants to 
try

> Improve test cases
> ------------------
>
>                 Key: DATAFU-164
>                 URL: https://issues.apache.org/jira/browse/DATAFU-164
>             Project: DataFu
>          Issue Type: Test
>            Reporter: Eyal Allweil
>            Priority: Major
>              Labels: good-first-issue, newbie, up-for-grabs
>
> We can get better code coverage and cover edge cases that are currently 
> missing in our main tests file, 
> [TestSparkDFUtils|https://github.com/apache/datafu/blob/master/datafu-spark/src/test/scala/datafu/spark/TestSparkDFUtils.scala].
>  
> For example, another test for 
> [joinWithRange|https://github.com/apache/datafu/blob/master/datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala#L419]
>  that includes the case that a record falls into 
> {{{}decreased_range_single{}}}, but {{range_start}} and {{range_end}} do not 
> contain {{single.}}
> Another case for the 
> [flatten|https://github.com/apache/datafu/blob/master/datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala#L256]
>  API could also be good.
> Or for 
> [dedupRandomN|https://github.com/apache/datafu/blob/master/datafu-spark/src/main/scala/datafu/spark/SparkDFUtils.scala#L567].
> Or adding to our 
> [randomJoinSkewedTests|https://github.com/apache/datafu/blob/main/datafu-spark/src/test/scala/datafu/spark/TestSparkDFUtils.scala#L256]
>  a test that verifies that _joinSkewed_ gives the same results as a regular 
> join ({_}broadcastJoinSkewed{_} is already checked).
> Or for anything else, for that matter.
>  
> It's perfectly alright to only do one of them - either as a patch or GitHub 
> PR.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to