-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/34636/
-----------------------------------------------------------

Review request for DataFu.


Repository: datafu


Description
-------

This updates DataFu to use Hadoop 2.7.0 and Pig 0.14.0.  I've fixed all issues 
that I could find.  All the unit tests pass.  Some of the issues were already 
addressd by Daniel Dai's earlier patch, but I found some more problems.

Summary of issues:

* Hourglass tests failed because we were pulling in avro-tools, which contains 
Hadoop classes and therefore conflicts.  I removed the dependency.  I'm not 
sure why we had this.  I also added a handy script to help with this analysis, 
which uses jarfish.
* Hourglass used .toString() for many Path instances.  This caused a problem 
because the resulting string is prefixed with "file:" and therefore isn't a 
valid path you can use in File.  The general fix is to use getName() instead, 
which gets the simple name and is usually suitable.
* Some Pig unit tests had assumptions on tuple ordering in bags.  I applied a 
sort to the actual and expected bags to ensure a consistent comparison.


Diffs
-----

  README.md 8e1b67d 
  
build-plugin/src/main/java/org/adrianwalker/multilinestring/MultilineProcessor.java
 9abdba5 
  datafu-hourglass/.gitignore 942515e 
  datafu-hourglass/build.gradle 75a2876 
  datafu-hourglass/find_dupes.rb PRE-CREATION 
  datafu-hourglass/src/main/java/datafu/hourglass/fs/PathUtils.java c270c7b 
  
datafu-hourglass/src/main/java/datafu/hourglass/mapreduce/DistributedCacheHelper.java
 62975d1 
  datafu-hourglass/src/test/java/datafu/hourglass/demo/Examples.java 039822c 
  
datafu-hourglass/src/test/java/datafu/hourglass/test/PartitionCollapsingExecutionPlannerTests.java
 d68ea83 
  
datafu-hourglass/src/test/java/datafu/hourglass/test/PartitionCollapsingJoinTest.java
 02aa342 
  
datafu-hourglass/src/test/java/datafu/hourglass/test/PartitionCollapsingTests.java
 fff1cfd 
  
datafu-hourglass/src/test/java/datafu/hourglass/test/PartitionPreservingCollapsingIntegrationTests.java
 a8f020b 
  
datafu-hourglass/src/test/java/datafu/hourglass/test/PartitionPreservingJoinTests.java
 c41fd39 
  
datafu-hourglass/src/test/java/datafu/hourglass/test/PartitionPreservingTests.java
 acae96c 
  datafu-hourglass/src/test/java/datafu/hourglass/test/TestAvroJob.java b428003 
  datafu-hourglass/src/test/java/datafu/hourglass/test/TestBase.java bc52977 
  datafu-pig/build.gradle ea385d2 
  datafu-pig/src/test/java/datafu/test/pig/bags/BagTests.java 9bcc384 
  gradle/dependency-versions.gradle 3b0835f 

Diff: https://reviews.apache.org/r/34636/diff/


Testing
-------

./gradlew test


Thanks,

Matthew Hayes

Reply via email to