GitHub user AnatoliShein opened a pull request:

    https://github.com/apache/orc/pull/134

    Orc 17

    In this pull request I added **LIBHDFS++** library for reading files from 
HDFS to **ORC** project. 
    
    Libhdfs++ is located in orc/c++/lib/libhdfspp and by default builds as a 
light-weight library without examples, tests, and tools (and by this avoids 
dependencies on JDK, valgrind and gmock). However, if the flag 
**-DHDFSPP_LIBRARY_ONLY=FALSE** is passed to cmake, then it will build the 
examples, tests, and tools as well.
    
    Libhdfs++ depends on protobuf libraries in orc/c++/libs/protobuf-2.6.0 and 
is searching the system for packages Doxygen, OpenSSL, CyrusSASL, GSasl, and 
Threads dynamically (however only OpenSSL and Threads are required).
    
    The folder libhdfspp also includes a script pull_hdfs.sh which pulls the 
latest changes from Libhdfs++ Hadoop branch to ORC, and generates file 
'imported_timestamp' with the timestamp and the information about the latest 
commit.
    
    I also updated all the ORC tools to automatically use Libhdfs++ to read ORC 
files on HDFS if their path begins with 'hdfs://'.
    
    Please review.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/AnatoliShein/orc ORC-17

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/orc/pull/134.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #134
    
----

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to