[jira] [Comment Edited] (ARROW-11075) [Python] Getting reference not found with OCR enabled pyarrow

2021-01-24 Thread Kandarpa (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-11075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17271070#comment-17271070
 ] 

Kandarpa edited comment on ARROW-11075 at 1/25/21, 5:20 AM:


Hello [~uwe], [~wesm], thanks for looking into this issue.

 

Ran cmake as following :Ran cmake as following :

 

 
{code:java}
#cmake -DCMAKE_INSTALL_PREFIX=$ARROW_HOME
       -DCMAKE_INSTALL_LIBDIR=lib
       -DARROW_WITH_BZ2=ON
       -DARROW_WITH_ZLIB=ON
       -DARROW_WITH_ZSTD=ON
       -DARROW_WITH_LZ4=ON
       -DARROW_WITH_SNAPPY=ON
       -DARROW_WITH_BROTLI=ON
       -DARROW_PARQUET=ON
       -DARROW_PYTHON=ON
       -DARROW_BUILD_TESTS=ON
       -DARROW_CUDA=ON
       -DCUDA_CUDA_LIBRARY=/usr/local/cuda/lib64/stubs/libcuda.so
       -DARROW_ORC=ON
   -DARROW_JEMALLOC=ON
   -DARROW_DATASET=ON
   ..
#make -j

{code}
nm -D ./release/libarrow.so | c++filt | grep orc
 00bf21d0 u guard variable for 
arrow::adapters::orc::ArrowInputFile::getName[abi:cxx11]() const::filename
 U orc::ParseError::ParseError(char const*)
 U orc::ParseError::ParseError(std::__cxx11::basic_string, std::allocator > const&)
 U orc::ParseError::~ParseError()
 U orc::InputStream::~InputStream()
 U orc::createReader(std::unique_ptr >, orc::ReaderOptions const&)
 U orc::ReaderOptions::ReaderOptions()
 U orc::ReaderOptions::~ReaderOptions()
 U orc::RowReaderOptions::includeTypes(std::__cxx11::list > const&)
 U orc::RowReaderOptions::range(unsigned long, unsigned long)
 U orc::RowReaderOptions::RowReaderOptions(orc::RowReaderOptions const&)
 U orc::RowReaderOptions::RowReaderOptions()
 U orc::RowReaderOptions::~RowReaderOptions()
 00474110 T 
arrow::io::internal::LibHdfsShim::BuilderSetForceNewInstance(hdfsBuilder*)
 009cfda0 T arrow::adapters::orc::AppendBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009cb720 T arrow::adapters::orc::GetArrowType(orc::Type const*, 
std::shared_ptr*)
 009c2ad0 T 
arrow::adapters::orc::ORCFileReader::ReadSchema(std::shared_ptr*)
 009c4140 T arrow::adapters::orc::ORCFileReader::ReadStripe(long, 
std::shared_ptr*)
 009c4690 T arrow::adapters::orc::ORCFileReader::ReadStripe(long, 
std::vector > const&, 
std::shared_ptr*)
 009c2a80 T arrow::adapters::orc::ORCFileReader::NumberOfRows()
 009c2a50 T arrow::adapters::orc::ORCFileReader::NumberOfStripes()
 009c4e50 T arrow::adapters::orc::ORCFileReader::NextStripeReader(long, 
std::shared_ptr*)
 009c4f60 T arrow::adapters::orc::ORCFileReader::NextStripeReader(long, 
std::vector > const&, 
std::shared_ptr*)
 009c2ba0 T 
arrow::adapters::orc::ORCFileReader::Open(std::shared_ptr
 const&, arrow::MemoryPool*, 
std::unique_ptr >*)
 {color:#0747a6}*009c32a0 T 
arrow::adapters::orc::ORCFileReader::Read(std::shared_ptr*)*{color}
 {color:#0747a6}*009c3630 T 
arrow::adapters::orc::ORCFileReader::Read(std::shared_ptr 
const&, std::shared_ptr*)*{color}
 {color:#0747a6}*009c3d80 T 
arrow::adapters::orc::ORCFileReader::Read(std::shared_ptr 
const&, std::vector > const&, 
std::shared_ptr*)*{color}
 009c3760 T arrow::adapters::orc::ORCFileReader::Read(std::vector > const&, std::shared_ptr*)
 009c2810 T arrow::adapters::orc::ORCFileReader::Seek(long)
 009c2600 T arrow::adapters::orc::ORCFileReader::ORCFileReader()
 009c2600 T arrow::adapters::orc::ORCFileReader::ORCFileReader()
 009c2770 T arrow::adapters::orc::ORCFileReader::~ORCFileReader()
 009c2770 T arrow::adapters::orc::ORCFileReader::~ORCFileReader()
 009d0fc0 T arrow::adapters::orc::AppendMapBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009ca990 T 
arrow::adapters::orc::AppendBoolBatch(orc::ColumnVectorBatch*, long, long, 
arrow::ArrayBuilder*)
 009d0770 T arrow::adapters::orc::AppendListBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009d1610 W arrow::Status 
arrow::adapters::orc::AppendBinaryBatch(orc::ColumnVectorBatch*,
 long, long, arrow::ArrayBuilder*)
 009d1df0 W arrow::Status 
arrow::adapters::orc::AppendBinaryBatch(orc::ColumnVectorBatch*,
 long, long, arrow::ArrayBuilder*)
 009d0490 T arrow::adapters::orc::AppendStructBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009cfa60 T arrow::adapters::orc::AppendDecimalBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009cb160 T 
arrow::adapters::orc::AppendTimestampBatch(orc::ColumnVectorBatch*, long, long, 
arrow::ArrayBuilder*)
 009cf7d0 T 
arrow::adapters::orc::AppendFixedBinaryBatch(orc::ColumnVectorBatch*, long, 
long, arrow::ArrayBuilder*)
 009d46c0 W arrow::Status 
arrow::adapters::orc::AppendNumericBatchCast,
 int, orc::LongVe

[jira] [Comment Edited] (ARROW-11075) [Python] Getting reference not found with OCR enabled pyarrow

2021-01-24 Thread Kandarpa (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-11075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17271070#comment-17271070
 ] 

Kandarpa edited comment on ARROW-11075 at 1/25/21, 5:19 AM:


Hello [~uwe], [~wesm], thanks for looking into this issue.

 

Ran cmake as following :Ran cmake as following :

 

 
{code:java}
#cmake -DCMAKE_INSTALL_PREFIX=$ARROW_HOME
       -DCMAKE_INSTALL_LIBDIR=lib
       -DARROW_WITH_BZ2=ON
       -DARROW_WITH_ZLIB=ON
       -DARROW_WITH_ZSTD=ON
       -DARROW_WITH_LZ4=ON
       -DARROW_WITH_SNAPPY=ON
       -DARROW_WITH_BROTLI=ON
       -DARROW_PARQUET=ON
       -DARROW_PYTHON=ON
       -DARROW_BUILD_TESTS=ON
       -DARROW_CUDA=ON
       -DCUDA_CUDA_LIBRARY=/usr/local/cuda/lib64/stubs/libcuda.so
       -DARROW_ORC=ON
   -DARROW_JEMALLOC=ON
   -DARROW_DATASET=ON
   ..
#make -j

{code}
 

# nm -D ./release/libarrow.so | c++filt | grep orc
 00bf21d0 u guard variable for 
arrow::adapters::orc::ArrowInputFile::getName[abi:cxx11]() const::filename
 U orc::ParseError::ParseError(char const*)
 U orc::ParseError::ParseError(std::__cxx11::basic_string, std::allocator > const&)
 U orc::ParseError::~ParseError()
 U orc::InputStream::~InputStream()
 U orc::createReader(std::unique_ptr >, orc::ReaderOptions const&)
 U orc::ReaderOptions::ReaderOptions()
 U orc::ReaderOptions::~ReaderOptions()
 U orc::RowReaderOptions::includeTypes(std::__cxx11::list > const&)
 U orc::RowReaderOptions::range(unsigned long, unsigned long)
 U orc::RowReaderOptions::RowReaderOptions(orc::RowReaderOptions const&)
 U orc::RowReaderOptions::RowReaderOptions()
 U orc::RowReaderOptions::~RowReaderOptions()
 00474110 T 
arrow::io::internal::LibHdfsShim::BuilderSetForceNewInstance(hdfsBuilder*)
 009cfda0 T arrow::adapters::orc::AppendBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009cb720 T arrow::adapters::orc::GetArrowType(orc::Type const*, 
std::shared_ptr*)
 009c2ad0 T 
arrow::adapters::orc::ORCFileReader::ReadSchema(std::shared_ptr*)
 009c4140 T arrow::adapters::orc::ORCFileReader::ReadStripe(long, 
std::shared_ptr*)
 009c4690 T arrow::adapters::orc::ORCFileReader::ReadStripe(long, 
std::vector > const&, 
std::shared_ptr*)
 009c2a80 T arrow::adapters::orc::ORCFileReader::NumberOfRows()
 009c2a50 T arrow::adapters::orc::ORCFileReader::NumberOfStripes()
 009c4e50 T arrow::adapters::orc::ORCFileReader::NextStripeReader(long, 
std::shared_ptr*)
 009c4f60 T arrow::adapters::orc::ORCFileReader::NextStripeReader(long, 
std::vector > const&, 
std::shared_ptr*)
 009c2ba0 T 
arrow::adapters::orc::ORCFileReader::Open(std::shared_ptr
 const&, arrow::MemoryPool*, 
std::unique_ptr >*)
 {color:#0747a6}*009c32a0 T 
arrow::adapters::orc::ORCFileReader::Read(std::shared_ptr*)*{color}
 {color:#0747a6}*009c3630 T 
arrow::adapters::orc::ORCFileReader::Read(std::shared_ptr 
const&, std::shared_ptr*)*{color}
 {color:#0747a6}*009c3d80 T 
arrow::adapters::orc::ORCFileReader::Read(std::shared_ptr 
const&, std::vector > const&, 
std::shared_ptr*)*{color}
 009c3760 T arrow::adapters::orc::ORCFileReader::Read(std::vector > const&, std::shared_ptr*)
 009c2810 T arrow::adapters::orc::ORCFileReader::Seek(long)
 009c2600 T arrow::adapters::orc::ORCFileReader::ORCFileReader()
 009c2600 T arrow::adapters::orc::ORCFileReader::ORCFileReader()
 009c2770 T arrow::adapters::orc::ORCFileReader::~ORCFileReader()
 009c2770 T arrow::adapters::orc::ORCFileReader::~ORCFileReader()
 009d0fc0 T arrow::adapters::orc::AppendMapBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009ca990 T 
arrow::adapters::orc::AppendBoolBatch(orc::ColumnVectorBatch*, long, long, 
arrow::ArrayBuilder*)
 009d0770 T arrow::adapters::orc::AppendListBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009d1610 W arrow::Status 
arrow::adapters::orc::AppendBinaryBatch(orc::ColumnVectorBatch*,
 long, long, arrow::ArrayBuilder*)
 009d1df0 W arrow::Status 
arrow::adapters::orc::AppendBinaryBatch(orc::ColumnVectorBatch*,
 long, long, arrow::ArrayBuilder*)
 009d0490 T arrow::adapters::orc::AppendStructBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009cfa60 T arrow::adapters::orc::AppendDecimalBatch(orc::Type const*, 
orc::ColumnVectorBatch*, long, long, arrow::ArrayBuilder*)
 009cb160 T 
arrow::adapters::orc::AppendTimestampBatch(orc::ColumnVectorBatch*, long, long, 
arrow::ArrayBuilder*)
 009cf7d0 T 
arrow::adapters::orc::AppendFixedBinaryBatch(orc::ColumnVectorBatch*, long, 
long, arrow::ArrayBuilder*)
 009d46c0 W arrow::Status 
arrow::adapters::orc::AppendNumericBatchCast,
 int, orc::L

[jira] [Comment Edited] (ARROW-11075) [Python] Getting reference not found with OCR enabled pyarrow

2021-01-10 Thread Kandarpa (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-11075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17262386#comment-17262386
 ] 

Kandarpa edited comment on ARROW-11075 at 1/11/21, 4:31 AM:


Hello [~uwe]

Please find following:
|Conda list|[^conda_list.txt]|
|Arrow cpp build logs
 This includes cmake, make, make install|[^arrow_cpp_build.log]|
|Arrow python build logs|[^arrow_python_build.log]|

Please let me know if you need any further information.

Regards,

Kandarpa

 


was (Author: kandarpamalipeddi):
Hello [~uwe]

Please find following:
|Conda list|[^conda_list.txt]|
|Arrow cpp build logs
 This includes cmake, make, make install|[^arrow_cpp_build.log]|
|Arrow python build logs|[^arrow_python_build.log]|

Please let me know if you need any further information.

Regards,

Kandarpa

 

> [Python] Getting reference not found with OCR enabled pyarrow
> -
>
> Key: ARROW-11075
> URL: https://issues.apache.org/jira/browse/ARROW-11075
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 1.0.1
> Environment: PPC64LE
>Reporter: Kandarpa
>Priority: Major
> Attachments: arrow_cpp_build.log, arrow_python_build.log, 
> conda_list.txt
>
>
> Generated the pyarrow with OCR enabled on Power using following steps:
> {code:java}
> export ARROW_HOME=$CONDA_PREFIX
> mkdir cpp/build
> cd cpp/build
> cmake -DCMAKE_INSTALL_PREFIX=$ARROW_HOME \
>       -DCMAKE_INSTALL_LIBDIR=lib \
>       -DARROW_WITH_BZ2=ON \
>       -DARROW_WITH_ZLIB=ON \
>       -DARROW_WITH_ZSTD=ON \
>       -DARROW_WITH_LZ4=ON \
>       -DARROW_WITH_SNAPPY=ON \
>       -DARROW_WITH_BROTLI=ON \
>       -DARROW_PARQUET=ON \
>       -DARROW_PYTHON=ON \
>       -DARROW_BUILD_TESTS=ON \
>       -DARROW_CUDA=ON \
>       -DCUDA_CUDA_LIBRARY=/usr/local/cuda/lib64/stubs/libcuda.so \
>       -DARROW_ORC=ON \
>   ..
> make -j
> make install
> cd ../../python
> python setup.py build_ext --bundle-arrow-cpp --with-orc --with-cuda 
> --with-parquet bdist_wheel
> {code}
>  
>  
> With the generated whl package installed, ran CUDF tests and observed 
> following error:
> *_ERROR cudf - ImportError: 
> /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
>  undefined symbol: _ZN5arrow8adapters3orc13OR..._*
> Please find the whole error log below:
> 
>  ERRORS 
> 
>   ERROR 
> collecting test session 
> _
>  /conda/envs/rmm/lib/python3.7/importlib/__init__.py:127: in import_module
>      return _bootstrap._gcd_import(name[level:], package, level)
>  :1006: in _gcd_import
>      ???
>  :983: in _find_and_load
>      ???
>  :953: in _find_and_load_unlocked
>      ???
>  :219: in _call_with_frames_removed
>      ???
>  :1006: in _gcd_import
>      ???
>  :983: in _find_and_load
>      ???
>  :953: in _find_and_load_unlocked
>      ???
>  :219: in _call_with_frames_removed
>      ???
>  :1006: in _gcd_import
>      ???
>  :983: in _find_and_load
>      ???
>  :967: in _find_and_load_unlocked
>      ???
>  :677: in _load_unlocked
>      ???
>  :728: in exec_module
>      ???
>  :219: in _call_with_frames_removed
>      ???
>  cudf/cudf/__init__.py:60: in 
>      from cudf.io import (
>  cudf/cudf/io/__init__.py:8: in 
>      from cudf.io.orc import read_orc, read_orc_metadata, to_orc
>  cudf/cudf/io/orc.py:6: in 
>      from pyarrow import orc as orc
>  /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/orc.py:24: in 
>      import pyarrow._orc as _orc
>  {color:#de350b}E   ImportError: 
> /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
>  undefined symbol: 
> _ZN5arrow8adapters3orc13ORCFileReader4ReadEPSt10shared_ptrINS_5TableEE{color}
>  === 
> short test summary info 
> 
>  *_ERROR cudf - ImportError: 
> /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
>  undefined symbol: _ZN5arrow8adapters3orc13OR..._*
>   
> Interrupted: 1 error during collection 
> 
>  === 
> 1 error in 1.54s 
> ===
>  Fatal Python error: Segmentation fault



--
This messag

[jira] [Comment Edited] (ARROW-11075) [Python] Getting reference not found with OCR enabled pyarrow

2021-01-10 Thread Kandarpa (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-11075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17262386#comment-17262386
 ] 

Kandarpa edited comment on ARROW-11075 at 1/11/21, 4:30 AM:


Hello [~uwe]

Please find following:
|Conda list|[^conda_list.txt]|
|Arrow cpp build logs
 This includes cmake, make, make install|[^arrow_cpp_build.log]|
|Arrow python build logs|[^arrow_python_build.log]|

Please let me know if you need any further information.

Regards,

Kandarpa

 


was (Author: kandarpamalipeddi):
Hello [~uwe]

Please find following:

 
|Conda list|[^conda_list.txt]|
|Arrow cpp build logs
This includes cmake, make, make install|[^arrow_cpp_build.log]|
|Arrow python build logs|[^arrow_python_build.log]|

Please let me know if you need any further information.

Regards,

Kandarpa

 

> [Python] Getting reference not found with OCR enabled pyarrow
> -
>
> Key: ARROW-11075
> URL: https://issues.apache.org/jira/browse/ARROW-11075
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 1.0.1
> Environment: PPC64LE
>Reporter: Kandarpa
>Priority: Major
> Attachments: arrow_cpp_build.log, arrow_python_build.log, 
> conda_list.txt
>
>
> Generated the pyarrow with OCR enabled on Power using following steps:
> {code:java}
> export ARROW_HOME=$CONDA_PREFIX
> mkdir cpp/build
> cd cpp/build
> cmake -DCMAKE_INSTALL_PREFIX=$ARROW_HOME \
>       -DCMAKE_INSTALL_LIBDIR=lib \
>       -DARROW_WITH_BZ2=ON \
>       -DARROW_WITH_ZLIB=ON \
>       -DARROW_WITH_ZSTD=ON \
>       -DARROW_WITH_LZ4=ON \
>       -DARROW_WITH_SNAPPY=ON \
>       -DARROW_WITH_BROTLI=ON \
>       -DARROW_PARQUET=ON \
>       -DARROW_PYTHON=ON \
>       -DARROW_BUILD_TESTS=ON \
>       -DARROW_CUDA=ON \
>       -DCUDA_CUDA_LIBRARY=/usr/local/cuda/lib64/stubs/libcuda.so \
>       -DARROW_ORC=ON \
>   ..
> make -j
> make install
> cd ../../python
> python setup.py build_ext --bundle-arrow-cpp --with-orc --with-cuda 
> --with-parquet bdist_wheel
> {code}
>  
>  
> With the generated whl package installed, ran CUDF tests and observed 
> following error:
> *_ERROR cudf - ImportError: 
> /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
>  undefined symbol: _ZN5arrow8adapters3orc13OR..._*
> Please find the whole error log below:
> 
>  ERRORS 
> 
>   ERROR 
> collecting test session 
> _
>  /conda/envs/rmm/lib/python3.7/importlib/__init__.py:127: in import_module
>      return _bootstrap._gcd_import(name[level:], package, level)
>  :1006: in _gcd_import
>      ???
>  :983: in _find_and_load
>      ???
>  :953: in _find_and_load_unlocked
>      ???
>  :219: in _call_with_frames_removed
>      ???
>  :1006: in _gcd_import
>      ???
>  :983: in _find_and_load
>      ???
>  :953: in _find_and_load_unlocked
>      ???
>  :219: in _call_with_frames_removed
>      ???
>  :1006: in _gcd_import
>      ???
>  :983: in _find_and_load
>      ???
>  :967: in _find_and_load_unlocked
>      ???
>  :677: in _load_unlocked
>      ???
>  :728: in exec_module
>      ???
>  :219: in _call_with_frames_removed
>      ???
>  cudf/cudf/__init__.py:60: in 
>      from cudf.io import (
>  cudf/cudf/io/__init__.py:8: in 
>      from cudf.io.orc import read_orc, read_orc_metadata, to_orc
>  cudf/cudf/io/orc.py:6: in 
>      from pyarrow import orc as orc
>  /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/orc.py:24: in 
>      import pyarrow._orc as _orc
>  {color:#de350b}E   ImportError: 
> /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
>  undefined symbol: 
> _ZN5arrow8adapters3orc13ORCFileReader4ReadEPSt10shared_ptrINS_5TableEE{color}
>  === 
> short test summary info 
> 
>  *_ERROR cudf - ImportError: 
> /conda/envs/rmm/lib/python3.7/site-packages/pyarrow/_orc.cpython-37m-powerpc64le-linux-gnu.so:
>  undefined symbol: _ZN5arrow8adapters3orc13OR..._*
>   
> Interrupted: 1 error during collection 
> 
>  === 
> 1 error in 1.54s 
> ===
>  Fatal Python error: Segmentation fault



--
This mess