[X10-users] Confused with CUDATopology

Richard Gomes Mon, 08 Nov 2010 15:34:10 -0800

Hi guys,

I've executed the procedures necessary to install the Nvidia Driver, the 
development kit and examples. Everything seems to be working fine.


Running the NVidia's deviceQuery program, it says:

./deviceQuery Starting...

  CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: "GeForce 8300 GS"
   CUDA Driver Version:                           3.20
   CUDA Runtime Version:                          3.20
   CUDA Capability Major/Minor version number:    1.1
   Total amount of global memory:                 133496832 bytes
   Multiprocessors x Cores/MP = Cores:            1 (MP) x 8 (Cores/MP) 
= 8 (Cores)
   Total amount of constant memory:               65536 bytes
   Total amount of shared memory per block:       16384 bytes
   Total number of registers available per block: 8192
   Warp size:                                     32
   Maximum number of threads per block:           512
   Maximum sizes of each dimension of a block:    512 x 512 x 64
   Maximum sizes of each dimension of a grid:     65535 x 65535 x 1
   Maximum memory pitch:                          2147483647 bytes
   Texture alignment:                             256 bytes
   Clock rate:                                    0.92 GHz
   Concurrent copy and execution:                 No
   Run time limit on kernels:                     Yes
   Integrated:                                    No
   Support host page-locked memory mapping:       Yes
   Compute mode:                                  Default (multiple host 
threads can use this device simultaneously)
   Concurrent kernel execution:                   No
   Device has ECC support enabled:                No
   Device is using TCC driver mode:               No

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.20, CUDA 
Runtime Version = 3.20, NumDevs = 1, Device = GeForce 8300 GS


OK. Next step is CUDATopology. Let's see if it recognizes the board.


I executed the instructions regarding compilation and standalone 
execution parameters. In a nutshell:

$ x10c++ -O -NO_CHECKS -STATIC_CALLS CUDATopology.x10 -o CUDATopology
$ X10RT_ACCELS=ALL ../../bin/runx10 CUDATopology


... and it says:


Dumping places at place: (Place 0)
Place: (Place 0)
   Parent: (Place 0)
   NumChildren: 0
   Is a Host place



What am I doing wrong?


I've tried to recompile the distribution files as recipe below


$ cd x10.dist
$ ant dist -DX10RT_CUDA=true -DNO_CHECKS=true -Doptimize=true


... and I've got this:


build-x10rt:
      [exec] g++ -g -DNDEBUG -DNO_PLACE_CHECKS -O2 -finline-functions 
-ansi -pedantic -Wall -Wextra -Wno-long-long -Wno-unused-parameter 
-pthread -Iinclude -Icommon 
-I"/opt/JavaIDE/jdk1.6.0_21-linux-x86_64"/include 
-I"/opt/JavaIDE/jdk1.6.0_21-linux-x86_64"/include/linux -DENABLE_CUDA 
-isystem/usr/local/cuda/include -shared -fPIC -c common/x10rt_cuda.cc -o 
common/x10rt_cuda.o
      [exec] common/x10rt_cuda.cc: In function 
‘void<unnamed>::cu_safe(CUresult, const char*, int)’:
      [exec] common/x10rt_cuda.cc:112: error: 
‘CUDA_ERROR_POINTER_IS_64BIT’ was not declared in this scope
      [exec] common/x10rt_cuda.cc:114: error: ‘CUDA_ERROR_SIZE_IS_64BIT’ 
was not declared in this scope
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_UNSUPPORTED_LIMIT’ not handled in switch
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_SHARED_OBJECT_SYMBOL_NOT_FOUND’ not handled in switch
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_SHARED_OBJECT_INIT_FAILED’ not handled in switch
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_OPERATING_SYSTEM’ not handled in switch
      [exec] common/x10rt_cuda.cc: In function ‘void 
x10rt_cuda_register_msg_receiver(x10rt_cuda_ctx*, x10rt_msg_type, void 
(*)(const x10rt_msg_params*, size_t*, size_t*, size_t*, size_t*, char**, 
size_t*, char**), void (*)(const x10rt_msg_params*, size_t, size_t, 
size_t, size_t, char*, size_t, char*), const char*, const char*)’:
      [exec] common/x10rt_cuda.cc:375: error: cannot convert ‘unsigned 
int*’ to ‘size_t*’ for argument ‘2’ to ‘CUresult 
cuModuleGetGlobal_v2(CUdeviceptr*, size_t*, CUmod_st*, const char*)’
      [exec] common/x10rt_cuda.cc: In function ‘void 
x10rt_cuda_probe(x10rt_cuda_ctx*)’:
      [exec] common/x10rt_cuda.cc:751: warning: dereferencing 
type-punned pointer will break strict-aliasing rules
      [exec] common/x10rt_cuda.cc:752: warning: dereferencing 
type-punned pointer will break strict-aliasing rules
      [exec] make: *** [common/x10rt_cuda.o] Error 1

BUILD FAILED
/home/rgomes/developer/sources/x10/x10-trunk/x10.dist/build.xml:214: The 
following error occurred while executing this line:
/home/rgomes/developer/sources/x10/x10-trunk/x10.runtime/build.xml:281: 
exec returned: 2


Any idea ???


Some more information:


$ which nvcc
/usr/local/cuda/bin/nvcc

$ echo $LD_LIBRARY_PATH
:/usr/local/cuda/lib64:/usr/local/cuda/lib

$ ant -version
Apache Ant version 1.7.1 compiled on June 27 2008

$ mvn -version
Apache Maven 2.2.1 (r801777; 2009-08-06 20:16:01+0100)
Java version: 1.6.0_21
Java home: /opt/JavaIDE/jdk1.6.0_21-linux-x86_64/jre
Default locale: en_GB, platform encoding: UTF-8
OS name: "linux" version: "2.6.32-5-amd64" arch: "amd64" Family: "unix"


Thanks a lot :)


-- 
Richard Gomes
M: +44(77)9955-6813
http://tinyurl.com/frgomes
twitter: frgomes

JQuantLib is a library for Quantitative Finance written in Java.
http://www.jquantlib.org/
twitter: jquantlib

------------------------------------------------------------------------------
The Next 800 Companies to Lead America's Growth: New Video Whitepaper
David G. Thomson, author of the best-selling book "Blueprint to a 
Billion" shares his insights and actions to help propel your 
business during the next growth cycle. Listen Now!
http://p.sf.net/sfu/SAP-dev2dev
_______________________________________________
X10-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/x10-users

[X10-users] Confused with CUDATopology

Reply via email to