Hi guys,
I've executed the procedures necessary to install the Nvidia Driver, the
development kit and examples. Everything seems to be working fine.
Running the NVidia's deviceQuery program, it says:
./deviceQuery Starting...
CUDA Device Query (Runtime API) version (CUDART static linking)
There is 1 device supporting CUDA
Device 0: "GeForce 8300 GS"
CUDA Driver Version: 3.20
CUDA Runtime Version: 3.20
CUDA Capability Major/Minor version number: 1.1
Total amount of global memory: 133496832 bytes
Multiprocessors x Cores/MP = Cores: 1 (MP) x 8 (Cores/MP)
= 8 (Cores)
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 16384 bytes
Total number of registers available per block: 8192
Warp size: 32
Maximum number of threads per block: 512
Maximum sizes of each dimension of a block: 512 x 512 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 1
Maximum memory pitch: 2147483647 bytes
Texture alignment: 256 bytes
Clock rate: 0.92 GHz
Concurrent copy and execution: No
Run time limit on kernels: Yes
Integrated: No
Support host page-locked memory mapping: Yes
Compute mode: Default (multiple host
threads can use this device simultaneously)
Concurrent kernel execution: No
Device has ECC support enabled: No
Device is using TCC driver mode: No
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.20, CUDA
Runtime Version = 3.20, NumDevs = 1, Device = GeForce 8300 GS
OK. Next step is CUDATopology. Let's see if it recognizes the board.
I executed the instructions regarding compilation and standalone
execution parameters. In a nutshell:
$ x10c++ -O -NO_CHECKS -STATIC_CALLS CUDATopology.x10 -o CUDATopology
$ X10RT_ACCELS=ALL ../../bin/runx10 CUDATopology
... and it says:
Dumping places at place: (Place 0)
Place: (Place 0)
Parent: (Place 0)
NumChildren: 0
Is a Host place
What am I doing wrong?
I've tried to recompile the distribution files as recipe below
$ cd x10.dist
$ ant dist -DX10RT_CUDA=true -DNO_CHECKS=true -Doptimize=true
... and I've got this:
build-x10rt:
[exec] g++ -g -DNDEBUG -DNO_PLACE_CHECKS -O2 -finline-functions
-ansi -pedantic -Wall -Wextra -Wno-long-long -Wno-unused-parameter
-pthread -Iinclude -Icommon
-I"/opt/JavaIDE/jdk1.6.0_21-linux-x86_64"/include
-I"/opt/JavaIDE/jdk1.6.0_21-linux-x86_64"/include/linux -DENABLE_CUDA
-isystem/usr/local/cuda/include -shared -fPIC -c common/x10rt_cuda.cc -o
common/x10rt_cuda.o
[exec] common/x10rt_cuda.cc: In function
‘void<unnamed>::cu_safe(CUresult, const char*, int)’:
[exec] common/x10rt_cuda.cc:112: error:
‘CUDA_ERROR_POINTER_IS_64BIT’ was not declared in this scope
[exec] common/x10rt_cuda.cc:114: error: ‘CUDA_ERROR_SIZE_IS_64BIT’
was not declared in this scope
[exec] common/x10rt_cuda.cc:42: warning: enumeration value
‘CUDA_ERROR_UNSUPPORTED_LIMIT’ not handled in switch
[exec] common/x10rt_cuda.cc:42: warning: enumeration value
‘CUDA_ERROR_SHARED_OBJECT_SYMBOL_NOT_FOUND’ not handled in switch
[exec] common/x10rt_cuda.cc:42: warning: enumeration value
‘CUDA_ERROR_SHARED_OBJECT_INIT_FAILED’ not handled in switch
[exec] common/x10rt_cuda.cc:42: warning: enumeration value
‘CUDA_ERROR_OPERATING_SYSTEM’ not handled in switch
[exec] common/x10rt_cuda.cc: In function ‘void
x10rt_cuda_register_msg_receiver(x10rt_cuda_ctx*, x10rt_msg_type, void
(*)(const x10rt_msg_params*, size_t*, size_t*, size_t*, size_t*, char**,
size_t*, char**), void (*)(const x10rt_msg_params*, size_t, size_t,
size_t, size_t, char*, size_t, char*), const char*, const char*)’:
[exec] common/x10rt_cuda.cc:375: error: cannot convert ‘unsigned
int*’ to ‘size_t*’ for argument ‘2’ to ‘CUresult
cuModuleGetGlobal_v2(CUdeviceptr*, size_t*, CUmod_st*, const char*)’
[exec] common/x10rt_cuda.cc: In function ‘void
x10rt_cuda_probe(x10rt_cuda_ctx*)’:
[exec] common/x10rt_cuda.cc:751: warning: dereferencing
type-punned pointer will break strict-aliasing rules
[exec] common/x10rt_cuda.cc:752: warning: dereferencing
type-punned pointer will break strict-aliasing rules
[exec] make: *** [common/x10rt_cuda.o] Error 1
BUILD FAILED
/home/rgomes/developer/sources/x10/x10-trunk/x10.dist/build.xml:214: The
following error occurred while executing this line:
/home/rgomes/developer/sources/x10/x10-trunk/x10.runtime/build.xml:281:
exec returned: 2
Any idea ???
Some more information:
$ which nvcc
/usr/local/cuda/bin/nvcc
$ echo $LD_LIBRARY_PATH
:/usr/local/cuda/lib64:/usr/local/cuda/lib
$ ant -version
Apache Ant version 1.7.1 compiled on June 27 2008
$ mvn -version
Apache Maven 2.2.1 (r801777; 2009-08-06 20:16:01+0100)
Java version: 1.6.0_21
Java home: /opt/JavaIDE/jdk1.6.0_21-linux-x86_64/jre
Default locale: en_GB, platform encoding: UTF-8
OS name: "linux" version: "2.6.32-5-amd64" arch: "amd64" Family: "unix"
Thanks a lot :)
--
Richard Gomes
M: +44(77)9955-6813
http://tinyurl.com/frgomes
twitter: frgomes
JQuantLib is a library for Quantitative Finance written in Java.
http://www.jquantlib.org/
twitter: jquantlib
------------------------------------------------------------------------------
The Next 800 Companies to Lead America's Growth: New Video Whitepaper
David G. Thomson, author of the best-selling book "Blueprint to a
Billion" shares his insights and actions to help propel your
business during the next growth cycle. Listen Now!
http://p.sf.net/sfu/SAP-dev2dev
_______________________________________________
X10-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/x10-users