The PyCUDA binary you use was built with CUDAPP_TRACE_CUDA defined. Try the updated installer.

Christoph


On 4/28/2011 12:09 AM, brett.br...@csiro.au wrote:
Hi

I seem to have successfully installed PyCUDA on a Windows 7 64bit
machine, Python 2.7.1, CUDA 3.2, PyCUDA 2011.1 using instructions from
Christoph Bussler “Windows 7 64-bit with Visual Studio Professional 2008
(Strictly Binary Versions)”. Problem is… it runs very verbosely, and
hence, slowly. Below is an example output from running the demo.py
script from the PyCUDA source.

How can I switch off the verbose output?

Thanks

Regards

Brett

C:\Users\bry074>python
P:\SAF\AICAS\GPU_code_testing\Copy_LinuxGPU\python_scripts\pycuda_demo\demo.py


cuInit

cuDeviceGetCount

cuDeviceGet

cuCtxCreate

cuCtxGetDevice

cuMemAlloc

cuMemcpyHtoD

cuCtxGetDevice

cuDeviceComputeCapability

cuModuleLoadDataEx

cuModuleGetFunction

cuFuncSetBlockShape (doublify)

cuParamSetv (doublify)

cuParamSetSize (doublify)

cuLaunchGrid (doublify)

cuMemcpyDtoH

original array:

[[-1.85515511 -0.61387902 -0.9409495 0.57415462]

[ 2.04725552 -0.43586037 1.19255412 -0.46156505]

[-0.83301312 0.76718497 -0.53578204 0.16794606]

[-1.24282026 -0.25060421 -0.50485265 -1.50926399]]

doubled with kernel:

[[-3.71031022 -1.22775805 -1.881899 1.14830923]

[ 4.09451103 -0.87172073 2.38510823 -0.9231301 ]

[-1.66602623 1.53436995 -1.07156408 0.33589211]

[-2.48564053 -0.50120842 -1.00970531 -3.01852798]]

cuFuncSetBlockShape (doublify)

cuMemAlloc

cuParamSetv (doublify)

cuParamSetSize (doublify)

cuMemcpyHtoD

cuLaunchGrid (doublify)

cuCtxSynchronize

cuMemcpyDtoH

cuMemFree

doubled with InOut:

[[-3.71031022 -1.22775805 -1.881899 1.14830923]

[ 4.09451103 -0.87172073 2.38510823 -0.9231301 ]

[-1.66602623 1.53436995 -1.07156408 0.33589211]

[-2.48564053 -0.50120842 -1.00970531 -3.01852798]]

cuMemAlloc

cuCtxGetDevice

cuDeviceGetAttribute

cuDeviceGetAttribute

cuDeviceComputeCapability

cuDeviceGetAttribute

cuDeviceGetAttribute

cuDeviceComputeCapability

cuDeviceComputeCapability

cuDeviceGetAttribute

cuMemcpyHtoD

cuMemFree

cuMemAlloc

cuCtxGetDevice

cuCtxGetDevice

cuDeviceComputeCapability

cuModuleLoadDataEx

cuModuleGetFunction

cuFuncSetBlockShape (axpb)

cuParamSetSize (axpb)

cuFuncSetBlockShape (axpb)

cuParamSetv (axpb)

cuLaunchGrid (axpb)

cuMemcpyDtoH

cuMemFree

original array:

cuMemcpyDtoH

[[-0.48802722 0.91386962 -0.81973875 0.19833829]

[ 0.51856601 -0.09402688 -0.48339829 1.20150411]

[-1.65700185 2.41741824 0.0643658 0.15416692]

[ 0.3448593 1.04530859 0.03367852 1.69314694]]

doubled with gpuarray:

[[-0.97605443 1.82773924 -1.63947749 0.39667657]

[ 1.03713202 -0.18805376 -0.96679658 2.40300822]

[-3.31400371 4.83483648 0.12873159 0.30833384]

[ 0.6897186 2.09061718 0.06735703 3.38629389]]

cuCtxPopCurrent

cuCtxPushCurrent

cuModuleUnload

cuCtxPopCurrent

cuCtxPushCurrent

cuMemFree

cuCtxPopCurrent

cuCtxPushCurrent

cuModuleUnload

cuCtxPopCurrent

cuCtxPushCurrent

cuCtxDetach



_______________________________________________
PyCUDA mailing list
PyCUDA@tiker.net
http://lists.tiker.net/listinfo/pycuda

_______________________________________________
PyCUDA mailing list
PyCUDA@tiker.net
http://lists.tiker.net/listinfo/pycuda

Reply via email to