7.9. cudaDeviceProp 结构体参考
[CUDA运行时使用的数据类型]
CUDA设备属性
公共变量
- int ECCEnabled
- int accessPolicyMaxWindowSize
- int asyncEngineCount
- int canMapHostMemory
- int canUseHostPointerForRegisteredMem
- int clockRate
- int clusterLaunch
- int computeMode
- int computePreemptionSupported
- int concurrentKernels
- int concurrentManagedAccess
- int cooperativeLaunch
- int cooperativeMultiDeviceLaunch
- int deferredMappingCudaArraySupported
- int deviceOverlap
- int directManagedMemAccessFromHost
- int globalL1CacheSupported
- unsigned int gpuDirectRDMAFlushWritesOptions
- int gpuDirectRDMASupported
- int gpuDirectRDMAWritesOrdering
- int hostNativeAtomicSupported
- int hostRegisterReadOnlySupported
- int hostRegisterSupported
- int integrated
- int ipcEventSupported
- int isMultiGpuBoard
- int kernelExecTimeoutEnabled
- int l2CacheSize
- int localL1CacheSupported
- char luid[8]
- unsigned int luidDeviceNodeMask
- int major
- int managedMemory
- int maxBlocksPerMultiProcessor
- int maxGridSize[3]
- int maxSurface1D
- int maxSurface1DLayered[2]
- int maxSurface2D[2]
- int maxSurface2DLayered[3]
- int maxSurface3D[3]
- int maxSurfaceCubemap
- int maxSurfaceCubemapLayered[2]
- int maxTexture1D
- int maxTexture1DLayered[2]
- int maxTexture1DLinear
- int maxTexture1DMipmap
- int maxTexture2D[2]
- int maxTexture2DGather[2]
- int maxTexture2DLayered[3]
- int maxTexture2DLinear[3]
- int maxTexture2DMipmap[2]
- int maxTexture3D[3]
- int maxTexture3DAlt[3]
- int maxTextureCubemap
- int maxTextureCubemapLayered[2]
- int maxThreadsDim[3]
- int maxThreadsPerBlock
- int maxThreadsPerMultiProcessor
- size_t memPitch
- int memoryBusWidth
- int memoryClockRate
- unsigned int memoryPoolSupportedHandleTypes
- int memoryPoolsSupported
- int minor
- int multiGpuBoardGroupID
- int multiProcessorCount
- char name[256]
- int pageableMemoryAccess
- int pageableMemoryAccessUsesHostPageTables
- int pciBusID
- int pciDeviceID
- int pciDomainID
- int persistingL2CacheMaxSize
- int regsPerBlock
- int regsPerMultiprocessor
- int reserved[63]
- size_t reservedSharedMemPerBlock
- size_t sharedMemPerBlock
- size_t sharedMemPerBlockOptin
- size_t sharedMemPerMultiprocessor
- int singleToDoublePrecisionPerfRatio
- int sparseCudaArraySupported
- int streamPrioritiesSupported
- size_t surfaceAlignment
- int tccDriver
- size_t textureAlignment
- size_t texturePitchAlignment
- int timelineSemaphoreInteropSupported
- size_t totalConstMem
- size_t totalGlobalMem
- int unifiedAddressing
- int unifiedFunctionPointers
- cudaUUID_t uuid
- int warpSize
变量
- int cudaDeviceProp::ECCEnabled [inherited]
-
设备已启用ECC支持
- int cudaDeviceProp::accessPolicyMaxWindowSize [inherited]
- int cudaDeviceProp::asyncEngineCount [inherited]
-
异步引擎数量
- int cudaDeviceProp::canMapHostMemory [inherited]
-
设备可以通过cudaHostAlloc/cudaHostGetDevicePointer映射主机内存
- int cudaDeviceProp::canUseHostPointerForRegisteredMem [inherited]
-
设备可以通过与CPU相同的虚拟地址访问主机注册的内存
- int cudaDeviceProp::clockRate [inherited]
-
已弃用,时钟频率以千赫为单位
- int cudaDeviceProp::clusterLaunch [inherited]
-
表示设备支持集群启动
- int cudaDeviceProp::computeMode [inherited]
-
已弃用,计算模式(参见 cudaComputeMode)
- int cudaDeviceProp::computePreemptionSupported [inherited]
-
设备支持计算抢占
- int cudaDeviceProp::concurrentKernels [inherited]
-
设备可能可以同时执行多个内核
- int cudaDeviceProp::concurrentManagedAccess [inherited]
-
设备可以与CPU同时一致地访问托管内存
- int cudaDeviceProp::cooperativeLaunch [inherited]
-
设备支持通过 cudaLaunchCooperativeKernel 启动协作内核
- int cudaDeviceProp::cooperativeMultiDeviceLaunch [inherited]
-
已弃用,cudaLaunchCooperativeKernelMultiDevice 已被弃用。
- int cudaDeviceProp::deferredMappingCudaArraySupported [inherited]
-
如果设备支持延迟映射CUDA数组和CUDA多级渐远纹理数组,则返回1
- int cudaDeviceProp::deviceOverlap [inherited]
-
设备可以同时复制内存和执行内核。已弃用。请改用asyncEngineCount。
- int cudaDeviceProp::directManagedMemAccessFromHost [inherited]
-
主机可以直接访问设备上的托管内存,无需迁移。
- int cudaDeviceProp::globalL1CacheSupported [inherited]
-
设备支持在L1缓存全局变量
- unsigned int cudaDeviceProp::gpuDirectRDMAFlushWritesOptions [inherited]
-
位掩码,需根据cudaFlushGPUDirectRDMAWritesOptions枚举进行解释
- int cudaDeviceProp::gpuDirectRDMASupported [inherited]
-
如果设备支持GPUDirect RDMA API则返回1,否则返回0
- int cudaDeviceProp::gpuDirectRDMAWritesOrdering [inherited]
-
有关数值值,请参阅 cudaGPUDirectRDMAWritesOrdering 枚举
- int cudaDeviceProp::hostNativeAtomicSupported [inherited]
-
设备与主机之间的链接支持原生原子操作
- int cudaDeviceProp::hostRegisterReadOnlySupported [inherited]
-
设备支持使用cudaHostRegister标志cudaHostRegisterReadOnly来注册必须映射为GPU只读的内存
- int cudaDeviceProp::hostRegisterSupported [inherited]
-
设备支持通过 cudaHostRegister 进行主机内存注册。
- int cudaDeviceProp::integrated [inherited]
-
设备是集成而非独立的
- int cudaDeviceProp::ipcEventSupported [inherited]
-
设备支持IPC事件。
- int cudaDeviceProp::isMultiGpuBoard [inherited]
-
设备位于多GPU主板上
- int cudaDeviceProp::kernelExecTimeoutEnabled [inherited]
-
已弃用,指定内核是否存在运行时限制
- int cudaDeviceProp::l2CacheSize [inherited]
-
L2缓存的大小(以字节为单位)
- int cudaDeviceProp::localL1CacheSupported [inherited]
-
设备支持在L1缓存中缓存局部变量
- char cudaDeviceProp::luid[8] [inherited]
-
8字节本地唯一标识符。在TCC和非Windows平台上该值未定义
- unsigned int cudaDeviceProp::luidDeviceNodeMask [inherited]
-
LUID设备节点掩码。在TCC和非Windows平台上该值未定义
- int cudaDeviceProp::major [inherited]
-
主要计算能力
- int cudaDeviceProp::managedMemory [inherited]
-
设备支持在此系统上分配托管内存
- int cudaDeviceProp::maxBlocksPerMultiProcessor [inherited]
-
每个多处理器上常驻块的最大数量
- int cudaDeviceProp::maxGridSize[3] [inherited]
-
网格每个维度的最大尺寸
- int cudaDeviceProp::maxSurface1D [inherited]
-
最大一维表面尺寸
- int cudaDeviceProp::maxSurface1DLayered[2] [inherited]
-
最大一维分层表面尺寸
- int cudaDeviceProp::maxSurface2D[2] [inherited]
-
最大二维表面尺寸
- int cudaDeviceProp::maxSurface2DLayered[3] [inherited]
-
最大二维分层表面尺寸
- int cudaDeviceProp::maxSurface3D[3] [inherited]
-
最大3D表面尺寸
- int cudaDeviceProp::maxSurfaceCubemap [inherited]
-
立方体贴图表面的最大尺寸
- int cudaDeviceProp::maxSurfaceCubemapLayered[2] [inherited]
-
立方体贴图分层表面的最大尺寸
- int cudaDeviceProp::maxTexture1D [inherited]
-
一维纹理的最大尺寸
- int cudaDeviceProp::maxTexture1DLayered[2] [inherited]
-
一维分层纹理的最大尺寸
- int cudaDeviceProp::maxTexture1DLinear [inherited]
- int cudaDeviceProp::maxTexture1DMipmap [inherited]
-
最大一维mipmapped纹理尺寸
- int cudaDeviceProp::maxTexture2D[2] [inherited]
-
最大2D纹理尺寸
- int cudaDeviceProp::maxTexture2DGather[2] [inherited]
-
执行纹理收集操作时的最大2D纹理尺寸
- int cudaDeviceProp::maxTexture2DLayered[3] [inherited]
-
最大二维分层纹理尺寸
- int cudaDeviceProp::maxTexture2DLinear[3] [inherited]
-
绑定到倾斜内存的2D纹理的最大尺寸(宽度、高度、间距)
- int cudaDeviceProp::maxTexture2DMipmap[2] [inherited]
-
最大2D多级纹理贴图尺寸
- int cudaDeviceProp::maxTexture3D[3] [inherited]
-
最大3D纹理尺寸
- int cudaDeviceProp::maxTexture3DAlt[3] [inherited]
-
最大替代3D纹理尺寸
- int cudaDeviceProp::maxTextureCubemap [inherited]
-
立方体贴图纹理的最大尺寸
- int cudaDeviceProp::maxTextureCubemapLayered[2] [inherited]
-
立方体贴图分层纹理的最大尺寸
- int cudaDeviceProp::maxThreadsDim[3] [inherited]
-
每个块维度的最大尺寸
- int cudaDeviceProp::maxThreadsPerBlock [inherited]
-
每个块的最大线程数
- int cudaDeviceProp::maxThreadsPerMultiProcessor [inherited]
-
每个多处理器上的最大常驻线程数
- size_t cudaDeviceProp::memPitch [inherited]
-
内存拷贝允许的最大间距(以字节为单位)
- int cudaDeviceProp::memoryBusWidth [inherited]
-
全局内存总线宽度(以位为单位)
- int cudaDeviceProp::memoryClockRate [inherited]
-
已弃用,峰值内存时钟频率(单位:千赫兹)
- unsigned int cudaDeviceProp::memoryPoolSupportedHandleTypes [inherited]
-
支持基于内存池的IPC的句柄类型的位掩码
- int cudaDeviceProp::memoryPoolsSupported [inherited]
-
如果设备支持使用cudaMallocAsync和cudaMemPool系列API,则返回1,否则返回0
- int cudaDeviceProp::minor [inherited]
-
次要计算能力
- int cudaDeviceProp::multiGpuBoardGroupID [inherited]
-
同一多GPU板卡上设备组的唯一标识符
- int cudaDeviceProp::multiProcessorCount [inherited]
-
设备上的多处理器数量
- char cudaDeviceProp::name[256] [inherited]
-
标识设备的ASCII字符串
- int cudaDeviceProp::pageableMemoryAccess [inherited]
-
设备支持无需调用cudaHostRegister即可一致访问可分页内存
- int cudaDeviceProp::pageableMemoryAccessUsesHostPageTables [inherited]
-
设备通过主机的页表访问可分页内存
- int cudaDeviceProp::pciBusID [inherited]
-
设备的PCI总线ID
- int cudaDeviceProp::pciDeviceID [inherited]
-
设备的PCI设备ID
- int cudaDeviceProp::pciDomainID [inherited]
-
设备的PCI域ID
- int cudaDeviceProp::persistingL2CacheMaxSize [inherited]
-
设备的L2持久行容量最大设置(以字节为单位)
- int cudaDeviceProp::regsPerBlock [inherited]
-
每个块可用的32位寄存器数量
- int cudaDeviceProp::regsPerMultiprocessor [inherited]
-
每个多处理器可用的32位寄存器数量
- int cudaDeviceProp::reserved[63] [inherited]
-
保留供未来使用
- size_t cudaDeviceProp::reservedSharedMemPerBlock [inherited]
-
CUDA驱动为每个块保留的共享内存大小(字节)
- size_t cudaDeviceProp::sharedMemPerBlock [inherited]
-
每个块可用的共享内存大小(以字节为单位)
- size_t cudaDeviceProp::sharedMemPerBlockOptin [inherited]
-
每个设备每个块可用的最大共享内存(需特别选择启用)
- size_t cudaDeviceProp::sharedMemPerMultiprocessor [inherited]
-
每个多处理器可用的共享内存大小(以字节为单位)
- int cudaDeviceProp::singleToDoublePrecisionPerfRatio [inherited]
-
已弃用,单精度性能(以每秒浮点运算次数计)与双精度性能的比率
- int cudaDeviceProp::sparseCudaArraySupported [inherited]
-
如果设备支持稀疏CUDA数组和稀疏CUDA mipmapped数组,则返回1,否则返回0
- int cudaDeviceProp::streamPrioritiesSupported [inherited]
-
设备支持流优先级
- size_t cudaDeviceProp::surfaceAlignment [inherited]
-
表面对齐要求
- int cudaDeviceProp::tccDriver [inherited]
-
如果设备是使用TCC驱动的Tesla设备则为1,否则为0
- size_t cudaDeviceProp::textureAlignment [inherited]
-
纹理的对齐要求
- size_t cudaDeviceProp::texturePitchAlignment [inherited]
-
绑定到倾斜内存的纹理引用所需的音高对齐要求
- int cudaDeviceProp::timelineSemaphoreInteropSupported [inherited]
-
设备支持外部时间线信号量互操作
- size_t cudaDeviceProp::totalConstMem [inherited]
-
设备上可用的常量内存大小(以字节为单位)
- size_t cudaDeviceProp::totalGlobalMem [inherited]
-
设备上可用的全局内存大小(以字节为单位)
- int cudaDeviceProp::unifiedAddressing [inherited]
-
设备与主机共享统一地址空间
- int cudaDeviceProp::unifiedFunctionPointers [inherited]
-
表示设备支持统一指针
- cudaUUID_t cudaDeviceProp::uuid [inherited]
-
16字节唯一标识符
- int cudaDeviceProp::warpSize [inherited]
-
线程束大小(以线程数为单位)