Class cudaFuncAttributes

    • Constructor Detail

      • cudaFuncAttributes

        public cudaFuncAttributes()
        Default native constructor.
      • cudaFuncAttributes

        public cudaFuncAttributes​(long size)
        Native array allocator. Access with Pointer.position(long).
    • Method Detail

      • sharedSizeBytes

        @Cast("size_t")
        public long sharedSizeBytes()
        The size in bytes of statically-allocated shared memory per block required by this function. This does not include dynamically-allocated shared memory requested by the user at runtime.
      • constSizeBytes

        @Cast("size_t")
        public long constSizeBytes()
        The size in bytes of user-allocated constant memory required by this function.
      • localSizeBytes

        @Cast("size_t")
        public long localSizeBytes()
        The size in bytes of local memory used by each thread of this function.
      • maxThreadsPerBlock

        public int maxThreadsPerBlock()
        The maximum number of threads per block, beyond which a launch of the function would fail. This number depends on both the function and the device on which the function is currently loaded.
      • numRegs

        public int numRegs()
        The number of registers used by each thread of this function.
      • ptxVersion

        public int ptxVersion()
        The PTX virtual architecture version for which the function was compiled. This value is the major PTX version * 10 + the minor PTX version, so a PTX version 1.3 function would return the value 13.
      • binaryVersion

        public int binaryVersion()
        The binary architecture version for which the function was compiled. This value is the major binary version * 10 + the minor binary version, so a binary version 1.3 function would return the value 13.
      • cacheModeCA

        public int cacheModeCA()
        The attribute to indicate whether the function has been compiled with user specified option "-Xptxas --dlcm=ca" set.
      • maxDynamicSharedSizeBytes

        public int maxDynamicSharedSizeBytes()
        The maximum size in bytes of dynamic shared memory per block for this function. Any launch must have a dynamic shared memory size smaller than this value.
      • maxDynamicSharedSizeBytes

        public cudaFuncAttributes maxDynamicSharedSizeBytes​(int setter)
      • preferredShmemCarveout

        public int preferredShmemCarveout()
        On devices where the L1 cache and shared memory use the same hardware resources, this sets the shared memory carveout preference, in percent of the maximum shared memory. Refer to ::cudaDevAttrMaxSharedMemoryPerMultiprocessor. This is only a hint, and the driver can choose a different ratio if required to execute the function. See ::cudaFuncSetAttribute
      • preferredShmemCarveout

        public cudaFuncAttributes preferredShmemCarveout​(int setter)