Class GPUObject
- java.lang.Object
- 
- org.apache.sysds.runtime.instructions.gpu.context.GPUObject
 
- 
 public class GPUObject extends Object Handle to a matrix block on the GPU
- 
- 
Field SummaryFields Modifier and Type Field Description GPUObjectlineageCachedChainHeadNext GPUObject that points to the same lineage cached GPU pointerGPUObjectnextLineageCachedEntryHead of the linked list of GPUObjects that point to the same lineage cached GPU pointer
 - 
Constructor SummaryConstructors Constructor Description GPUObject(GPUContext gCtx, GPUObject that, MatrixObject mat)
 - 
Method SummaryAll Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description booleanacquireDeviceModifyDense()booleanacquireDeviceModifyDense(boolean initialize)booleanacquireDeviceModifySparse()booleanacquireDeviceModifySparse(boolean initialize)booleanacquireDeviceRead(String opcode)booleanacquireHostRead(String instName)if the data is allocated on the GPU and is dirty, it is copied back to the host memoryvoidaddReadLock()voidaddWriteLock()voidallocateAndFillDense(double v)Allocates a dense matrix of size obtained from the attached matrix metadata and fills it up with a single valuevoidallocateSparseAndEmpty()Allocates a sparse and emptyGPUObjectThis is the result of operations that are both non zero matrices.voidclearData(String opcode, boolean eager)Clears the data associated with thisGPUObjectinstancevoidclearDensePointer()Removes the dense pointer and potential soft referencevoidclearGPUObject()Objectclone()static CSRPointercolumnMajorDenseToRowMajorSparse(GPUContext gCtx, jcuda.jcusparse.cusparseHandle cusparseHandle, jcuda.Pointer densePtr, int rows, int cols)Convenience method to convert a CSR matrix to a dense matrix on the GPU Since the allocated matrix is temporary, bookkeeping is not updated.voiddenseColumnMajorToRowMajor()Convenience method.voiddenseRowMajorToColumnMajor()Convenience method.voiddenseToSparse()Converts this GPUObject from dense to sparse format.MatrixBlockevictFromDeviceToHostMB(String instName, boolean eagerDelete)jcuda.PointergetDensePointer()Pointer to dense matrixlonggetDensePointerAddress()CSRPointergetJcudaSparseMatrixPtr()Pointer to sparse matrixMatrixObjectgetMatrixObject()longgetNnz(String instName, boolean recomputeDenseNNZ)Being allocated is a prerequisite for computing nnz.static longgetPointerAddress(jcuda.Pointer p)longgetSizeOnDevice()CSRPointergetSparseMatrixCudaPointer()Convenience method to directly examine the Sparse matrix on GPUbooleanisAllocated()booleanisDensePointerNull()Checks if the dense pointer is nullbooleanisDirty()Whether this block is dirty on the GPUbooleanisLinCached()booleanisLocked()booleanisrmVarPending()booleanisSparse()booleanisSparseAndEmpty()If thisGPUObjectis sparse and empty Being allocated is a prerequisite to being sparse and empty.voidreleaseInput()Releases input allocated on GPUvoidreleaseOutput()releases output allocated on GPUvoidreleaseReadLock()voidreleaseWriteLock()voidresetReadWriteLock()voidsetDensePointer(jcuda.Pointer densePtr)Convenience method to directly set the dense matrix pointer on GPUvoidsetDirty(boolean flag)voidsetIsLinCached(boolean val)voidsetrmVarPending(boolean val)voidsetSparseMatrixCudaPointer(CSRPointer sparseMatrixPtr)Convenience method to directly set the sparse matrix on GPU Needed for operations like cusparseDcsrgemm(cusparseHandle, int, int, int, int, int, cusparseMatDescr, int, Pointer, Pointer, Pointer, cusparseMatDescr, int, Pointer, Pointer, Pointer, cusparseMatDescr, Pointer, Pointer, Pointer)voidsparseToColumnMajorDense()More efficient method to convert sparse to dense but returns dense in column major formatvoidsparseToDense()Convert sparse to dense (Performs transpose, use sparseToColumnMajorDense if the kernel can deal with column major format)voidsparseToDense(String instructionName)Convert sparse to dense (Performs transpose, use sparseToColumnMajorDense if the kernel can deal with column major format) Also records per instruction invokation of sparseToDense.static inttoIntExact(long l)StringtoString()static jcuda.Pointertranspose(GPUContext gCtx, jcuda.Pointer densePtr, int m, int n, int lda, int ldc)Transposes a dense matrix on the GPU by calling the cublasDgeam operation
 
- 
- 
- 
Constructor Detail- 
GPUObjectpublic GPUObject(GPUContext gCtx, GPUObject that, MatrixObject mat) 
 
- 
 - 
Method Detail- 
getDensePointerpublic jcuda.Pointer getDensePointer() Pointer to dense matrix- Returns:
- a pointer to the dense matrix
 
 - 
isDensePointerNullpublic boolean isDensePointerNull() Checks if the dense pointer is null- Returns:
- if the state of dense pointer is null
 
 - 
clearDensePointerpublic void clearDensePointer() Removes the dense pointer and potential soft reference
 - 
getMatrixObjectpublic MatrixObject getMatrixObject() 
 - 
setDensePointerpublic void setDensePointer(jcuda.Pointer densePtr) Convenience method to directly set the dense matrix pointer on GPU- Parameters:
- densePtr- dense pointer
 
 - 
setDirtypublic void setDirty(boolean flag) 
 - 
clonepublic Object clone() 
 - 
transposepublic static jcuda.Pointer transpose(GPUContext gCtx, jcuda.Pointer densePtr, int m, int n, int lda, int ldc) Transposes a dense matrix on the GPU by calling the cublasDgeam operation- Parameters:
- gCtx- a valid- GPUContext
- densePtr- Pointer to dense matrix on the GPU
- m- rows in ouput matrix
- n- columns in output matrix
- lda- rows in input matrix
- ldc- columns in output matrix
- Returns:
- transposed matrix
 
 - 
columnMajorDenseToRowMajorSparsepublic static CSRPointer columnMajorDenseToRowMajorSparse(GPUContext gCtx, jcuda.jcusparse.cusparseHandle cusparseHandle, jcuda.Pointer densePtr, int rows, int cols) Convenience method to convert a CSR matrix to a dense matrix on the GPU Since the allocated matrix is temporary, bookkeeping is not updated. Also note that the input dense matrix is expected to be in COLUMN MAJOR FORMAT Caller is responsible for deallocating memory on GPU.- Parameters:
- gCtx- a valid- GPUContext
- cusparseHandle- handle to cusparse library
- densePtr- [in] dense matrix pointer on the GPU in row major
- rows- number of rows
- cols- number of columns
- Returns:
- CSR (compressed sparse row) pointer
 
 - 
getSparseMatrixCudaPointerpublic CSRPointer getSparseMatrixCudaPointer() Convenience method to directly examine the Sparse matrix on GPU- Returns:
- CSR (compressed sparse row) pointer
 
 - 
setSparseMatrixCudaPointerpublic void setSparseMatrixCudaPointer(CSRPointer sparseMatrixPtr) Convenience method to directly set the sparse matrix on GPU Needed for operations like cusparseDcsrgemm(cusparseHandle, int, int, int, int, int, cusparseMatDescr, int, Pointer, Pointer, Pointer, cusparseMatDescr, int, Pointer, Pointer, Pointer, cusparseMatDescr, Pointer, Pointer, Pointer)- Parameters:
- sparseMatrixPtr- CSR (compressed sparse row) pointer
 
 - 
denseToSparsepublic void denseToSparse() Converts this GPUObject from dense to sparse format.
 - 
denseRowMajorToColumnMajorpublic void denseRowMajorToColumnMajor() Convenience method. Converts Row Major Dense Matrix to Column Major Dense Matrix
 - 
denseColumnMajorToRowMajorpublic void denseColumnMajorToRowMajor() Convenience method. Converts Column Major Dense Matrix to Row Major Dense Matrix
 - 
sparseToDensepublic void sparseToDense() Convert sparse to dense (Performs transpose, use sparseToColumnMajorDense if the kernel can deal with column major format)
 - 
sparseToDensepublic void sparseToDense(String instructionName) Convert sparse to dense (Performs transpose, use sparseToColumnMajorDense if the kernel can deal with column major format) Also records per instruction invokation of sparseToDense.- Parameters:
- instructionName- Name of the instruction for which statistics are recorded in- GPUStatistics
 
 - 
sparseToColumnMajorDensepublic void sparseToColumnMajorDense() More efficient method to convert sparse to dense but returns dense in column major format
 - 
isSparsepublic boolean isSparse() 
 - 
isAllocatedpublic boolean isAllocated() 
 - 
allocateSparseAndEmptypublic void allocateSparseAndEmpty() Allocates a sparse and emptyGPUObjectThis is the result of operations that are both non zero matrices.
 - 
allocateAndFillDensepublic void allocateAndFillDense(double v) Allocates a dense matrix of size obtained from the attached matrix metadata and fills it up with a single value- Parameters:
- v- value to fill up the dense matrix
 
 - 
isSparseAndEmptypublic boolean isSparseAndEmpty() If thisGPUObjectis sparse and empty Being allocated is a prerequisite to being sparse and empty.- Returns:
- true if sparse and empty
 
 - 
getNnzpublic long getNnz(String instName, boolean recomputeDenseNNZ) Being allocated is a prerequisite for computing nnz. Note: if the matrix is in dense format, it explicitly re-computes the number of nonzeros.- Parameters:
- instName- instruction name
- recomputeDenseNNZ- recompute NNZ if dense
- Returns:
- the number of nonzeroes
 
 - 
acquireDeviceReadpublic boolean acquireDeviceRead(String opcode) 
 - 
acquireDeviceModifyDensepublic boolean acquireDeviceModifyDense() 
 - 
acquireDeviceModifyDensepublic boolean acquireDeviceModifyDense(boolean initialize) 
 - 
acquireDeviceModifySparsepublic boolean acquireDeviceModifySparse() 
 - 
acquireDeviceModifySparsepublic boolean acquireDeviceModifySparse(boolean initialize) 
 - 
acquireHostReadpublic boolean acquireHostRead(String instName) if the data is allocated on the GPU and is dirty, it is copied back to the host memory- Parameters:
- instName- name of the instruction
- Returns:
- true if a copy to host happened, false otherwise
 
 - 
isLockedpublic boolean isLocked() 
 - 
addReadLockpublic void addReadLock() 
 - 
addWriteLockpublic void addWriteLock() 
 - 
releaseReadLockpublic void releaseReadLock() 
 - 
releaseWriteLockpublic void releaseWriteLock() 
 - 
resetReadWriteLockpublic void resetReadWriteLock() 
 - 
releaseInputpublic void releaseInput() Releases input allocated on GPU
 - 
releaseOutputpublic void releaseOutput() releases output allocated on GPU
 - 
getSizeOnDevicepublic long getSizeOnDevice() 
 - 
toIntExactpublic static int toIntExact(long l) 
 - 
evictFromDeviceToHostMBpublic MatrixBlock evictFromDeviceToHostMB(String instName, boolean eagerDelete) throws DMLRuntimeException - Throws:
- DMLRuntimeException
 
 - 
clearDatapublic void clearData(String opcode, boolean eager) throws DMLRuntimeException Clears the data associated with thisGPUObjectinstance- Parameters:
- opcode- opcode of the instruction
- eager- whether to be done synchronously or asynchronously
- Throws:
- DMLRuntimeException- if error occurs
 
 - 
clearGPUObjectpublic void clearGPUObject() 
 - 
getJcudaSparseMatrixPtrpublic CSRPointer getJcudaSparseMatrixPtr() Pointer to sparse matrix- Returns:
- ?
 
 - 
isDirtypublic boolean isDirty() Whether this block is dirty on the GPU- Returns:
- ?
 
 - 
setIsLinCachedpublic void setIsLinCached(boolean val) 
 - 
isLinCachedpublic boolean isLinCached() 
 - 
setrmVarPendingpublic void setrmVarPending(boolean val) 
 - 
isrmVarPendingpublic boolean isrmVarPending() 
 - 
getDensePointerAddresspublic long getDensePointerAddress() 
 - 
getPointerAddresspublic static long getPointerAddress(jcuda.Pointer p) 
 
- 
 
-