simba.pp.pca

simba.pp.pca(adata, n_components=50, algorithm='randomized', n_iter=5, random_state=2021, tol=0.0, feature=None, **kwargs)[source]

perform Principal Component Analysis (PCA)

Parameters:

adata (AnnData) – Annotated data matrix.
n_components (int, optional (default: 50)) – Desired dimensionality of output data
algorithm (str, optional (default: ‘randomized’)) – SVD solver to use. Choose from {‘arpack’, ‘randomized’}.
n_iter (int, optional (default: ‘5’)) – Number of iterations for randomized SVD solver. Not used by ARPACK.
tol (float, optional (default: 0)) – Tolerance for ARPACK. 0 means machine precision. Ignored by randomized SVD solver.
feature (str, optional (default: None)) – Feature used to perform PCA. The data type of .var[feature] needs to be bool If None, adata.X will be used.
kwargs – Other keyword arguments are passed down to TruncatedSVD()

Returns:

updates adata with the following fields
`.obsm[‘X_pca’]` (array) – PCA transformed X.
`.uns[‘pca’][‘PCs’]` (array) – Principal components in feature space, representing the directions of maximum variance in the data.
`.uns[‘pca’][‘variance’]` (array) – The variance of the training samples transformed by a projection to each component.
`.uns[‘pca’][‘variance_ratio’]` (array) – Percentage of variance explained by each of the selected components.