hello,can you help me ? #29

1366409175 · 2023-03-28T14:06:52Z

AttributeError Traceback (most recent call last)
~\AppData\Local\Temp\ipykernel_9316\2717546558.py in
----> 1 svg_full, _ = SpatialDE.test(data, omnibus=True)
2 svg_full["total_counts"] = np.asarray(data.X.sum(axis=0)).squeeze()
3 svg_full.to_pickle("ST8059048_svg_full.pkl")

AttributeError: module 'SpatialDE' has no attribute 'test'

TODO: port AEH, investigate caching distance matrices for fixed inducers

instead use a sum of a spectral mixture kernel (Wilson + Adams 2013) and a linear kernel, ARD should figure out the rest This also changes the default settings to fixed inducers and ard=True

fixes

- implement sparse GPs according to Titisas (2009) - use score test with BH correction - reorganize code

note that the results data frame no longer contains null model results

Previously, a score test was performed for every lengthscale. Now, only the maximum likelihood model is tested.

The standard workflow now is run() to detect spatially variable genes followed by fit_patterns, which fits spectral mixture kernels to the spatially variable genes. Alternatively, one can use run_detailed() to directly fit spectral mixture kernels

this makes p-values under the null uniformly distributed

this prevents significant hits that have one spot with many reads and very little reads overall

the previous strategy may have been generating false-positives due to the score test being performed with the fitted kernel. The current workflow is to first run the score test on multiple kernels, and then analyze the detected spatially variable genes in more detail with GPs. Based on extensive benchmarking, the Cauchy combination seems to perform best in terms of statistical power, so it is the default. If CPU time is a bottleneck, the omnibus test can be used instead. It may lose some power, but its runtime is independent of the number of kernels to be tested. In the default configuration, it is 10x faster than the Cauchy combination. This commit also ports the score test to use Tensorflow, so it can run on GPUs. This gives a 100x speed-up compared to CPU.

this uses a Hidden Markov Random Field with a Dirichlet process prior to automatically detect the number of cell types present in the given sample. It uses a Poisson likelihood with a Gamma prior. If no spatial information is given, the model reduces to a Dirichlet process Poisson mixture.

previous implementation relied on TensorFlow's arithmetic optimizer to remove redundant cholesky factorizations. Visualizing the graph in TensorBoard showed that this was not the case, one superfluous cholesky factorization was performed in every iteration

a remnant of ScoreTest.dtype was left after commit d2dd0bf

this plays better with ScanPy's plotting functions

…master

this plays better with ScanPy's plotting

parameter fits for null model are always done in float64, but the test itself uses default_float

Disabling caching is useful to reduce memory footprint when testing many data points. It would be nice to do this automatically, but Tensorflow doesn't provide a way to get the total available memory, so...

this removes all the useless clutter introduced by autosummary

ilia-kats added 30 commits January 15, 2020 11:41

fix s2_t_hat calculation

1db4060

First port of SpatialDE to gpflow + tensorflow

1727ab2

TODO: port AEH, investigate caching distance matrices for fixed inducers

remove apparently unused file

c125580

fix s2_t_hat calculation

ef72229

remove apparently unused file

2f0737b

add option to use score test instead of LRT test

8a0c4f4

Merge branch 'master' of https://github.com/ilia-kats/SpatialDE

ded3891

do not fit multiple GPs for multiple kernels

1e570ae

instead use a sum of a spectral mixture kernel (Wilson + Adams 2013) and a linear kernel, ARD should figure out the rest This also changes the default settings to fixed inducers and ard=True

enable pickling of frozen models, improve power spectrum plot

d8f03a4

fix deleted function in __init__.py

cdec422

implement an optimizer based on tensorflow probability, add some minor

22717c6

fixes

full refactor

84cae8b

- implement sparse GPs according to Titisas (2009) - use score test with BH correction - reorganize code

minor bugfixes

09c441d

refactor score tests in prepration for NB null model

781aba7

note that the results data frame no longer contains null model results

testing negative binomial null model

b09ad95

performance: perform score test once per gene and model

dd1d203

Previously, a score test was performed for every lengthscale. Now, only the maximum likelihood model is tested.

Merge branch 'gpflow'

9a19a45

fix up some merge issues

5a6902b

fully integrate gpflow into spatialde

db545ad

The standard workflow now is run() to detect spatially variable genes followed by fit_patterns, which fits spectral mixture kernels to the spatially variable genes. Alternatively, one can use run_detailed() to directly fit spectral mixture kernels

robustify against singular matrices, some bugfixes

b687830

NB score test: use proper MLE for mean and dispersion

f9d400b

this makes p-values under the null uniformly distributed

clean up obsolete functions

a0b3227

setup: bump version, add gpflow dependency

9714c83

remove deleted function from __init__.py

11ef99e

increase minimum lengthscale for automatically generated kernel space

88cf687

this prevents significant hits that have one spot with many reads and very little reads overall

bugfixes for spectral mixture kernel

fbd166d

work around negative variances

2217ff2

NB score test: initialize alpha MLE with MoM estimator

599abb4

ilia-kats added 30 commits October 19, 2020 09:26

svca: use squared exponential kernel (faster)

81a7038

svca: fix test (got broken in previous commit)

916e0e3

fix spatially variable genes test

8c3d7c8

a remnant of ScoreTest.dtype was left after commit d2dd0bf

tissue segmentation: write the labels to the AnnData as pd.Categorical

6cc9e16

this plays better with ScanPy's plotting functions

Merge branch 'master' of https://github.com/ilia-kats/SpatialDE into …

f3a853f

…master

svca: handle sparse matrices

e93cef6

aeh: save spatial patterns into adata.obs

3c15d79

this plays better with ScanPy's plotting

when working with AnnData, allow the user to specify the layer

267fd68

make setting a different default_float work for score test

714a63c

parameter fits for null model are always done in float64, but the test itself uses default_float

make caching the distance matrices optional

4da263e

Disabling caching is useful to reduce memory footprint when testing many data points. It would be nice to do this automatically, but Tensorflow doesn't provide a way to get the total available memory, so...

add docstrings, adjust Sphinx config

ec40f5a

docs: use custom template for classes

603fbaf

this removes all the useless clutter introduced by autosummary

auto-build docs on push

6bd6d1c

update black version

13eddb8

test: improve speed for AnnData with lots of metadata

f9ff81f

tissue segmentation: don't crash when no coordinates are found

37e22ad

workflow: use Python 3.9 for docs (no NumPy wheels for 3.10 yet)

d1bbc5f

docs workflow: debug

44d5dbe

docs workflow: explicitly install missing NaiveDE dep

0345513

docs workflow: remove debug step

bc8ee56

README: add docs badge

906c5e7

fixes for GPflow 2.1

78da0ac

add layer argument to GP fitting

a3d5ec7

some more bugfixes, black-ify

0e27c17

update pre-commit action

cde7720

update pre-commit config

c5aa9cd

allow specifying a .obs column to use as size factor for normalization

42b076c

fix SGPR model

a69ba45

fit_fast: add parameter to enable/disable sparse GPs

25f96b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hello,can you help me ? #29

hello,can you help me ? #29

1366409175 commented Mar 28, 2023

hello,can you help me ? #29

Are you sure you want to change the base?

hello,can you help me ? #29

Conversation

1366409175 commented Mar 28, 2023