metaUSAT is a data-adaptive statistical approach for testing genetic associations of multiple traits from single/multiple studies using univariate GWAS summary statistics. This multivariate meta-analysis method can appropriately account for overlapping samples (if any) and can potentially test binary and/or continuous traits. The R function metausat
implements this association test. For details of this statistical method, please refer/cite:
Ray, D., Boehnke, M. "Methods for meta-analysis of multiple traits using GWAS summary statistics". Genetic Epidemiology, 42(2):134-145, 2018. PMID: 29226385
"The power of multivariate association test depends on a complex interplay of the number of truly associated traits, their correlation structure and the directions of the signals. The underlying association scenario changes from one genetic variant to another, and is not known a priori for any real dataset. There is no uniformly most powerful multivariate test, and a particular choice of association test may not be powerful enough to detect true signals. metaUSAT, being data-adaptive in nature, is less affected by the true (unknown) association scenario, and proves to be a robust yet computationally efficient choice for investigators."
Key Words: Cross-phenotype association; GWAS; Meta-analysis; Multiple traits; Multivariate analysis; Overlapping samples; PheWAS; Pleiotropy; Score test; Summary statistics
R (>= 3.0.1), CompQuadForm, minqa, psych, survey
require(devtools)
source_url("https://github.com/RayDebashree/metaUSAT/blob/master/metaUSAT_v1.17.R?raw=TRUE")
It is recommended to download/copy the stand-alone R program in this repository, save it in your local directory of choice and source()
it from your local directory. When a new version of the software is available, older versions may be removed from this repository, and the above devtools::source_url()
technique may not work.
Version 1.17 - December 11, 2017
First public release of the software.
-
The method metaUSAT and its software is designed to test association of multiple traits (categorical and/or continuous) from a single study, or a single trait from multiple studies, or multiple traits from multiple studies. It only requires individual trait summary statistics for testing joint association of traits with a single genetic variant.
- Caution: metaUSAT is a single-variant association test; so not expected to work well for rare variants (i.e., genetic variants with very low allele-frequencies).
- Caution: metaUSAT does “multivariate meta-analysis”, which is not exactly the same as what we understand about meta-analysis for a single trait. For meta-analysis of a single trait, we have summary statistics from multiple studies and we combine these summary statistics into a single summary measure for the trait. The multivariate meta-analysis implemented by metaUSAT does not combine the summary statistics for the two (or more) traits into a single trait; it merely analyzes them jointly.
-
metaUSAT uses the summary statistics for a given genetic variant and the estimated correlation matrix to test association. If one or more studies have overlapping samples/individuals (which may or may not be known), the estimated correlation matrix reflects this overlap, and metaUSAT can appropriately account for that.
- Caution: the joint analysis framework on which metaUSAT or metaMANOVA depends does not work well when the overlap between studies is large.
-
Since metaUSAT uses only summary statistics, it is assumed that all necessary covariate adjustments were performed when the individual trait summary statistics were obtained.
- Caution: Harmonize the same effect allele across the two or more studies/traits so that Z-scores from all the datasets can be jointly analyzed appropriately using metaUSAT. This is a fairly standard procedure in meta-analysis or other procedures using summary data from two or more studies/traits.
-
metaUSAT does not require independence of samples. When samples are related (e.g., in family-based GWAS), metaUSAT can use the summary statistics from EMMAX (or other univariate mixed model framework or an appropriate family-based association test) to appropriately test for genetic associations.
-
metaUSAT does not assume homogeneity of trait effects across studies, hence powerful in detecting association if trait effects are heterogeneous across studies. However, if the studies are nearly independent and the trait effects are believed to be homogeneous across studies, you may use meta-analyzed summary statistics for each trait (e.g., Z-statistic output from METAL) to perform joint meta-analysis of multiple traits. This may increase the power of the test by reducing the degrees of freedom of the test.
-
If you receive an error message like
the integral is probably divergent
, try reducing the absolute tolerance parameterAbsTol
. -
A user kindly pointed out a warning message displayed when
weights
are provided with version 1.17:
In if (weights != 1) { :
the condition has length > 1 and only the first element will be used
You can ignore this message (as long as the first weight is not 1, which would mean your sample size is 1); the weights you provided will be appropriately used inside the function. If the sample sizes are different across traits/studies, provide square root of sample sizes asweights
, else the default isweights=1
.