JAMPred

This function runs JAMPred by analysing many SNP blocks independently in parallel.

JAMPred(
  marginal.betas = NULL,
  n.training = NULL,
  marginal.logor.ses = NULL,
  p.cases.training = NULL,
  ref.geno = NULL,
  total.snps.genome.wide = NULL,
  n.cores = NULL,
  n.mil.iter = 0.2,
  beta.binom.b.lambda = 1,
  beta.binom.a = 1,
  snps.blocks = NULL,
  initial.block.size = 100,
  initial.snps.blocks = NULL,
  seed = NULL,
  thinning.factor = round(1000/n.mil.iter),
  effect.sd.uniform.prior = c(0.05, 2),
  residual.var.invgamma.prior = c(0.01, 0.01),
  save.path = NULL,
  debug = FALSE
)

Arguments

marginal.betas: Vector of named one-at-a-time SNP effects. NB: This must be a named vector; it is where. the SNP names are derived from. These could be from linear regressions, or log-Odds Ratios. NB: If providing log-ORs, you must also provide marginal.logor.ses and p.cases.training so that a linear transformaiton can be applied.
n.training: The sample size in which the marginal.betas were calculated.
marginal.logor.ses: IF marginal log-ORs were provided, please provide a named vector of standard errors here. These are necessary to convert them to a linear scale. Do not set if providing linear regression estimates.
p.cases.training: IF marginal log-ORs were provided, please provide the proportion of cases in the sample in which they were calculated
ref.geno: Reference genotype matrix which will be used to calculate SNP-SNP correlations. Individual's genotype must be coded as a numeric risk allele count 0/1/2. Non-integer values reflecting imputation uncertaimnty may be given. NB: The risk allele coding MUST correspond to that used in marginal.betas and column names must match the names of marginal.betas elements.
total.snps.genome.wide: Total number of SNPs being analysed across the genome. Used to determine the level of prior sparsity.
n.cores: Number of CPU cores to parralelise the SNP blocks over. If unspecified the number available will be detected. NB: On Windows systems this will be forced to 1, since mclapply does not yet support forking.
n.mil.iter: Number of million iterations to run. In the paper we found 0.2 was sufficient to reach adequeate convergence.
beta.binom.b.lambda: Spartisty tuning parameter. Suggest a range of values is tried, e.g. 0.01, 0.1, 1, 10. See the paper for more detail. Default is 1.
beta.binom.a: First hyper-parameter of the beta-binomial prior on model sparsity. See the paper for more detail. Default 1 can usually be used.
snps.blocks: OPTIONAL: A partitioning of the SNPs into blocks, created using the function R2BGLiMS-JAMPred_SplitIntoPositiveDefiniteBlocks. This is automatically generated if not provided.
initial.block.size: Block size for the initial partitioning. Default is 100. This is ignored when initial.snps.blocks is given.
initial.snps.blocks: A user specified initial partitioning of the SNPs. This must be a list, each element of which is a vector of SNP names that correspond to the columns of ref.geno. This could be defined, for example, according to LD information.
seed: An integer specifying the RJMCMC seed. If not set a random number will be used every time this is run.
thinning.factor: Determines every ith iteration to use from the RJMCMC sample. The more thinning that is applied the quicker the analysis will be, for the same number of iterations. Leaving at the default should be fine.
effect.sd.uniform.prior: Upper and lower uniform hyper-parameters for the prior on the standard deviation of SNP effects. Default is what we used in the paper, a Uniform(0.05, 2) distribution.
residual.var.invgamma.prior: Hyper-parameters for the inversegamma prior on the residual variance. Default is what we used in the paper, a Inverse-Gamma(0.01, 0.01) distribution.
save.path: Optional path for JAM to store and save its temporary files in. Can help with debugging (default NULL).
debug: An option passed to JAM requesting more verbose output (default FALSE).

Value

A JAMPred results object, which is a list including as elements step1.posterior.mean.snp.weights (which do not adjust for long range LD) and step2.posterior.mean.snp.weights (which do adjust for long range LD). These SNP weights can be used to generate predictions from individual level genotype data.

Author

Paul Newcombe

Arguments

Value

See also

Author