當前位置：首頁 > 编程资源 > 综合教程 >内容正文

综合教程

DESeq2包

發布時間：2024/8/26 综合教程 42 生活家

生活随笔收集整理的這篇文章主要介紹了 DESeq2包小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

1）簡介：

DESeq2-package： for differential analysis of count data（對count data 做差異分析）

2）安裝

if("DESeq2" %in% rownames(installed.packages()) == FALSE) {source("http://bioconductor.org/biocLite.R");biocLite("DESeq2")}
suppressMessages(library(DESeq2))
ls('package:DESeq2')

3）對象的使用說明

3.1）coef（Extract a matrix of model coef?cients/standard errors，高級用戶檢驗模型系數）

語法：coef(object, SE = FALSE, ...)

參數解釋：

object：a DESeqDataSet returned by DESeq, nbinomWaldTest, or nbinomLRT.

例子：

dds <- makeExampleDESeqDataSet(m=4)
dds <- DESeq(dds)
coef(dds)[1,]
coef(dds, SE=TRUE)[1,]

3.2） collapseReplicates：Collapse technical replicates in a RangedSummarizedExperiment or DESeqDataSet(用于消除技術重復)
用法：collapseReplicates(object, groupby, run, renameCols = TRUE)
參數：

object：A RangedSummarizedExperiment or DESeqDataSet
groupby：a grouping factor, as long as the columns of object，分組因子
run：optional, the names of each unique column in object. if provided, a new column runsCollapsed will be added to the colData which pastes together the names of run （測序run）
renameCols：whether to rename the columns of the returned object using the levels of the grouping factor

例子：

dds <- makeExampleDESeqDataSet(m=12)
str(dds)
dds$sample <- factor(sample(paste0("sample",rep(1:9, c(2,1,1,2,1,1,2,1,1))))) （#共9個樣品：其中 3個樣品有2個技術重重）
dds$run <- paste0("run",1:12) #12個run道
ddsColl <- collapseReplicates(dds, dds$sample, dds$run)
# examine the colData and column names of the collapsed data
colData(ddsColl)
colnames(ddsColl)
# check that the sum of the counts for "sample1" is the same
# as the counts in the "sample1" column in ddsColl
matchFirstLevel <- dds$sample == levels(dds$sample)[1]
stopifnot(all(rowSums(counts(dds[,matchFirstLevel])) == counts(ddsColl[,1])))

3.3）counts：Accessors for the ’counts’ slot of a DESeqDataSet object（對表達矩陣進行統計，）

one row for each observational unit (gene or the like), and one column for each sample(行代表觀察值(例如基因)，列代表樣本(例如肝、脾、腎等))

語法:counts(object, normalized = FALSE,replaced = FALSE)

參數:

object:a DESeqDataSet object(表達矩陣).
normalized：logical indicating whether or not to divide the counts by the size factors or normalization factors before returning (normalization factors always preempt size factors)，(即不同量級的數據要不要歸一化)
replaced：返回極端值

dds <- makeExampleDESeqDataSet(m=4)  ##構建一個表達矩陣
head(counts(dds))
dds <- estimateSizeFactors(dds) # run this or DESeq() first  
head(counts(dds, normalized=TRUE))

3.4）DESeq：Differential expression analysis based on the Negative Binomial (a.k.a.Gamma-Poisson) distribution（基于負二項分布進行差異分析）

語法：

DESeq(object, test = c("Wald", "LRT"), fitType = c("parametric", "local","mean"), sfType = c("ratio", "poscounts", "iterate"), betaPrior,full = design(object), reduced, quiet = FALSE,minReplicatesForReplace = 7, modelMatrixType, useT = FALSE, minmu = 0.5,
parallel = FALSE, BPPARAM = bpparam())

參數：

object：a DESeqDataSet object（表達矩陣對象）
test：Wald" or "LRT"檢驗
fitType：either "parametric", "local", or "mean"
sfType：either "ratio", "poscounts", or "iterate" for teh type of size factor estimation.
betaPrior：whether or not to put a zero-mean normal prior on the non-intercept coef?cients
reduced：for test="LRT", a reduced formula to compare against
quiet：whether to print messages at each step
minReplicatesForReplace：the minimum number of replicates required
modelMatrixType：either "standard" or "expanded", which describe how the model matrix, X of the GLM formula is formed.
useT：logical, passed to nbinomWaldTest, default is FALSE
minmu：lower bound on the estimated count for ?tting gene-wise dispersion
parallel：if FALSE, no parallelization. if TRUE, parallel execution using BiocParallel,
BPPARAM：an optional parameter object passed internally to bplapply when parallel=TRUE.
例子：

# count tables from RNA-Seq data
cnts <- matrix(rnbinom(n=1000, mu=100, size=1/0.5), ncol=10)
cond <- factor(rep(1:2, each=5))

# object construction
dds <- DESeqDataSetFromMatrix(cnts, DataFrame(cond), ~ cond)

# standard analysis
dds <- DESeq(dds)
res <- results(dds)

# moderated log2 fold changes
resultsNames(dds)
resLFC <- lfcShrink(dds, coef=2, type="apeglm")

# an alternate analysis: likelihood ratio test
ddsLRT <- DESeq(dds, test="LRT", reduced= ~ 1)
resLRT <- results(ddsLRT)

3.5）DESeqDataSet-class（DESeqDataSet object and constructors）

語法：

DESeqDataSet(se, design, ignoreRank = FALSE)
DESeqDataSetFromMatrix(countData, colData, design, tidy = FALSE,ignoreRank = FALSE, ...)
DESeqDataSetFromHTSeqCount(sampleTable, directory = ".", design,ignoreRank = FALSE, ...)
DESeqDataSetFromTximport(txi, colData, design, ...)

例子：

countData <- matrix(1:100,ncol=4)
condition <- factor(c("A","A","B","B"))
dds <- DESeqDataSetFromMatrix(countData, DataFrame(condition), ~ condition)

3.6）DESeqResults-class：DESeqResults object and constructor

語法：DESeqResults(DataFrame, priorInfo = list())

參數：

DataFrame：a DataFrame of results, standard column names are: baseMean, log2FoldChange,lfcSE, stat, pvalue, padj.
priorInfo：a list giving information on the log fold change prior

3.7）DESeqTransform-class（DESeqTransform object and constructor）

語法：DESeqTransform(SummarizedExperiment)

參數：SummarizedExperiment a RangedSummarizedExperiment

3.8）rlog Apply a ’regularized log’ transformation

用法：
rlog(object, blind = TRUE, intercept, betaPriorVar, fitType = "parametric")
rlogTransformation(object, blind = TRUE, intercept, betaPriorVar,fitType = "parametric")

dds <- makeExampleDESeqDataSet(m=6,betaSD=1)
rld <- rlog(dds)
dists <- dist(t(assay(rld)))
plot(hclust(dists))

3.9）plotPCA（Sample PCA plot for transformed data）

用法：plotPCA(object, intgroup = "condition",ntop = 500, returnData = FALSE)

參數：

object：a DESeqTransform object, with data in assay(x), produced for example by either rlog or varianceStabilizingTransformation.
intgroup： interesting groups: a character vector of names in colData(x) to use for grouping
ntop：number of top genes to use for principal components, selected by highest row variance
returnData：should the function only return the data.frame of PC1 and PC2 with intgroup covariates for custom plotting

# using rlog transformed data:
dds <- makeExampleDESeqDataSet(betaSD=1)
rld <- rlog(dds)
plotPCA(rld)

# also possible to perform custom transformation:
dds <- estimateSizeFactors(dds)
# shifted log of normalized counts
se <- SummarizedExperiment(log2(counts(dds, normalized=TRUE) + 1),
colData=colData(dds))
# the call to DESeqTransform() is needed to
# trigger our plotPCA method.
plotPCA( DESeqTransform( se ) )

3.10）

總結

以上是生活随笔為你收集整理的DESeq2包的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

日韩av黄I国产麻豆传媒I国产91av视频在线观看I日韩一区二区三区在线看I美女国产在线I麻豆视频国产在线观看I成人黄色短片

综合教程

DESeq2包

1）簡介：

總結