ROnmine处理oncomine数据

  1. 基本介绍

oncomine是一个很好的肿瘤数据库,功能很强大,也有可视化的操作,但是最大的问题是数据不能很好的获得,,而且自带的图很丑(而且只是png),关于oncomine的教程有很多,比较经典有解螺旋的教程,比如下面这个,基本涵盖了常用的功能:

然而,大神yikeshu0611默默的为我们付出,之前开发了一个R包叫ROncomine可以很方便的获得数据,并且再次出图,不过最近又更新为oncomineR了,新版的我还没研究,这里我们还是先说老版的ROncomine

这个包以前托管在Github上(由于众所皆知的原因,国内访问Github很困难),但是目前已经删除了,好在我之前已经导入到了我的码云上了,所以也可以很快的安装。

以前其实还有一个教程破解oncomine无法免费下载数据,学员开发了一款R语言包,但是目前也被删除了,估计这个还是有版权的,所以我还是悄悄的写个自己看的教程

首先需要安装devtools,然后调用install_git函数就可以直接安装,之后就方便了

install.packages(‘devtools’)

devtools::install_git(’https://gitee.com/swcyo/oncomineR’))

其实每一个oncomine的数据都可以使用浏览器获得,用chrome浏览器登录,随便一个地方鼠标右击找到检查元素,就可以看到代码区,使用Crtl+F查找<map,我们需要的代码全都在<map那一片,右击复制然后新建一个文本文档保存起来

oncomineR的原理只是是把检查元素里<map那串代码二次处理,从而简化工作,然后调用ploty作图,当然你也可以用ggplot2再次画图

  1. 单基因在基本中的总结(Gene Summary

按照解螺旋的教程,比如搜索CXCL8基因,实际上基因名为IL8,然后我们可以在右边看到这样的一个热图:

我们复制<map那一串代码,保存为heatmap.txt文件,我们可以用Oncomine_heatmap_DiseaseSummaryfor_SomeGene处理

 library(ROncomine)
heat<-Oncomine_heatmap_DiseaseSummaryfor_SomeGene('/Users/mac/Documents/GitHub/myblog/content/post/2021-08-19-oncomine/heatmap.txt') # 命名为heat
knitr::kable(heat) ## 这步是我的教程示例代码,不需要执行
Cancer TypeAnalysis Typemeat thresholdmeat intotalexpression
X1Bladder CancerCancer Histology318over
X2Bladder CancerCancer Histology318under
X3Bladder CancerMulti-cancer19over
X4Bladder CancerOutlier412over
X5Bladder CancerOutlier212under
X6Brain and CNS CancerCancer vs. Normal136over
X7Brain and CNS CancerCancer Histology366over
X8Brain and CNS CancerCancer Histology266under
X9Brain and CNS CancerOutlier1637over
X10Brain and CNS CancerOutlier737under
X11Breast CancerMulti-cancer223under
X12Breast CancerOutlier2968over
X13Breast CancerOutlier568under
X14Cervical CancerCancer vs. Normal410over
X15Cervical CancerOutlier28over
X16Colorectal CancerCancer vs. Normal1835over
X17Colorectal CancerCancer Histology132over
X18Colorectal CancerCancer Histology132under
X19Colorectal CancerMulti-cancer424over
X20Colorectal CancerOutlier335over
X21Colorectal CancerOutlier435under
X22Esophageal CancerCancer vs. Normal311over
X23Esophageal CancerCancer Histology18over
X24Esophageal CancerCancer Histology18under
X25Esophageal CancerMulti-cancer18over
X26Esophageal CancerOutlier312over
X27Esophageal CancerOutlier112under
X28Gastric CancerCancer vs. Normal223over
X29Gastric CancerCancer Histology129over
X30Gastric CancerCancer Histology129under
X31Gastric CancerOutlier115over
X32Gastric CancerOutlier315under
X33Head and Neck CancerCancer vs. Normal432over
X34Head and Neck CancerCancer Histology114over
X35Head and Neck CancerMulti-cancer210over
X36Head and Neck CancerMulti-cancer110under
X37Head and Neck CancerOutlier422over
X38Head and Neck CancerOutlier622under
X39Kidney CancerCancer Histology144over
X40Kidney CancerCancer Histology144under
X41Kidney CancerMulti-cancer118over
X42Kidney CancerMulti-cancer118under
X43Kidney CancerOutlier1018over
X44Kidney CancerOutlier218under
X45LeukemiaCancer vs. Normal139over
X46LeukemiaCancer vs. Normal339under
X47LeukemiaCancer Histology6117over
X48LeukemiaCancer Histology4117under
X49LeukemiaOutlier1456over
X50LeukemiaOutlier1256under
X51Liver CancerCancer vs. Normal213over
X52Liver CancerOutlier215over
X53Liver CancerOutlier715under
X54Lung CancerMulti-cancer222over
X55Lung CancerOutlier1036over
X56Lung CancerOutlier436under
X57LymphomaCancer vs. Normal236under
X58LymphomaMulti-cancer517under
X59LymphomaOutlier1834over
X60LymphomaOutlier234under
X61MelanomaCancer Histology14over
X62MelanomaCancer Histology14under
X63MelanomaMulti-cancer118over
X64MelanomaOutlier722over
X65MelanomaOutlier422under
X66MyelomaCancer Histology113over
X67MyelomaCancer Histology113under
X68MyelomaMulti-cancer26under
X69MyelomaOutlier216over
X70MyelomaOutlier316under
X71Other CancerCancer vs. Normal332over
X72Other CancerCancer Histology132under
X73Other CancerOutlier1236over
X74Other CancerOutlier236under
X75Ovarian CancerOutlier519over
X76Ovarian CancerOutlier219under
X77Pancreatic CancerCancer vs. Normal312over
X78Pancreatic CancerOutlier114over
X79Pancreatic CancerOutlier114under
X80Prostate CancerOutlier730over
X81Prostate CancerOutlier130under
X82SarcomaCancer Histology2101over
X83SarcomaCancer Histology1101under
X84SarcomaMulti-cancer213under
X85SarcomaOutlier925over
X86SarcomaOutlier225under

这样我们很快的就提取了所有需要的数据,比如我们只想提取Cancer vs. Normal,那么可以用R语言处理,也可以用DataEditR交互式处理,或者导出来用excel处理

# cn<-heat[heat$`Analysis Type` == 'Cancer vs. Normal',]
# DataEditR::data_edit(heat) # 也可以用交互式编辑
 cn<-read.csv('/Users/mac/Documents/GitHub/myblog/content/post/2021-08-19-oncomine/cn.csv')
knitr::kable(cn)
XCancer.TypeAnalysis.Type.meat.thresholdmeat.intotalexpression
X6Brain and CNS CancerCancer vs. Normal136over
X14Cervical CancerCancer vs. Normal410over
X16Colorectal CancerCancer vs. Normal1835over
X22Esophageal CancerCancer vs. Normal311over
X28Gastric CancerCancer vs. Normal223over
X33Head and Neck CancerCancer vs. Normal432over
X45LeukemiaCancer vs. Normal139over
X46LeukemiaCancer vs. Normal339under
X51Liver CancerCancer vs. Normal213over
X57LymphomaCancer vs. Normal236under
X71Other CancerCancer vs. Normal332over
X77Pancreatic CancerCancer vs. Normal312over
  1. 正常与癌症组织比较(Differential Analysis

继续使用解螺旋的示例,用Oncomine分别查询CXCL8在肠癌组织(与正常比)中高表达的数据集和低表达的数据集。筛选条件:P-value:1E-4;Fold Change:3;GENE Rank:Top10%。

这里要注意,默认选择的是子数据,比如Kalser Colon,光标是在Colon Mucinous Adenocarcinoma vs. Normal,这样只是比较肿瘤亚型与正常,这个时候在Differential Analysis的GROUP BY是灰白的,如果我们要看所有,就要点击Kalser Colon,然后GROUP BY就可以选择下拉框了,我们选择Cancer and Normal Type,我们可以看到在GROUP里有很多很多的分组,其实就是很多很多的数据,我们想要的都可以提取了

默认的是一个barplot,出现的是直方图,鼠标在柱子上停留就可以看到value,也就是我们需要的

可以看到有一个箱式图的图标,点一下就变成了Boxplot,鼠标房子Box上可以看到参数

同样的办法,复制<map代码区并保存为文本文件,比如命名为box.txt,这里其实box和bar的界面结果是一样的,我们在bar里复制,然后使用Oncomine_bar函数

 box<-Oncomine_bar('/Users/mac/Documents/GitHub/myblog/content/post/2021-08-19-oncomine/box.txt')
knitr::kable(box)
Expression valueCancer TypeSample NameNormal Tissue TypeLegend Value
3.509Colon Small Cell CarcinomaT4360A3No valueNo value
4.205Colon Signet Ring Cell AdenocarcinomaT5245A1No valueNo value
4.407Rectosigmoid Mucinous AdenocarcinomaT940ANo valueNo value
4.575Colon Small Cell CarcinomaT4360A2No valueNo value
5.109Rectal Signet Ring Cell AdenocarcinomaT924CNo valueNo value
5.867Colon Signet Ring Cell AdenocarcinomaT5002A1No valueNo value
6.110Rectosigmoid Mucinous AdenocarcinomaT4980A1No valueNo value
0.188CancerN552GColonColon
0.315CancerN1369AColonColon
0.533CancerN773A1ColonColon
1.020CancerN1102AColonColon
1.160CancerN2367AColonColon
2.732CancerT4452A1Cecum AdenocarcinomaCecum Adenocarcinoma
2.889CancerT4550BCecum AdenocarcinomaCecum Adenocarcinoma
2.944CancerT902ACecum AdenocarcinomaCecum Adenocarcinoma
3.397CancerT4834A1Cecum AdenocarcinomaCecum Adenocarcinoma
3.453CancerT4354HCecum AdenocarcinomaCecum Adenocarcinoma
4.437CancerT4544A1Cecum AdenocarcinomaCecum Adenocarcinoma
4.492CancerT4926A1Cecum AdenocarcinomaCecum Adenocarcinoma
4.597CancerT5376A1Cecum AdenocarcinomaCecum Adenocarcinoma
4.623CancerT4452A2Cecum AdenocarcinomaCecum Adenocarcinoma
4.724CancerT4452ACecum AdenocarcinomaCecum Adenocarcinoma
4.801CancerT4452A3Cecum AdenocarcinomaCecum Adenocarcinoma
5.183CancerT5133ACecum AdenocarcinomaCecum Adenocarcinoma
5.315CancerT4452A4Cecum AdenocarcinomaCecum Adenocarcinoma
5.407CancerT5024A1Cecum AdenocarcinomaCecum Adenocarcinoma
5.897CancerT4573A1Cecum AdenocarcinomaCecum Adenocarcinoma
5.981CancerT4984A1Cecum AdenocarcinomaCecum Adenocarcinoma
6.215CancerT5565A1Cecum AdenocarcinomaCecum Adenocarcinoma
1.644CancerT5811A1Colon AdenocarcinomaColon Adenocarcinoma
2.586CancerT4174AColon AdenocarcinomaColon Adenocarcinoma
2.747CancerT4701A1Colon AdenocarcinomaColon Adenocarcinoma
3.043CancerT519AColon AdenocarcinomaColon Adenocarcinoma
3.377CancerT6190A1Colon AdenocarcinomaColon Adenocarcinoma
3.553CancerT5287A1Colon AdenocarcinomaColon Adenocarcinoma
3.634CancerT4660A1Colon AdenocarcinomaColon Adenocarcinoma
3.748CancerT573AColon AdenocarcinomaColon Adenocarcinoma
3.838CancerT740AColon AdenocarcinomaColon Adenocarcinoma
3.877CancerT4373B1Colon AdenocarcinomaColon Adenocarcinoma
3.927CancerT5573A1Colon AdenocarcinomaColon Adenocarcinoma
3.931CancerT826AColon AdenocarcinomaColon Adenocarcinoma
3.982CancerT4448AColon AdenocarcinomaColon Adenocarcinoma
4.052CancerT4612A1Colon AdenocarcinomaColon Adenocarcinoma
4.061CancerT5266A1Colon AdenocarcinomaColon Adenocarcinoma
4.161CancerT433AColon AdenocarcinomaColon Adenocarcinoma
4.168CancerT4376AColon AdenocarcinomaColon Adenocarcinoma
4.173CancerT5589B1Colon AdenocarcinomaColon Adenocarcinoma
4.188CancerT4475A1Colon AdenocarcinomaColon Adenocarcinoma
4.294CancerT5164A1Colon AdenocarcinomaColon Adenocarcinoma
4.395CancerT4956A1Colon AdenocarcinomaColon Adenocarcinoma
4.441CancerT4975A1Colon AdenocarcinomaColon Adenocarcinoma
4.478CancerT773A1Colon AdenocarcinomaColon Adenocarcinoma
4.566CancerT4373B2Colon AdenocarcinomaColon Adenocarcinoma
4.761CancerT949BColon AdenocarcinomaColon Adenocarcinoma
4.810CancerT4750A1Colon AdenocarcinomaColon Adenocarcinoma
4.857CancerT4257AColon AdenocarcinomaColon Adenocarcinoma
4.956CancerT801A1Colon AdenocarcinomaColon Adenocarcinoma
5.044CancerT5162A1Colon AdenocarcinomaColon Adenocarcinoma
5.077CancerT5389A1Colon AdenocarcinomaColon Adenocarcinoma
5.172CancerT4695A1Colon AdenocarcinomaColon Adenocarcinoma
5.241CancerT4667A1Colon AdenocarcinomaColon Adenocarcinoma
5.243CancerT4508A1Colon AdenocarcinomaColon Adenocarcinoma
5.286CancerT4920A1Colon AdenocarcinomaColon Adenocarcinoma
5.330CancerT5107B1Colon AdenocarcinomaColon Adenocarcinoma
5.614CancerT4373B3Colon AdenocarcinomaColon Adenocarcinoma
5.638CancerT4541A1Colon AdenocarcinomaColon Adenocarcinoma
5.674CancerT4489A1Colon AdenocarcinomaColon Adenocarcinoma
5.819CancerT4555A1Colon AdenocarcinomaColon Adenocarcinoma
5.852CancerT5063AColon AdenocarcinomaColon Adenocarcinoma
5.959CancerT5102A1Colon AdenocarcinomaColon Adenocarcinoma
3.001CancerT5513A1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
3.836CancerT4948A2Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
4.035CancerT4611B1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
4.225CancerT4799A1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
4.389CancerT932B1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
4.776CancerT5536C4Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
4.838CancerT451AColon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
4.872CancerT5261B1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
5.087CancerT4491A1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
5.304CancerT694CColon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
5.676CancerT4644B1Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
5.973CancerT625AColon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
6.231CancerT4491A2Colon Mucinous AdenocarcinomaColon Mucinous Adenocarcinoma
2.231CancerT551CRectal AdenocarcinomaRectal Adenocarcinoma
2.638CancerT645ARectal AdenocarcinomaRectal Adenocarcinoma
3.811CancerT521BRectal AdenocarcinomaRectal Adenocarcinoma
4.441CancerT789ARectal AdenocarcinomaRectal Adenocarcinoma
4.584CancerT74ARectal AdenocarcinomaRectal Adenocarcinoma
4.669CancerT565ARectal AdenocarcinomaRectal Adenocarcinoma
5.346CancerT462ARectal AdenocarcinomaRectal Adenocarcinoma
5.547CancerT420ARectal AdenocarcinomaRectal Adenocarcinoma
3.798CancerT4449E4Rectal Mucinous AdenocarcinomaRectal Mucinous Adenocarcinoma
4.233CancerT4449E3Rectal Mucinous AdenocarcinomaRectal Mucinous Adenocarcinoma
4.360CancerT4449E1Rectal Mucinous AdenocarcinomaRectal Mucinous Adenocarcinoma
4.784CancerT4449E2Rectal Mucinous AdenocarcinomaRectal Mucinous Adenocarcinoma
2.439CancerT5139A1Rectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
2.922CancerT4919A1Rectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
2.983CancerT656ARectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
3.072CancerT771ARectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
3.404CancerT866DRectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
3.846CancerT5430A1Rectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
4.396CancerT552GRectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
4.789CancerT981ARectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
5.055CancerT4603A1Rectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma
5.815CancerT672ARectosigmoid AdenocarcinomaRectosigmoid Adenocarcinoma

我们可以用自带的函数画个图,有两个图,一个是bar_plot,一个是box_plot

Oncomine_bar_plot(box)
## 载入需要的程辑包:ggplot2
## 
## 载入程辑包:'plotly'
## The following object is masked from 'package:ggplot2':
## 
##     last_plot
## The following object is masked from 'package:stats':
## 
##     filter
## The following object is masked from 'package:graphics':
## 
##     layout
Oncomine_box_plot(box)

当然我们也可以用ggplot2作图


```r
library(ggplot2) 
ggplot(box,aes(`Cancer Type`,`Expression value`,color=`Cancer Type`))+
  geom_boxplot()+
  theme_bw(base_size = 12)+
  theme(axis.text.x = element_text(angle=90, hjust=1, vjust=.5))

欧阳松
欧阳松
主治医师、讲师

My research interests include urogenital tumors, urolithiasis, male infertility, male erectile dysfunction,etc.

Related