Abstract

The cell-specific information of transcriptional regulation on microRNAs (miRNAs) is crucial to the precise understanding of gene regulations in various physiological and pathological processes existed in different tissues and cell types. The database, mirTrans, provides comprehensive information about cell-specific transcription of miRNAs including the transcriptional start sites (TSSs) of miRNAs, transcription factor (TF) to miRNA regulations and miRNA promoter sequences. mirTrans also maps the experimental H3K4me3 and DHS (DNase-I hypersensitive site) marks within miRNA promoters and expressed sequence tags (ESTs) within transcribed regions. The current version of database covers 35 259 TSSs and over 2.3 million TF-miRNA regulations for 1513 miRNAs in a total of 54 human cell lines. These cell lines span most of the biological systems, including circulatory system, digestive system and nervous system. Information for both the intragenic miRNAs and intergenic miRNAs is offered. Particularly, the quality of miRNA TSSs and TF-miRNA regulations is evaluated by literature curation. 23 447 TSS records and 2148 TF-miRNA regulations are supported by special experiments as a result of literature curation. EST coverage is also used to evaluate the accuracy of miRNA TSSs. Interface of mirTrans is friendly designed and convenient to make downloads (http://mcube.nju.edu.cn/jwang/lab/soft/mirtrans/ or http://120.27.239.192/mirtrans/).

INTRODUCTION

The microRNA (miRNA) is a subset of non-coding RNAs regarded as ‘small RNAs of big roles’ (1). It prevents or suppresses protein translation by binding to the 3’UTR of mRNA. The target mRNA of miRNA is usually in vast numbers, which enables miRNA to participate in almost all the biological processes. The essential roles of miRNA in gene regulation attract huge research interests, which in turn drives the rapid development of miRNA study. In recent years, miRNAs are also widely recognized as the potential biomarker for clinical diagnoses, treatments and prognoses of various kinds of diseases, especially cancers (2,3). Till 2014, about twenty years after the first miRNA was found (4), 28645 hairpin precursor miRNAs (pre-miRNAs) and 35828 mature miRNAs from 223 species have been recorded in the most comprehensive repository of miRNAs, miRBase (version 21) (5). For human, 1881 pre-miRNAs and 2588 mature miRNAs were revealed.

A full understanding of miRNA function depends on the knowledge of miRNA regulation network, including the upstream transcriptional regulation and their downstream targets. Compared to the abundance of miRNA expression databases and target databases, the resource for regulating miRNA expression is far behind. The main reason is that the 5’ end of primary miRNA transcript is rapidly sliced by Drosha after transcription so that it is very hard to capture the full-length transcript of miRNA (6,7). This eventually lead to the difficulties in the gain of accurate miRNA transcriptional start sites (TSSs), miRNA promoters and the regulations of transcription factors (TFs) on the miRNA transcription (i.e. TF-miRNA regulations). Several approaches have been conducted by combining high throughput data with appropriate algorithms or literature mining. A few databases about miRNA transcription were built to provide information of either TF-miRNA regulation or miRNA TSSs. ChIPBase (8) provides abundant resources of TF-miRNA regulations and other TF-target regulations by parsing ChIP data. CircuitsDB (9) discovered TF-miRNA regulation circuits for human and mouse. TransmiR (10) offers highly reliable yet somewhat limited number (735 in total) of TF-miRNA regulations with literature curation. However, without accurate annotation of miRNA TSSs and promoters, it's hard to tell whether the parsed TF truly functions in the transcription of the miRNA that is downstream to the binding site of the TF (i.e. TFBS). While miRT (11) provides miRNA TSSs that are collected from a couple of experiments, no TF-miRNA regulations are offered. All of these databases developed earlier did not tackle the problem of cell-specificity of miRNA transcription.

Recent studies have shown that miRNAs expression is distinctly tissue-specific or cell-type specific, which are fundamental to tissue development and function maintenance (12). Increasing number of tissue specific miRNAs are reported to be disease associated (1316). These miRNAs could be of clinical potentials as biomarkers for the diagnoses, treatments and prognoses of diseases (2,3). So, the knowledge about cell-specific transcription of miRNAs is eagerly demanded to facilitate the investigation of miRNAs in fundamental biological aspects as well as in clinical applications. TSmiR (17) offers the regulations of TFs on tissue-specific miRNAs for 12 tissues based on ChIP-seq data. DIANA-miRGen (18) provides information of both miRNA TSSs and TF-miRNA regulations for 428 intergenic miRNAs in nine cell lines including human and mouse.

Here, we introduce mirTrans database to provide comprehensive information about cell-specific transcription of miRNA. mirTrans offers abundant cell-specific data which includes both miRNA TSSs and TF-miRNA regulations, together with some supportive data derived from other public resources [e.g. H3K4me3 peaks and DHS peaks]. Up to now, it contains 35 259 TSSs and 2 340 406 TF-miRNA regulations for 1513 miRNAs in 54 human cell lines. The 54 cell lines belong to 17 tissues which cover most of biological systems, including circulatory system, digestive system, nervous system, reproductive system and respiratory system. So, the database could be applied in quite a variety of biological or medical fields. Information of both intragenic miRNAs and intergenic miRNAs is reported. Specifically, quality evaluation on every miRNA TSS and TF-miRNA regulation is presented by literature evidence and EST evidence. Literature evidence gives the documentary support to miRNA TSS and TF-miRNA regulations, and EST evidence evaluates the accuracy of miRNA TSS by EST coverage. mirTrans also provides a convenient and friendly interface for customers. Flexible query is allowed by either one of the three types of keywords (i.e. miRNA, cell line and TF) or any combinations among them. Genomic distribution graph illustrates the intuitive view of miRNA cell-specific transcription. All the data in mirTrans are offered for free downloads.

RESULTS

Database content

mirTrans generally covers two types of data, i.e. the core data and the supportive data. The core data were produced on our own, which mainly include cell-specific miRNA TSSs and TF-miRNA regulations; the supportive data are derived from other resources, which are the primary materials for either the production or the quality evaluation of core data. Figure 1 illustrates how these data were produced or applied.

Figure 1.

Workflow for the production of core data.

Core data

Cell-specific miRNA TSSs

TSS data are produced through a python implementation that is developed on the bases of a method previously published by the authors (19). This implementation allows the input in the form of ‘.bed’, where the TSS-related segments are recorded. The ‘.bed’ file could be obtained when peak-calling tools [e.g. MACS2 (20) and Homer (21)] are used to catch the peak areas in TSS-related sequencing data (e.g. H3K4me3, Pol II and GRO/PRO-seq). With the help of the implementation, it is convenient to integrate multiple types of high-throughput data for the expansion of TSS data. In the future, TSS data in mirTrans will be updated once a certain amount of additional TSS-related data are collected. Up to now, the TSSs in mirTrans are predicted based on H3K4me3 and DHSs (19).

Usually, predictions on miRNA TSSs are focused on intergenic miRNAs, because intragenic miRNAs are considered to be co-transcribed with its host gene. However, more and more evidences have shown that intragenic miRNA is also capable of transcription independent to its host gene (22). Our implementation has a special module to tell whether the transcription is dependent or independent on its host gene, and also returned the right TSSs for every intragenic miRNA in certain cell lines.

In particular, mirTrans provides quality evaluations for all the TSSs, which is quite helpful for customers to make choice. The quality of TSSs is annotated in two ways. One is so called ‘ESTs (expressed sequence tags) evidence’, which evaluates TSS quality with the significance of EST coverage difference around TSS. EST is the expressed segment of transcripts. Theoretically, significant differences of EST coverage between the upstream and the downstream of TSS should be observed. The less the P value of EST evidence is, the higher the TSS quality is. The additional evaluation is termed as ‘literature evidence’, which annotates TSS quality by literature curation. ‘Strong evidence’ means the supports from literatures of traditional molecular experiments, while ‘evidence’ refers to the supports from GRO-Seq, CAGE and other high-throughput transcriptional signals (2325).

Cell-specific TF-miRNA regulations

TF-miRNA regulation is produced based on the data of miRNA TSS. In general, the promoter that is 1kb upstream to the cell-specific miRNA TSSs is scanned with geneXplain platform (http://platform.genexplain.com) in order to obtain the cell-specific transcription factor binding sites (TFBSs). In particular, for the cell lines with DHSs data, TFBSs are scanned on the DHS segments within the promoter, since DHSs data gives the marks for TFBS regions (26). TF-miRNA regulations are obtained when the related TFs are retrieved from TRANSFAC (27) by TFBSs. Since ChIP-seq data for TF binding is able to provide the experiment-proved TFBSs for certain TFs, it will be employed for the refinement of cell-specific TF-miRNA regulations in the future, as the updating of mirTrans.

Two parameters i.e. the affinity score and the conservation scores, are additionally offered for TFBSs. The affinity score which scales the affinity of TF binding to TFBS is calculated with MATCH (28) provided by TRANSFAC, while the conservation score which measures the conservation of TFBS sequence across species is calculated based on data of UCSC track ‘phastCons100way’ (29,30). Till now, >2.3 million cell specific TF-miRNA regulations are available in mirTrans.

The quality of cell-specific TF-miRNA regulations is also verified with literature evidences. The literature pool was constructed on the basis of TransmiR database (http://www.cuilab.cn/transmir) combined with new papers from PubMed during the year 2013–2017. The TF-miRNA regulations of ‘strong evidence’ are supported by promoter-related experiments, e.g. ChIP and luciferase report assays detecting the activity of promoters (31); while the regulations of ‘evidence’ are supported by gene expression experiments or the computational analysis based on expression data (32). The other TF-miRNA regulations recorded in this database were only based on DHS evidence.

Supportive data

So far, H3K4me3 peaks, DHSs peaks, ESTs and pre-miRNAs are the supportive data in mirTrans. H3K4me3 and DHSs data were collected from ENCODE (33), ESTs data were derived from UCSC (34), and pre-miRNAs were derived from miRBase (miRBase version 21). H3K4me3, DHSs, and pre-miRNA data are the source data for the production of cell-specific miRNA TSSs and cell-specific TF-miRNA regulations. ESTs are used to evaluate the quality of TSSs.

Most of the supportive data are graphically illustrated together with miRNA TSSs and TFBSs when miRNA TSSs are retrieved. Theoretically, H3K4me3 peak represents the location of miRNA TSS, DHSs peak represent the region of TFBSs, and EST the position of transcripts. Therefore, the genomic distributions of all these data gives the intuitive illustration of cell-specific transcription of miRNAs.

Database statistics

Up to now, 35 259 TSSs and 2 340 406 TF-miRNA regulations for 1513 pre-miRNAs (546 intergenic miRNA and 967 intragenic miRNA) in 54 human cell lines are recorded in mirTrans. Comparing to the previous comparable databases, mirTrans offers the most abundant repository for the cell-specific transcription of miRNAs (see Table 1). Two recent researches reported a large amount of miRNA transcriptional regulation data by using GRO/PRO-seq and CAGE techniques respectively (24,25). The statistics on the results of these two papers are also shown in Table 1. In addition, literature evidence is provided to indicate the data quality. So far, there are all together 23 447 TSS records and 2148 TF-miRNA regulations supported by the specified experimental evidences as described above and which are regarded as the most reliable data. Note that literature evidence will be increasingly accumulated as the update of mirTrans.

Summary of the contents in mirTrans and other comparable databases/data sets

Table 1.
Summary of the contents in mirTrans and other comparable databases/data sets
DatabaseCell linesmiRNAsmiRNA TSSsmiRNA TSSs with literature evidenceTF-miRNA regulationsTF-miRNA regulations with literature evidence
mirTrans541513 (546 intergenic / 967 intragenic)352592344723404062148
Liu Q et al. [24] (2017)27480 (intergenic)5839N/A18939
de Rie D et al. [25] (2017)841357 (371 intergenic / 986 intragenic)1118N/AN/AN/A
ChIPBase (2016)N/AunknownN/AN/A273761None
DIANA-miRGen (2016)9428 (intergenic)276NoneunknownNone
TSmiR (2014)12 tissues116N/AN/A2347None
CircuitsDB (2013)N/A180N/AN/A115 TFs to 180 miRNAsNone
miRT (2012)N/A588 (206 intergenic / 382 intragenic)670NoneN/AN/A
TransmiR (2009)N/A100N/AN/A735735
DatabaseCell linesmiRNAsmiRNA TSSsmiRNA TSSs with literature evidenceTF-miRNA regulationsTF-miRNA regulations with literature evidence
mirTrans541513 (546 intergenic / 967 intragenic)352592344723404062148
Liu Q et al. [24] (2017)27480 (intergenic)5839N/A18939
de Rie D et al. [25] (2017)841357 (371 intergenic / 986 intragenic)1118N/AN/AN/A
ChIPBase (2016)N/AunknownN/AN/A273761None
DIANA-miRGen (2016)9428 (intergenic)276NoneunknownNone
TSmiR (2014)12 tissues116N/AN/A2347None
CircuitsDB (2013)N/A180N/AN/A115 TFs to 180 miRNAsNone
miRT (2012)N/A588 (206 intergenic / 382 intragenic)670NoneN/AN/A
TransmiR (2009)N/A100N/AN/A735735
Table 1.
Summary of the contents in mirTrans and other comparable databases/data sets
DatabaseCell linesmiRNAsmiRNA TSSsmiRNA TSSs with literature evidenceTF-miRNA regulationsTF-miRNA regulations with literature evidence
mirTrans541513 (546 intergenic / 967 intragenic)352592344723404062148
Liu Q et al. [24] (2017)27480 (intergenic)5839N/A18939
de Rie D et al. [25] (2017)841357 (371 intergenic / 986 intragenic)1118N/AN/AN/A
ChIPBase (2016)N/AunknownN/AN/A273761None
DIANA-miRGen (2016)9428 (intergenic)276NoneunknownNone
TSmiR (2014)12 tissues116N/AN/A2347None
CircuitsDB (2013)N/A180N/AN/A115 TFs to 180 miRNAsNone
miRT (2012)N/A588 (206 intergenic / 382 intragenic)670NoneN/AN/A
TransmiR (2009)N/A100N/AN/A735735
DatabaseCell linesmiRNAsmiRNA TSSsmiRNA TSSs with literature evidenceTF-miRNA regulationsTF-miRNA regulations with literature evidence
mirTrans541513 (546 intergenic / 967 intragenic)352592344723404062148
Liu Q et al. [24] (2017)27480 (intergenic)5839N/A18939
de Rie D et al. [25] (2017)841357 (371 intergenic / 986 intragenic)1118N/AN/AN/A
ChIPBase (2016)N/AunknownN/AN/A273761None
DIANA-miRGen (2016)9428 (intergenic)276NoneunknownNone
TSmiR (2014)12 tissues116N/AN/A2347None
CircuitsDB (2013)N/A180N/AN/A115 TFs to 180 miRNAsNone
miRT (2012)N/A588 (206 intergenic / 382 intragenic)670NoneN/AN/A
TransmiR (2009)N/A100N/AN/A735735

The 54 cell lines in mirTrans come from 17 tissues, which cover most of the biological systems including circulatory system, digestive system, nervous system, reproductive system and respiratory system (see Figure 2). About 30% of cell lines are cancer cells. This enables mirTrans a broad range of application in cancer study.

Figure 2.

Statistics on the data in mirTrans. (A) Distribution of cell lines in 17 tissues; (B) distribution of miRNA TSSs, TF-miRNA regulations, miRNA TSSs with literature evidence, TF-miRNA regulations with literature evidence in 17 tissues.

The segment that starts from miRNA TSS to the end of pre-miRNA is annotated as the confident transcript of miRNA. Statistics on EST distribution show that 96.5% (10627/11016) of the confident transcripts are covered by at least one EST. Moreover, the coverage difference of EST between the upstream and the downstream of miRNA TSS is calculated for quality evaluation of TSSs (i.e. EST evidence mentioned above). 78.4% (8640/11016) of TSSs are with significant difference of EST coverage (P < 0.05, by t-test), which indicates the high-quality of miRNA TSSs in mirTrans.

Database implementation

System facility

All the data in mirTrans are well organized with MySQL. The website is implemented with PHP, JavaScript and HTML. Genomic graphs are illustrated using ‘Gviz’ package of Bioconductor.

Data query

Data retrieving starts from data query. mirTrans offers two ways of queries, i.e. simple search and advanced search (Figure 3A). Simple search allows the query with a single keyword, while advanced search allows the query with multiple keywords. Fuzzy words are supported in both simple and advanced search.

Figure 3.

Interface for data query and download. (AD) Data query; (E) data download. Red square in (B–D) marks the button for customized download or filter option.

For simple search, there are three entries which respectively relate to three types of keywords, i.e. miRNA, cell line and TF. Different entries go to different forms of return.

  • Simple search by miRNA returns a graph followed by a table. The graph shows the genomic distribution of pre-miRNA, miRNA TSS miRNA TFBSs, H3K4me3 peak, DHS peak and ESTs (Figure 3B). The table lists the following information that is associated with the input pre-miRNA: genomic location, TSS location, distance from TSS to pre-miRNA, cell line where the TSS are found, TFs that bind to the region of 1kb upstream to the TSS (i.e. the miRNA promoter), EST evidence and literature evidence for the TSS (Figure 3B).

  • Simple search by cell line returns a table where all the pre-miRNAs of which TSSs are found in this cell line, are listed. Additional information in the table includes miRNA location, TSS location of the miRNA, host gene of the miRNA and the location of the host gene (Figure 3C). More details can be retrieved when a miRNA or TSS in table is clicked (Figure 3C).

  • Simple search by TF returns a table where all the TF-miRNA regulations in which the TF participates are listed. Associated information includes the miRNA that the TF regulates, the cell line where the TF-miRNA regulation happens, the TFBS where the TF binds, affinity score that describes the binding strength of the TF to the TFBS, conservation score of the TFBS sequence, and literature evidence (Figure 3D). Detail information about the literature that supports the TF-miRNA regulation can be referred when the links in ‘Details’ are followed (Figure 3D). In particular, a filter option is offered, with which customers are free to retrieve either the whole results or only the literature-supported ones (see the bottom-left button in Figure 3D). Download of the return table is associated with the filter option (see the bottom-right button in Figure 3D).

Advanced search enables a flexible query with any combinations of the three keywords used in simple search (Figure 3A). The results of the advanced search will be returned in the way similar to that of simple search.

Data download

mirTrans provides two ways of data download, i.e. customized download and the complete download. The customized download is associated with the query that the customer submits. Every result of the query can be downloaded by clicking the ‘download’ button at the bottom of the return page (Figure 3BD). The complete download is specifically offered in the download page (Figure 3E). Customers are allowed to separately download the complete data in the form of the returned table when miRNA/cell line/TF is inquired. A collective download packing all the forms of data is also provided.

Database maintenance

A new version of mirTrans is released every 6 months in regular. The update includes the refresh of supportive data (e.g. H3K4me3, DHS, EST and pre-miRNA), core data (e.g. cell-specific miRNA TSSs and TF-miRNA regulation), EST evidence and literature evidence. The whole updating process is developed in a semi-automatic way. The supportive data are automatically collected from the related database, the core data and EST evidence are automatically produced through the workflow of our own, and the literature curation are manually made after related literatures are collected from PUBMED (https://www.ncbi.nlm.nih.gov/pubmed).

CONCLUSION

The wide participations in gene regulation of biological processes and clinical application potentials of miRNAs intrigue a great demand on the knowledge of cell-specific regulations of miRNAs. mirTrans offers abundant and high-quality information about cell-specific transcriptions of human miRNAs. Compared with the current databases reported so far, mirTrans specially advances in that, a significant amount of data is provided with sufficient evidence from high throughput data and literature reports manually curated. Firstly, the information for a total of 54 cell lines covering most of the key biological systems ensures the database applicable to a large variety of biological and medical investigations in different tissues or processes; Secondly, nearly all the human miRNAs known so far residing in either intergenic genomic region or intragenic region are included in mirTrans, while many of the other databases only provide intergenic miRNAs which generally account for <50% of the total miRNAs encoded by human genome; Finally, quality evaluation on either miRNA TSSs or TF-miRNA regulations were made by EST data and literature reports. Concerned with these aspects, mirTrans is very helpful for both the experimental biologists and the system biologists, who are interested in gene regulations, pathways and gene networks where miRNAs are involved.

Recent years have shown an ever rapid increase in the genomic and epigenomic data for diverse samples of tissues and cells. Accordingly, mirTrans will be updated regularly by employing new experimental data and information from literature mining. It will also incorporate cell-specific miRNA transcriptional networks by integrating all the regulations between miRNAs and TFs in the future.

ACKNOWLEDGEMENTS

The authors would acknowledge the Center of High Performance Computation of Nanjing University for the support of computational resources.

FUNDING

China Scholarship Council, DZHK (German Center for Cardiovascular Research); National Science Foundation of China [81250044, 31500674]; NJU-Yangzhou Institute of Optoelectronics. Funding for open access charge: National Science Foundation of China [81250044, 31500674].

Conflict of interest statement. None declared.

REFERENCES

1.

He
L.
,
Hannon
G.J.
MicroRNAs: small RNAs with a big role in gene regulation
.
Nat. Rev. Genet.
2004
;
5
:
522
531
.

2.

van Rooij
E.
,
Olson
E.N.
MicroRNA therapeutics for cardiovascular disease: opportunities and obstacles
.
Nat. Rev. Drug Discov.
2012
;
11
:
860
872
.

3.

Rupaimoole
R.
,
Slack
F.J.
MicroRNA therapeutics: towards a new era for the management of cancer and other diseases
.
Nat. Rev. Drug Discov.
2017
;
16
:
203
222
.

4.

Lee
R.C.
,
Feinbaum
R.L.
,
Ambros
V.
The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14
.
Cell
.
1993
;
75
:
843
854
.

5.

Kozomara
A.
,
Griffiths-Jones
S.
miRBase: annotating high confidence microRNAs using deep sequencing data
.
Nucleic Acids Res.
2014
;
42
:
D68
D73
.

6.

Houseley
J.
,
LaCava
J.
,
Tollervey
D.
RNA-quality control by the exosome
.
Nat. Rev. Mol. Cell Biol.
2006
;
7
:
529
539
.

7.

Chen
C.Y.
,
Shyu
A.B.
Mechanisms of deadenylation-dependent decay
.
Wiley Interdiscipl. Rev. RNA
.
2011
;
2
:
167
183
.

8.

Zhou
K.R.
,
Liu
S.
,
Sun
W.J.
,
Zheng
L.L.
,
Zhou
H.
,
Yang
J.H.
,
Qu
L.H.
ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data
.
Nucleic Acids Res.
2017
;
45
:
D43
D50
.

9.

Friard
O.
,
Re
A.
,
Taverna
D.
,
De Bortoli
M.
,
Cora
D.
CircuitsDB: a database of mixed microRNA/transcription factor feed-forward regulatory circuits in human and mouse
.
BMC Bioinformatics
.
2010
;
11
:
435
.

10.

Wang
J.
,
Lu
M.
,
Qiu
C.
,
Cui
Q.
TransmiR: a transcription factor-microRNA regulation database
.
Nucleic Acids Res.
2010
;
38
:
D119
D122
.

11.

Bhattacharyya
M.
,
Das
M.
,
Bandyopadhyay
S.
miRT: a database of validated transcription start sites of human microRNAs
.
Genomics Proteomics Bioinformatics
.
2012
;
10
:
310
316
.

12.

Roux
J.
,
Gonzalez-Porta
M.
,
Robinson-Rechavi
M.
Comparative analysis of human and mouse expression data illuminates tissue-specific evolutionary patterns of miRNAs
.
Nucleic Acids Res.
2012
;
40
:
5890
5900
.

13.

Esteller
M.
Non-coding RNAs in human disease
.
Nat. Rev. Genet.
2011
;
12
:
861
874
.

14.

Gama-Carvalho
M.
,
Andrade
J.
,
Bras-Rosario
L.
Regulation of cardiac cell fate by microRNAs: implications for heart regeneration
.
Cells
.
2014
;
3
:
996
1026
.

15.

Trionfini
P.
,
Benigni
A.
,
Remuzzi
G.
MicroRNAs in kidney physiology and disease
.
Nat. Rev. Nephrol.
2015
;
11
:
23
33
.

16.

Issler
O.
,
Chen
A.
Determining the role of microRNAs in psychiatric disorders
.
Nat. Rev. Neurosci.
2015
;
16
:
201
212
.

17.

Guo
Z.
,
Maki
M.
,
Ding
R.
,
Yang
Y.
,
Zhang
B.
,
Xiong
L.
Genome-wide survey of tissue-specific microRNA and transcription factor regulatory networks in 12 tissues
.
Scientific Rep.
2014
;
4
:
5150
.

18.

Georgakilas
G.
,
Vlachos
I.S.
,
Zagganas
K.
,
Vergoulis
T.
,
Paraskevopoulou
M.D.
,
Kanellos
I.
,
Tsanakas
P.
,
Dellis
D.
,
Fevgas
A.
,
Dalamagas
T.
et al. 
DIANA-miRGen v3.0: accurate characterization of microRNA promoters and their regulators
.
Nucleic Acids Res.
2016
;
44
:
D190
D195
.

19.

Hua
X.
,
Chen
L.
,
Wang
J.
,
Li
J.
,
Wingender
E.
Identifying cell-specific microRNA transcriptional start sites
.
Bioinformatics
.
2016
;
32
:
2403
2410
.

20.

Zhang
Y.
,
Liu
T.
,
Meyer
C.A.
,
Eeckhoute
J.
,
Johnson
D.S.
,
Bernstein
B.E.
,
Nusbaum
C.
,
Myers
R.M.
,
Brown
M.
,
Li
W.
et al. 
Model-based analysis of ChIP-Seq (MACS)
.
Genome Biol.
2008
;
9
:
R137
.

21.

Heinz
S.
,
Benner
C.
,
Spann
N.
,
Bertolino
E.
,
Lin
Y.C.
,
Laslo
P.
,
Cheng
J.X.
,
Murre
C.
,
Singh
H.
,
Glass
C.K.
Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities
.
Mol. Cell
.
2010
;
38
:
576
589
.

22.

Ribas
J.
,
Ni
X.
,
Castanares
M.
,
Liu
M.M.
,
Esopi
D.
,
Yegnasubramanian
S.
,
Rodriguez
R.
,
Mendell
J.T.
,
Lupold
S.E.
A novel source for miR-21 expression through the alternative polyadenylation of VMP1 gene transcripts
.
Nucleic Acids Res.
2012
;
40
:
6821
6833
.

23.

Georgakilas
G.
,
Vlachos
I.S.
,
Paraskevopoulou
M.D.
,
Yang
P.
,
Zhang
Y.
,
Economides
A.N.
,
Hatzigeorgiou
A.G.
microTSS: accurate microRNA transcription start site identification reveals a significant number of divergent pri-miRNAs
.
Nat. Commun.
2014
;
5
:
5700
.

24.

Liu
Q.
,
Wang
J.
,
Zhao
Y.
,
Li
C.I.
,
Stengel
K.R.
,
Acharya
P.
,
Johnston
G.
,
Hiebert
S.W.
,
Shyr
Y.
Identification of active miRNA promoters from nuclear run-on RNA sequencing
.
Nucleic Acids Res.
2017
;
45
:
e121
.

25.

de Rie
D.
,
Abugessaisa
I.
,
Alam
T.
,
Arner
E.
,
Arner
P.
,
Ashoor
H.
,
Astrom
G.
,
Babina
M.
,
Bertin
N.
,
Burroughs
A.M.
et al. 
An integrated expression atlas of miRNAs and their promoters in human and mouse
.
Nat. Biotechnol.
2017
;
35
:
872
878
.

26.

Thurman
R.E.
,
Rynes
E.
,
Humbert
R.
,
Vierstra
J.
,
Maurano
M.T.
,
Haugen
E.
,
Sheffield
N.C.
,
Stergachis
A.B.
,
Wang
H.
,
Vernot
B.
et al. 
The accessible chromatin landscape of the human genome
.
Nature
.
2012
;
489
:
75
82
.

27.

Matys
V.
,
Fricke
E.
,
Geffers
R.
,
Gossling
E.
,
Haubrock
M.
,
Hehl
R.
,
Hornischer
K.
,
Karas
D.
,
Kel
A.E.
,
Kel-Margoulis
O.V.
et al. 
TRANSFAC: transcriptional regulation, from patterns to profiles
.
Nucleic Acids Res.
2003
;
31
:
374
378
.

28.

Kel
A.E.
,
Gossling
E.
,
Reuter
I.
,
Cheremushkin
E.
,
Kel-Margoulis
O.V.
,
Wingender
E.
MATCH: A tool for searching transcription factor binding sites in DNA sequences
.
Nucleic Acids Res.
2003
;
31
:
3576
3579
.

29.

Pollard
K.S.
,
Hubisz
M.J.
,
Rosenbloom
K.R.
,
Siepel
A.
Detection of nonneutral substitution rates on mammalian phylogenies
.
Genome Res.
2010
;
20
:
110
121
.

30.

Siepel
A.
,
Bejerano
G.
,
Pedersen
J.S.
,
Hinrichs
A.S.
,
Hou
M.
,
Rosenbloom
K.
,
Clawson
H.
,
Spieth
J.
,
Hillier
L.W.
,
Richards
S.
et al. 
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes
.
Genome Res.
2005
;
15
:
1034
1050
.

31.

Zhang
Y.C.
,
Ye
H.
,
Zeng
Z.
,
Chin
Y.E.
,
Huang
Y.N.
,
Fu
G.H.
The NF-kappaB p65/miR-23a-27a-24 cluster is a target for leukemia treatment
.
Oncotarget
.
2015
;
6
:
33554
33567
.

32.

Li
P.
,
Hu
Y.
,
Yi
J.
,
Li
J.
,
Yang
J.
,
Wang
J.
Identification of potential biomarkers to differentially diagnose solid pseudopapillary tumors and pancreatic malignancies via a gene regulatory network
.
J Transl Med.
2015
;
13
:
361
.

33.

Kavanagh
D.H.
,
Dwyer
S.
,
O’Donovan
M.C.
,
Owen
M.J.
The ENCODE project: implications for psychiatric genetics
.
Mol. Psychiatry
.
2013
;
18
:
540
542
.

34.

Karolchik
D.
,
Hinrichs
A.S.
,
Furey
T.S.
,
Roskin
K.M.
,
Sugnet
C.W.
,
Haussler
D.
,
Kent
W.J.
The UCSC Table Browser data retrieval tool
.
Nucleic Acids Res.
2004
;
32
:
D493
D496
.

Author notes

These authors contributed equally to this work as first authors.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

Comments

0 Comments
Submit a comment
You have entered an invalid code
Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.