Prediction of siRNA knockdown efficiency using artificial neural network models

https://doi.org/10.1016/j.bbrc.2005.08.147Get rights and content

Abstract

Selective knockdown of gene expression by short interference RNAs (siRNAs) has allowed rapid validation of gene functions and made possible a high throughput, genome scale approach to interrogate gene function. However, randomly designed siRNAs display different knockdown efficiencies of target genes. Hence, various prediction algorithms based on siRNA functionality have recently been constructed to increase the likelihood of selecting effective siRNAs, thereby reducing the experimental cost. Toward this end, we have trained three Back-propagation and Bayesian neural network models, previously not used in this context, to predict the knockdown efficiencies of 180 experimentally verified siRNAs on their corresponding target genes. Using our input coding based primarily on RNA structure thermodynamic parameters and cross-validation method, we showed that our neural network models outperformed most other methods and are comparable to the best predicting algorithm thus far published. Furthermore, our neural network models correctly classified 74% of all siRNAs into different efficiency categories; with a correlation coefficient of 0.43 and receiver operating characteristic curve score of 0.78, thus highlighting the potential utility of this method to complement other existing siRNA classification and prediction schemes.

Section snippets

Materials and methods

Neural network models. Artificial neural network is built on a set of interconnected neural units and consists of one input and one output layer that takes the input values and outputs the final output result individually. Some of them have one or more hidden layers which perform nonlinear modeling (Fig. 1).

There are many different types of neural networks. Each differs from the others in network topology and/or learning algorithm. In this study, we introduce the back-propagation, general

Network parameter optimization

Several important parameters affect neural network structure configuration and performance. In back-propagation neural network, these parameters include training time, the number of units in hidden layer, learning rate, and momentum. Training time is measured in epoch. One epoch is equivalent to presenting all patterns to the network once. Long training time increases the possibility of over-fitting the training set: the error of training set will get lower as the training time gets longer and

Discussion

Two main encoding methods are used in previous studies to facilitate the selection of effective siRNA: energy-based [11] and sequence-feature based [9], [10], [25] methods. Some studies combined both features [5], [6], [7]. Most of these studies attempted to discover the correlation between the functionality of siRNAs and their specific sequence motif or base preference. Due to differences in experimental setting such as target transcript sequence and the relatively small dataset, different

Acknowledgment

G.W.W. is supported by the NIH NRSA postdoctorate fellowship (5F32DK 067835-02).

References (26)

  • K. Berns et al.

    A large-scale RNAi screen in human cells identifies new components of P53 pathway

    Nature

    (2004)
  • A. Reynolds et al.

    Rational siRNA design for RNA interence

    Nat. Biotechnol.

    (2004)
  • K. Ui-Tei et al.

    Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference

    Nucleic Acids Res.

    (2004)
  • Cited by (18)

    • Development of in silico methodology for siRNA lipid nanoparticle formulations

      2022, Chemical Engineering Journal
      Citation Excerpt :

      The binary classification method was used in this research. The classification standard was 50% knockdown efficiency, which is a common acceptable value [22,23]. There are 46 effective formulations and 83 ineffective formulations for the in vitro model, while there are 113 effective formulations and 188 ineffective for the in vivo model on this assignment.

    • Selecting highly effective siRNAs by their modified entropies with mini-clusters

      2012, Theoretical and Applied Fracture Mechanics
      Citation Excerpt :

      The results show that type-1 (or type-2) entropy with mini-clusters is reliable for identifying effective siRNAs. To compare our results to existing siRNA-based design tools, three different algorithms [17–19] are applied to the data set of siRNAs. In fact, exceed 10% of effective siRNAs were not identified by any algorithm.

    View all citing articles on Scopus
    View full text