Acessibilidade / Reportar erro

PRELIMINARY MODELING OF AN INDUSTRIAL RECOMBINANT HUMAN ERYTHROPOIETIN PURIFICATION PROCESS BY ARTIFICIAL NEURAL NETWORKS

Abstract

In the present study a preliminary neural network modelling to improve our understanding of Recombinant Human Erythropoietin purification process in a plant was explored. A three layer feed-forward back propagation neural network was constructed for predicting the efficiency of the purification section comprising four chromatographic steps as a function of eleven operational variables. The neural network model performed very well in the training and validation phases. Using the connection weight method the predictor variables were ranked based on their estimated explanatory importance in the neural network and five input variables were found to be predominant over the others. These results provided useful information showing that the first chromatographic step and the third chromatographic step are decisive to achieve high efficiencies in the purification section, thus enriching the control strategy of the plant.

Keywords:
Neural network; Erythropoietin; Chromatographic purification; Modeling

INTRODUCTION

Industrial biopharmaceutical production has gradually become dependent on large-scale processes using sensitive mammalian cell cultures (Glassey et al., 2011Glassey, J., Gernaey, K. V., Clemens, C., Schulz, T. W., Oliveira, R. Striedner, G. and Mandenius C. F., Process analytical technology (PAT) for biopharmaceuticals, Biotechnol. J., 6, 369-377 (2011).). The biopharmaceutical market has witnessed the arrival of many new complex therapeutic proteins produced in mammalian cell lines (Teixeira et al., 2009Teixeira, A. P., Oliveira, R., Alves, P. M. and Car-rondo, M. J. T., Advances in on-line monitoring and control of mammalian cell cultures: Supporting the PAT initiative. Biotechnology Advances, 27, 726-732 (2009).). Mammalian hosts have an innate capacity to perform post-translational modifications; in particular, glycosylation is of special interest as it influences the functionality, serum half-life and immunogenicity of such therapeutics. Several glycoproteins and monoclonal antibodies with relatively high production needs are being identified as new drug candidates (Walsh, 2006Walsh, G., Biopharmaceutical benchmarks 2006. Nat. Biotechnol., 24,769-76 (2006).). The Chinese Hamster Ovary (CHO) cell line is one of the most used for obtaining biological therapeutics (Merten, 2006Merten, O. W., Introduction to animal cell culture technology-past, present and future. Cytotechnology, 50, 1-7 (2006).).

Recombinant Human Erythropoietin (rhEPO) is a glycoprotein hormone produced by means of CHO cell culture. It has proven to be a highly effective therapeutic agent with no evidence of an adverse physiological effect due to the structural differences in terminal sialylation (Butler, 2005Butler, M., Animal cell cultures: Recent achievements and perspectives in the production of biopharmaceuticals. Appl. Microbiol. Biotechnol., 68, 283-291 (2005).). Since EPO increases the number of peripheral red blood cells, there has been considerable interest in the therapeutic use of EPO for the treatment of severe anemia. However, EPO is also massively misused for doping purposes. The current pharmaceutical market value is over 12 billion US dollars a year (Hoeksema, 2011Hoeksema, F., Selection markers and expression enhancing elements: Novel strategies to create mammalian production cell lines. Faculty of Science. University of Amsterdam. Thesis Dissertation (2011). Available at: <http://dare.uva.nl/document/482571>
http://dare.uva.nl/document/482571...
).

Once a biopharmaceutical has been approved based on a given process, any significant deviation from the production protocol may require new clinical trials to test the safety of the resulting product ("process defines the product"). Since clinical trials are very expensive, process improvements are made under very hard constraints. Therefore, production processes are normally run far below their potential maximum performance (Sommerfeld and Strube, 2005Sommerfeld, S. and Strube, J., Challenges in biotechnology production - generic processes and process optimization for monoclonal antibodies. Chem. Eng. Process, 44, 1123-37 (2005).; Teixeira et al., 2009Teixeira, A. P., Oliveira, R., Alves, P. M. and Car-rondo, M. J. T., Advances in on-line monitoring and control of mammalian cell cultures: Supporting the PAT initiative. Biotechnology Advances, 27, 726-732 (2009).).

Production of rhEPO has potential for improvement and is still being investigated (Hoeksema, 2011Hoeksema, F., Selection markers and expression enhancing elements: Novel strategies to create mammalian production cell lines. Faculty of Science. University of Amsterdam. Thesis Dissertation (2011). Available at: <http://dare.uva.nl/document/482571>
http://dare.uva.nl/document/482571...
; Surabattula et al., 2011Surabattula, R., Sambasiva, K. R. S. and Polavarapu, R., An optimized process for expression, scale-up and purification of recombinant erythropoietin produced in Chinese hamster ovary cell culture. Research in Biotechnology, 2(3), 58-74 (2011).). Its industrial process production has two main sections, fermentation and purification in chromatographic columns. In the case studied, fermentation takes place in stirred tank bioreactors with perfusion-based process, which can create relatively high product concentrations and volumetric throughput (Acuna et al., 2011Acuna, J., Hewitt, M., Johnston, R., Kirkland, D., Shikibu, T. and Zhang, D., Modeling perfusion processes in biopharmaceutical production. Bio-Process Int., 9, 52-59 (2011).). The purification section is based on 4 chromatographic steps which use affinity chromatography, ion exchange chromatography and molecular exclusion chromatography.

In the plant of the case studied there has been a great instability in the purification section efficiencies. In spite of the fact that there is a great amount of registered information, there is a lack of knowledge concerning the way the controlled variables impact purification efficiencies.

Multivariate data analysis has a great potential for improving biopharmaceutical production. It can interact with several other methods and techniques that analyze the biopharmaceutical manufacturing process, leading to better understanding of it and exerting control of its quality. Biopharmaceutical processes typically lead to the generation of large multivariate data sets which are highly heterogeneous, correlated, non-linear in nature and with high levels of redundancy and noise. The usefulness of multivariate data analysis techniques has been proven in the area of biopharmaceuticals. Their ability to reduce dimensionality by removing the redundancy and noise leads to the identification of salient features in the data. These features can subsequently be used in bioprocess monitoring, fault detection and process optimization, as has been described extensively in the literature over the years (Teixeira et al., 2009Teixeira, A. P., Oliveira, R., Alves, P. M. and Car-rondo, M. J. T., Advances in on-line monitoring and control of mammalian cell cultures: Supporting the PAT initiative. Biotechnology Advances, 27, 726-732 (2009).; Glassey et al., 2011Glassey, J., Gernaey, K. V., Clemens, C., Schulz, T. W., Oliveira, R. Striedner, G. and Mandenius C. F., Process analytical technology (PAT) for biopharmaceuticals, Biotechnol. J., 6, 369-377 (2011).). The inner workings of the majority of batch pharmaceutical processes are not well understood for a fundamental or knowledge-driven model to be developed. Because such knowledge-driven models provide a much more detailed and insightful view of the process, their development should be pursued and is indeed pursued, for selective critical parts of the process. For the majority of pharmaceutical processes or their processing steps, one needs to rely substantially on the development of data-driven models. The availability on an ever-increasing set of off-line and on-line process measurements (spectroscopic or otherwise) provides the engineer with substantial data as the starting point for developing a data-driven model and, through it, attaining a certain understanding of the process (Troup and Georgakis, 2013Troup, G. M. and Georgakis, C., Process systems engineering tools in the pharmaceutical industry. Computers & Chemical Engineering, 51, 157-171 (2013).) .

Artificial Neural Networks (ANN) are a powerful tool in multivariate data analysis, applicable to a wide range of problems, especially in the domain of process modeling. Since ANN have the ability to extract from experimental data the highly non-linear and complex relationships between the variables of the problem without any detailed knowledge of the system (Si-Moussa et al., 2008Si-Moussa, C., Hanini, S., Derriche, R., Bouhedda, M. and Bouzidi, A., Prediciton of high-pressure vapor liquid equilibrium of six binary systems, carbon dioxide with six esters, using an artificial neural network model. Braz. J. Chem. Eng., 25, 183-199 (2008).; Khaouane et al., 2013Khaouane, L., Benkortbi, O., Hanini, S. and Si-Moussa, C., Modeling of an industrial process of pleuromutilin fermentation using feed-forward neural networks. Braz. J. Chem. Eng., 30(1), 1005-116 (2013).) and, given the great amount of available experimental data in the study, it was decided to apply this approach. In this way, it was possible to extract useful information for making decisions without the need to have a theoretical model of process behaviour.

Chromatography has developed from an analytical technology to a well-established separation process in industry. Large-scale industrial applications have been reported from different industrial fields, e.g. in the petrochemical, pharmaceutical, bio-chemical, and also in the food industry (Engell and Toumi, 2005Engell, S. and Toumi, A., Optimisation and control of chromatography. Computers and Chemical Engineering, 29 1243-1252 (2005).). ANN have been widely used in liquid chromatography modeling (Korany et al., 2012Korany, M. A., Mahgoub, H., Fahmy, O. T. and Maher, H. M., Application of artificial neural networks for response surface modeling in HPLC method development. Journal of Advanced Research, 3, 53-63 (2012).; Webb et al., 2009Webb, R., Doble, P. and Dawson, M., Optimisation of HPLC gradient separations using Artificial Neural Networks (ANNs): Application to benzodiazepines in post-mortem samples. J. Chromatogr., B, 877(7), 615-20 (2009).; Tran et al, 2007Tran, A. T. K., Hyne, R. V., Pablo, F., Day, W. R. and Doble, P., Optimisation of the separation of herbicides by linear gradient high performance liquid chromatography utilising artificial neural networks. Talanta, 71(3), 1268-75 (2007).; Novotna et al., 2005Novotna, K., Havlis, J. and Havel, J., Optimisation of high performance liquid chromatography separation of neuroprotective Peptides: Fractional experimental designs combined with artificial neural networks. J. Chromatogr., A, 1096(1-2), 50-7 (2005).; Loukas, 2000Loukas, Y. L., Artificial neural networks in liquid chromatography: Efficient and improved quantitative structure-retention relationship models. Journal of Chromatography, A, 904, 119-129 (2000). Available at: <http://144.206.159.178/FT/553/30150/ 521375.pdf>
http://144.206.159.178/FT/553/30150/5213...
; Gao and Engell, 2004Gao, W. and Engell, S., Neural network-based identification of nonlinear adsorption isotherms. 7th International Symposium on Dynamics and Control of Process Systems DYCOPS 7, Cambridge, USA, July 5-7 (2004). Available at: <http://www.nt.ntnu.no/users/skoge/prost/proceedings/dycops04/pdffiles/ papers/132.pdf>
http://www.nt.ntnu.no/users/skoge/prost/...
; Satlawa et al., 2013Satlawa, T., Grabska-Chrzastowska, J. and Korohoda, P., Application of multidimensional data analysis to chromatography. Image Processing & Communication, 18(2-3), 101-108 (2013).), but in all papers reported, the modeling of the chromatographic purification processes is limited to one column operation level, hence not covering the overall efficiency of a purification section with several chromatographic steps. On the other hand, ANN have been applied in the rhEPO production process, but only in the fermentation section (Didier, 2008Didier, C., Optimización de la producción de eritropoyetina humana recombinante (rhEPO) mediante el cultivo de células animales. Tesis de maestría en Bioquímica, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas (2008). Available at: <http://bibliotecavirtual.unl. edu.ar:8180/tesis/bitstream/1/173/1/tesis1.pdf> (In Spanish).
http://bibliotecavirtual.unl.edu.ar:8180...
; Didier et al., 2009Didier, C., Forno, G., Etcheverrigaray, M., Kratje, R. and Goicoechea, H., Novel chemometric strategy based on the application of artificial neural networks to crossed mixture design for the improvement of recombinant protein production in continuous culture. Anal Chim. Acta, 650, 167-74 (2009).; Hoeksema, 2011Hoeksema, F., Selection markers and expression enhancing elements: Novel strategies to create mammalian production cell lines. Faculty of Science. University of Amsterdam. Thesis Dissertation (2011). Available at: <http://dare.uva.nl/document/482571>
http://dare.uva.nl/document/482571...
; Forno et al., 2009Forno, G., Didier, C., Etcheverrigaray, M., Goicoechea, H. and Kratje, R., Crossed mixture design and artificial neural networks: An efficient approach to cell culture medium optimization. Proceedings of the 21st Annual Meeting of the European Society for Animal Cell Technology (ESACT), Dublin, Ireland, June 7-10 (2009).). No work has been reported concerning application of ANN to the modelling of the industrial process of EPO purification.

The objective of this preliminary study is to explore the possibilities of using neural network modelling to gain knowledge about the process, specifically how variables that are registered in this stage affect the purification section efficiency in the rhEPO production process; a method for quantifying the explanatory contributions of the predictor variables in the network was applied and identified the variables with the biggest contribution to the efficiency. This would be very interesting for further applications in monitoring and control of the process, and could be a great help to identify key parameters and to define control spaces, thus increasing the rhEPO productivity.

MATERIALS AND METHODS

Data Set

The data used in this study were provided by the Center of Molecular Immunology in Havana, whose main research objective is the development of new products for the diagnosis and treatment of cancer and other diseases related to the immune system. Data were collected from the operation of the purification section in the rhEPO plant during one campaign, resulting in 204 records.

Eleven input variables were studied: four related to the first chromatographic step (affinity chromatography), two related to the second chromatographic step (affinity chromatography) and five related to the third chromatographic step (ion exchange chromatography). The output variable of interest was rhEPO yield in bulbs (at the elution of the fourth chromatographic step - molecular exclusion chromatography) per liter of supernatant. The selection of these variables was based on the domain knowledge and was made by taking into account criteria from the experts of the plant who considered these inputs relevant.

Two variables related to the supernatant applied in the first chromatographic step were considered:

1. Cell specific perfusion rate, which means the medium consumption per cell per day (cspr, nL/cell.day), calculated as the dilution rate divided by the live cell concentration in the bioreactor.

2. rhEPO mass in the supernatant applied (masasnhplc, g), calculated with rhEPO concentration measured via HPLC-RP (High Performance Liquid Chromatography in Reverse Phase).

Variables related to the first chromatographic step, comprising two columns, were:

3. Optical density at 280 nm at the elution of the first column (dO280b, AU), measured via spectrophotometry.

4. rhEPO concentration at the elution of the second column (concg25b, mg/mL), measured by optical density.

Variables related to the second chromatographic step were:

5. Relation of the rhEPO mass applied divided by the packed gel volume in the first column (masa_vgch, g/L).

6. Yield of the second chromatographic step with regard to the first chromatographic step (rendch_b, adim.), calculated as the mass of elution of the second step divided by the mass of elution of the first step.

Variables related to the third chromatographic step were:

7. Relation of the rhEPO mass applied divided by the packed gel volume in the column (masa_vgq, g/L).

8. Relation of the optical density at 280 nm divided by the optical density at 260 nm of eluted stream in the column (releluq , adim.).

9. Recovery of rhEPO in the elution of the column (recelucq, percent), calculated as the relation of the eluted mass divided by the total eluted mass in the third step.

10. Relation of the acid rhEPO mass divided by the total rhEPO at the chromatographic step (eluq/ eluq+lavq, adim.).

11. Yield of the third chromatographic step with regard to the second chromatographic step (rendq/ch, adim), calculated as the relation of the mass of elution of the third step divided by the mass of elution of the second step.

Variables related to the fourth chromatographic step were:

12. rhEPO yield in bulbs per liter of supernatant (rendbb/lsn, bulbs/L). This is the target variable of the study and is calculated as the relation of the amount of vials obtained divided by the volume of supernatant applied.

In Table 1 the minimum, maximum, mean and standard deviation values of the variables are presented.

Table 1
A summary of the data set.

Figure 1 is a synthetic presentation of records for rhEPO yield in bulbs per liter of supernatant before the modeling.

Figure 1
Synthetic presentation of records for rhEPO yield in bulbs per liter of supernatant.

Neural Network

An artificial neural network is a nonlinear statistical data modeling tool that simulates the structure and functional aspects of biological neural networks. Although many differing types of ANN exist, they do possess some common features. They are generally composed of numerous process elements, termed nodes, which are arranged together to form a network. The choice of the architecture of the network depends on the task to be performed and the architecture of the model is specified by the node characteristics, network topology and learning algorithm. In standard architecture, neurons are grouped into different layers like input, output and hidden layers. The use of hidden layers confers on ANN the ability to describe nonlinear systems. Good descriptions of ANN are given in the literature (Bose and Liang, 1996Bose, N. K. and Liang, P., Neural Networks Fundamentals with Graphs, Algorithms, and Applications, in Electrical and Computer Engineering. McGraw-Hill Series, McGraw-Hill Companies, Inc., USA (1996).; Himmelblau, 2000Himmelblau, D. M., Applications of artificial neural networks in chemical engineering. Korean J. Chem. Eng., 17(4), 373-392 (2000).).

One of the most popular neural network paradigms applied to the modeling of a wide range of nonlinear systems, especially chemical and biological engineering processes, is the feed-forward back propagation neural network (Silva et al., 2000Silva, R. G., Cruz, A. J. G., Hokka, C. O., Giordano, R. L. C. and Giordano, R. C., A hybrid feedforward neural network model for the cephalosporin C production process. Braz. J. Chem. Eng., 17, 587-598 (2000).; Khaouane et al., 2013Khaouane, L., Benkortbi, O., Hanini, S. and Si-Moussa, C., Modeling of an industrial process of pleuromutilin fermentation using feed-forward neural networks. Braz. J. Chem. Eng., 30(1), 1005-116 (2013).), which has been used in this paper with one hidden layer.

In the present study, 11 variables are used as input parameters in developing of an ANN for rhEPO yield in bulbs per liter of supernatant prediction. These parameters are: cell specific perfusion rate (cspr), rhEPO mass applied in the supernatant (masasnhplc), optical density at 280 nm at the elution of the first column (dO280b), rhEPO concentration at the elution of the second column (concg25b), relation of the rhEPO mass applied divided by the packed gel volume in the column (masa_vgch), yield of the second chromatographic step with regard to the first chromatographic step (rendch_b), relation between the rhEPO mass applied and packed gel volume in the column (masa_vgq), relation of the the optical density at 280 nm divided by the optical density at 260 nm of eluted stream in the column (releluq), recovery of rhEPO in the elution of the column (recelucq), relation of the acid rhEPO mass divided by the total rhEPO in the chromatographic step (eluq/eluq+lavq) and yield of the third chromatographic step with regard to the second chromatographic step (rendq/ch). All inputs were normalized.

The network was trained and validated by using 70% and 30% of 204 data sets, respectively. The network training was carried out by the Levenberg- Marquardt back propagation algorithm. A trial-anderror approach was used to minimize the error in order to determine the optimal number of hidden neurons. The number of hidden neurons in the hidden layer is a crucial parameter. If the number is too small, the ANN may not reflect the complex relationship between input and output value. On the contrary, a large number may create such a complex network that it might lead to a very large output error caused by over fitting of the training sample set. In the present study, the number of neurons in the hidden layer was optimized using trial and error methods during the training stage. Neurons in the input layer have no transfer function. A tan-sigmoid (tansig) transfer function was used in the hidden layer, while a pure linear (purelin) transfer function was used in the output layer.

To judge the performance of the network, the Mean Square Error function (MSE) and correlation coefficient (R-value) from post-training regression analysis were used.

The development procedure of the ANN described above was carried out by elaborating a MATLAB program under MATLAB Neural Network Toolbox Release 14.

Connection Weight Method

Complex supervised learning Data Mining methods, such as Neural Networks, Support Vector Machines and ensembles, are capable of high quality prediction results and thus are useful to support decision making. However, the obtained models are often treated as black boxes, since they are difficult to understand by humans. Improving model interpretability increases the acceptance of the Data Mining results by the domain users and this is an important issue in critical applications, such as control or medicine (Cortez and Embrechts, 2011Cortez, P. and Embrechts, M. J., Opening black box data mining models using sensitivity analysis. Symposium on Computational Intelligence and Data Mining (CIDM), Paris, 11-15 April (2011). Available at: <http://repositorium.sdum.uminho.pt/bitstream/1822/14836/1/blackbox.pdf>
http://repositorium.sdum.uminho.pt/bitst...
).

One of the main limitations of artificial neural networks is the high inability to know, in an explicit way, the relations established between explanatory variables (input) and dependent variables (output). This is a major reason why they are usually called "black boxes" (De Oña and Garrido, 2014De Oña, J. and Garrido, C., Extracting the contribution of independent variables in neural network models: A new approach to handle instability. Neural Computing and Applications, March (2014).). This arises from the fact that the internal characteristic of a trained network is a set of numbers that are very difficult to relate back to the application in a meaningful fashion (Paliwal and Kumar, 2011Paliwal, M. and Kumar, U. A., Assessing the contribution of variables in feed forward neural network. Appl Soft Comput., 11, 3690-3696 (2011).). In the last few years, several methods have been proposed to assess the relative importance of each explanatory variable (Olden and Jackson, 2002Olden, J. D. and Jackson, D. A., Illuminating the "black box": a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154, 135-150 (2002).; Olden et al., 2004Olden, J. D., Joy, M. K. and Death, R. G., An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecological Modelling, 178, 389-397 (2004).; Papadokonstantakis et al., 2006Papadokonstantakis, S., Lygeros, A. and Jacobsson, S. V., Comparison of recent methods for inference of variable influence in neural networks. Neural Networks, 19, 500-513 (2006).; Watts and Worner, 2008Watts, M. J. and Worner, S. P., Using artificial neural networks to determine the relative contribution of abiotic factors influencing the establishment of insect pestpecies. Ecological Informatics, 3, 64-74 (2008).; Paliwal and Kumar, 2011Paliwal, M. and Kumar, U. A., Assessing the contribution of variables in feed forward neural network. Appl Soft Comput., 11, 3690-3696 (2011).; Cortez and Embrechts, 2013Cortez, P. and Embrechts, M. J., Using sensitivity analysis and visualization techniques to open black box data mining models. Information Sciences, 225, 1-17 (2013).; De Oña and Garrido, 2014De Oña, J. and Garrido, C., Extracting the contribution of independent variables in neural network models: A new approach to handle instability. Neural Computing and Applications, March (2014).). Nevertheless, it has not been possible to reach a consensus on which is the best-performing method. Despite the variety of methods studied, there is no general consensus on which model is the best for determining the contribution of variables (De Oña and Garrido, 2014De Oña, J. and Garrido, C., Extracting the contribution of independent variables in neural network models: A new approach to handle instability. Neural Computing and Applications, March (2014).).

The connection weight method is a well-known method existing in the literature which has been used with good results (Olden and Jackson, 2002Olden, J. D. and Jackson, D. A., Illuminating the "black box": a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154, 135-150 (2002).; Olden et al., 2004Olden, J. D., Joy, M. K. and Death, R. G., An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecological Modelling, 178, 389-397 (2004).; Papadokonstantakis et al., 2006Papadokonstantakis, S., Lygeros, A. and Jacobsson, S. V., Comparison of recent methods for inference of variable influence in neural networks. Neural Networks, 19, 500-513 (2006).; Watts and Worner, 2008Watts, M. J. and Worner, S. P., Using artificial neural networks to determine the relative contribution of abiotic factors influencing the establishment of insect pestpecies. Ecological Informatics, 3, 64-74 (2008).; Paliwal and Kumar, 2011Paliwal, M. and Kumar, U. A., Assessing the contribution of variables in feed forward neural network. Appl Soft Comput., 11, 3690-3696 (2011).). This method calculates the sum of products of raw weights of the connection from the input node to the hidden nodes with the connection from hidden node to output nodes for all input nodes. The larger the sum for a given input node, the more the importance of the corresponding input variable. The relative importance of a given input variable can be defined as

where RI is the relative importance of the input variable I, h is the total number of hidden nodes, WI_H is the weight of the connection between input node I and hidden node H, and WH_O is the weight of the connection between hidden node H and output node.

The connection weight method has been used in this study in order to rank the importance of independent variables in predicting the output variable for the neural network.

RESULTS AND DISCUSSION

The properties of the training data determine the number of input and output neurons. In this study, the number of factors (cspr, masasnhplc, do280b, concg25b, masa_vgch, rendch_b, masa_vgq, releluq, recelucq, eluq/eluq+lavq, rendch_b) forced the number of input neurons to be eleven. The number of responses (rendbb/lsn) forced the number of output neurons to be one.

Only one hidden layer was tried and it was sufficient to predict with enough accuracy the rhEPO yield in bulbs per liter of supernatant. This is in line with the fact that the one-hidden-layer, feed-forward neural network trained by the back-propagation algorithm is the type of neural network most frequently used, including applications in biological systems (Olden and Jackson, 2001Olden, J. D. and Jackson, D. A., Fish-habitat relationships in lakes: Gaining predictive and explanatory insight by using artificial neural networks. Transactions of the American Fisheries Society 130, 878-897 (2001).). The number of neurons in the hidden layer is related to the convergence of the error term during the training process. An increase in the number of hidden neurons up to a point usually results in a better learning performance. However, too many hidden neurons may allow too much freedom for the weights to be adjusted and result in learning the noise present in the database used in training. Too few hidden neurons limit the ability of the neural network to model the process. Therefore, in this study, the number of the hidden neurons was varied in the range of 10-11. Simplicity of the architecture and accuracy of the results were the requirements imposed to find the optimum architecture. After several trials, it was found that 11 neurons in the hidden layer gave the best performance with an acceptable compromise between network estimation accuracy and network complexity. Thus, the architecture of the proposed ANN model had three layers: 11 neurons in the input layer, 11 neurons in the hidden layer, and 1 neuron in the output layer, (11,11,1), as shown in Figure 2.

Figure 2
Structure of the ANN.

204 data points were selected and divided into two data sets, i.e., a training data set and a validation data set. In this study, the input data were divided into a training data set consisting of 70% of the input data and a validation data set consisting of 30% of the input data. The training and validation data sets were selected randomly, and the ANN model was trained using the training data set. The generalized performance of the network was estimated periodically by computing the validation error from the independent validation data set. The error decreased with increasing number of training cycles to a certain point, but again increased with increasing number of training cycles. The training was stopped after 22 iterations as shown in Figure 3 to avoid over fitting of the network.

Figure 3
Training performance of ANN model.

The default performance function for feedforward networks is the Mean Square Error (MSE) - the average squared error between the network outputs and the target outputs.The stop criteria are based on the Mean Square Error for the validation set instead of that for the training set to ensure model generalization. The average MSE was observed to have reached the error of 0.0042.

The training result of the proposed feed forward network is displayed in Figure 4.

Figure 4
Neural network training results for rhEPO yield in bulbs per liter of supernatant.

As shown, there is a very good agreement between the experimental data and the trained ones. The correlation coefficient (R) is 0.958, close to 1, suggesting the accuracy of the network training. Similar values of the correlation coefficient ranging from 0.95 have been obtained in successful applications of neural network modeling to cell culture medium optimization in the context of a bio-pharmaceutical production process (Didier, 2008Didier, C., Optimización de la producción de eritropoyetina humana recombinante (rhEPO) mediante el cultivo de células animales. Tesis de maestría en Bioquímica, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas (2008). Available at: <http://bibliotecavirtual.unl. edu.ar:8180/tesis/bitstream/1/173/1/tesis1.pdf> (In Spanish).
http://bibliotecavirtual.unl.edu.ar:8180...
; Didier et al., 2009Didier, C., Forno, G., Etcheverrigaray, M., Kratje, R. and Goicoechea, H., Novel chemometric strategy based on the application of artificial neural networks to crossed mixture design for the improvement of recombinant protein production in continuous culture. Anal Chim. Acta, 650, 167-74 (2009).; Forno et al., 2009Forno, G., Didier, C., Etcheverrigaray, M., Goicoechea, H. and Kratje, R., Crossed mixture design and artificial neural networks: An efficient approach to cell culture medium optimization. Proceedings of the 21st Annual Meeting of the European Society for Animal Cell Technology (ESACT), Dublin, Ireland, June 7-10 (2009).).

These results are promising for further improvement of the ANN and are acceptable for quantifying variable importance. The optimum calculated values of network parameters to be used in the connection weight method are given in Tables 2 and 3.

Table 2
Input-Hidden Connection Weights.
Table 3
Hidden-Output Connection Weights.

Connection Weight Method

In order to assess the relative contributions of the input variables in prediction of the output, the connection weight method was applied. Tables 4 and 5 show the Connection Weights Products for the neural network and relative importance of the input variables obtained with their rank, respectively.

Table 4
Connection Weights Products.
Table 5
Connection Weights method results.

The relative contributions of the independent variables to the predictive output of the neural network depend primarily on the magnitude and direction of the connection weights. Input variables with larger connection weights represent greater intensities of signal transfer, and therefore are more important in the prediction process compared to variables with smaller weights. Negative connection weights represent inhibitory effects on neurons (reducing the intensity of the incoming signal) and decrease the value of the predicted response, whereas positive connection weights represent excitatory effects on neurons (increasing the intensity of the incoming signal) and increase the value of the predicted response (Olden and Jackson, 2002Olden, J. D. and Jackson, D. A., Illuminating the "black box": a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154, 135-150 (2002).). Since the Connection Weight Approach uses raw connection weights, it accounts for the direction of the input-hidden-output relationship and allows accounting for the contrasting influences thus resulting in the correct identification of the variable contribution. This approach can thus successfully identify the true importance of all the variables in the neural network, including variables that exhibit both strong and weak correlations with the response variable (Olden et al., 2004Olden, J. D., Joy, M. K. and Death, R. G., An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecological Modelling, 178, 389-397 (2004).).

Table 5 shows that the relation of the applied rhEPO mass divided by the packed gel volume in the column of the third chromatographic step (masa_vgq) is ranked the first, which matchs with results obtained in the operation of the plant; it has been verified before from lab scale experiments that this column has a high reserve in terms of adsorption capacity.

The rhEPO mass applied in the supernatant (masasnhplc) and the cell specific perfusion rate (cspr) ranked the second and the third, respectively, which is a very important result taking into account that these variables can be manipulated from fermentation.

Recovery of rhEPO in the elution of the column (recelucq), yield of the third chromatographic step with regard to the second chromatographic step (rendq/ch) and relation between acid rhEPO mass and total rhEPO at the chromatographic step (eluq/ eluq+lavq) ranked the fourth, the fifth and the sixth, respectively.

Given these results it is clear that the first chromatographic step and the third chromatographic step are decisive to get high rhEPO efficiencies in bulbs per liter of supernatant. Therefore, it is necessary to improve the control of these steps. It was decided to redefine three operational parameters: the in fermentation area cspr variable was defined as critical parameter and the in purification area masa_vgch and masa_vgq variables were defined as key parameters.

The following variables show negative values in Table 5: concg25b (rhEPO concentration at the elution of the second column of the first chromatographic step), releluq (relation of the optical density at 280 nm divided by the optical density at 260 nm of eluted stream in the column of the third chromatographic step), masa_vgch (relation of the rhEPO mass applied divided by the packed gel volume in the first column of the second chromatographic step), rendch_b (yield of the second chromatographic step with regard to the first chromatographic step) and dO280b (optical density at 280 nm at the outlet of the first column). This result may be influenced by the great variability of these variables during the studied campaign.

Although the connection weight method suffers from some drawbacks, particularly when a high degree of interaction between predictor variables can be expected, it could be used in the model-building stage to help reduce the complexity of the model by suggesting the elimination of variables which either do not significantly impact the output variable or whose impact could be assessed by an input variable already in the model. In this regard the results obtained in this preliminary study have given useful information for further research. In order to improve the ANN performance, input variables are being preprocessed, using the Principal Components Analysis technique, before they are fed to the backpropagated ANN. More data is being gathered as well, so as to get an independent test set which was not used in both the training and validation process. In this way, the generalization capabilities of the developed neural network may be investigated for further simulation purposes.

CONCLUSION

In the present study, the possibilities of neural network modelling to gain knowledge about the purification process in rhEPO production were explored with satisfactory results. An ANN model was developed to predict the rhEPO yield in bulbs per liter of supernatant for the purification section as a function of eleven operational variables. The ANN model performed very well in the training and validation phases, and was very effective to capture most of the important patterns related to four chromatographic purification steps. The connection weight method was used for quantifying the importance of the variables and its application showed five input variables to be predominant over the others. These results provided useful information showing that the first chromatographic step and the third chromatographic step are decisive to get high rhEPO yields in bulbs per liter of supernatant, thus enriching the control strategy of the plant. The fairly good results obtained indicate that ANNs are promising tools not only in modelling of the EPO purification process, but also in providing insight from the learned relationship, which assist the modeller in understanding the process under investigation, as well as in evaluation of the model. Given the usefulness of the results obtained with this preliminary neural network modelling, improvement of the model will be the focus of future research.

NOMENCLATURE


REFERENCES

  • Acuna, J., Hewitt, M., Johnston, R., Kirkland, D., Shikibu, T. and Zhang, D., Modeling perfusion processes in biopharmaceutical production. Bio-Process Int., 9, 52-59 (2011).
  • Bose, N. K. and Liang, P., Neural Networks Fundamentals with Graphs, Algorithms, and Applications, in Electrical and Computer Engineering. McGraw-Hill Series, McGraw-Hill Companies, Inc., USA (1996).
  • Butler, M., Animal cell cultures: Recent achievements and perspectives in the production of biopharmaceuticals. Appl. Microbiol. Biotechnol., 68, 283-291 (2005).
  • Cortez, P. and Embrechts, M. J., Opening black box data mining models using sensitivity analysis. Symposium on Computational Intelligence and Data Mining (CIDM), Paris, 11-15 April (2011). Available at: <http://repositorium.sdum.uminho.pt/bitstream/1822/14836/1/blackbox.pdf>
    » http://repositorium.sdum.uminho.pt/bitstream/1822/14836/1/blackbox.pdf
  • Cortez, P. and Embrechts, M. J., Using sensitivity analysis and visualization techniques to open black box data mining models. Information Sciences, 225, 1-17 (2013).
  • De Oña, J. and Garrido, C., Extracting the contribution of independent variables in neural network models: A new approach to handle instability. Neural Computing and Applications, March (2014).
  • Didier, C., Optimización de la producción de eritropoyetina humana recombinante (rhEPO) mediante el cultivo de células animales. Tesis de maestría en Bioquímica, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas (2008). Available at: <http://bibliotecavirtual.unl. edu.ar:8180/tesis/bitstream/1/173/1/tesis1.pdf> (In Spanish).
    » http://bibliotecavirtual.unl.edu.ar:8180/tesis/bitstream/1/173/1/tesis1.pdf
  • Didier, C., Forno, G., Etcheverrigaray, M., Kratje, R. and Goicoechea, H., Novel chemometric strategy based on the application of artificial neural networks to crossed mixture design for the improvement of recombinant protein production in continuous culture. Anal Chim. Acta, 650, 167-74 (2009).
  • Engell, S. and Toumi, A., Optimisation and control of chromatography. Computers and Chemical Engineering, 29 1243-1252 (2005).
  • Forno, G., Didier, C., Etcheverrigaray, M., Goicoechea, H. and Kratje, R., Crossed mixture design and artificial neural networks: An efficient approach to cell culture medium optimization. Proceedings of the 21st Annual Meeting of the European Society for Animal Cell Technology (ESACT), Dublin, Ireland, June 7-10 (2009).
  • Gao, W. and Engell, S., Neural network-based identification of nonlinear adsorption isotherms. 7th International Symposium on Dynamics and Control of Process Systems DYCOPS 7, Cambridge, USA, July 5-7 (2004). Available at: <http://www.nt.ntnu.no/users/skoge/prost/proceedings/dycops04/pdffiles/ papers/132.pdf>
    » http://www.nt.ntnu.no/users/skoge/prost/proceedings/dycops04/pdffiles/papers/132.pdf
  • Glassey, J., Gernaey, K. V., Clemens, C., Schulz, T. W., Oliveira, R. Striedner, G. and Mandenius C. F., Process analytical technology (PAT) for biopharmaceuticals, Biotechnol. J., 6, 369-377 (2011).
  • Himmelblau, D. M., Applications of artificial neural networks in chemical engineering. Korean J. Chem. Eng., 17(4), 373-392 (2000).
  • Hoeksema, F., Selection markers and expression enhancing elements: Novel strategies to create mammalian production cell lines. Faculty of Science. University of Amsterdam. Thesis Dissertation (2011). Available at: <http://dare.uva.nl/document/482571>
    » http://dare.uva.nl/document/482571
  • Khaouane, L., Benkortbi, O., Hanini, S. and Si-Moussa, C., Modeling of an industrial process of pleuromutilin fermentation using feed-forward neural networks. Braz. J. Chem. Eng., 30(1), 1005-116 (2013).
  • Korany, M. A., Mahgoub, H., Fahmy, O. T. and Maher, H. M., Application of artificial neural networks for response surface modeling in HPLC method development. Journal of Advanced Research, 3, 53-63 (2012).
  • Loukas, Y. L., Artificial neural networks in liquid chromatography: Efficient and improved quantitative structure-retention relationship models. Journal of Chromatography, A, 904, 119-129 (2000). Available at: <http://144.206.159.178/FT/553/30150/ 521375.pdf>
    » http://144.206.159.178/FT/553/30150/521375.pdf
  • Merten, O. W., Introduction to animal cell culture technology-past, present and future. Cytotechnology, 50, 1-7 (2006).
  • Novotna, K., Havlis, J. and Havel, J., Optimisation of high performance liquid chromatography separation of neuroprotective Peptides: Fractional experimental designs combined with artificial neural networks. J. Chromatogr., A, 1096(1-2), 50-7 (2005).
  • Olden, J. D. and Jackson, D. A., Fish-habitat relationships in lakes: Gaining predictive and explanatory insight by using artificial neural networks. Transactions of the American Fisheries Society 130, 878-897 (2001).
  • Olden, J. D. and Jackson, D. A., Illuminating the "black box": a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling, 154, 135-150 (2002).
  • Olden, J. D., Joy, M. K. and Death, R. G., An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecological Modelling, 178, 389-397 (2004).
  • Paliwal, M. and Kumar, U. A., Assessing the contribution of variables in feed forward neural network. Appl Soft Comput., 11, 3690-3696 (2011).
  • Papadokonstantakis, S., Lygeros, A. and Jacobsson, S. V., Comparison of recent methods for inference of variable influence in neural networks. Neural Networks, 19, 500-513 (2006).
  • Satlawa, T., Grabska-Chrzastowska, J. and Korohoda, P., Application of multidimensional data analysis to chromatography. Image Processing & Communication, 18(2-3), 101-108 (2013).
  • Silva, R. G., Cruz, A. J. G., Hokka, C. O., Giordano, R. L. C. and Giordano, R. C., A hybrid feedforward neural network model for the cephalosporin C production process. Braz. J. Chem. Eng., 17, 587-598 (2000).
  • Si-Moussa, C., Hanini, S., Derriche, R., Bouhedda, M. and Bouzidi, A., Prediciton of high-pressure vapor liquid equilibrium of six binary systems, carbon dioxide with six esters, using an artificial neural network model. Braz. J. Chem. Eng., 25, 183-199 (2008).
  • Sommerfeld, S. and Strube, J., Challenges in biotechnology production - generic processes and process optimization for monoclonal antibodies. Chem. Eng. Process, 44, 1123-37 (2005).
  • Surabattula, R., Sambasiva, K. R. S. and Polavarapu, R., An optimized process for expression, scale-up and purification of recombinant erythropoietin produced in Chinese hamster ovary cell culture. Research in Biotechnology, 2(3), 58-74 (2011).
  • Teixeira, A. P., Oliveira, R., Alves, P. M. and Car-rondo, M. J. T., Advances in on-line monitoring and control of mammalian cell cultures: Supporting the PAT initiative. Biotechnology Advances, 27, 726-732 (2009).
  • Tran, A. T. K., Hyne, R. V., Pablo, F., Day, W. R. and Doble, P., Optimisation of the separation of herbicides by linear gradient high performance liquid chromatography utilising artificial neural networks. Talanta, 71(3), 1268-75 (2007).
  • Troup, G. M. and Georgakis, C., Process systems engineering tools in the pharmaceutical industry. Computers & Chemical Engineering, 51, 157-171 (2013).
  • Walsh, G., Biopharmaceutical benchmarks 2006. Nat. Biotechnol., 24,769-76 (2006).
  • Watts, M. J. and Worner, S. P., Using artificial neural networks to determine the relative contribution of abiotic factors influencing the establishment of insect pestpecies. Ecological Informatics, 3, 64-74 (2008).
  • Webb, R., Doble, P. and Dawson, M., Optimisation of HPLC gradient separations using Artificial Neural Networks (ANNs): Application to benzodiazepines in post-mortem samples. J. Chromatogr., B, 877(7), 615-20 (2009).

Publication Dates

  • Publication in this collection
    Sept 2015

History

  • Received
    26 May 2014
  • Reviewed
    14 Oct 2014
  • Accepted
    02 Dec 2014
Brazilian Society of Chemical Engineering Rua Líbero Badaró, 152 , 11. and., 01008-903 São Paulo SP Brazil, Tel.: +55 11 3107-8747, Fax.: +55 11 3104-4649, Fax: +55 11 3104-4649 - São Paulo - SP - Brazil
E-mail: rgiudici@usp.br