Introduction

Geotechnical engineers facing complicated multivariate problems, which include several interacting factors. Generally, the relationship among these factors is not exactly known. The extraction of information to develop a relationship between these factors required advanced techniques for modeling beside a human experience. Braced excavation is popular in construction such as high multistoried buildings with basements, underground structures for transportation stations, and parking.

A safe excavation can be achieved using a satisfactorily braced wall; the braced wall also reduces deformations in the surrounding ground due to excessive wall deflections, which are most important to avoid costly damages to neighboring structures and buildings. In deep excavation projects, observational methods are involved to ensure safe construction, as well as used a finite element to forecast deflections of the wall in deep excavations [1, 15, 16].

Some design charts were established for predicting the maximum lateral deflection of the wall based on depth of excavation, support system, and wall length [7, 9] see Fig. 1.

Fig. 1
figure 1

design chart obtained by Hashash and Whittle [7]

Artificial neural networks (ANNs) form a category of frameworks, which are mimicked neural networks in living creatures. An important merit of ANNs is learning, learning models, (ANNs) can deal with incomplete, qualitative, and uncertain data effectively. Therefore, for modeling sophisticated problems in which the equations that govern the problems are difficult to set, ANN is a highly promising tool. ANNs can be performed in mapping, transitory and optimization problems, besides modeling dynamic processes [3]. Generally, the number of input parameters used should be small in artificial neural networks to predict the maximum lateral deflection of the wall in clays with soft to medium consistency. The database used to develop the ANN is generated from presumptive cases employing the method of finite element (FEM) [11] and case histories of deep excavations for training, testing, and verification [8]. The objective of this work is to employ (ANNs) to build a model to forecast maximum lateral wall deflection and its position in homogenous clays.

Methodology

Artificial neural networks

Artificial neural networks (ANNs) are simulated the nerve cells in the brain, they are considered as computational devices. Despite the complexity of the brain that cannot be approached by ANNs, there are two kinds of similarities. First, the computational units of networks are simple and interconnected to a high degree. The second one is that the connections among neurons set the function and feature of the network [5].

By adjusting weights among neurons, ANNs may be trained to perform a selected function. Neural networks are trained to perform complicated functions in extraordinary fields of application such as identification, pattern recognition, speech, vision, classification, and control systems.

Various tools have been used by engineers to perform informal modeling (mapping from cause to effect for estimation and prediction) and inverse mapping (mapping from effects to causes) which include knowledge-based systems, optimization, regression, statistics, probability, and others. Thus, ANN can be considered as another tool for engineers to perform either the modeling or inverse mapping [10]. Neural networks are applied in different fields of geotechnical engineering including parameter assessment, underground openings, foundations, site investigation, liquefaction, retaining structures, slopes, and ground movement [17].

Back-propagations

One of the neural networks is back-propagation which is vastly used in solving problems that need modeling nonlinear relationships. By modifying the weights so the error among the actual target and the desired target is minimized, as a result, the simulation of the networks will produce the output units, and the forecasting of the neural networks to the outputs is highly more correct.

When the internal parameters of the network have been corrected, the mechanism of correction begins with the output layers then the error back-propagates backward through each internal (hidden) layer. Hence, for this kind of network, the backpropagation is used [2].

Levenberg Marquardt algorithm.

The algorithm is based on the newton method variation in which the functions which are the sum of squares of other nonlinear functions are minimized, it is appropriate when MSE (Mean Square Error) is the index of performance.

The basic step of the Levenberg–Marquardt algorithm is:

$${X}_{K+1}={X}_{K}-{[{J}^{T}J+\mu I]}^{-1}{J}^{T}e$$
(1)

where (e) is a vector of network errors, (J) is the Jacobian matrix, (I) is a unit matrix, and (μ) is a positive scalar. The algorithm becomes gradient descent with a small step size when μ is large. For some input/output patterns, the Levenberg–Marquardt algorithm is very fast and effective compared to the standard gradient algorithms. Thus, in this study, the training algorithm used is the Levenberg–Marquardt algorithm with the mean square of errors as a performance function.

Prediction of maximum wall deflection and it’s position by ANN

Development of ANN Models

The database includes a total of (169) cases recorded from site observations and finite element analysis for homogeneous clayey soil types. The six parameters used as the ANN model inputs have the most considerable effect on the deep excavations in homogeneous clayey soil. Parameters include: Excavation depth (H) m, Ratio of Length of wall/width of excavation (L/B), ratio of EI of the wall / EI of the brace, the vertical distance of bracing (hs) m, undrained shear strength of soil (Su, kN/m2), and the earth pressure coefficient of at rest (Ko). Table 1 shows the dataset used and its sources.

Table 1 Data used in building the neural networks

In this work, a computer program MATLAB (2015) Neural Network Toolbox is used for modeling the neural network. A summary of the basic characteristics of the ANN model utilized in this study is as follows: Type of network: Feed-forward backpropagation, Algorithm of training: (TRAINLM) Levenberg–Marquardt algorithm, the function of Learning: (LEARNGDM) Gradient descent with momentum, the function of performance: (MSE) Mean square error and the coefficient of correlation (R), the first layer transfer function: Sigmoid, the second transfer function: tangent-sigmoid.

The functions of the Sigmoid (logsig) and the hyperbolic tangent-sigmoid (tansig) can be expressed as in Eqs. 2 and 3 respectively:

$$logsig\left(n\right)=1/(1+\mathrm{exp}\left(-n\right))$$
(2)
$$Tansig\left(n\right)=[2/(1+\mathrm{exp}\left(-2n\right))]-1$$
(3)

where: n is any value between (− ∞) and (+ ∞). Figure 2 shows the details of (ANN).

Fig. 2
figure 2

Flow chart algorithm and structure of ANN for predicting max. Wall deflection

Data division and pre-processing

The data set was randomly divided into three sets through the nntools box using the default percentages. The percentage of the training, testing, and validation are, 70%, 15% and 15% respectively. Before building the neural network, the data Pre-processing is essential to put them into proportional form, which is carried out by scaling (normalizing) them so that they fall into a standard range − 1 to 1. In the output layer, the variables have to be normalized to be proportional to the transfer function limits that used in the output layer [5].

Scaling the data is important to assure that during the training, all the variables will receive equal attention. The scaled value \({X}_{n}\) is calculated as follows:

$${X}_{n}=\frac{2\times (X-{X}_{min})}{({X}_{max}-{X}_{min})}-1$$
(4)

Results and discussion

Two models were built one for maximum wall deflection and the other to the position of the maximum deflection, each model has six nodes that represent the inputs used and one node for the output required.

ANNs with 1 to 12 hidden layer nodes are trained, to determine the optimum network geometry.

Training of data

Table 2 summarizes statistical parameters of variables used to construct and validate the network and their histograms are shown in Fig. 3. The retaining structures are formed of either stiff or flexible walls with Excavation depth up to 20 m and maximum width up to 80 m was considered.

Table 2 Statistics parameters of Input and Output data used in (ANN)
Fig. 3
figure 3

Histograms of the data used

Figures 4 and 5 indicate the variation of the average of Mean Squared Error (MSE) with the number of hidden neurons of the networks for the maximum deflection and its position respectively. The difference in (MSE) is relatively small after hidden neuron no. 4 and hidden neuron no. 3 for the maximum deflection and its position respectively. Figure 4 shows that the networks for max. deflection and its position with 4 and 3 hidden layer node respectively. It is considered optimal, as its prediction error (MSE) is not far from the network with 12 hidden layer nodes coupled with smaller number of connection weights and with high coefficient of correlation as shown in Fig. 5

Fig. 4
figure 4

Variation of Mean squared error with different hidden layer nodes

Fig. 5
figure 5

Variation of Coefficient of correlation with different hidden nodes

Figures 6 and 7 show a comparison between the measured and predicted value of the maximum deflection and its position respectively. It shows a good agreement between the measured and predicted values.

Fig. 6
figure 6

Comparison between predicted and Observed max. deflection δH for All data set, Training data Set, Testing Data Set, and Validation Data Set

Fig. 7
figure 7

Comparison between Predicted and Observed position of max. deflection δH for All data set, Training data Set, Testing Data Set, and Validation Data Set

Sensitivity analysis of the ANN model inputs

Garson [4] proposed an innovative and simple technique that is used to explain the relative significance of the input variables by examining the connection weights of the trained network. For a network with 4 hidden nodes, the technique involves a process of partitioning the hidden output connection weights into components associated with each input node.

The results indicate that the ratio of the EI wall/EI of brace has the most considerable effect on the predicted maximum deflection followed by Ko, L/B, H, Su, and hs respectively. The results are presented in Fig. 8a. Similarly, the results indicate that Su and Ko have the most significant effect on the predicted position of maximum deflection followed by L/B, H, hs, and EI/Ks, the results are presented in Fig. 8b.

Fig. 8
figure 8

The relative importance of input variable for the value of a max. deflection, b Position of max. deflection

Parametric study

Parametric studies were carried out to estimate whether the model has convenient trends, to confirm the validation of the trained network and the generalization ability of the model. the first five input variables (i.e. (L/B), EI of wall/stiffness of braced, Vertical distance of bracing, Undrained shear strength of soil (Su), coefficient of Earth pressure at rest (Ko)), except one, are fixed to their mean values used for training. The increasing values of artificial ranging (10–20%) of the total range between the minimum and maximum values. Figure 9a–d reveal that the general trend of the maximum wall deflection is to increase with an increase in excavation depth, increasing (L/B), (EI wall/EI of bracing) ratios and vertical distance of bracing respectively as expected. Figure 9e shows that the changes in max. wall deflections were less significant for Su greater than 75 MPA, and increase with the increasing coefficient of lateral earth pressure as shown in Fig. 9f.

Fig. 9
figure 9

Result of a parametric study of (ANN) model for the max. excavation under different, a excavation depth, (H), b length to width ratio, (L/B), c ratio of EI wall/EI of bracing, d vertical distance of bracing between spacing, e undrained shear strength, Su, f coefficient of lateral earth pressure at rest Ko

The general trend for the position of max. Wall deflection was revealed in Fig. 10a–f, it increases with the increase in excavation depth, while decreases with (L/B) ratios, (EI/stiffness of bracing) ratios, and the vertical distance of bracing (hs), on the other hand the position of max. deflection increases with the increase in shear strength of the soil (Su) up to 60 kN/m2 then decreases, and the same trend observed with the coefficient of lateral earth pressure (Ko). Consequently, the trained neural network model was approved to give reasonable and appropriate relationships between the diverse input variables. The developed ANN has shown satisfactory performance.

Fig. 10
figure 10

Result of a parametric study of (ANN) model for the position of max. excavation under different, a excavation depth, (H), b length to width ratio, (L/B), c ratio of EI wall/EI of bracing, d vertical distance of bracing between spacing, e undrained shear strength, Su, f coefficient of lateral earth pressure at rest Ko

A set of design charts are developed. It`s carried out by using synthetic input values within the ranges of data shown in Table 2. Figure 11 is an illustrative sample of the design charts obtained for L/B = 1, Ko = 0.53, hs = 2.5 m, the increasing trend of varying with excavation depth (H) for all cases, maximum deflection increases as the excavation depth increases and as expected.

Fig. 11
figure 11

Maximum wall deflections predicted by the ANN versus variation excavation depth, for L/B = 1, Ko = 0.53, hs = 2.5 m

On the other hand, the robustness of the (ANN) for the position of max. deflection can be determined by examining with input values within the ranges of data shown in Table 2, the sample of design chart obtained for L/B = 1, Ko = 0.53, hs = 2.5 m, EI wall/EI brace = 7, It was found that the trends of the design charts are in good agreement with what one would expect as shown in Fig. 12

Fig. 12
figure 12

Position of maximum wall deflections predicted by the ANN versus variation excavation depth, for L/B = 1, Ko = 0.53, hs = 2.5 m, EI wall/EI brace = 7

Summary and conclusions

In this study, a neural network-based procedure to predict maximum wall excavation during excavation has been proposed. Based on a large number of case studies generated by observation techniques and FEM, the ANN model is developed to predict the max. wall deflection induced by excavation. Six input variables are adopted to build the ANN model; they are excavation depth, system stiffness, the vertical distance between bracing, excavation width/wall length ratio, shear strength, and the lateral earth pressure coefficient. The effect of using a various number of a hidden layer on the ANN model results was investigated. The relative importance of the factors that affect maximum lateral deflection and its position was achieved by sensitivity analysis. The typical behavior of braced excavation in soft to medium clay is confirmed by the results of the extensive assessment of the ANN model developed.

The results showed that the sensitivity analysis indicates that the system stiffness has the most significant effect on the maximum wall deflection. The soil properties have the most significant effects on the position, alternately the system stiffness and the vertical distance between bracing were found to have minor effects compared to the soil properties depending on the position of max. wall deflection.

The ANN developed has shown to be able to predict the wall deflections satisfactorily as showed by the results of the validation. Degree of accuracy was acceptable (R = 0.922, MSE = 0.4%, and the number of optimum hidden layer nodes was found to be four) for predicted maximum wall deflection, and (R = 0.978, MSE = 0.22%, and the number hidden layer nodes was found to be three hidden layer nodes) for predicted position of maximum wall deflection. The developed ANN trend to improved results based on the mean of case histories.