Abstract

Using high-precision sensors to monitor and predict the deformation trend of supertall buildings is a hot research topic for a long time. And in terms of deformation trend prediction, the main way to realized deformation trend prediction is the deep learning algorithm, but the accuracy of prediction result needs to be improved. To solve the problem described above, firstly, based on the conditional deep belief network (CDBN) model, the levenberg-marquardt (LM) was used to optimize the CDBN model; the LM-CDBN model has been constructed. Then taking CITIC tower, the tallest building in Beijing as the research object, the real-time monitoring data of the shape acceleration array (SAA) as an example, we used LM-CDBN model to analyse and predict the building deformation. Finally, to verify the accuracy and robustness of LM-CDBN model, the prediction results of the LM-CDBN model are compared with the prediction results of the CDBN model, the extreme learning machine (ELM) model, and the unscented Kalman filter-support vector regression (UKF-SVR) model, and we evaluated the result from three aspects: training error, fitness, and stability of prediction results. The results show that the LM-CDBN model has higher precision and fitting degree in the prediction of deformation trend of supertall buildings. And the MRE, MAE, and RMSE of the LM-CDBN model prediction results are only 0.0060, 0.0023mm, and 0.0031mm, and the prediction result was more in line with the actual deformation trend.

1. Introduction

Affected by its own structural characteristics and external changes, supertall buildings will continue to produce complex deformations such as differential settlement, compression, inclination, deflection, and vibration during the construction process. It is necessary to implement high-precision deformation monitoring and prediction to ensure its construction safety. With the continuous advancement of sensor technology, a lot of progress has been made in obtaining deformation data by installing high-precision sensors on supertall buildings; for example, Su, J.Z. et al. designed a supertall building precision structural performance monitoring system consisting of more than 400 sensors, applied to the structural health monitoring of Shanghai Tower [1]; Chen, W.H. et al. designed a health monitoring system which consists of anemometer, strain gauge, GPS, and other sensors and put it on the Guangzhou TV Tower during the typhoon, aiming at monitoring building health [2]; Ni, Y.Q. et al. used wireless sensors to monitor environmental vibration of the Guangzhou TV Tower during construction [3]; Gu, M. et al. proposed a new method for optimal sensor placement based on the simplified multidegree-of-freedom system for calculating the weak axis modal matrix based on the equivalent stiffness parameter identification method and using the numerical calculation verified the feasibility of the method [4]; Yi, T.H. et al. proposed a modified monkey algorithm for sensor array optimization design of structural health monitoring systems and proved its effectiveness by implementing calculation cases of super high-rise buildings [5].

Using deformation data to predict the deformation of supertall buildings is one of the current research hotspots. Deformation of supertall buildings has strong temporal and spatial linkage and obvious time-varying characteristics. On the concept of time, the monitoring data has a strong dependence on the concept of time slip; and during the different deformation periods, the structural integrity deformation are both random and time-variation, as well as having continuity and periodicity in time; it requires the model to have higher time-varying information extraction capabilities. In the spatial range, the deformation of supertall buildings has a close spatial correlation with the complexity of the environmental factors, spatial characteristics, and change trends. The different changes of environmental factors in different time periods and different fields have different effects on the deformation of supertall buildings. How to deeply mine and extract the feature attributes of environmental factors has always been one of the research difficulties.

Recently, many advances have been made in the prediction of deformation of buildings using neural network technology. In aspect of shallow neural networks [6, 7] such as backpropagation (BP), extreme learning machine (ELM), and support vector machine (SVM), Kang F et al. used the ELM to predict the deformation of a dam and used the prediction accuracy and prediction stability as evaluation indicators to evaluate the prediction results and obtained good experimental results [8]; Xin, J. et al. used the ARIMA-GARCH model to predict the deformation of the bridge and achieved good prediction results [9]; Wang, X. et al. used a multiple population genetic algorithm to improve the BP neural network, optimized the network’s weight and parameter selection mechanism, and applied it to the deformation prediction of dams [10]; Cao, Y.B. et al. used a new method which integrated the combining genetic algorithm with the artificial neural network to predict the deformation of landslide in some reservoir area of the Three Gorges [11]; Zhang, H. et al. proposed a multiscale deformation prediction model which integrated the genetic algorithm support vector machine (GA-SVM) with the empirical mode decomposition (EMD) and used it to predict the deformation of the dam. By multiscale dam deformation prediction model, BP neural network prediction results were compared with the predicted results demonstrating high accuracy [12].

The deep learning algorithm [1315] adopts interlayer network training and batch sample grading training methods to solve problems such as overfitting of data analysis and local minima in shallow networks, which improves the training speed of the network. Deep belief networks (DBN) model [1618] is one of the classic models for deep learning; it has the advantages of fast network training, easy parameter selection, high-efficiency feature extraction capability, and ease of regression analysis. Conditional deep belief network (CDBN) model [1923] is a variant of DBN model; it inherits many excellent features of the DBN model and adopts the normal distribution to process the resampling of the deformation data of the supertall building. But CDBN model uses the gradient descent method to search the optimal solution during the process of determining weight, and that causes the difference between the predicted value and the actual value in the deformation prediction to be obviously different. The prediction oscillation is more obvious and has a greater impact on the prediction accuracy and stability. So the LM algorithm [24, 25] was adopted to replace the gradient descent algorithm to optimize the weighting mechanism of the CDBN model, the information extraction stability of the model and the generalization ability of the nonlinear problem of the time-varying system are improved, the speed of the model convergence is accelerated, and the prediction accuracy and stability of the model are improved. Considering the complexity of the deformation of supertall buildings, the complexity of deformation factors, the deformation characteristics, and the advantages and disadvantages of the model, the LM-CDBN model was applied to the deformation prediction of the CITIC tower, and the prediction accuracy and stability of the proposed model are verified by model comparison experiments and predictive analysis.

2. Brief of CDBN Model

CDBN model is a variant of the traditional DBN model. It inherits many excellent features of the DBN model and resamples the distortion data of the supertall building through normal distribution. The deformation data of supertall building has a high degree of four-dimensional spatial characteristics, of which the temporal and spatial characteristics are particularly prominent. In the process of extracting deformation information from a supertall building, the CDBN model uses an automatic regression mechanism to dynamically mine, extract, and feedback the temporal and spatial characteristics of the deformation data. The model can be used to excavate the dynamic change characteristics of deformation trend from historical data. These features complement and guide the in-depth digging of the current deformation trend. In addition, the automatic regression (AR) adjustment capability also provides convenient conditions for analysis of supertall deformation trend, deformation extrapolation, and trend fitting.

As shown in Figure 1, suppose that there are visible neurons obeying a Gaussian distribution in space and observable Bernoulli distributions of hidden neurons , and is the No. hidden node and is the No. visible node; is the weight matrix between the historical data and the target data; is the weight matrix between the historical data and the hidden node. The system energy function of the CDBN network structure was shown in where is the No. i visible node; is the No. j hidden node; is the weight matrix between and ; is the threshold of ; is the threshold of ; is the noise of ; in order to facilitate model calculation and expansion, normally sets to 1.

The energy function of CDBN is obtained by (1), and then the conditional probability distribution of the model is derived using where is conditional probability distribution of ; is conditional probability distribution of ; represents an activation function; N () represents a Gaussian probability distribution.

The CDBN network model uses the gradient descent algorithm to search for optimal solution. The gradient descent algorithm is the most commonly used optimization algorithm for neural network model training. For the function , is the gradient of the function. Its iteration equation is shown in where satisfies the minimum value of ; represents the gradient in the descending direction; and represents the search step in the gradient direction, also meaning learning rate in deep learning.

Finally, training and learning is performed by comparing the divergence sampling method [2628] to update and set the model parameters, as shown in where is the update value of ; is the weight matrix between at time t-q and at current time t; is the weight matrix between at time t-q and at current time t; is the status value of at time t-q; is the Status value of at time t-q; is the expected output for original data; is the expected output data calculated by model; is the initial expectation; is the stable expectation; is learning rate.

3. Deformation Prediction Approach

Deformation of supertall buildings is more complex, and the characteristics of spatiotemporal linkage are more obvious. Therefore, the deformation prediction of it needs higher prediction accuracy and prediction stability. The CDBN model has a high ability to extract deformation tendency. But due to the use of a gradient descent algorithm to find the optimal solution during the process of weight determination, the predicted output and the actual deformation output are significantly different, and the predicted oscillation is more obvious. In order to solve this problem, we used the L-M algorithm to model the weight, and the Gauss-Newton algorithm was used to update the weight and threshold of the model to speed up the convergence of the algorithm. The combination of powerful deformation information extraction capability and stable nonlinear optimization ability can help improve the prediction accuracy and stability of supertall buildings.

3.1. LM-CDBN Model Weighting Principle

Assume is the matrix vector composed of all the weights and thresholds after the LM-CDBN model iteration k times; then the matrix vector is composed of the weights and thresholds updated after k+1 iteration is , shown as the

The mean squared error (MSE) of the model training is defined as the minimum reference standard.where represents the sample dimension; represents the mathematical expectation function; and represent the No. i actual prediction value and model value of the prediction label, respectively. Solve the second derivative’s extreme value according to the principle of least squares and correct by Gauss-Newton algorithm; then iswhere is a proportional coefficient; e is the network error vector; is a unit matrix; is a Jacobi matrix, shown as

In the initial stage of model training, the value of μ is large, the model will seek the minimum value following the gradient descent method, and each iteration will make μ decrease continuously. Then the Gauss-Newton algorithm is used to find the expected value of the target. The second derivative of the algorithm is used. The principle of seeking extreme improves the speed of model training and the ability of nonlinear generalization.

3.2. Flow of Algorithm

The network training and learning process of the LM-CDBN model is constructed as follows and the flow of algorithm was shown in Figure 2.

Step 1 (data preparation phase). The original data is preprocessed (denoised, filtered, normalized, and batched) and the topology of the network is determined.

Step 2. Enter the first batch of data and prepare for network training.

Step 3. Use (9) to update the status of the hidden node of the first layer network.where N(0,1) represents a Gaussian distribution; is the weight matrix connecting the visible layer and the hidden layer; is a state value of visible node .

Step 4. Use (10) to update the status value of the visible node .

Step 5. According to the visible layer node state value obtained in Step 4, use (11) to update the state of the hidden layer node again.

Step 6. Dynamically update the offsets of the visible and implicit nodes thresholds.where represents the dynamic offset of the implicit node j; represents the dynamic offset of the visible node ; is the state value of the visible node on time t-q; is the weighted value of directed connections between in time t-q and on current time ; is the weighted value of directed connections between in time t-q and on current time t.

Step 7. Use the state of the hidden layer node of the first layer network as the initial input of the layer 2 network. Repeat Steps 36 to complete the pretraining of the layer 2 network until the network is completed layer by layer.

Step 8. Establish a matrix vector of the network weights and thresholds. Use (5) and (7) to update the network weights and thresholds.

Step 9. Enter the second batch of data and go to Step 3 to complete the next round of training and so on until all data processing is completed.

Step 10. Use the function softmax to output the predicted value, denormalize data, and evaluate the network prediction results.

3.3. Evaluation Mechanism

To objectively evaluate the prediction results, the prediction model needs to be evaluated based on full consideration of the prediction error and the accuracy of the prediction value fitting degree. The evaluation mechanism is mainly composed of three aspects: model training error evaluation, fitting degree evaluation, and prediction accuracy.

Taking the three aspects of root mean square error (RMSE), mean absolute error (MAE), mean relative error (MRE) as evaluation indices, as shown in (13). The smaller number of three indicators means the stronger the ability to extract model information, the higher the prediction accuracy.where and represent the No. actual prediction value and model value of the prediction label, respectively; N represents the input sample dimension.

R represents the degree of fit between the actual observed value and the predicted output value. If the value of R is very large, it means that the predicted value and the actual observed value are compliant; else it means that the correlation between the two is poor. The equation for R is shown in

Put the prediction results of LM-CDBN model to compare with CDBN model, extreme learning machine (ELM), and UKF-SVR model to evaluate the training error and predictive performance of the LM-CDBN model.

4. Case Study

4.1. Description of the CITIC Tower

The supertall building CITIC tower is in the core area of CBD, Chaoyang District, Beijing, China. The external shape is the overall shape of the “bottle” of Chinese ancient wine containers (Figure 3). The CITIC tower has a total construction area of 350,000 square meters, with a total height of 528 meters, 108 floors above the ground, 7 basement floors, and 5 underground floors in the tower area. The base of the CITIC tower has a square base. From the base to the upper part of the base, its plane size is gradually tightened inward. From the narrowest part of the waistline to the top part, the plane size gradually enlarges. CITIC tower adopts the core-tube megaframe outrigger conversion truss structure, which has features such as high altitude, structural heterogeneity, and large changes in the curvature of the construction curve. The construction uses BIM technology to preassemble the structure and reduce rework and errors in construction.

4.2. Monitoring Sensor Layout

With the continuous development of global navigation satellite system (GNSS) [29, 30] technology, deformation data such as settlement, vertical compression, and horizontal displacement can be obtained with real-time kinematic (RTK) technology performing dynamic deformation observation of supertall buildings. In the study, two GNSS receivers were placed as reference stations outside the walls on the north and east sides of the building, and eight GNSS receivers were installed as monitoring stations on the core barrel and the frame. The GNSS antenna is fixed on the core barrel and the four corners of the frame by special brackets. Each GNSS host is connected to the cable through the cable and transmits the monitoring data to the data center through the data transfer unit (DTU). The specific deployment of base stations and monitoring stations is shown in Figures 4 and 5.

The shape acceleration array (SAA) system [31, 32] is a high-precision sensor based on MEMS accelerometer and consists of several rigid test sections connected by flexible joints. Each test section has a length of 200~500mm and has an internal triaxial accelerometer and thermometer. There is a special section between every 8 test sections, equipped with a microprocessor and a digital temperature sensor. For the SAA laid in the vertical direction, there is a fixed section at the front end of each test sensor for connecting a wired or wireless signal transmission device, and there is a fixed device at the end for fixing the entire SAA, as shown in Figure 6.

Assuming is the length of the test section and is the angle between two test sections calculated by the accelerometer, the deformation value in the direction of the standard section can be obtained. Adding to each section is the total amount of deformation, shown as

According to the CITIC tower’s architectural characteristics, two SAAs are placed in PVC sleeves and embedded along the vertical axis of the outside of the core tube. The ends are fixed on the floor of the structure and the front end is connected to a wireless serial modem (WSM) to achieve long-distance communication.

4.3. Data Processing

As shown in Table 1, from the time of October 10, 2017, the displacement monitoring data of 70 consecutive phases acquired at a sampling frequency of one hour are taken as samples, of which the first 55 training sample sets are used as a priori samples for the network pretraining, and the last 15 samples for the deformation analysis and prediction.

During the construction phase, the core barrel is subject to environmental cross-wind loads, temperature differences between the inside and outside of the shell structure, and changes in light intensity, which are easily subject to dynamic deformation. The training data set consists of core barrel displacement data, temperature, wind speed, light intensity, and time series. During the training of the model, the influence factors such as temperature, wind speed, and light intensity are taken as the characteristic values of the network input layer, and the displacement data of the core cylinder is used as the output feature vector.

In the process of sample priming, gross errors and noise elimination are first performed. Select the first 55 periods of data as a priori samples for network pretraining, and the last 15 samples for deformation analysis and prediction.

Due to the large range of deformation fluctuations and the large magnitude difference between the input factors of each group, the logarithmic interpolation algorithm (as shown in (16) was used to normalize the displacement deformation value .where and represent the maximum and minimum values of the predicted output deformation; and represent the normalized and original deformation information.

4.4. Parameter Determination

In the network topology determination process, the precise determination of network depth, the number of hidden layer nodes, and various training parameters are the key to accurate prediction.

In the process of model building, the network depth means the number of network layers of the model. In the process of layer-by-layer network training, reconstruction error (RE) is generated, which is an important indicator to measure network stability. In the case of a certain input layer data pattern, the network reconstruction error is calculated to effectively determine the network depth. As shown in Figure 7, the model gradually increases the network depth during the application process.

As shown in Figure 7(a), when the network depth is 1, the reconstruction error oscillates violently and tends to fall rapidly. The error range is mainly concentrated on 15~28mm, which reflects the lack of network depth. In Figure 7(b), in the early period, the reconstruction error changed dramatically, and it begins to show a slowing trend in the later period. The error range is mainly concentrated on 0.5~1.7mm, and the overall training effect of the network is the best. In Figures 7(c) and 7(d), the network reconstruction error is concentrated on 0.8~3.4mm and 0.7~3.9mm, and the network shows irregular fluctuations and the reconstruction error value has accumulated. Therefore, the network depth is determined to be 2 layers.

According to (17), the number of hidden layer nodes is determined by a comparison test.where is the empirical constant and the range is ; is the number of input layer nodes; is the number of output layer nodes; l is the number of hidden layer nodes.

In this prediction of deformation for supertall buildings, take the number of input layer nodes m=4, the number of output layer nodes n=1, and the range of hidden layer nodes is . As shown in Figure 8 and Table 2, the RMSE, MAE, MRE, and R/% are the evaluation criteria for network training, and the number of optimal hidden layer nodes is obtained through statistical analysis.

When the hidden layer node is 7, the RMSE, MAE, and MRE have the minimum value and the fitting R also has the largest value. Currently, the model has the best deformation prediction ability and nonlinear generalization ability.

4.5. Analysis of Forecast Results

As shown in Table 3, LM-CDBN model, CDBN model, ELM model, and UKF-SVR model were used to predict the 15th period of monitoring data and compared with the deformation value Z. RE and ARE of each group of results were calculated.

Compared with CDBN model, ELM, and improved SVR, LM-CDBN model has higher prediction accuracy and stability, and the model's extrapolation ability is better than other forecast models. The average relative error of the 15 forecast results is 3.12%. At the same time, the relative error of the LM-CDBN model prediction is even and stable. The prediction results of LM-CDBN model are less volatile than UKF-SVR. In addition, compared with the CDBN model, the L-M algorithm is used for optimization and improvement, the generalization ability of the CDBN model is enhanced, and the stability and prediction accuracy of the prediction are improved.

Similarly, as shown in Table 4, through the numerical analysis of several evaluation indicators we can see that the prediction error of LM-CDBN is smaller and the prediction accuracy is much higher than that of shallow neural network. Compared with other models, the deep network has higher feature extraction capabilities and nonlinear regression analysis capabilities.

As shown in Figure 9(a), although gross errors and noise were excluded from the experimental data, due to external influences, the collected data still had large fluctuations, which caused some interference to the actual prediction work. In addition, under the influence of external factors, significant displacement changes have occurred during the continuous monitoring of the overall structure of the Chinese dignity, and the tendency of migration has gradually increased. Compared with other models, the LM-CDBN model has a high degree of fitting ability and deformation extraction ability for the displacement change trend. As shown in Figure 9(b), the LM-CDBN model has better extrapolation capability of deformation and is more in line with the actual law of displacement change.

5. Results

A new deformation prediction approach for supertall building was proposed in the paper. The LM algorithm was used to optimize the weighting method of CDBN model in this approach. Then use this model to predict the deformation of the supertall building CITIC tower and value the perdition results using several different methods. In terms of error, the MAE value of the LM-CDBN model is 0.0023 mm, while the MAE of the CDBN model, the ELM model, and the UKF-SVR model was 0.0141 mm, 010262 mm, and 0.0155 mm. The RMSE of LM-CDBN model was 0.0031 mm, while the RMSE of CDBN model, ELM model, and UKF-SVR model was 0.0212 mm, 0.0385 mm, and 0.0223 mm. In terms of fitness, the fitting performance of the LM-CDBN model increased by 64%, 80%, and 64%, compared with the CDBN model, the ELM model, and the UKF-SVR model. By comparing experiments and data analysis, we can find that the LM-CDBN model has higher prediction accuracy than three other models, and the variation law of the prediction data is more consistent with the actual variation law. Hence, we can conclude that the LM-CDBN model is suitable for the variable prediction of supertall buildings and also has better robustness and deformation prediction ability.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

There is no conflict of interest regarding the publication of this paper.

Acknowledgments

This research was supported mainly by the National Key Research and Development Program of China [no. 2017YFB0503700], the Fundamental Research Funds for Beijing University of Civil Engineering and Architecture [no. FZ02], and Beijing University of Civil Engineering and Architecture Postgraduate Innovation Project [no. PG2018054, no. PG2018062].