Adaptive neural network tracking control for underactuated systems with matched and mismatched disturbances

Liu, Pengcheng; Yu, Hongnian; Cang, Shuang

doi:10.1007/s11071-019-05170-8

Adaptive neural network tracking control for underactuated systems with matched and mismatched disturbances

Original paper
Open access
Published: 08 October 2019

Volume 98, pages 1447–1464, (2019)
Cite this article

Download PDF

You have full access to this open access article

Nonlinear Dynamics Aims and scope Submit manuscript

Adaptive neural network tracking control for underactuated systems with matched and mismatched disturbances

Download PDF

3721 Accesses
94 Citations
Explore all metrics

Abstract

This paper studies neural network-based tracking control of underactuated systems with unknown parameters and with matched and mismatched disturbances. Novel adaptive control schemes are proposed with the utilization of multi-layer neural networks, adaptive control and variable structure strategies to cope with the uncertainties containing approximation errors, unknown base parameters and time-varying matched and mismatched external disturbances. Novel auxiliary control variables are designed to establish the controllability of the non-collocated subset of the underactuated systems. The approximation errors and the matched and mismatched external disturbances are efficiently counteracted by appropriate design of robust compensators. Stability and convergence of the time-varying reference trajectory are shown in the sense of Lyapunov. The parameter updating laws for the designed control schemes are derived using the projection approach to reduce the tracking error as small as desired. Unknown dynamics of the non-collocated subset is approximated through neural networks within a local region. Finally, simulation studies on an underactuated manipulator and an underactuated vibro-driven system are conducted to verify the effectiveness of the proposed control schemes.

Adaptive Neural Network Control for Uncertain Robotic Manipulators with Output Constraint Using Integral-Barrier Lyapunov Functions

Global adaptive tracking control of robot manipulators using neural networks with finite-time learning convergence

Article 20 July 2017

Chenguang Yang, Tao Teng, … Chun-Yi Su

Fixed-time prescribed performance tracking control for manipulators against input saturation

Article 27 May 2023

Yizhuo Sun, Jiyuan Kuang, … Ligang Wu

1 Introduction

Underactuated mechanical systems (UMSs) are rapidly growing research fields that combine control and robotics societies [1,2,3,4,5,6,7,8]. They have extensive applications such as UAVs, underground vehicles, spacecraft, humanoid robots and vibro-driven robots [9,10,11,12]. UMSs have more degrees-of-freedom (DOF) n than independent control inputs m; thus, (n-m) DOF are not directly controllable, which characterize the nature of underactuation. This nature is made possible for the UMSs to undertake complicated tasks with a reduced number of actuators that in turn implies the reduction in weight and energy consumption. Challenge that faced by control of underactuated systems is the existence of underactuation and some undesirable properties such as being in a non-minimum phase and/or possessing an undetermined relative degree, which makes conventional approaches not directly applicable, particularly for the issue of trajectory tracking control. Design of the control schemes for UMSs is intractable because of the internal dynamics and non-holonomic property, and they are not feedback linearizable [13]. Besides, the uncertainties in system model, as well as the matched and mismatched disturbances, make control of UMSs more challenging.

The complexity of control problem related to UMSs can be reduced when the objective is merely to stabilize a subset of the UMSs’ DOF. In the literature, a great number of existing control system designs for UMSs explore the concept of linearization through partial feedback [14,15,16,17,18]. Linear systems can be utilized to capture the underactuated dynamics within a local range; however, global stabilization of the underactuated dynamics is still unavailable under this approach. Other prevailing techniques such as inverse dynamics [19, 20], sliding mode/variable structure [14, 21, 22], energy/passivity-based approaches [17, 23, 24] have been extensively exploited. Furthermore, practical requirements are raised from the current applications, in which the adaptability of UMSs is extremely crucial when facing environments with uncertainties. For instance, microrobotic systems work across vulnerable media in restricted space for minimally invasive sensing and risk intervention in pipeline inspection, endoscopic assistance, underwater exploration, etc. However, an exact dynamic model is intractable to obtain due to the presence of frictions, unknown disturbances, time-varying parameters, etc. As a result, adaptive control schemes for generic UMSs have attracted great attentions. Considering uncertainties and ocean disturbances, a control system using leader–follower formation was studied in [25] for underactuated autonomous surface vehicles. Dynamic surface control technique and neural networks (NNs) were used to construct the control scheme. A hierarchical sliding mode control system with adaptive and fuzzy inclusions was studied for uncertain UMSs in [14], where different layers of sliding surface are constructed to cope with the uncertainties and disturbances and fuzzy models are designed to approximate the nonlinearities.

It is evident that description of dynamic couplings between the actuated and passive subsystems of UMSs is typically highly nonlinear. Therefore, it is plausible to consider the employment of approximation approaches to map the coupling between the torques applied at the actuated subsystem and the resulting accelerations of the passive subsystem, with the intent of achieving control globally. As such, in this paper, nonlinear control approach is investigated by employing multi-layer NNs. NNs have versatile features such as learning capability mapping and parallel processing. An attractive feature of NNs is that their synaptic weights are online updated without any offline learning phases. NNs have the property of robustness; thus, they have been widely applied in various robotic systems to address the stabilization problem [26,27,28]. The issue of tracking control of UMSs based on NNs has attracted extensive attentions. Optimal motion control using NNs and stochastic adaptive concepts was studied towards the Pendubot in [29] and a WIP system in [30]. For UMSs with full-state constraints containing a Moore–Penrose inverse term, an adaptive NN control system was proposed in [31]. In [31], the authors developed two decentralized output feedback control systems based on adaptive NN to tackle with immeasurable states and unknown time delays in UMSs. Towards a wheeled mobile robot that is non-holonomic with unknown parameters and uncertain dynamics, an adaptive tracking control scheme was presented in [32] to tune the kinematic controller gain online and minimize the tracking error in velocity. A bio-inspired tracking control scheme based on NNs was developed in [33] for an underactuated surface vessel with unknown system dynamics. A cart–pendulum system with unknown dynamics was studied in [34], and a trajectory tracking control scheme of the pendulum subsystem based on adaptive NN was designed instead of considering the position of the cart. In [35], an output feedback control system based on NNs was proposed for tracking control of a spherical inverted pendulum. A combined PID and neural network compensation approach was proposed in [36] to control a wheel-driven mobile pendulum system, and the results were experimentally analysed. From the literature, it is noted that relatively few studies have addressed the issue of tracking control for UMSs, particularly when the disturbance exists in the non-collocated subsystem, which is mismatched with the control actions. Also, it is noted that very few reported studies towards this topic have presented rigorous analysis of trajectories of the closed-loop system for UMSs. Therefore, trajectory tracking control for UMSs with uncertainties and disturbances is still an open problem and requires in-depth investigations.

Through the utilization of the unique physical properties of the UMSs, the overall underactuated system breaks down into two subsystems, i.e. a fully actuated subsystem and an unactuated (passive) subsystem. Radial basis neural network (RBFNN) has simple structure and fast convergence rate, and it can overcome the local minimum problem; therefore, it is utilized as a nonlinear function approximator of uncertain dynamics of the unactuated (passive) subsystem of the UMSs. The NN control has the ability of universal approximation, and it has been thoroughly studied on discrete-time system [37,38,39,40] and continuous-time systems [41,42,43,44]. There are very limited studies using NN to approximate the system dynamics of UMSs. In this paper, we develop NN-based adaptive tracking control schemes to cope with the internal uncertain dynamics and external disturbances, and auxiliary control variables are explicitly designed to close the unactuated feedback loops. RBFNN is adopted to approximate the mismatched system uncertainties, and the adaptive control algorithm is constructed to estimate the NNs approximation error and the bounded mismatched disturbance. The combination of NN approximation, variable structure control and adaptive approach makes the constructed new controller more robust, and as such, errors resulting from trajectory tracking, parameter uncertainties, mismatched external disturbances and NN approximation are counteracted. Theoretical background of these methods is presented with rigorous analysis and developed in detail for some examples. The schemes promote the utilization of linear filters in the control input such that the system robustness is improved. Stability of the system dynamics and convergence of the time-varying reference trajectories are demonstrated using Lyapunov analysis. In addition, adaptation laws for the NNs weights of the proposed control systems are derived from the above procedure. The main contributions of this paper are summarized as follows:

1.
Stabilization for fully actuated systems is well established in terms of the time-varying trajectories through adaptive control. However, its application and extension to the UMSs are not straightforward. This paper proposes the adaptive control schemes to encompass the conventional approaches and stabilize the UMSs’ state space through design of auxiliary control variables that contain NN approximator and robust compensator.
2.
The parametric uncertainties and the matched and mismatched disturbances are considered in the design of the adaptive control schemes, which feature a generic model for the studies on underactuated systems. It is noted that the mismatched disturbances have been neglected in most of the existing approaches for the tracking control of UMSs.
3.
Employing the adaptive control approach, combined with variable structure and NNs, exact values of the system base parameters are not required to be known a priori.
4.
Designing robust compensators to counteract the matched and mismatched disturbances, and function approximation error of NNs and nonlinear frictions can reduce the tracking error as small as desired in finite time through selecting appropriate parameters for the controller.

The rest of this paper is organized as follows. Notations, assumptions, system dynamic model for UMSs and preliminaries are presented in Sect. 2. Section 3 gives the main theoretical results concerning the adaptive NN tracking control systems design for a UMSs. Validations of the effectiveness of the proposed approaches are presented in Sect. 4 through simulation studies on an underactuated manipulator and a vibro-driven mobile system. Finally, concluding remarks and perspectives are given in Sect. 5.

2 Preliminaries and problem description

2.1 Notations

Let $\Vert .\Vert $ denote any suitable vector Euclidean norm. Specifically, $\Vert .\Vert _{p}$ represents the p-norm of a given vector. The Frobenius norm of the given matrix $ H=[h_{ij}]\in {\mathcal {R}}^{n\times m}$ is defined as $\Vert H \Vert _{F}^{2}=Tr( H^{T}H )=Tr( HH^{T} )=\sum _{i,j} h_{ij}^{2} $ with Tr(.) denoting the trace operator. The Frobenius norm is associated with the 2-norm in a manner that $\Vert Hx \Vert _{2}\le \Vert H \Vert _{F}\Vert x \Vert _{2}$ with $H\in {\mathcal {R}}^{n\times m}$ and $x\in {\mathcal {R}}^{m}$. The trace operator has the property of ${\mathrm{A}}^{T}B=Tr(AB^{T})$ with $\forall A, B\in {\mathcal {R}}^{n}$. $\lambda _{\min }(.)$ and $\lambda _{\max }(.)$ are, respectively, the minimum and maximum eigenvalue of the given matrix. $I_{n}$ represents the identity matrix of dimension $n\times n$.

2.2 Dynamic model and properties

The dynamics of n-DOF UMSs can be expressed in the generalized coordinates via the Euler–Lagrangian’s approach, given by

$$\begin{aligned}&D\left( q,\alpha \right) {\ddot{q}}+C\left( q,{\dot{q}},\alpha \right) {\dot{q}}+G\left( q,\alpha \right) +F_{v}\left( \alpha \right) {\dot{q}}\nonumber \\&\quad +F_{c}\left( q,{\dot{q}},\alpha \right) +\tau _{d}=B\left( q \right) \tau \end{aligned}$$

(1)

where $q={[q_{1},\,...\, , q_{n}]}^\mathrm{T}\in {\mathcal {R}}^{n}$ describes the vector of generalized configurations, $\alpha \in {\mathcal {R}}^{p}$ is the vector of unknown parameters of the underactuated system, mainly including the initial parameters and possible loading parameters (p indicates the number of uncertain parameters), $D\left( q,\alpha \right) \in {\mathcal {R}}^{n\times n}$ is the inertial matrix, $C\left( q,{\dot{q}},\alpha \right) \in {\mathcal {R}}^{n\times n}$ represents the centripetal and Coriolis matrix, $G\left( q,\alpha \right) \in {\mathcal {R}}^{n}$ denotes the gravitational torque/force, $F_{v}\left( \alpha \right) \in {\mathcal {R}}^{n\times n}$ is the viscous friction coefficients, $F_{c}(q,{\dot{q}},\alpha )\in {\mathcal {R}}^{n}$ models the nonlinear friction torques, $\tau _{d}$ denotes the unknown disturbances and unmodelled dynamics which are bounded, $B\left( q \right) \in {\mathcal {R}}^{n\times (n-m)}$ represents the input transformation matrix and $\tau \in {\mathcal {R}}^{n-m}$ is the vector of control inputs to be constructed to obtain specific control objectives.

The Lagrangian dynamic model of the UMSs described by (1) has the following beneficial properties [6, 45, 46] that are employed in the design and analysis of the control schemes in this paper:

Property 1

The inertia matrix $D\left( q,\alpha \right) $ is symmetric and positive definite, i.e. $D\left( q,\alpha \right) =D^{T}\left( q,\alpha \right) $; it is uniformly positive definite and has upper and lower boundaries, which implies

$$\begin{aligned}&0<\lambda _{\min }\left( \alpha \right) \left\| x \right\| ^{2}\le x^{T}D\left( q,\alpha \right) x\le \lambda _{\max }\left( \alpha \right) \left\| x \right\| ^{2}\nonumber \\&\quad <+\,\infty , \forall x\in {\mathcal {R}}^{n-m} \end{aligned}$$

(2)

Property 2

The centripetal and Coriolis term $C\left( q,{\dot{q}},\alpha \right) {\dot{q}}$ is quadratic in the generalized velocity ${\dot{q}}$ and satisfies

$$\begin{aligned} \left\| C\left( q,{\dot{q}},\alpha \right) {\dot{q}} \right\| \le \lambda _{3}\left( \alpha \right) \left\| {\dot{q}} \right\| ^{2} \end{aligned}$$

(3)

where $\lambda _{3}\left( \alpha \right) $ is a bounded scalar constant.

Property 3

The above matrixes $D\left( q,\alpha \right) $ and $C\left( q,{\dot{q}},\alpha \right) $ have the following skew-symmetric interconnection

$$\begin{aligned} x^{T}\left[ {\dot{D}}\left( q,\alpha \right) -2C\left( q,{\dot{q}},\alpha \right) \right] x=0, \forall x\in {\mathcal {R}}^{n-m} \end{aligned}$$

(4)

under an appropriate definition of $C\left( q,{\dot{q}},\alpha \right) $. This property is a matrix version of energy conservation.

Property 4

The gravitational torque/force $G\left( q,\alpha \right) $ is bounded and satisfies

$$\begin{aligned} \left\| G\left( q,\alpha \right) \right\| \le \lambda _{4}\left( \alpha \right) \end{aligned}$$

(5)

where $\lambda _{4}\left( \alpha \right) $ is a bounded constant.

Property 5

The dynamic model (1) can be rewritten in a linear form with respect to an appropriate selection of the system’s initial parameters and load parameters $\alpha $. Furthermore, there exist a regressor matrix $Y(q,{\dot{q}},{\ddot{q}})$ and a vector $Y_{0}(q,{\dot{q}},{\ddot{q}})$ containing known functions, given as follows:

$$\begin{aligned}&D\left( q,\alpha \right) {\ddot{q}}+C\left( q,{\dot{q}},\alpha \right) {\dot{q}}+G\left( q,\alpha \right) +F_{v}\left( \alpha \right) {\dot{q}}\nonumber \\&\quad +F_{c}\left( q,{\dot{q}},\alpha \right) =Y\left( q,{\dot{q}},{\ddot{q}} \right) \alpha +Y_{0}(q,{\dot{q}},{\ddot{q}}) \end{aligned}$$

(6)

where $Y(.)\in R^{(n-m)\times p}$ is the regressor matrix containing known functions.

Remark 1

Based on Property 5, we introduce ${\hat{\alpha }} $ be the time-varying estimation of $\alpha $, and define ${\hat{D}}$, ${\hat{C}}$, ${\hat{G}}$, ${\hat{F}}_{v} $ and ${\hat{F}}_{c} $ be the corresponding affine matrices, respectively, estimated from D, C, G, $F_{v} $ and $F_{c} $ through substitution $\hat{\alpha }$ for the real $\alpha $. Then, the linear parametrizability is given by

$$\begin{aligned}&{\tilde{D}}\left( q,\alpha \right) {\ddot{q}}+{\tilde{C}}\left( q,{\dot{q}},\alpha \right) \varrho +{\tilde{G}}\left( q,\alpha \right) \nonumber \\&\qquad +{\tilde{F_{v}}}\left( \alpha \right) \varrho +{\tilde{F_{c}}}\left( q,{\dot{q}},\alpha \right) \nonumber \\&\quad =Y\left( q,{\dot{q}},\varrho ,{\ddot{q}} \right) \tilde{\alpha }+Y_{0}(q,{\dot{q}},\varrho ,{\ddot{q}}) \end{aligned}$$

(7)

where ${\tilde{\alpha }}(t)={\hat{\alpha }}(t)-\alpha $ is the parameter estimation error, $\varrho \in {\mathcal {R}}^{n} $ is an arbitrary vector and ${\tilde{D}}$, ${\tilde{C}}$, ${\tilde{G}}$, ${\tilde{F}}_{v}$, ${\tilde{F}}_{c}$ represent the corresponding affine matrices of estimation errors in the presence of the parameter estimation error ${\tilde{\alpha }}$.

Remark 2

Concretely, the unmodelled friction torque/force F in (1) can be partitioned into two aspects as

$$\begin{aligned} F=F_{v}\left( \alpha \right) {\dot{q}}+F_{c}(q,{\dot{q}},\alpha ) \end{aligned}$$

(8)

where $F_{v}\left( \alpha \right) {\dot{q}}=[F_{v1}\left( \alpha \right) {\dot{q}}_{1},F_{v2}\left( \alpha \right) {\dot{q}}_{2}, ... \,,F_{vn}\left( \alpha \right) {\dot{q}}_{n}]^{T}$ is the viscous friction torque describing the linear part and $F_{c}\left( q,{\dot{q}},\alpha \right) =[F_{c1}\left( q_{1},{\dot{q}}_{1},\alpha \right) ,F_{c2}\left( q_{2},{\dot{q}}_{2},\alpha \right) , ... \, ,F_{cn}\left( q_{n},{\dot{q}}_{n},\alpha \right) ]^{T}$ denotes the nonlinear friction torques.

Definition 1

[47] UMSs’ DOF contains two subsets, including the collocated subset whose cardinality equals the number of control inputs and encompasses the actuated DOF, and the non-collocated subset contains the rest of the DOF which are passive.

Assumption 1

It is assumed in this paper that the matched and mismatched external disturbances are bounded.

Assumption 2

It is assumed that each subsystem is equipped with encoder and tachometer for the position and velocity measurement.

2.3 RBFNN approximation

The structure of RBFNN is presented in Fig. 1. The universal approximation capability of RBFNN towards any continuous nonlinear function ${\upchi }\left( z \right) :{\mathcal {R}}^{n}\rightarrow {\mathcal {R}}$ over a compact set ${\Omega }_{z}$ has been well established, which can be expressed as

$$\begin{aligned} {\upchi }\left( z \right)= & {} W^{*T}\phi \left( z \right) +\varepsilon \left( z \right) \, \forall z\in {\Omega }_{z}\subset {\mathcal {R}}^{n},\nonumber \\&\left\| \varepsilon \left( z \right) \right\| \le \varepsilon _{N} \end{aligned}$$

(9)

where $z\in {\Omega }_{z}\subset {\mathcal {R}}^{n}$ denotes the input vector of dimension n, ${\upchi }\left( z \right) $ is the unknown function to be approximated, $W^{*}={[W_{1}^{*},W_{2}^{*}, ... \, ,W_{k}^{*}]}^{T}\in {\mathcal {R}}^{k}$ is the bounded ideal synaptic weight vector with dimension (or the NN node number) $k>1$ (i.e.$ \forall $ positive constant $W_{N}$ such that $\left\| W_{k}^{*} \right\| \le W_{N}$ and $tr\left\{ W_{k}^{{*}^{T}}W_{k}^{*} \right\} \le W_{N})$, $\varepsilon \left( z \right) \in {\mathcal {R}}$ is a bounded approximation error over the compact set, $\varepsilon _{N}$ is an upper bound (positive constant) of the approximation error which satisfies $\varepsilon _{N}=sup\left\| {\hat{\upchi }}\left( z,W^{*} \right) - {\upchi }\left( z \right) \right\| $ and $\phi \left( z \right) ={[\phi _{1}\left( z \right) ,\phi _{2}\left( z \right) , ... \, ,\phi _{k}\left( z \right) ]}^{T}$ is the NN basis function which is conventionally chosen as Gaussian functions as

$$\begin{aligned} \phi _{i}\left( z \right) =\exp \left( -\frac{\left\| z-C_{i} \right\| ^{2}}{2b_{i}^{2}} \right) , i=1,2, ... \, ,k \end{aligned}$$

(10)

where vector $C_{i}$ and $b_{i}$ represent the centre and the width of the i-th receptive field.

The Gaussian function is chosen as NN basis function; it is well known that given a sufficient number of NNs nodes and properly adopted centres and the widths of the node, RBFNN is able to approximate any unknown nonlinearities to arbitrarily close to a compact set with any desired accuracy. Note that the approximation error $\varepsilon \left( z \right) $ decreases along with the increase in the number of NN node k.

It is noted that the bounded ideal weight matrix $W^{*}$ is merely a quantity utilized for analysis purposes, whilst in practical control applications, the estimate ${\hat{W}}$ of $W^{*}$ is utilized for practical approximation of unknown nonlinear function ${\upchi }\left( z \right) $. As such, the estimation of ${\upchi }\left( z \right) $ is represented by

$$\begin{aligned} {\hat{\upchi }}\left( z \right) ={\hat{W}}^{T}\phi \left( z \right) \end{aligned}$$

(11)

Based on the NN defined by (11), approximation error of the nonlinear function can be described as

$$\begin{aligned} {\upchi }\left( z \right) -{\hat{\upchi }}\left( z \right) ={\tilde{W}}^{T}\phi \left( z \right) +\varepsilon \left( z \right) \end{aligned}$$

(12)

where ${\tilde{W}}=W^{*}-{\hat{W}}$.

Assumption 3

${\hat{\upchi }}\left( z,W^{*} \right) $ is the output of the NNs and continuous; there exists a sufficient small positive constant such that

$$\begin{aligned} max\left\| {\hat{\upchi }}\left( z,W^{*} \right) -{\upchi }\left( z \right) \right\| \le \varepsilon _{0} \end{aligned}$$

(13)

where $W^{*}$ is typically defined as the optimal value of W such that the approximation error $\varepsilon \left( z \right) $ could be minimized for all $z\in {\Omega }_{z}$ as

$$\begin{aligned} {W^{{*}}} := \arg {\min }_{W \in {{\mathcal {R}}^k}} \left\{ {{\sup }_{{{~}}z \in {{{\Omega }}_z}}\Vert {{\upchi }}\left( z \right) - {W^{{{*}}T}}\phi \left( z \right) }\Vert \right\} \end{aligned}$$

(14)

3 Control system design and stability analysis

It is assumed that for system (1), there are only m control inputs that are equipped with actuators; then, the generalized coordinate vector q can be partitioned into collocated and non-collocated vectors as

$$\begin{aligned} q{:}{=}{[q_{c}\, q_{n}]}^{T} \end{aligned}$$

(15)

where $q_{c}\in {\mathcal {R}}^{m}$ and $q_{n}\in {\mathcal {R}}^{n-m}$ denote the actuated and unactuated coordinate vector, respectively. The subscripts “c” and “n,” respectively, indicate collocated and non-collocated subsets.

Without loss of generality, the underactuated system (1) can be rewritten into a partitioned form as

$$\begin{aligned} \left\{ {\begin{array}{l} D_{cc}\left( q,\alpha \right) {\ddot{q}}_{c}+D_{cn}\left( q,\alpha \right) {\ddot{q}}_{n}+C_{cc}\left( q,{\dot{q}},\alpha \right) {\dot{q}}_{c}\\ \quad +\,C_{cn}\left( q,{\dot{q}},\alpha \right) {\dot{q}}_{n} +G_{c}\left( q,\alpha \right) +F_{vc}\left( \alpha \right) {\dot{q}}_{c}\\ \quad +\,F_{cc}\left( q,{\dot{q}},\alpha \right) +\tau _\mathrm{dc}=\tau \\ D_{nc}\left( q,\alpha \right) {\ddot{q}}_{c}+D_{nn}\left( q,\alpha \right) {\ddot{q}}_{n}+C_{nc}\left( q,{\dot{q}},\alpha \right) {\dot{q}}_{c}\\ \quad +\,C_{nn}\left( q,{\dot{q}},\alpha \right) {\dot{q}}_{n} +G_{n}\left( q,\alpha \right) \\ \quad +\,F_{vn}\left( \alpha \right) {\dot{q}}_{n}+F_{cn}\left( q,{\dot{q}},\alpha \right) +\tau _{dn}=0 \\ \end{array}} \right. \end{aligned}$$

(16)

where $\tau _\mathrm{dc}$ and $\tau _{dn}$ denote the bounded unknown disturbances and unmodelled dynamics to the collocated and non-collocated subsets, respectively.

Let the reference trajectories for the collocated and non-collocated subsets be descried by the vector-valued functions $\left\| q_{cd} \right\| _{\infty }\le \vartheta _{1}$ and $\left\| q_{nd} \right\| _{\infty }\le \vartheta _{2}$, respectively, and assume that these functions are bounded in norm and uniformly continuous on ${{\mathcal {R}}}^{+}$, and homogenously on the same set, its first- and second-order derivatives are bounded, well defined and uniformly continuous. Introduce the trajectory tracking error as

$$\begin{aligned} {\tilde{q}}_{c}=q_{c}-{q}_{cd}, {\tilde{q}}_{n}=q_{n}-{q}_{nd} \end{aligned}$$

(17)

which is to be stabilized to zero without the knowledge of the system parameters $\alpha $. $\vartheta _{1}$ and $\vartheta _{2}$ are positive upper bounds of the desired reference trajectories. Noting that the design of $\vartheta _{1}$ and $\vartheta _{2}$ has to satisfy the zero dynamics based on the non-holonomic dynamics, we have

$$\begin{aligned}&D_{nc}\left( q,\alpha \right) {\ddot{\vartheta }}_{1}+D_{nn}\left( q,\alpha \right) {\ddot{\vartheta }}_{2}+C_{nc}\left( q,{\dot{q}},\alpha \right) {\dot{\vartheta }}_{1}\nonumber \\&\quad +\,C_{nn}\left( q,{\dot{q}},\alpha \right) {\dot{\vartheta }}_{2}+G_{n}\left( q,\alpha \right) \nonumber \\&\quad +\,F_{vn}\left( \alpha \right) {\dot{\vartheta }}_{2}+F_{cn}\left( q,{\dot{q}},\alpha \right) +\tau _{dn}=0 \end{aligned}$$

(18)

In the following, auxiliary kinematic vector variables $\varrho = {[\varrho _{c} \,\varrho _{n}]}^{T}$ and $\delta = {[\delta _{c}\, \delta _{n}]}^{T}$ are defined as

$$\begin{aligned} \varrho _{c}= & {} {\dot{q}}_{cd}-{\Lambda }_{c}{\tilde{q}}_{c},\nonumber \\ \varrho _{n}= & {} {\dot{q}}_{nd}-{\Lambda }_{n}{\tilde{q}}_{n} \end{aligned}$$

(19)

$$\begin{aligned} \delta _{c}= & {} {\dot{q}}_{c}-\varrho _{c}={\dot{{\tilde{q}}}}_{c}+{\Lambda }_{c}{\tilde{q}}_{c}, \nonumber \\ \delta _{n}= & {} {\dot{q}}_{n}-\varrho _{n}={\dot{{\tilde{q}}}}_{n}+{\Lambda }_{n}{\tilde{q}}_{n} \end{aligned}$$

(20)

where $\varrho _{c},\delta _{c}\in {\mathcal {R}}^{m}$ and $\varrho _{n},\delta _{n}\in {\mathcal {R}}^{n-m}$. $\delta $ denotes the filtered error signal and describes the measure of tracking accuracy, $\varrho $ is referred to as vector of the reference trajectory, ${\Lambda =\hbox {diag}[}{\Lambda }_{c}I_{m\times m}{, }\,{\Lambda }_{n}I_{(n-m)\times (n-m)}{]}$ with ${\Lambda }_{c}$ and ${\Lambda }_{n}$ be positive constants selected by designers. $I_{i\times i}$ denotes $i\times i$ identity matrix. It is noted that the error dynamics of the underactuated systems can be obtained by firstly introducing the tracking error from collocated and non-collocated loops and then filtering out the error signals. In this regard, we can encompass the conventional adaptive control approaches and stabilize the state space of underactuated systems. The choice of ${\Lambda }_{c}>0$ and ${\Lambda }_{n}{>}0$ guarantees that (20) is an exponentially stable system for q. Therefore, the trajectory q converges to an adjacent of $q_{d}$ exponentially fast as long as the control system drives $\delta $ to an adjacent of zero.

Applying the defined variables in the system dynamics (17), we have

$$\begin{aligned} \left\{ {\begin{array}{l} D_{cc}\left( q,\alpha \right) \left( {\dot{\delta }}_{c}+\dot{\varrho }_{c} \right) +D_{cn}\left( q,\alpha \right) \left( \dot{\delta }_{n}+{\dot{\varrho }}_{n} \right) \\ \quad +\,C_{cc}\left( q,{\dot{q}},\alpha \right) \left( \delta _{c}+\varrho _{c} \right) +C_{cn}\left( q,{\dot{q}},\alpha \right) (\delta _{n}\\ \quad +\,\varrho _{n})+G_{c}\left( q,\alpha \right) +F_{vc}\left( \alpha \right) {\dot{q}}_{c}+F_{cc}\left( q,{\dot{q}},\alpha \right) \\ \quad +\,\tau _\mathrm{dc}=\tau \\ D_{nc}\left( q,\alpha \right) \left( {\dot{\delta }}_{c}+\dot{\varrho }_{c} \right) +D_{nn}\left( q,\alpha \right) \left( \dot{\delta }_{n}+{\dot{\varrho }}_{n} \right) \\ \quad +\,C_{nc}\left( q,{\dot{q}},\alpha \right) \left( \delta _{c}\!+\!\varrho _{c} \right) \!+\! C_{nn}\left( q,{\dot{q}},\alpha \right) (\delta _{n}\!+\!\varrho _{n})\\ \quad +\,G_{n}\left( q,\alpha \right) \!+\!F_{vn}\left( \alpha \right) {\dot{q}}_{n}\!+\!F_{cn}\left( q,{\dot{q}},\alpha \right) \!+\!\tau _{dn}=0 \\ \end{array}} \right. \end{aligned}$$

(21)

The corresponding lumped error equation can be yielded as

$$\begin{aligned} \left\{ {\begin{array}{l} D_{cc}\left( q,\alpha \right) {\dot{\delta }}_{c}+D_{cn}\left( q,\alpha \right) {\dot{\delta }}_{n}+C_{cc}\left( q,{\dot{q}},\alpha \right) \delta _{c} \\ \quad +\,C_{cn}\left( q,{\dot{q}},\alpha \right) \delta _{n}+\tau _\mathrm{dc}=\tau \\ \quad -\,Y_{c}(q,{\dot{q}},\dot{\varrho }_{c},{\dot{\varrho }}_{n},\varrho _{c},\varrho _{n})\alpha _{c} \\ D_{nc}\left( q,\alpha \right) {\dot{\delta }}_{c}+D_{nn}\left( q,\alpha \right) {\dot{\delta }}_{n}+C_{nc}\left( q,{\dot{q}},\alpha \right) \delta _{c} \\ \quad +\,C_{nn}\left( q,{\dot{q}},\alpha \right) \delta _{n}+\tau _{dn}=-{\upchi }\left( \mathrm {z} \right) \\ \end{array}} \right. \end{aligned}$$

(22)

where $Y_{c}\left( q,{\dot{q}},{\dot{\varrho }}_{c},\dot{\varrho }_{n},\varrho _{c},\varrho _{n} \right) \alpha _{c}=D_{cc}\left( q,\alpha \right) {\dot{\varrho }}_{c}+D_{cn}\left( q,\alpha \right) {\dot{\varrho }}_{n}+C_{cc}\left( q,{\dot{q}},\alpha \right) \varrho _{c}+C_{cn}\left( q,{\dot{q}},\alpha \right) \varrho _{n}+G_{c}\left( q,\alpha \right) +F_{vc}\left( \alpha \right) {\dot{q}}_{c}+F_{cc}(q,{\dot{q}},\alpha )$, ${\upchi }\left( \mathrm {z} \right) =D_{nc}\left( q,\alpha \right) \dot{\varrho }_{c}+D_{nn}\left( q,\alpha \right) {\dot{\varrho }}_{n}+C_{nc}\left( q,{\dot{q}},\alpha \right) \varrho _{c}+C_{nn}\left( q,{\dot{q}},\alpha \right) \varrho _{n}+G_{n}\left( q,\alpha \right) +F_{vn}\left( \alpha \right) {\dot{q}}_{n}+F_{cn}(q,{\dot{q}},\alpha )$, and $\alpha _{c}={\hat{\alpha }}_{c}-{\tilde{\alpha }}_{c}$, $\alpha _{n}=\hat{\alpha }_{n}-{\tilde{\alpha }}_{n}$. The input ${\upchi }\left( \mathrm {z} \right) $ is adopted as $\mathrm {z=[}{\tilde{q}}^{T}, {\dot{{\tilde{q}}}}^{T},\, q_{d}^{T}, \,{\dot{q}}_{d}^{T}, \,{\ddot{q}}_{d}^{T}{]}$.

The estimation of nonlinear function ${\upchi }\left( \mathrm {z} \right) = -Y_{n}\left( q,{\dot{q}},{\dot{\varrho }}_{c},\dot{\varrho }_{n},\varrho _{c},\varrho _{n} \right) \alpha _{n}$ is expressed as

$$\begin{aligned} {\hat{\upchi }}\left( z \right) ={\hat{W}}^{T}\phi \left( z \right) \end{aligned}$$

(23)

where ${\hat{W}}$ is the NN adaptation law, $\phi \left( z \right) $ is the basis function.

Accordingly, (22) evolves to the following form

$$\begin{aligned} \left\{ {\begin{array}{l} D_{cc}\left( q,\alpha \right) {\dot{\delta }}_{c}+D_{cn}\left( q,\alpha \right) {\dot{\delta }}_{n}+C_{cc}\left( q,{\dot{q}},\alpha \right) \delta _{c} \\ +\,C_{cn}\left( q,{\dot{q}},\alpha \right) \delta _{n}+\tau _\mathrm{dc}=\tau \\ \quad -\,Y_{c}(q,{\dot{q}},\dot{\varrho }_{c},{\dot{\varrho }}_{n},\varrho _{c},\varrho _{n})\alpha _{c} \\ D_{nc}\left( q,\alpha \right) {\dot{\delta }}_{c}+D_{nn}\left( q,\alpha \right) {\dot{\delta }}_{n}+C_{nc}\left( q,{\dot{q}},\alpha \right) \delta _{c} \\ +\,C_{nn}\left( q,{\dot{q}},\alpha \right) \delta _{n}+\tau _{dn}={\hat{W}}^{T}\phi +{\tilde{W}}^{T}\phi +\varepsilon \\ \end{array}} \right. \end{aligned}$$

(24)

where ${\tilde{W}}=W^{*}-{\hat{W}}$.

Concretely, with these derivations, the adaptive control problem for underactuated systems can be formulated as: given the reference trajectories $q_{d}\in {\mathcal {R}}^{n}$, finding a nonlinear control law for $\tau $ such that for any $q(0)\in {\mathcal {R}}^{n}$ subjecting to parameter uncertainty and external matched and mismatched disturbances, the tracking error ${\tilde{q}}$ and its derivative converge to zero in finite time as $t\rightarrow \infty $.

The following theorem presents NNs-based control schemes that ensure the convergence of the closed-loop signals.

Theorem 1

Consider the dynamic properties, assumptions and definitions, and apply the following control laws to the uncertain underactuated system (24)

$$\begin{aligned}&\tau =\tau _{c}+\tau _{n} \end{aligned}$$

(25a)

$$\begin{aligned}&\tau _{c}=Y_{c}{\hat{\alpha }}_{c}-K_{1}\delta _{c}-\xi ,\nonumber \\&\tau _{n}=-\,\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \end{aligned}$$

(25b)

where the Adaptation Algorithm 1 for the collocated subsystem is designed as

$$\begin{aligned} {\dot{{\hat{\alpha }}}}_{c}= -\Gamma Y_{c}\delta _{c} \end{aligned}$$

(25c)

and the auxiliary input $\eta $ in (25b) is constructed as

$$\begin{aligned} {\dot{\eta }}=\eta ^{\frac{1}{2n+1}}\left( -K_{3}\left\| \delta _{n} \right\| ^{2}-\left\| \delta _{n} \right\| {\hat{W}}^{T}\phi +\delta _{n}^{T}\zeta \right) \end{aligned}$$

(25d)

with robust compensator $\zeta $ for the non-collocated subsystem designed as

$$\begin{aligned} \zeta =-\frac{\delta _{n}}{\left\| \delta _{n} \right\| +\mu }\kappa \end{aligned}$$

(25e)

and its adaptation law

$$\begin{aligned} {\dot{\kappa }}=\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu } \end{aligned}$$

(25f)

where $K_{1}\in {\mathcal {R}}^{m\times m}$ , $K_{2}, K_{3}\in {\mathcal {R}}^{\left( n-m \right) \times (n-m)}$ are diagonal, constant positive definite matrixes and $\Gamma \in {\mathcal {R}}^{p\times p}$ are positive definite matrixes. $\xi $ and $\zeta $ are auxiliary robust compensators designed later for convenience of stability analysis of the closed-loop system, and they are designed to compensate for matched and mismatched disturbances, and function approximation error of NNs and nonlinear frictions. $\mu >0$ is selected in a manner that $\int _0^\infty \mu \,\mathrm{d}t<\infty $. Then, the following conclusions hold:

(1)
$tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} \le W_{N}$ holds.
(2)
The control objective of global asymptotically stabilization can be achieved;
(3)
All signals within the closed-loop system are bounded, and the trajectory tracking errors ${\tilde{q}}$ and ${\dot{{\tilde{q}}}}$ will converge to zero asymptotically.

Proof

Consider a candidate Lyapunov function as follows

$$\begin{aligned} V&=\frac{1}{2}\delta ^{T}D\delta +\frac{1}{2}\tilde{\alpha }_{c}^{T}{\Gamma }^{-1}{\tilde{\alpha }}_{c}+\frac{1}{2}tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\tilde{W}} \right\} \nonumber \\&\quad +\frac{2n+1}{2n}\eta ^{\frac{2n}{2n+1}}+\frac{1}{2}{(\kappa -\varepsilon _{T})}^{2} \end{aligned}$$

(26)

where $\varepsilon _{T}\ge \left\| \varepsilon -\tau _{dn} \right\| $ denotes the upper bound of the mismatched disturbance and approximation error.

Differentiating both sides of (26) and applying the control laws (25) yield

$$\begin{aligned} {\dot{V}}&=\delta ^{T}\left( \left[ {\begin{array}{l} \tau -Y_{c}\alpha _{c} \\ W^{T}\phi +\varepsilon \\ \end{array}} \right] -\tau _{d} \right) +\dot{\hat{\alpha }}_{c}^{T}{\Gamma }^{-1}\tilde{\alpha }_{c}\nonumber \\&+tr\{{\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}}\}+(\kappa -\varepsilon _{T}){\dot{\kappa }}+\eta ^{\frac{-1}{2n+1}}{\dot{\eta }}\nonumber \\&=\left[ \delta _{c}^{T} \delta _{n}^{T} \right] \nonumber \\&\left[ \begin{array}{c} -Y_{c}{\tilde{\alpha }}_{c}-K_{1}\delta _{c}-\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \left| \eta \right| -\xi -K_{2}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \\ W^{T}\phi +\varepsilon \end{array}\right] \nonumber \\&-\delta ^{T}\tau _{d}+\dot{\hat{\alpha }}_{c}^{T}{\Gamma }^{-1}{\tilde{\alpha }}_{c}+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}} \right\} \nonumber \\&+(\kappa -\varepsilon _{T}){\dot{\kappa }}+\eta ^{\frac{-1}{2n+1}}{\dot{\eta }}\nonumber \\&=-\delta _{c}^{T}K_{1}\delta _{c}-\delta _{c}^{T}K_{2}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| -\delta _{c}^{T}\xi \nonumber \\&-\delta _{c}^{T}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \left| \eta \right| -\delta ^{T}\tau _{d}+\delta _{n}^{T}(W^{T}\phi +\varepsilon )\nonumber \\&+tr\{{\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}}\}+(\kappa -\varepsilon _{T}){\dot{\kappa }}+\eta ^{\frac{-1}{2n+1}}{\dot{\eta }}\nonumber \\&=-\delta _{c}^{T}K_{1}\delta _{c}-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| \nonumber \\&+\delta _{n}^{T}\varepsilon +\delta _{n}^{T}W^{T}\phi -\delta _{c}^{T}\xi -\delta ^{T}\tau _{d}+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}} \right\} \nonumber \\&+(\kappa -\varepsilon _{T}){\dot{\kappa }}+\eta ^{\frac{-1}{2n+1}}{\dot{\eta }}\nonumber \\&=-\delta _{c}^{T}K_{1}\delta _{c}-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| \nonumber \\&-\delta _{c}^{T}\xi -\delta ^{T}\tau _{d}+\delta _{n}^{T}\varepsilon +\delta _{n}^{T}W^{T}\phi +tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}} \right\} \nonumber \\&-K_{3}\left\| \delta _{n} \right\| ^{2}+(\kappa -\varepsilon _{T})\dot{\kappa }-\left\| \delta _{n} \right\| {\hat{W}}^{T}\phi -\delta _{n}^{T}\zeta \nonumber \\&=-\delta _{c}^{T}K_{1}\delta _{c} -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| \nonumber \\&+\delta _{n}^{T}(\varepsilon -\tau _{dn})-\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}-K_{3}\left\| \delta _{n} \right\| ^{2}\nonumber \\&-\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\kappa +(\kappa -\varepsilon _{T}){\dot{\kappa }}\nonumber \\&+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}({\dot{{\tilde{W}}}}+{\Upsilon }\delta _{n}^{T}\phi ) \right\} \nonumber \\&{\le }-\delta _{c}^{T}K_{1}\delta _{c}-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| \nonumber \\&+\left\| \delta _{n} \right\| \varepsilon _{T}-\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}-K_{3}\left\| \delta _{n} \right\| ^{2}-\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\kappa \\&+(\kappa -\varepsilon _{T}){\dot{\kappa }}+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}({\dot{{\tilde{W}}}}+{\Upsilon }\delta _{n}^{T}\phi ) \right\} \nonumber \end{aligned}$$

(27)

$\square $

Towards the parameter drifting problem, the neural weight adaptation law for ${\hat{W}}$ is constructed based on the projection algorithm, given by

$$\begin{aligned} {\dot{{\hat{W}}}}=-{\dot{{\tilde{W}}}}=\left\{ {\begin{array}{l} {\Upsilon }\phi \delta _{n}^{T}-\frac{\delta _{n}^{T}{\hat{W}}^{T}{\Upsilon }\phi {\hat{W}}}{W_{N}},\quad if \, tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} \\ \quad =W_{N}\, \,and\, \delta _{n}^{T}{\hat{W}}^{T}\phi \, {\le }\,0; \\ {\Upsilon }\phi \delta _{n}^{T}, if\, tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} <W_{N} \,or\\ \quad \,if \, tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} =W_{N} \,and \, \\ \quad \delta _{n}^{T}{\hat{W}}^{T}\phi >0. \\ \end{array}} \right. \end{aligned}$$

(28)

Corollary 1

Let $V_{tr1}\triangleq tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} $ and $V_{tr2}\triangleq tr\left\{ {\tilde{W}}^{T}\varUpsilon ^{-1}({\dot{{\tilde{W}}}}+\varUpsilon \delta _{n}^{T}\phi ) \right\} $ and apply weight adaptation law (28), then the following results hold for the boundedness of ${\hat{W}}$

$$\begin{aligned} (1)\, V_{tr1}\le & {} W_{N} \end{aligned}$$

(29)

$$\begin{aligned} (2)\, V_{tr2}\le & {} 0 \end{aligned}$$

(30)

Proof

(1) Recalling (28), it is evident that

(a)
If $V_{tr1}=W_{N} \text { and } \delta _{n}^{T}{\hat{W}}^{T}\phi >0$, ${\dot{V}}_{tr1}= 2tr\left\{ {\hat{W}}^{T}{\dot{{\hat{W}}}} \right\} = {2}tr\left\{ {\hat{W}}^{T}{\Upsilon }\phi \delta _{n}^{T} \right\} -2\delta _{n}^{T}{\hat{W}}^{T}{\Upsilon }\phi =0$.
(b)
If $V_{tr1}=W_{N} \text { and } \delta _{n}^{T}{\hat{W}}^{T}\phi \le 0$, ${\dot{V}}_{tr1}={2}tr\left\{ {\hat{W}}^{T}{\Upsilon }\phi \delta _{n}^{T} \right\} <0$.
(c)
If $V_{tr1}<W_{N}$, the result 1) holds by itself. (2) Adopting ${\dot{{\hat{W}}}}$ in (28), it is apparent that
1. (a)
  If $V_{tr1}=W_{N} \text { and } \delta _{n}^{T}{\hat{W}}^{T}{\Upsilon }\phi >0$,
  $$\begin{aligned} V_{tr2}&=\frac{\delta _{n}^{T}{\hat{W}}^{T}{\Upsilon }\phi }{W_{N}}tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} \\&\quad \le \frac{\delta _{n}^{T}{\hat{W}}^{T}{\Upsilon }\phi }{W_{N}}\left( \frac{1}{2}tr\left\{ W^{{*}^{T}}W^{*} \right\} -\frac{1}{2}W_{N}\right) \\&\quad \le 0 \end{aligned}$$
2. (b)
  If ${\dot{{\hat{W}}}}={\Upsilon }\phi \delta _{n}^{T}$, we have $V_{tr2}=0$. This completes the proof of Corollary 1. $\square $

Substituting (30) into (27), the time derivative of Lyapunov candidate function becomes

$$\begin{aligned} {\dot{V}}&\le -\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\delta _{c}^{T}\xi \nonumber \\&\quad -\,\delta _{c}^{T}\tau _\mathrm{dc}+\left\| \delta _{n} \right\| \varepsilon _{T}\nonumber \\&\quad -\,\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\kappa -K_{3}\left\| \delta _{n} \right\| ^{2}+(\kappa -\varepsilon _{T}){\dot{\kappa }}\nonumber \\&=-\,\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \nonumber \\&\quad -\,\delta _{c}^{T}\xi -K_{3}\left\| \delta _{n} \right\| ^{2}-\delta _{c}^{T}\tau _\mathrm{dc}+\left\| \delta _{n} \right\| \varepsilon _{T}\nonumber \\&\quad +\,\left( \kappa -\varepsilon _{T} \right) \left( {\dot{\kappa }}-\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu } \right) -\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\varepsilon _{T}\nonumber \\&\le -\,\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \nonumber \\&\quad -\,\delta _{c}^{T}\xi -K_{3}\left\| \delta _{n} \right\| ^{2}-\delta _{c}^{T}\tau _\mathrm{dc}\nonumber \\&\quad +\,\left\| \delta _{n} \right\| \varepsilon _{T}-\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\varepsilon _{T}\nonumber \\&=-\,\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \nonumber \\&\quad -\delta _{c}^{T}\xi -K_{3}\left\| \delta _{n} \right\| ^{2}-\delta _{c}^{T}\tau _\mathrm{dc}+\frac{\left\| \delta _{n} \right\| \mu \varepsilon _{T}}{\left\| \delta _{n} \right\| +\mu }\nonumber \\&\le -\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \nonumber \\&\quad -\delta _{c}^{T}\xi -K_{3}\left\| \delta _{n} \right\| ^{2}-\delta _{c}^{T}\tau _\mathrm{dc}+\mu \varepsilon _{T}\nonumber \\&\le K_{1}\left\| \delta _{c} \right\| ^{2}-K_{3}\left\| \delta _{n} \right\| ^{2}-\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}+\mu \varepsilon _{T}\nonumber \\&=-\left\| \delta \right\| ^{T}K\left\| \delta \right\| -\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}+\mu \varepsilon _{T} \end{aligned}$$

(31)

where $K=\left[ {\begin{array}{cc} K_{1} &{} 0\\ 0 &{} K_{3}\\ \end{array} } \right] $.

When no disturbance exerts on the collocated subsystem ($\tau _\mathrm{dc}=0)$, i.e. the system is only subject to mismatched disturbances, we design the collocated robust compensator as $\xi =0$ and integrate both sides of (31) from $t=0$ to $t=T$ as

$$\begin{aligned} V\left( T \right) -V\left( 0 \right) \le -\int _{0}^{T} {\left\| \delta \right\| ^{T}K\left\| \delta \right\| } \,\mathrm{d}t+\varepsilon _{T}\int _{0}^{T} \mu \,\mathrm{d}t \end{aligned}$$

(32)

Considering that $V\left( T \right) \ge 0$ and $\int _0^\infty \mu \mathrm{d}t<\infty $, we have

$$\begin{aligned}&{\lim }_{T \rightarrow \infty } sup\frac{1}{T} \int _0^T \Vert {\delta \Vert ^{2}}\,\mathrm{d}t \le \frac{1}{K}\left( {V\left( 0 \right) + {\varepsilon _T} \int _{0}^{T} \mu \, \mathrm{d}t} \right) \nonumber \\&\quad {\lim }_{T \rightarrow \infty } \frac{1}{T} \end{aligned}$$

(33)

From the definition of the Lyapunov function V in (26) and ${\dot{V}}$ derived from (31–33), the global uniform boundedness of the filtered tracking error $\delta _{c}$ for collocated subsystem and $\delta _{n}$ for non-collocated subsystem, the parameter estimation error ${\tilde{W}}$ is guaranteed. From the definition and assumption 1 of filtered tracking error $\delta $, it is evident that $\delta $ is bounded. The boundedness of control input is obvious from (25). It can be concluded that since $\delta ={[\delta _{c} \,\delta _{n}]}^{T}\in L_{2}^{n}\cap L_{\infty }^{n}$, $\delta _{c}$ and $\delta _{n}$ are continuous and $\delta _{c}\rightarrow 0, \delta _{n}\rightarrow 0$ as $t\rightarrow \infty $, and $\eta \in L_{\infty }$. From (25c), it can be shown that ${\tilde{\alpha }}_{c}\in L_{\infty }^{p}$. This in turn implies, based on property 1 and (25c), that ${\dot{\delta }}\in L_{\infty }^{n}$, ${\ddot{q}}={[{\ddot{q}}_{c}\, {\ddot{q}}_{n}]}^{T}\in L_{\infty }^{n}$ and ${\tilde{q}}={[{\tilde{q}}_{c} \, {\tilde{q}}_{n}]}^{T}\in L_{\infty }^{2n}$. Therefore, ${\tilde{q}}_{c}$ and ${\tilde{q}}_{n}$ are uniformly continuous and ${\tilde{q}}={[{\tilde{q}}_{c}\, {\tilde{q}}_{n}]}^{T}\in L_{\infty }^{2n}$, and it is evident that ${\tilde{q}}\rightarrow 0$ as $t\rightarrow \infty $.

Remark 3

The NNs are adopted to approximate the mismatched system uncertainties, and the adaptive control algorithm is constructed to estimate the NN approximation error and the bounded mismatched disturbance. The combination of variable structure control, NN approximation and adaptive approach makes the constructed new controller more robust, and such errors resulting from trajectory tracking, parameter uncertainties, mismatched external disturbances and NN approximation are compensated.

For the case $\tau _\mathrm{dc}\ne 0$ and $\left\| \tau _\mathrm{dc} \right\| <\beta _{m}$, i.e. the system is subject to both matched and mismatched disturbances, one can only conclude that $\delta $ is bounded from (26) and (31), but ${\tilde{\alpha }}_{c}$ and ${\tilde{W}}$ may become unbounded as (31) merely contains a negative definite component of $\left\| \delta \right\| ^{2}$ and no negative terms of ${\tilde{\alpha }}_{c}$ and ${\tilde{W}}$ are apparently included. As a result, the system may tend to be unstable. To improve the robustness of Theorem 1, the following adaptation algorithm is therefore proposed.

Adaptation Algorithm 2. Consider the following adaptation law

$$\begin{aligned} {\dot{{\hat{\alpha }}}}_{c}{=-\Gamma '}{\tilde{\alpha }}_{c}{-\Gamma }Y_{c}\delta _{c} \end{aligned}$$

(34)

Corollary 2

Consider the error equation (22) with the sliding surface designed in (20) under the adaptive NNs-based robust control law in (25), the following corollary holds: If adaptation algorithm 2 is adopted, the system error signals ${\tilde{q}}$, ${\dot{{\tilde{q}}}}$ and ${\tilde{\alpha }}$ converge to zero asymptotically. If $\tau _\mathrm{dc}\ne 0$ and $\left\| \tau _\mathrm{dc} \right\| <\beta _{m}$, then the system becomes globally uniformly ultimately stable and the boundedness depends on $\tau _\mathrm{dc}$.

Proof

Adopting Adaptation Algorithm 2 in function (27), we have

$$\begin{aligned} {\dot{V}}&=\left[ \delta _{c}^{T} \delta _{n}^{T} \right] \nonumber \\&\left[ {\begin{array}{c} -K_{1}\delta _{c}-Y_{c}{\tilde{\alpha }}_{c}-\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| -\xi \\ W^{T}\phi +\varepsilon \\ \end{array}} \right] \nonumber \\&-\delta ^{T}\tau _{d}+\dot{\hat{\alpha }}_{c}^{T}{\Gamma }^{-1}{\tilde{\alpha }}_{c}+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}} \right\} \nonumber \\&+(\kappa -\varepsilon _{T}){\dot{\kappa }}+\eta ^{\frac{-1}{2n+1}}{\dot{\eta }}\nonumber \\&=-\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -\delta _{c}^{T}\xi +\delta _{n}^{T}\left( \varepsilon -\tau _{dn} \right) \nonumber \\&-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\delta _{c}^{T}\tau _\mathrm{dc}-\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\kappa -K_{3}\left\| \delta _{n} \right\| ^{2}\nonumber \\&+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}\left( {\dot{{\tilde{W}}}}+{\Upsilon }\delta _{n}^{T}\phi \right) \right\} \nonumber \\&+\left( \kappa -\varepsilon _{T} \right) \dot{\kappa }-{\tilde{\alpha }}_{c}^{T}{\Gamma '}{\Gamma }^{-1}\tilde{\alpha }_{c}\nonumber \\&{\le }-\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -\delta _{c}^{T}\xi +\left\| \delta _{n} \right\| \varepsilon _{T}\nonumber \\&-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\delta _{c}^{T}\tau _\mathrm{dc}-\frac{\left\| \delta _{n} \right\| ^{2}}{\left\| \delta _{n} \right\| +\mu }\kappa \nonumber \\&-K_{3}\left\| \delta _{n} \right\| ^{2}+(\kappa -\varepsilon _{T}){\dot{\kappa }}+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}({\dot{{\tilde{W}}}}+{\Upsilon }\delta _{n}^{T}\phi ) \right\} \nonumber \\&-{\tilde{\alpha }}_{c}^{T}{\Gamma '}{\Gamma }^{-1}{\tilde{\alpha }}_{c}\nonumber \\&\le -\delta _{c}^{T}K_{1}\delta _{c}-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}\nonumber \\&-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -K_{3}\left\| \delta _{n} \right\| ^{2}+\mu \varepsilon _{T}-\tilde{\alpha }_{c}^{T}{\Gamma '}{\Gamma }^{-1}{\tilde{\alpha }}_{c} \end{aligned}$$

(35)

$$\begin{aligned} {\dot{V}}&\le -K_{1}\left\| \delta _{c} \right\| ^{2}-K_{3}\left\| \delta _{n} \right\| ^{2}-{\Gamma }^{{'}{\Gamma }^{-1}\left\| {\tilde{\alpha }}_{c} \right\| ^{2}}\nonumber \\&-\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}+\mu \varepsilon _{T}\nonumber \\&=-\left\| \delta ' \right\| ^{T}K'\left\| \delta ' \right\| -\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}+\mu \varepsilon _{T} \end{aligned}$$

(36)

where $K'=diag[K_{1}, K_{3},{\Gamma '}{\Gamma }^{-1}]$ and $\delta ^{'}=[\delta _{c},\delta _{n},{\tilde{\alpha }}_{c}]^{T}$. $\square $

Considering that ${\Gamma '}$ and ${\Gamma }^{-1}$ are positive definite diagonal matrix, thus ${\Gamma '}{\Gamma }^{-1}$ is a positive definite diagonal matrix.

Case 1. For the case when $\tau _\mathrm{dc}=0$, design the collocated robust compensator as $\xi = 0$ and integrate both sides of (36) from $t=0$ to $t=T$ as

$$\begin{aligned} V\left( T \right) -V\left( 0 \right) \le -\int _0^T {\left\| \delta ' \right\| ^{T}K'\left\| \delta ' \right\| } \mathrm{d}t+\varepsilon _{T}\int _{0}^{T} \mu \, \mathrm{d}t \end{aligned}$$

(37)

Considering that $V\left( T \right) \ge 0$ and $\int _0^\infty \mu \, \mathrm{d}t<\infty $, we have

$$\begin{aligned}&{\lim }_{T \rightarrow \infty } sup\frac{1}{T} \int _{0}^{T} \Vert \delta {'^2}\Vert \mathrm{d}t \nonumber \\&\quad \le \frac{1}{{K'}}\left( {V\left( 0 \right) + {\varepsilon _T}\int _{0}^{T} \mu \, \mathrm{d}t} \right) {\lim }_{T \rightarrow \infty } \frac{1}{T} \end{aligned}$$

(38)

Case 2. For the case when $\tau _\mathrm{dc}\ne 0$ and $\left\| \tau _\mathrm{dc} \right\| <\beta _{m}$, the collocated robust compensator $\xi $ is designed to satisfy the following conditions

$$\begin{aligned} \left\{ {\begin{array}{l} \delta _{c}^{T}\xi \ge 0 \\ {\beta _{m}\left\| \delta _{c} \right\| -\delta }_{c}^{T}\xi \le \rho \\ \end{array}} \right. \end{aligned}$$

(39)

where $\beta _{m}$ is the upper bound of $\tau _\mathrm{dc}$ and $\rho $ is a positive design scalar.

Theorem 2

Consider following control laws to the uncertain underactuated system

$$\begin{aligned} \tau&=\tau _{c}+\tau _{n} \end{aligned}$$

(40a)

$$\begin{aligned} \tau _{c}&=Y_{c}{\hat{\alpha }}_{c}-K_{1}\delta _{c}-\xi , \tau _{n}=-\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \left| \eta \right| \nonumber \\&\quad -K_{2}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \end{aligned}$$

(40b)

with the Adaptation Algorithm 2 designed in (34), and the collocated robust compensator $\xi $ designed using hyperbolic tangent function as

$$\begin{aligned} \xi =\beta _{m}tanh\left( \frac{n\eta _{r}\beta _{m}\delta _{c}}{\rho }\right) \end{aligned}$$

(40c)

with $\eta _{r}$ is a gain constant chosen as $\eta _{r}=0.2785$ here, and the auxiliary input $\eta $ in (40b) is constructed as

$$\begin{aligned} {\dot{\eta }}= & {} \eta ^{\frac{1}{2n+1}}\left( -K_{3}\left\| \delta _{n} \right\| ^{2}-\left\| \delta _{n} \right\| {\hat{W}}^{T}\phi \right) \end{aligned}$$

(40d)

with the adaptation law for ${\hat{W}}$ based on the projection algorithm, given by

$$\begin{aligned} {\dot{{\hat{W}}}}= & {} -{\dot{{\tilde{W}}}}\nonumber \\= & {} \left\{ {\begin{array}{l} {\Upsilon }\phi \delta _{n}^{T}-\beta {\Upsilon }\left\| \delta _{n} \right\| {\hat{W}}-\frac{\delta _{n}^{T}{\hat{W}}^{T}{\Upsilon }\phi {\hat{W}}}{W_{N}}, \quad \\ \quad \quad if \, tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} =W_{N} \,and\, \delta _{n}^{T}{\hat{W}}^{T}\phi \,{\le }\,0; \\ {\Upsilon }\phi \delta _{n}^{T}-\beta {\Upsilon }\left\| \delta _{n} \right\| {\hat{W}}, \quad if tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} \\ \quad <W_{N} \,or\, \, if\, \,tr\,\left\{ {\hat{W}}^{T}{\hat{W}} \right\} \\ \quad =W_{N} \, and \, \delta _{n}^{T}{\hat{W}}^{T}\phi >0. \\ \end{array}} \right. \end{aligned}$$

(40e)

Then, it follows:

(1)
$tr\left\{ {\hat{W}}^{T}{\hat{W}} \right\} \le W_{N}$ holds.
(2)
All signals in the collocated and non-collocated systems are UUB.

Proof

Consider the Lyapunov function as follows

$$\begin{aligned} V= & {} \frac{1}{2}\delta ^{T}D\delta +\frac{1}{2}\tilde{\alpha }_{c}^{T}{\Gamma }^{-1}{\tilde{\alpha }}_{c}\nonumber \\&+\frac{1}{2}tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\tilde{W}} \right\} +\frac{2n+1}{2n}\eta ^{\frac{2n}{2n+1}} \end{aligned}$$

(41)

The derivative of Lyapunov candidate function is yielded as

$$\begin{aligned} {\dot{V}}= & {} \left[ \delta _{c}^{T} \delta _{n}^{T} \right] \nonumber \\&\left[ {\begin{array}{c} -K_{1}\delta _{c}-Y_{c}{\tilde{\alpha }}_{c}-\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\mathrm{sgn}\left( \delta _{c} \right) \left\| \delta _{n} \right\| -\xi \\ W^{T}\phi +\varepsilon \\ \end{array}} \right] \nonumber \\&-\delta ^{T}\tau _{d}+\dot{\hat{\alpha }}_{c}^{T}{\Gamma }^{-1}{\tilde{\alpha }}_{c}+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}{\dot{{\tilde{W}}}} \right\} +\eta ^{\frac{-1}{2n+1}}{\dot{\eta }}\nonumber \\= & {} -\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -\delta _{c}^{T}K_{1}\delta _{c}-K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \nonumber \\&-\delta _{c}^{T}\xi -\delta _{c}^{T}\tau _\mathrm{dc}-K_{3}\left\| \delta _{n} \right\| ^{2}\nonumber \\&+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}\left( {\dot{{\tilde{W}}}}+{\Upsilon }\delta _{n}^{T}\phi \right) \right\} \nonumber \\&+\delta _{n}^{T}\left( \varepsilon -\tau _{dn} \right) -{\tilde{\alpha }}_{c}^{T}{\Gamma '}{\Gamma }^{-1}{\tilde{\alpha }}_{c}\nonumber \\&{\le }-\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| \left| \eta \right| -K_{2}\left\| \delta _{c} \right\| \left\| \delta _{n} \right\| -\delta _{c}^{T}K_{1}\delta _{c}-\delta _{c}^{T}\xi \nonumber \\&+-K_{3}\left\| \delta _{n} \right\| ^{2}\nonumber \\&+tr\left\{ {\tilde{W}}^{T}{\Upsilon }^{-1}(-{\Upsilon }\phi \delta _{n}^{T}+\beta {\Upsilon }\left\| \delta _{n} \right\| {\hat{W}}+{\Upsilon }\delta _{n}^{T}\phi ) \right\} \nonumber \\&\left\| \delta _{n} \right\| \varepsilon _{T}-\delta _{c}^{T}\tau _\mathrm{dc}-{\tilde{\alpha }}_{c}^{T}{\Gamma '}{\Gamma }^{-1}{\tilde{\alpha }}_{c}\nonumber \\ {\dot{V}}\le & {} -\delta _{c}^{T}K_{1}\delta _{c}-{\Gamma }^{{'}}{\Gamma }^{-1}\left\| {\tilde{\alpha }}_{c} \right\| ^{2}-K_{3}\left\| \delta _{n} \right\| ^{2}-\delta _{c}^{T}\xi \nonumber \\&-\delta _{c}^{T}\tau _\mathrm{dc}+\beta \left\| \delta _{n} \right\| tr\left\{ {\tilde{W}}^{T}\left( W-{\tilde{W}} \right) \right\} +\delta _{n}^{T}\varepsilon _{T} \end{aligned}$$

(42)

$\square $

Let us decompose (42) into the following two parts

$$\begin{aligned} {\dot{V}}_{1}&=-K_{3}\left\| \delta _{n} \right\| ^{2}+\beta \left\| \delta _{n} \right\| tr\left\{ {\tilde{W}}^{T}(W-{\tilde{W}}) \right\} +\delta _{n}^{T}\varepsilon _{T} \end{aligned}$$

(43a)

$$\begin{aligned} {\dot{V}}_{2}&=-{\Gamma '}{\Gamma }^{-1}\left\| \tilde{\alpha }_{c} \right\| ^{2}-\delta _{c}^{T}K_{1}\delta _{c}-\delta _{c}^{T}\tau _\mathrm{dc}-\delta _{c}^{T}\xi \end{aligned}$$

(43b)

We have

(1) For ${\dot{V}}_{1}$, considering that

$$\begin{aligned} \mathrm{tr}\left\{ {\tilde{W}}^{T}(W-{\tilde{W}}) \right\}&={({\tilde{W}},W)}_{F}-\left\| {\tilde{W}} \right\| _{F}^{2}\nonumber \\&\le \left\| {\tilde{W}} \right\| _{F}\left\| W \right\| _{F}-\left\| {\tilde{W}} \right\| _{F}^{2} \end{aligned}$$

(44)

Substituting (44) into (43a), we have

$$\begin{aligned} {\dot{V}}_{1}\le & {} -\,\lambda _{\min }(K_{3})\left\| \delta _{n} \right\| ^{2}\\&+\,\beta \left\| \delta _{n} \right\| \left\| {\tilde{W}} \right\| _{F}\left( W_{\max }-\left\| {\tilde{W}} \right\| _{F} \right) +\varepsilon _{T}\left\| \delta _{n} \right\| \\= & {} -\,\left\| \delta _{n} \right\| (\lambda _{\min }(K_{3})\left\| \delta _{n} \right\| \\&+\,\beta \left\| {\tilde{W}} \right\| _{F}\left( \left\| {\tilde{W}} \right\| _{F}-W_{\max } \right) -\varepsilon _{T}) \end{aligned}$$

Since

$$\begin{aligned}&\lambda _{\min }(K_{3})\left\| \delta _{n} \right\| +\beta \left\| {\tilde{W}} \right\| _{F}\left( \left\| {\tilde{W}} \right\| _{F}-W_{\max } \right) -\varepsilon _{T}\nonumber \\&\quad =\beta \left( \left\| {\tilde{W}} \right\| _{F}-\frac{W_{\max }}{2} \right) ^{2}-\beta \frac{W_{\max }^{2}}{4}\nonumber \\&\qquad +\lambda _{\min }(K_{3})\left\| \delta _{n} \right\| -\varepsilon _{T} \end{aligned}$$

(45)

To guarantee ${\dot{V}}_{1}\le 0$, the following inequality needs to be satisfied

$$\begin{aligned} \left\| \delta _{n} \right\|> & {} \frac{\beta W_{\max }^{2}+4\varepsilon _{T}}{4\lambda _{\min }(K_{3})} \hbox { or } \left\| {\tilde{W}} \right\| _{F}\nonumber \\> & {} \frac{W_{\max }}{2}+\sqrt{\frac{W_{\max }^{2}}{4}+\frac{\varepsilon _{T}}{\beta }} \end{aligned}$$

(46)

Therefore, ${\dot{V}}_{1}$ is negative outside a compact set. Based on the standard Lyapunov theorem extension, the UUB of both $\delta _{n}$ and $\left\| {\tilde{W}} \right\| _{F}$ is demonstrated.

Through (43b), the time derivative of $V_{2}$ can be given by

$$\begin{aligned} {\dot{V}}_{2}\le -{\Gamma '}{\Gamma }^{-1}\left\| {\tilde{\alpha }}_{c} \right\| ^{2}-\delta _{c}^{T}K_{1}\delta _{c}+\beta _{m}\left\| \delta _{c} \right\| -\delta _{c}^{T}\xi \end{aligned}$$

(47)

Based on the above knowledge of the design requirement (39), the definition of V and ${\dot{V}}_{2}$, as well as the assumption of boundedness of neural network weight, we substitute the collocated robust compensator (40c) into (43b) and yield

$$\begin{aligned} {\dot{V}}_{2}\le & {} -{\Gamma '}{\Gamma }^{-1}\left\| {\tilde{\alpha }}_{c} \right\| ^{2}-\delta _{c}^{T}K_{1}\delta _{c}+\rho \nonumber \\= & {} -\vartheta ^{T}K_{4}\vartheta +\rho \nonumber \\\le & {} -\lambda _{\min }(K_{4})\left\| \vartheta \right\| ^{2}+\rho \end{aligned}$$

(48)

where $K_{4}=diag\{\Gamma '{\Gamma }^{-1}, K_{1}\}$ and $\lambda _{\min }(K_{4})$ is the minimum eigenvalue of the matrix $K_{4}$. As a result, ${\dot{V}}_{2}$ is strictly negative outside the following compact set ${\Sigma }_{\vartheta }$:

$$\begin{aligned} {\Sigma }_{\vartheta }=\left\{ \vartheta (t)\left| 0\le \left\| \vartheta \right\| \le \sqrt{\frac{\rho }{\lambda _{\min }(K_{4})}} \right. \right\} \end{aligned}$$

(49)

Therefore, it is concluded that the filtered tracking error $\delta _{c}$ for collocated subsystem and $\delta _{n}$ for non-collocated subsystem, and the estimation error ${\tilde{W}}$ of the parameters are uniformly ultimately bounded. The tracking error of collocated subsystem decreases whenever $\vartheta $ is outside the compact set ${\Sigma }_{\vartheta }$, and thus $\left\| \vartheta \right\| $ is UUB. Considering that all the signals included in the control system (40) are UUB, it is therefore concluded that the control system (40) is uniformly ultimately bounded.

4 Simulations

In this section, simulation studies are conducted to demonstrate the effectiveness of the designed control schemes from the examples of an 2-DOF underactuated manipulator and 2-DOF vibro-driven system [5]. The consideration behind is to investigate the effectiveness towards both underactuated manipulation and locomotion systems.

4.1 2-DOF underactuated manipulator

The two-link planar manipulator as shown in Fig. 2 has its first link actuated and the second link unactuated. Two revolute joints are used to connect link 1 and link 2, and in the horizontal plane, link 1 is able to rotate 360 degrees. The denotations are introduced as follows: for link $i, i=1, 2$, $q_{i}$ are the generalized coordinate and the joint angle of each link, $m_{i}$ and $l_{i}$ are the mass and length, respectively. $l_{ci}$ represents the length from the previous joint to the COM of link i, and $I_{i}$ is the moment of inertia about the axis coming out of the page and coming through the COM of link i.

The equations of motion of the manipulator can be derived using the Lagrange’s approach as follows:

$$\begin{aligned} D\left( q \right) {\ddot{q}}+C\left( q,{\dot{q}} \right) {\dot{q}}+G\left( q \right) +F_{v}{\dot{q}}+F_{c}\left( q,{\dot{q}} \right) +\tau _{d}=\tau \end{aligned}$$

(50)

where

$$\begin{aligned}&D\left( q \right) =\left[ {\begin{array}{cc} m_{2}\left( l_{1}^{2}+l_{c2}^{2}+2l_{1}l_{c2}\cos q_{2} \right) +m_{1}l_{c1}^{2}+I_{1}+I_{2} &{} m_{2}\left( l_{1}l_{c2}\mathrm{cos}q_{2}+l_{c2}^{2} \right) +I_{2}\\ m_{2}\left( l_{1}l_{c2}\mathrm{cos}q_{2}+l_{c2}^{2} \right) +I_{2} &{} I_{2}+m_{2}l_{c2}^{2}\\ \end{array} } \right] ,\\&C\left( q,{\dot{q}} \right) =\left[ {\begin{array}{*{20}c} -m_{2}l_{c2}l_{1}\mathrm{sin}q_{2}{\dot{q}}_{2} &{} -m_{2}l_{c2}l_{1}\mathrm{sin}q_{2}({\dot{q}}_{1}+{\dot{q}}_{2})\\ m_{2}l_{c2}l_{1}\mathrm{sin}q_{2}{\dot{q}}_{1} &{} 0\\ \end{array} } \right] ,\\&G\left( q \right) =\left[ {\begin{array}{c} \left( m_{2}l_{1}+m_{1}l_{c1} \right) g\mathrm{cos}q_{1}+m_{2}l_{c2}gcos(q_{1}+q_{2})\\ m_{2}l_{c2}gcos(q_{1}+q_{2}) \\ \end{array}} \right] ,\\&F_{v}{\dot{q}}+F_{c}\left( q,{\dot{q}} \right) =\left[ {\begin{array}{l} f_{v1}{\dot{q}}_{1}+c_{1}\mathrm{\mathrm{sgn}}({\dot{q}}_{1}) \\ f_{v2}{\dot{q}}_{2}+c_{2}\mathrm{\mathrm{sgn}}({\dot{q}}_{2}) \\ \end{array}} \right] , \tau _{d}=\left[ {\begin{array}{l} a_{1}sin(t) \\ a_{2}sin(t) \\ \end{array}} \right] , \tau =\left[ {\begin{array}{l} \tau _{1} \\ 0 \\ \end{array}} \right] . \end{aligned}$$

It is assumed that the moments of inertia are calculated in the form of $I_{i}=\frac{m_{i} \,l_{i}^{2}}{12}$. The unknown parameters are chosen as $\alpha _{1}=m_{2}\left( l_{1}^{2}+l_{c2}^{2} \right) +m_{1}l_{c1}^{2}+I_{1}+I_{2}$, $\alpha _{2}=m_{2}l_{c2}l_{1}$, $\alpha _{3}=I_{2}+m_{2}l_{c2}^{2}$, $\alpha _{4}=m_{2}l_{1}+m_{1}l_{c1}$, $\alpha _{5}=m_{2}l_{c2}$, and then the uncertain parameter is $\alpha =\left[ \alpha _{1}\, \alpha _{2}\, \alpha _{3} \, \alpha _{4}\, \alpha _{5} \right] ^{T}\in R^{5}$. Based on the auxiliary kinematic vector variables defined in (20), the collocated regressor $Y_{c}$ is therefore obtained as $Y_{c}=\left[ -{\dot{\varrho }}_{1} Y_{c2} -\dot{\varrho }_{2}-g\mathrm{cos}q_{1}-gcos(q_{1}+q_{2}) \right] $ with $Y_{c2}=-(2\mathrm{cos}q_{2}{\dot{\varrho }}_{1}+\mathrm{cos}q_{2}\dot{\varrho }_{2}-{\dot{q}}_{2}\varrho _{1}\mathrm{sin}q_{2}-\mathrm{sin}q_{2}({\dot{q}}_{1}+{\dot{q}}_{2})\varrho _{2})$.

Generically, the adaptive NN-based tracking control scheme in (40) is evaluated with matched and mismatched uncertainties. The rationality of system parameter values selection of the manipulator in this section is configured from studies in the literature as reported in [45] as follows: $m_{1}=m_{2}=2 \,\mathrm{Kg}$, $I_{1}=I_{2}=0.2528\,\mathrm{Kgm}^{2}$, $l_{c1}=l_{c2}=0.75\,\mathrm{m}$, $l_{1}=l_{2}=1.5\,\mathrm{m}$. The initial conditions are set as $q\left( 0 \right) =\left[ q_{1}\left( 0 \right) \, q_{2}\left( 0 \right) \right] ^{\mathrm{T}}=\left[ 0.09-0.09 \right] ^{\mathrm{T}}$, ${\dot{q}}\left( 0 \right) =\left[ {\dot{q}}_{1}\left( 0 \right) \, {\dot{q}}_{2}\left( 0 \right) \right] ^{\mathrm{T}}=\left[ 0 \,0 \right] ^{\mathrm{T}}$, and the reference trajectory is given as $q_{1d}\left( t \right) = 0.5\pi (1+\hbox {sin}\,(0.1t))$ [45]. It is noted that when the desired trajectory for $q_{1}$ is chosen, the prior knowledge of the desired trajectory for $q_{2}$ can be achieved through convenient computation, and it should satisfy the following constraint equation

$$\begin{aligned}&D_{21}{\ddot{q}}_{1}+D_{22}{\ddot{q}}_{2}+C_{21}{\dot{q}}_{1} +C_{22}{\dot{q}}_{2}\nonumber \\&\quad +G_{2}+F_{v1}{\dot{q}}+F_{c1}+\tau _{d1}=0 \end{aligned}$$

(51)

The parameter values of friction and disturbance are chosen as $c_{1}=c_{2}=0.02$, $a_{1}=a_{2}=0.2$. The bandwidth of the first-order filter is set as $\Lambda =[\Lambda _{1}\, {\Lambda }_{2}]^{\mathrm{T}}={{[12 \, 30]}}^{\mathrm{T}}$. In the simulation, parameters of the control schemes are chosen to be $K_{1}=2I$, $K_{2}=5I$ and $K_{3}=20I$. The adaptation gains are chosen as ${\Gamma '}=8I$ and ${\Gamma }=4I$. Parameter values for the collocated robust compensator are set as $\beta _{m}=20$, $\rho =0.5$. In addition, the weight tuning parameter of the designed control schemes is chosen as ${\Upsilon =0.005}$ and ${\upbeta =0.1}$. The rationality of these selections is configured using iterative simulations.

Simulation results of the trajectory tracking performance of the adaptive NN-based control system (40) are presented in Fig. 3 with time-varying matched and mismatched disturbances. The reference trajectory (red solid line), the tracking trajectory (blue dashed line) in Fig. 3, the trajectory tracking error in Fig. 4, the control torque in Fig. 5 and the NN approximation performance in Figs. 6 and 7 are portrayed. We can see that the proposed scheme demonstrates good performance under the model uncertainties, frictions and time-varying external disturbances. It can be observed from Fig. 4 that the system tracks the reference trajectory accurately and the tracking error converges to a small compact set after about 4s. The bounded control input torque by using the designed control scheme is shown in Fig. 5 with an upper bound of 50Nm and a lower bound of $-20 \hbox { Nm}$. The RBFNN approximates the nonlinear uncertainties ${\upchi }\left( \mathrm {z} \right) $ effectively from Fig. 7. From the simulation studies, we can draw a conclusion that the developed control system is able to adapt the model uncertainties and is robust against the matched and mismatched external disturbances.

4.2 2-DOF underactuated vibro-driven system

The simulation study in Sect. 4.1 considers an underactuated manipulator with its base mounted on the working surface under uncertain dynamics and external disturbances. The passive dynamics of the second link and the actuated dynamics of the first link are coupled; thus, the unmodelled dynamics of the second link may contribute additional time-varying inertia and nonlinearity to the manipulator dynamics. In this subsection, the context of an underactuated mobile robotic model is considered as shown in Fig. 8. This underactuated vibro-driven robotic system was proposed in [3] for which the actuated and unactuated dynamics are strongly coupled.

In the presence of matched and mismatched external disturbances, the underactuated dynamics of the vibro-driven system are given as

$$\begin{aligned} D\left( q \right) {\ddot{q}}+C\left( q,{\dot{q}} \right) {\dot{q}}+K\left( q \right) q+G\left( q \right) +F+\tau _{d}=B\tau \end{aligned}$$

(52)

where $D\left( q \right) =\left[ {\begin{array}{cc} ml^{{2}} &{} -mlc_{\theta }\\ -mlc_{\theta } &{} \left( M+m \right) \\ \end{array} } \right] $ denotes the inertia matrix, $C\left( q,{\dot{q}} \right) =\left[ {\begin{array}{cc} 0 &{} {0}\\ mls_{\theta }{\dot{\theta }} &{} {0}\\ \end{array} } \right] $ is the Centripetal and Coriolis matrix, $K\left( q \right) =\left[ {\begin{array}{cc} k &{} {0}\\ 0 &{} {0}\\ \end{array} } \right] $ represents the generalized stiffness matrix, $G\left( q \right) ={[-mgls_{\theta } \, 0]}^{T}$ represents the gravitational torques, $F={[c{\dot{\theta }}\, f]}^{T}$ is the friction forces, $\tau _{d}={[\tau _\mathrm{dc} \, \tau _{dn}]}^{T}=\left[ {\begin{array}{l} a_{1}sin(t) \\ a_{2}sin(t) \\ \end{array}} \right] $ denotes the external matched and mismatched disturbances, $B={[1 \, 0]}^{T}$ is the input force matrix and $\tau \in {\mathcal {R}}^{1}$ denotes the control input applied to the system.

In the simulation, the rationality of the parameter values selection in this section is specified as follows: the system parameter values are configured from the studies in literature as reported in [48, 49] as $M=0.5 \, kg$, $m=0.138 \, kg$, $l=0.3 \, m$, $g=9.81\, m/s^{2}$, $\mu =0.01 \, N/ms$. Initial conditions of the system are set as ${\theta \left( 0 \right) =\theta }_{0}=\pi {/3}$, ${\dot{\theta }}\left( 0 \right) = 0$, $x\left( 0 \right) =0$ and ${\dot{x}}\left( 0 \right) =0$. The simulation is conducted in 6.6s which is one full motion cycle. The parameter values for the matched and mismatched external disturbances are chosen as $a_{1}=a_{2}=0.2$. The bandwidth of the first-order filter is set as $\Lambda =[{\Lambda }_{1}\, {\Lambda }_{2}]^{T}={{[15 \, 20]}}^{T}$. In the simulation, the controller parameters are chosen to be $K_{1}=10I$, $K_{2}=20I$ and $K_{3}=50I$. The adaptation gains are chosen as ${\Gamma '}=10I$ and ${\Gamma }=6I$. Parameter values for the collocated robust compensator are set as $\beta _{m}=20$, $\rho =0.5$. In addition, the weight tuning parameter of the proposed control system is selected as ${\Upsilon =0.005}$ and ${\upbeta =0.1}$. The rationality of these selections is configured using iterative simulations.

The trajectory tracking performance of the actuated subsystem is presented in Fig. 9. It is observed from the figure that although the response of the proposed control scheme is slightly slower, the controlled pendulum trajectory tracks the reference trajectory accurately. The reference trajectory for the actuated subsystem is chosen as shown in Fig. 9. The proposed control system has a learning process that makes the estimated parameters adapt to appropriate values.

The tracking error is shown in Fig. 10, from which the tracking error converges to an adjacent and bounded compact set near zero in finite time. The trajectory of the vibro-driven system is presented in Fig. 11 showing that the cart travels at the speed about 7cm within 6.6s. The control torque is shown in Fig. 12 that demonstrates the boundedness of the torque input. As demonstrated in the system performance, the developed NN adaptive control scheme is capable of guaranteeing accurate trajectory tracking of the actuated subsystem, and meanwhile, the passive subsystem can maintain a forward locomotion at some desired velocity. Therefore, it is concluded that the designed control system is efficient in the presence of unknown nonlinear dynamic systems and environmental disturbances.

5 Conclusions

In this paper, novel NNs-based adaptive tracking control schemes for underactuated systems with matched and mismatched disturbances have been presented. The parametric uncertainties and matched and mismatched external disturbances have been considered in the controller design, which feature a generic model in the research of underactuated systems. The mismatched disturbances have been omitted in most of the existing approaches for the tracking control of UMSs. Auxiliary control variables have been designed to establish the controllability of non-collocated subset of underactuated systems by using a universal approximation of RBFNN, and approximation errors and external disturbances can be efficiently counteracted through design of robust compensators. Employing the adaptive control approach, combined with variable structure and NNs, the exact values of the parameters of the underactuated systems are not required to be known a priori. The stability of the overall system has been proved by Lyapunov analysis, and it is shown that the tracking error can be reduced as small as desired in finite time by choosing appropriate controller parameters. The simulation studies on an underactuated manipulator and an underactuated vibro-driven system have shown the effectiveness of the proposed adaptive control systems.

References

Azimi, M.M., Koofigar, H.R.: Adaptive fuzzy backstepping controller design for uncertain underactuated robotic systems. Nonlinear Dyn. 79, 1457–1468 (2015). https://doi.org/10.1007/s11071-014-1753-y
Article MATH Google Scholar
Seifried, R.: Integrated mechanical and control design of underactuated multibody systems. Nonlinear Dyn. 67, 1539–1557 (2012). https://doi.org/10.1007/s11071-011-0087-2
Article MathSciNet Google Scholar
Liu, P., Yu, H., Cang, S.: Geometric analysis-based trajectory planning and control for underactuated capsule systems with viscoelastic property. Trans. Inst. Meas. Control. (2017). https://doi.org/10.1177/0142331217708833
Article Google Scholar
Zhang, X., Fang, Y., Sun, N.: Minimum-time trajectory planning for underactuated overhead crane systems with state and control constraints. IEEE Trans. Ind. Electron. 61, 6915–6925 (2014). https://doi.org/10.1109/TIE.2014.2320231
Article Google Scholar
Liu, P., Yu, H., Cang, S.: Modelling and dynamic analysis of underactuated capsule systems with friction-induced hysteresis. In: 2016 IEEE/RSJ international conference on intelligent robots and systems (IROS), pp. 549–554. IEEE (2016)
Fang, Y., Ma, B., Wang, P., Zhang, X.: A motion planning-based adaptive control method for an underactuated crane system. Control Syst. Technol. IEEE Trans. 20, 241–248 (2012)
Google Scholar
Nguyen, K.-D., Dankowicz, H.: Adaptive control of underactuated robots with unmodeled dynamics. Robot. Auton. Syst. 64, 84–99 (2015). https://doi.org/10.1016/j.robot.2014.10.009
Article Google Scholar
Liu, P., Yu, H., Cang, S.: On periodically pendulum-driven systems for underactuated locomotion: a viscoelastic jointed model. In: 2015 21st International Conference on Automation and Computing (ICAC). pp. 1–6 (2015)
Liu, P., Yu, H., Cang, S.: Modelling and analysis of dynamic frictional interactions of vibro-driven capsule systems with viscoelastic property. Eur. J. Mech.-A Solids. 74, 16–25 (2019)
Article MathSciNet Google Scholar
Liu, P., Huda, M.N., Tang, Z., Sun, L.: A self-propelled robotic system with a visco-elastic joint: dynamics and motion analysis. Eng. Comput. (2019). https://doi.org/10.1007/s00366-019-00722-3
Liu, P., Yu, H., Cang, S.: Trajectory synthesis and optimization of an underactuated microrobotic system with dynamic constraints and couplings. Int. J. Control Autom. Syst. 16, 2373–2383 (2018)
Article Google Scholar
Liu, P., Yu, H., Cang, S.: Optimized adaptive tracking control for an underactuated vibro-driven capsule system. Nonlinear Dyn. 94, 1803–1817 (2018)
Article Google Scholar
Brockett, R.W.: others: Asymptotic stability and feedback stabilization. Differ. Geom. Control Theory. 27, 181–191 (1983)
Google Scholar
Hwang, C.-L., Chiang, C.-C., Yeh, Y.-W.: Adaptive fuzzy hierarchical sliding-mode control for the trajectory tracking of uncertain underactuated nonlinear dynamic systems. IEEE Trans. Fuzzy Syst. 22, 286–299 (2014)
Article Google Scholar
Liu, P., Neumann, G., Fu, Q., Pearson, S., Yu, H.: Energy-efficient design and control of a vibro-driven robot. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1464–1469. IEEE (2018)
Liu, P., Yu, H., Cang, S.: Modelling and control of an elastically joint-actuated cart-pole underactuated system. In: 2014 20th International Conference on Automation and Computing (ICAC) , pp. 26–31. IEEE (2014)
Valentinis, F., Donaire, A., Perez, T.: Energy-based motion control of a slender hull unmanned underwater vehicle. Ocean Eng. 104, 604–616 (2015)
Article Google Scholar
Liu, P.: Bio-inspired robotic control in underactuation: principles for energy efficacy, dynamic compliance interactions and adaptability (2018). https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.732064
Mistry, M., Buchli, J., Schaal, S.: Inverse dynamics control of floating base systems using orthogonal decomposition. In: 2010 IEEE International Conference on Robotics and Automation (ICRA), pp. 3406–3412. IEEE (2010)
Blajer, W., Dziewiecki, K., Kołodziejczyk, K., Mazur, Z.: Inverse dynamics of underactuated mechanical systems: a simple case study and experimental verification. Commun. Nonlinear Sci. Numer. Simul. 16, 2265–2272 (2011)
Article MathSciNet Google Scholar
Yue, M., An, C., Du, Y., Sun, J.: Indirect adaptive fuzzy control for a nonholonomic/underactuated wheeled inverted pendulum vehicle based on a data-driven trajectory planner. Fuzzy Sets Syst. 290, 158–177 (2016). https://doi.org/10.1016/j.fss.2015.08.013
Article MathSciNet MATH Google Scholar
Xu, J.-X., Guo, Z.-Q., Lee, T.H.: Design and implementation of integral sliding-mode control on an underactuated two-wheeled mobile robot. IEEE Trans. Ind. Electron. 61, 3671–3681 (2014)
Article Google Scholar
Xin, X., Tanaka, S., She, J., Yamasaki, T.: New analytical results of energy-based swing-up control for the Pendubot. Int. J. Non-Linear Mech. 52, 110–118 (2013). https://doi.org/10.1016/j.ijnonlinmec.2013.02.003
Article Google Scholar
Cornejo, C., Alvarez-Icaza, L.: Passivity based control of under-actuated mechanical systems with nonlinear dynamic friction. J. Vib. Control. (2011). https://doi.org/10.1177/1077546311408469
Article Google Scholar
Peng, Z., Wang, D., Chen, Z., Hu, X., Lan, W.: Adaptive dynamic surface control for formations of autonomous surface vehicles with uncertain dynamics. IEEE Trans. Control Syst. Technol. 21, 513–520 (2013). https://doi.org/10.1109/TCST.2011.2181513
Article Google Scholar
Cong, S., Liang, Y.: PID-like neural network nonlinear adaptive control for uncertain multivariable motion control systems. IEEE Trans. Ind. Electron. 56, 3872–3879 (2009)
Article Google Scholar
Sazonov, E.S., Klinkhachorn, P., Klein, R.L.: Hybrid LQG-neural controller for inverted pendulum system. In: Proceedings of the 35th Southeastern Symposium on System Theory, 2003, pp. 206–210. IEEE (2003)
Sprangers, O., Babuška, R., Nageshrao, S.P., Lopes, G.A.: Reinforcement learning for port-Hamiltonian systems. IEEE Trans. Cybern. 45, 1017–1027 (2015)
Article Google Scholar
Li, J., Guo, X., Li, Z., Chen, W.: Stochastic adaptive optimal control of under-actuated robots using neural networks. Neurocomputing. 142, 190–200 (2014). https://doi.org/10.1016/j.neucom.2014.04.049
Article Google Scholar
Yang, C., Li, Z., Cui, R., Xu, B.: Neural network-based motion control of an underactuated wheeled inverted pendulum model. IEEE Trans. Neural Netw. Learn. Syst. 25, 2004–2016 (2014). https://doi.org/10.1109/TNNLS.2014.2302475
Article Google Scholar
Tong, S.C., Li, Y.M., Zhang, H.G.: Adaptive neural network decentralized backstepping output-feedback control for nonlinear large-scale systems with time delays. IEEE Trans. Neural Netw. 22, 1073–1086 (2011). https://doi.org/10.1109/TNN.2011.2146274
Article Google Scholar
Mohareri, O., Dhaouadi, R., Rad, A.B.: Indirect adaptive tracking control of a nonholonomic mobile robot via neural networks. Neurocomputing. 88, 54–66 (2012). https://doi.org/10.1016/j.neucom.2011.06.035
Article Google Scholar
A biologically inspired approach to tracking control of underactuated surface vessels subject to unknown dynamics—ScienceDirect. http://www.sciencedirect.com/science/article/pii/S0957417414005958
Hsu, C.-F.: Adaptive backstepping Elman-based neural control for unknown nonlinear systems. Neurocomputing. 136, 170–179 (2014)
Article Google Scholar
Ping, Z.: Tracking problems of a spherical inverted pendulum via neural network enhanced design. Neurocomputing. 106, 137–147 (2013)
Article Google Scholar
Jung, S., Kim, S.S.: Control experiment of a wheel-driven mobile inverted pendulum using neural network. IEEE Trans. Control Syst. Technol. 16, 297–303 (2008). https://doi.org/10.1109/TCST.2007.903396
Article Google Scholar
Liu, D., Wang, D., Zhao, D., Wei, Q., Jin, N.: Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming. IEEE Trans. Autom. Sci. Eng. 9, 628–634 (2012)
Article Google Scholar
Liu, Y.-J., Chen, C.P., Wen, G.-X., Tong, S.: Adaptive neural output feedback tracking control for a class of uncertain discrete-time nonlinear systems. IEEE Trans. Neural Netw. 22, 1162–1167 (2011)
Article Google Scholar
Xu, B., Sun, F., Yang, C., Gao, D., Ren, J.: Adaptive discrete-time controller design with neural network for hypersonic flight vehicle via back-stepping. Int. J. Control 84, 1543–1552 (2011)
Article MathSciNet Google Scholar
Zhang, H., Qin, C., Luo, Y.: Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming. IEEE Trans. Autom. Sci. Eng. 11, 839–849 (2014)
Article Google Scholar
Wang, T., Gao, H., Qiu, J.: A combined adaptive neural network and nonlinear model predictive control for multirate networked industrial process control. IEEE Trans. Neural Netw. Learn. Syst. 27, 416–425 (2016)
Article MathSciNet Google Scholar
Zhang, H., Wang, Z., Liu, D.: A comprehensive review of stability analysis of continuous-time recurrent neural networks. IEEE Trans. Neural Netw. Learn. Syst. 25, 1229–1262 (2014)
Article Google Scholar
Zhang, H., Cui, L., Luo, Y.: Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP. IEEE Trans. Cybern. 43, 206–216 (2013)
Article Google Scholar
Zou, A.-M., Kumar, K.D., Hou, Z.-G., Liu, X.: Finite-time attitude tracking control for spacecraft using terminal sliding mode and Chebyshev neural network. IEEE Trans. Syst. Man Cybern. Part B Cybern. 41, 950–963 (2011)
Article Google Scholar
Pucci, D., Romano, F., Nori, F.: Collocated adaptive control of underactuated mechanical systems. IEEE Trans. Robot. 31, 1527–1536 (2015)
Article Google Scholar
Yang, C., Li, Z., Li, J.: Trajectory planning and optimized adaptive control for a class of wheeled inverted pendulum vehicle models. Cybern. IEEE Trans. 43, 24–36 (2013)
Article Google Scholar
Spong, M.W.: Underactuated mechanical systems. In: Siciliano, B., Valavanis, K.P. (eds.) Control Problems in Robotics and Automation. Lecture Notes in Control and Information Sciences, vol. 230, pp. 135–150. Springer, Berlin, Heidelberg (1998)
Li, H., Furuta, K., Chernousko, F.L.: Motion generation of the capsubot using internal force and static friction. In: Proceedings of the 45th IEEE Conference on Decision and Control, pp. 6575–6580. IEEE (2006)
Yu, H., Liu, Y., Yang, T.: Closed-loop tracking control of a pendulum-driven cart-pole underactuated system. Proc. Inst. Mech. Eng. Part J. Syst. Control Eng. 222, 109–125 (2008)
Article Google Scholar

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (61803396), European Commission Marie Skłodowska-Curie SMOOTH (smart robots for fire-fighting) project (H2020-MSCA-RISE-2016-734875) and Royal Society International Exchanges Scheme (Adaptive Learning Control of a Cardiovascular Robot using Expert Surgeon Techniques) project (IE151224). FP7 People: Marie-Curie Actions (Grant Number PIRSES-GA-2012-318902).

Author information

Authors and Affiliations

Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff, CF5 2YB, UK
Pengcheng Liu
School of Engineering and the Built Environment, Edinburgh Napier University, 10 Colinton Road, Edinburgh, EH10 5DT, UK
Hongnian Yu
School of Economics and Management, Yanshan University, Qinhuangdao, 066004, China
Shuang Cang

Authors

Pengcheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hongnian Yu
View author publications
You can also search for this author in PubMed Google Scholar
Shuang Cang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongnian Yu.

Ethics declarations

Conflicts of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Liu, P., Yu, H. & Cang, S. Adaptive neural network tracking control for underactuated systems with matched and mismatched disturbances. Nonlinear Dyn 98, 1447–1464 (2019). https://doi.org/10.1007/s11071-019-05170-8

Download citation

Received: 27 April 2018
Accepted: 27 July 2019
Published: 08 October 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s11071-019-05170-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Adaptive neural network tracking control for underactuated systems with matched and mismatched disturbances

Abstract

Similar content being viewed by others

Adaptive Neural Network Control for Uncertain Robotic Manipulators with Output Constraint Using Integral-Barrier Lyapunov Functions

Global adaptive tracking control of robot manipulators using neural networks with finite-time learning convergence

Fixed-time prescribed performance tracking control for manipulators against input saturation

1 Introduction

2 Preliminaries and problem description

2.1 Notations

2.2 Dynamic model and properties

Property 1

Property 2

Property 3

Property 4

Property 5

Remark 1

Remark 2

Definition 1

Assumption 1

Assumption 2

2.3 RBFNN approximation

Assumption 3

3 Control system design and stability analysis

Theorem 1

Proof

Corollary 1

Proof

Remark 3

Corollary 2

Proof

Theorem 2

Proof

4 Simulations

4.1 2-DOF underactuated manipulator

4.2 2-DOF underactuated vibro-driven system

5 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation