Elsevier

Knowledge-Based Systems

Volume 39, February 2013, Pages 159-172
Knowledge-Based Systems

Structure learning for belief rule base expert system: A comparative study

https://doi.org/10.1016/j.knosys.2012.10.016Get rights and content

Abstract

The Belief Rule Base (BRB) is an expert system which can handle both qualitative and quantitative information. One of the applications of the BRB is the Rule-base Inference Methodology using the Evidential Reasoning approach (RIMER). Using the BRB, RIMER can handle different types of information under uncertainty. However, there is a combinatorial explosion problem when there are too many attributes and/or too many alternatives for each attribute in the BRB. Most current approaches are designed to reduce the number of the alternatives for each attribute, where the rules are derived from physical systems and redundant in numbers. However, these approaches are not applicable when the rules are given by experts and the BRB should not be oversized. A structure learning approach is proposed using Grey Target (GT), Multidimensional Scaling (MDS), Isomap and Principle Component Analysis (PCA) respectively, named as GT–RIMER, MDS–RIMER, Isomap–RIMER and PCA–RIMER. A case is studied to evaluate the overall capability of an Armored System of Systems. The efficiency of the proposed approach is validated by the case study results: the BRB is downsized using any of the four techniques, and PCA–RIMER has shown excellent performance. Furthermore, the robustness of PCA–RIMER is further verified under different conditions with varied number of attributes.

Introduction

The Belief Rule Base (BRB) system is developed from the traditional IF–THEN rule base to represent different types of knowledge under uncertainty [1], [2], [3], which is indispensable when analyzing problems involving human decisions [4]. One of the most important applications of the BRB is the RIMER approach [5], [6], [7], which is short for the Rule-base Inference Methodology using the Evidential Reasoning. Using a BRB, RIMER is able to handle different kinds of information under uncertainty, qualitative and/or quantitative, linguistic and/or numerical, complete and/or incomplete. The applications of RIMER include the Multiple Attribute Decision Analysis (MADA) problem [8], group decision making [9], [10], risk analysis [11], [12], trade-off analysis [13], pipeline leak detection [14], military capability evaluation [15], etc.

However, when the BRB is applied in RIMER, it is required to cover all the possible combinations of each alternative for each attribute. This is, in fact, a combinatorial explosion problem [16]: the size of the BRB would grow exponentially along with the increase of the number of the attributes and/or the alternatives for each attribute. Therefore, there is a pressing demand to downsize the BRB.

From the perspective of its origination, the BRB is classified as physical-model-based and expert-knowledge-based. For the physical-model-based BRB, the number of the attributes is fixed while the alternatives for each attribute are changeable; for the expert-knowledge-based BRB, only the attributes are changeable.

There are four parameters concerning the structure of a BRB: the number of the attributes, the alternatives for each attribute, the number of the scales and the alternatives of each scale in the assessment result [17]. The first two parameters determine the size of a BRB, which leads to two means to downsize a BRB. To downsize a BRB by means of identifying either parameter is called the structure learning of a BRB. Also, to identify the alternatives of each attribute is called parameter learning of a BRB as well. Since the four parameters are the most important feature of a BRB concerning its structure, to perform structure and/or parameter learning is to perform feature selection on a BRB.

Different techniques are applicable for feature selection in a BRB. Zhou [18], [19] used hidden Markov chain to model the correlation between environmental variables and observable variables, based on which a parameter learning approach was proposed; Jiang [16] proposed a parameter model of Evidential Network (EN) in which the Conditional Belief Function (CBF) and the BRB model were combined, furthermore Jiang conducted the parameter learning study within the framework of EN. Tsai [20], [21] proposed a rule mining algorithm in the presence of concept drift, and the idea of feature extraction was demonstrated in a patient diagnose case study context when he managed to accurately discover the underlying governing rules [21]. Rule induction [22], [23] mainly focused on evaluating rules (of a BRB) so that the feature of the original rule set could be completely extracted and preserved while the number is considerably reduced. In fact there were over 40 measures in related literatures [24], [25]. Suzuki [26] summarized that there was no single universal measure for all rule set analysis. Back to the framework of RIMER, Zhou [18], [19] proposed the concept of “statistical utility”, which resembled the evaluation measure in the previously introduced rule induction. Zhou used “statistical utility” (1) to rate a rule, (2) to determine whether a rule should be kept by setting a valve value for the “statistical utility”, and (3) to learn what the alternatives for each attribute really were.

There has been extensive research on the structure learning for the physical-model-based BRB systems. Yang [27] proposed the first generic BRB learning framework with an optimization model for training the BRB systems. Xu [14] also proposed a training methodology for the BRB systems and applied it in a pipeline leak detection case study. Later more studies were carried out on learning and training the BRB either in a theoretical context and/or a practical context [17], [28]. Zhou argued that those optimization models were all offline and locally optimal, which were not suitable for a dynamic fashion [29], [30]. Therefore, Zhou [31], [32] proposed an online updating BRB approach, which did not require to collect a complete set of data before training the BRB.

However, for the expert-knowledge-based BRB systems, these approaches are no longer applicable. Because the alternatives for each attribute are pre-set and fixed while the number of the attributes is changeable. Therefore, another way is taken to solve this problem: to reduce the number of the attributes. Numerous approaches and/or techniques can help to meet this challenge, of which four techniques are applied in this study: the Grey Target (GT) theory, the Multidimensional Scaling (MDS), the Isomap and the Principal Component Analysis (PCA). MDS, Isomap and PCA are all dimensionality reduction techniques. Along with GT, they are all typical means for feature selection [33], [34], [35], [36].

GT, as a non-statistical technique, is part of the Grey System Method (GM) which was first introduced by Deng in early 1980s [33]. Since then GT had become more and more popular in dealing with problems that have partially unknown factors, or “grey” information [34], [35]. Successful applications of GT include pattern recognition, feature selection, technical assessment, etc. [37], [38]. Compared with traditional statistical approaches, GT requires much less information to estimate the behavior of the unknown systems. The “approaching degree” [33] in GT is used to select key attributes in this study.

MDS [39], [40] and Isomap [41] are two nonlinear techniques for dimensionality reduction. MDS [42] uses a “stress function” to measure the geometric distances between the original high-dimensional space and the transformed low-dimensional space, which represents the quality of the mapping. Isomap [41] takes consideration of the distribution of the neighboring datapoints by preserving the pairwise geodesic distance between datapoints, which is the distance between two points measured over the manifold. The two techniques have become more and more poplar so that there have been several computer-based packages [43]. It should be noted that there are many other nonlinear dimensionality reduction techniques, such as LLE [44] and kernel PCA [45].

PCA is the most representative linear dimensionality reduction technique. PCA was commented as a “multivariable technique that analyzes a data table in which observations are described by several inter-correlated quantitative dependent variables” [46]. So far, PCA has become the most widely used and popular linear dimensionality reduction technique [36], which is traditionally used to extract condensed and representative information from massive information [47], [48]. The input of PCA can be either statistical data or standardized experts’ knowledge. PCA requires certain dependency on the attributes that forms the “data table”: the more dependent (less independent) the attributes (as input) are, the better representative the principal components (as output) are.

The dimensionality reduction approaches (including PCA as the representative linear approach) are interrelated and even equivalent in certain cases. For example, the earliest MDS used PCA to analyze correlated data by treating them in the Euclidean distances [49].

The remainder of this study is organized as follows. The RIMER approach and the demand to downsize a BRB are introduced in Section 2. The structure learning approach is proposed in Sections 3 Grey target, multidimensional scaling, Isomap and principal component analysis, 4 Structure learning for the belief rule base using GT, MDS, Isomap and PCA respectively. In Section 5, a case to evaluate the overall capability of an Armored System of Systems is studied to validate the efficiency of the proposed approach. This study is concluded in Section 6.

Section snippets

Problem demonstration

RIMER [5], [6], [7] was proposed by Yang based on the evidence theory [50], [51]. RIMER consists of two parts, first building the BRB, and then integrating the activated rules from the BRB using the ER algorithm, as briefly introduced in Sections 2.1 Belief rule base, 2.2 Rules integration using the ER algorithm. When a BRB is constructed, it is required to cover all the possible combinations of each alternative for each attribute. When there are too many attributes and/or too many alternatives

Grey target, multidimensional scaling, Isomap and principal component analysis

GT, MDS, Isomap and PCA are applied in this study as structure learning techniques to select key attributes.

Structure learning for the belief rule base

Based on the GT, MDS, Isomap and PCA technique introduced in the Section 3, a BRB structure learning approach is introduced, as shown in Fig. 1.

Steps are given as following:

  • Step 1: Determine the four parameters concerning the structure of the BRB, including the number of the attributes, the alternatives for each attribute, the number of the scales and the alternatives for each scale;

  • Step 2: Invite the experts to link the attributes those have strong connections. The experts are only required to

Case study

In this section, first the details of the case study are introduced as in Sections 5.1 Background, 5.2 Input transformation preparation, including the background, the input and the four variables used to test the efficiency of the proposed approach. Following is the structure learning approach using GT, MDS, Isomap and PCA respectively, named as GT–RIMER, MDS–RIMER, Isomap–RIMER and PCA–RIMER as in Section 5.3. Finally, some conclusions are drawn regarding on the efficiency and robustness of

Conclusion

This study is motivated by the combinatorial explosion problem when a BRB is constructed and is targeted to downsize the BRB while still preserving the consistency with the original assessment results.

There are two means to downsize the BRB since there are two decisive parameters: the number of the attributes and the number of the alternatives for each attribute. The first means is fit for physical systems while the second means is fit for expert systems, which is the case of this study.

To

Acknowledgments

This research is supported by the National Science Foundation of China under Contract Nos. 70901074, 71001104 and 71201168. We would like to thank Dr. W.W. Wang for his support at the very starting of this study. We would like very much to thank the two anonymous reviewers since this study could not come this far without their advices and suggestions.

References (62)

  • Z.J. Zhou et al.

    A model for real-time failure prognosis based on hidden Markov model and belief rule base

    European Journal of Operational Research

    (2010)
  • Z.J. Zhou et al.

    A sequential learning algorithm for online constructing belief-rule-based systems

    Expert Systems with Applications

    (2010)
  • J.J. Zhu et al.

    Multiple stages grey target decision making method with incomplete weight based on multi-granularity linguistic label

    Information Sciences

    (2012)
  • C.H. Cheng et al.

    Evaluating the best main battle tank using fuzzy decision theory with linguistic criteria evaluation

    European Journal of Operational Research

    (2002)
  • R.O. Duda et al.

    Subjective Bayesian Methods for Rule-Based Inference Systems

    (1976)
  • M. Barla et al.

    Rule-based user characteristics acquisition from logs with semantics for personalized web-based systems

    Computing and Informatics

    (2009)
  • A. Rotolo

    Rule-based agents, compliance, and intention reconsideration in defeasible logic

  • J.B. Yang et al.

    An evidential reasoning approach for multiple-attribute decision making with uncertainty

    IEEE Transactions on Systems, Man, and Cybernetics – Part A: Systems and Humans

    (1994)
  • J.B. Yang et al.

    Belief rule-base inference methodology using the evidential reasoning approach-RIMER

    IEEE Transactions on Systems, Man, and Cybernetics

    (2006)
  • J.B. Yang et al.

    Nonlinear information aggregation via evidential reasoning in multiattribute decision analysis under uncertainty

    IEEE Transactions on Systems, Man, and Cybernetics

    (2002)
  • C. Fu et al.

    Group consensus based on evidential reasoning approach using interval-valued belief structures

    Knowledge-Based Systems

    (2012)
  • D.L. Xu et al.

    Integrated efficiency and trade-off analysis using DEA-oriented interactive minimax reference point approach

    Computers & Operational Research

    (2012)
  • J. Jiang

    Modeling, Reasoning and Learning Approach to Evidential Network

    (2011)
  • Z.J. Zhou

    Online Modeling Methods of Belief Rule Base and Failure Prognosis

    (2010)
  • Z.J. Zhou et al.

    Belief Rule Base Expert System and Complex System Modeling

    (2011)
  • C.I. Lee, C.J. Tsai, J.H. Wu, W.P. Yang, A decision tree-based approach to mining the rules of concept drift, in:...
  • B. Kavsek et al.

    APRIORI-SD: adapting association rule learning to subgroup discovering

    Lecture Notes in Computer Science

    (2003)
  • J. Li et al.

    A method of discovering important measures for knowledge discovery

    The Knowledge Engineering Review

    (2010)
  • H. Abe et al.

    Analyzing behavior of objective rule evaluation indices based on Pearson product-moment correlation coefficient

    Lecture Notes in Artificial Intelligence

    (2008)
  • S. Tsumoto et al.

    Visualization of similarities and dissimilarities in rules using multidimensional scaling

    Lecture Notes in Computer Science

    (2005)
  • E. Suzuki

    Pitfalls for categorizations of objective interestingness measures for rule discovery

  • Cited by (114)

    • Interpretable hierarchical belief rule base expert system for complex system modeling

      2024, Measurement: Journal of the International Measurement Confederation
    • A data-driven emergency plan evaluation method based on improved RIMER

      2023, Reliability Engineering and System Safety
    • An interpretable method for inertial platform fault diagnosis based on combination belief rule base

      2023, Measurement: Journal of the International Measurement Confederation
    View all citing articles on Scopus
    View full text