Elsevier

Knowledge-Based Systems

Volume 27, March 2012, Pages 443-450
Knowledge-Based Systems

Uncertainty measurement for interval-valued decision systems based on extended conditional entropy

https://doi.org/10.1016/j.knosys.2011.10.013Get rights and content

Abstract

Uncertainty measures can supply new points of view for analyzing data and help us to disclose the substantive characteristics of data sets. Some uncertainty measures for single-valued information systems or single-valued decision systems have been developed. However, there are few studies on the uncertainty measurement for interval-valued information systems or interval-valued decision systems. This paper addresses the uncertainty measurement problem in interval-valued decision systems. An extended conditional entropy is proposed in interval-valued decision systems based on possible degree between interval values. Consequently, a concept called rough decision entropy is introduced to evaluate the uncertainty of an interval-valued decision system. Besides, the original approximation accuracy measure proposed by Pawlak is extended to deal with interval-valued decision systems and the concept of interval approximation roughness is presented. Experimental results demonstrate that the rough decision entropy measure and the interval approximation roughness measure are effective and valid for evaluating the uncertainty measurement of interval-valued decision systems. Experimental results also indicate that the rough decision entropy measure outperforms the interval approximation roughness measure.

Introduction

Rough set theory, originally proposed by Pawlak and discussed in greater detail in [1], [2], has become a popular approach for the joint management of uncertainty and vagueness and has been applied in many fields [3], [4], [5], [6], [7].

Pawlak [2] proposed two numerical measures accuracy and roughness to evaluate uncertainty of a rough set in information systems, as well as approximation accuracy of a rough classification in decision systems. Some efforts were attracted to extend the Pawlak’s uncertainty model. Based on granulation, a measurement of uncertainty of a set in an information system and approximation accuracy of a rough classification in a decision table was proposed in [8].

Information entropy, proposed by Shannon [9] in information theory, has been an effective and powerful mechanism for characterizing the information content in diverse models. The measurement of uncertain information by entropy has been deployed in a wide range of fields. The extension of entropy and its variants were adapted for rough set in [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21]. For example, Düntsch and Gediga defined the information entropy and three kinds of conditional entropies in rough sets for predicting a decision attribute [12]. Beaubouef et al. [13] proposed a method measuring uncertainty of rough sets and rough relation databases based on rough entropy. Wierman [11] presented the measures of uncertainty and granularity in rough set theory, along with an axiomatic derivation. Yao et al. [14] studied several kinds of information-theoretical measures for attribute importance in rough set theory. Liang et al. [16] proposed a new method for evaluating both uncertainty and fuzziness. Qian and Liang [19] proposed a combination entropy for evaluating uncertainty of a knowledge from an information system. However, the methods mentioned above are based on single-valued information systems.

Interval-valued information systems (or Interval information systems) are an important type of data tables, and generalized models of single-valued information systems [22]. Several authors have studied about interval-valued information systems and interval-valued decision systems [22], [23], [24], [25], [26]. Yao et al. [23], [24] presented a model for the interval set by using the lower and upper approximations in interval-valued information systems, as well as introduced the generalized decision logic. Leung et al. [26] investigated a rough set approach to discover classification rules through a process of knowledge induction which selects decision rules with a minimal set of features in interval-valued information systems. Qian et al. [22] proposed a dominance relation to interval information systems. Yang et al. [25] presented a dominance relation and generated the optimal decision rules in incomplete interval-valued information system. Wu and Liu [27] introduced the real formal concept analysis about grey-rough set theory by using grey numbers, and proposed a grey-rough set approach to Galois lattices reductions. So far, however, there are few studies on the uncertainty measurement issue for interval-valued information systems (corresponding to unsupervised learning) or interval-valued decision systems (corresponding to supervised learning). In this paper, we address the uncertainty measurement issue in interval-valued decision systems and intend to construct effective uncertainty measures for interval-valued decision system. A similarity relation based on possible degree between two interval numbers is given, under which the concept of extended conditional entropy is proposed. Based on the proposed concept of conditional entropy, a measure of uncertainty for interval-valued decision systems called rough decision entropy is presented. Besides, the original approximation accuracy measure proposed by Pawlak is extended to deal with interval-valued decision systems and the concept of interval approximation roughness is presented. Experimental results demonstrate that the rough decision entropy measure and the interval approximation roughness measure are effective and valid for evaluating the uncertainty measurement of interval-valued decision systems. Experimental results also indicate that the rough decision entropy measure outperforms the interval approximation roughness measure.

This rest of the paper is organized as follows. Some basic concepts and notations in rough set theory are introduced in Section 2. In Section 3, serval key concepts of our method in interval-valued decision systems are illustrated in detail, including similarity degree, θ-conditional entropy, as well as θ-rough decision entropy. Some illustrative examples are also given. Simulation experiments are conducted to test and verify the effectiveness of the proposed measure in Section 4. Section 5 concludes the paper.

Section snippets

Preliminary knowledge

At first, some basic concepts in rough set theory are reviewed, including decision system, indiscernibility relation and approximation regions.

Similarity relation between two intervals

Ranking interval values are quite different from ranking real values [28], [29], [30]. In this section, a similarity measure based on possible degree is constructed to estimate two interval values.

Definition 1

[28]

Let A = [a, a+] and B = [b, b+] be two interval values. The possible degree of interval valued A relative to interval value B is defined as:P(AB)=min1,maxa+-b-(a+-a-)+(b+-b-),0P(AB) can be viewed as the possible degree of interval valued A greater than interval value B.

It is worth noting that P(AB)  P(B

Experiments

To test and verify the effectiveness and the validity of the proposed uncertainty measure, some experiments are conducted on different interval-valued decision systems. These interval-valued decision systems are shown in Table 2, Table 3, Table 4 respectively.

There are 10 objects and 6 condition attributes in the interval-valued decision system shown in Table 2. There are 15 objects and 8 condition attributes in the interval-valued decision system shown in Table 3. There are 20 objects and 9

Conclusions

In this paper, we have studied the uncertainty measurement problem in interval-valued decision systems. Extended conditional entropy is proposed in interval-valued decision systems based on possible degree between interval values. Consequently, a concept called rough decision entropy is introduced to measure the uncertainty of an interval-valued decision system. Besides, the original approximation accuracy measure proposed by Pawlak is extended to deal with interval-valued decision systems and

Acknowledgements

The work is supported by the National Natural Science Foundation of China (Nos. 61070074 and 60703038) and the Excellent Young Teachers Program of Zhejiang University.

References (30)

  • X. Yang et al.

    Dominance-based rough set approach to incomplete interval-valued information system

    Data and Knowledge Engineering

    (2009)
  • Y. Leung et al.

    A rough set approach for the discovery of classification rules in interval-valued information systems

    International Journal of Approximate Reasoning

    (2008)
  • Q. Wu et al.

    Real formal concept analysis based on grey-rough set theory

    Knowledge-Based Systems

    (2009)
  • Y. Nakahara et al.

    On the linear programming problems with interval coefficients

    Computers and Industrial Engineering

    (1992)
  • Z. Pawlak

    Rough sets

    International Journal of Computer Information Science

    (1982)
  • Cited by (143)

    • Information-theoretic measures of uncertainty for interval-set decision tables

      2021, Information Sciences
      Citation Excerpt :

      Considering that the set-valued attributes are the special cases of interval-set-valued attributes, we only investigate the interval-set-valued attributes in this paper. Referring to Fig. 1, there are two types of interval decision tables, namely, interval-valued decision tables [6,30] and interval-set decision tables [16,39]. At present, various research problems in interval-valued decision tables have been widely studied, such as uncertainty measurements [5,28], attribute reduction [6], incremental updating approximations [47] and so on.

    View all citing articles on Scopus
    View full text