Skip to main content
Top

2018 | OriginalPaper | Chapter

Group Outlying Aspects Mining

Authors : Shaoni Wang, Haiyang Xia, Gang Li, Jianlong Tan

Published in: Knowledge Science, Engineering and Management

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Existing works on outlying aspects mining have been focused on detecting the outlying aspects of a single query object, rather than the outlying aspects of a group of objects. While in many application scenarios, methods that can effectively mine the outlying aspects of a query group are needed. To fill this research gap, this paper extends the outlying aspects mining to the group level, and formalizes the problem of group outlying aspect mining. The Earth Move Distance based algorithm GOAM is then proposed to automatically identify the outlying aspects of the query group. The experiment result shows the capability of the proposed algorithm in identifying the group outlying aspects effectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aggarwal, C.C., Yu, P.S.: Outlier detection for high dimensional data. ACM SIGMOD Record, vol. 30, no. 2, pp. 37–46 (2001) Aggarwal, C.C., Yu, P.S.: Outlier detection for high dimensional data. ACM SIGMOD Record, vol. 30, no. 2, pp. 37–46 (2001)
2.
go back to reference Dang, X.H., Assent, I., Ng, R.T., Zimek, A., Schubert, E.: Discriminative features for identifying and interpreting outliers. In: IEEE International Conference on Data Engineering, pp. 88–99 (2014) Dang, X.H., Assent, I., Ng, R.T., Zimek, A., Schubert, E.: Discriminative features for identifying and interpreting outliers. In: IEEE International Conference on Data Engineering, pp. 88–99 (2014)
4.
go back to reference Duan, L., Tang, G., Pei, J., Campbell, A., Campbell, A., Tang, C.: Mining outlying aspects on numeric data. Data Min. Knowl. Discov. 29(5), 1116–1151 (2015)MathSciNetCrossRef Duan, L., Tang, G., Pei, J., Campbell, A., Campbell, A., Tang, C.: Mining outlying aspects on numeric data. Data Min. Knowl. Discov. 29(5), 1116–1151 (2015)MathSciNetCrossRef
5.
go back to reference Kriegel, H.P., Hubert, M.S., Zimek, A.: Angle-based outlier detection in high-dimensional data. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 444–452 (2008) Kriegel, H.P., Hubert, M.S., Zimek, A.: Angle-based outlier detection in high-dimensional data. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 444–452 (2008)
6.
go back to reference Li, Q., Li, G., Niu, W., Cao, Y., Chang, L., Tan, J., Guo, L.: Boosting imbalanced data learning with wiener process oversampling. Front. Comput. Sci. 11(5), 836–851 (2017)CrossRef Li, Q., Li, G., Niu, W., Cao, Y., Chang, L., Tan, J., Guo, L.: Boosting imbalanced data learning with wiener process oversampling. Front. Comput. Sci. 11(5), 836–851 (2017)CrossRef
7.
go back to reference Li, Q., Niu, W., Li, G., Cao, Y., Tan, J., Guo, L.: Lingo: linearized grassmannian optimization for nuclear norm minimization. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 801–809. ACM (2015) Li, Q., Niu, W., Li, G., Cao, Y., Tan, J., Guo, L.: Lingo: linearized grassmannian optimization for nuclear norm minimization. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 801–809. ACM (2015)
8.
go back to reference Micenkova, B., Dang, X.H., Assent, I., Ng, R.T.: Explaining outliers by subspace separability. In: IEEE International Conference on Data Mining, pp. 518–527 (2013) Micenkova, B., Dang, X.H., Assent, I., Ng, R.T.: Explaining outliers by subspace separability. In: IEEE International Conference on Data Mining, pp. 518–527 (2013)
9.
go back to reference Nguyen, H.V., Muller, E., Vreeken, J., Keller, F., Bohm, K.: CMI: an information-theoretic contrast measure for enhancing subspace cluster and outlier detection, pp. 198–206 (2013) Nguyen, H.V., Muller, E., Vreeken, J., Keller, F., Bohm, K.: CMI: an information-theoretic contrast measure for enhancing subspace cluster and outlier detection, pp. 198–206 (2013)
10.
go back to reference Vinh, N.X., Chan, J., Bailey, J.: Reconsidering mutual information based feature selection: a statistical significance view. In: IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, pp. 1–10 (2014) Vinh, N.X., Chan, J., Bailey, J.: Reconsidering mutual information based feature selection: a statistical significance view. In: IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, pp. 1–10 (2014)
11.
go back to reference Vinh, N.X., Chan, J., Bailey, J., Leckie, C., Ramamohanarao, K., Pei, J.: Scalable outlying-inlying aspects discovery via feature ranking. In: IEEE International Symposium on Biomedical Imaging, pp. 182–185 (2015) Vinh, N.X., Chan, J., Bailey, J., Leckie, C., Ramamohanarao, K., Pei, J.: Scalable outlying-inlying aspects discovery via feature ranking. In: IEEE International Symposium on Biomedical Imaging, pp. 182–185 (2015)
12.
go back to reference Vinh, N.X., Chan, J., Romano, S., Bailey, J., Leckie, C., Ramamohanarao, K., Pei, J.: Discovering outlying aspects in large datasets. Data Min. Knowl. Discov. 30(6), 1520–1555 (2016)MathSciNetCrossRef Vinh, N.X., Chan, J., Romano, S., Bailey, J., Leckie, C., Ramamohanarao, K., Pei, J.: Discovering outlying aspects in large datasets. Data Min. Knowl. Discov. 30(6), 1520–1555 (2016)MathSciNetCrossRef
13.
go back to reference Xiong, P., Wang, X., Niu, W., Zhu, T., Li, G.: Android malware detection with contrasting permission patterns. China Commun. 11(8), 1–14 (2014)CrossRef Xiong, P., Wang, X., Niu, W., Zhu, T., Li, G.: Android malware detection with contrasting permission patterns. China Commun. 11(8), 1–14 (2014)CrossRef
14.
go back to reference Zhang, J., Lou, M., Ling, T.W., Wang, H.: HOS-miner : a system for detecting outlying subspaces of high-dimensional data. In: Thirtieth International Conference on Very Large Data Bases, pp. 1265–1268 (2004) Zhang, J., Lou, M., Ling, T.W., Wang, H.: HOS-miner : a system for detecting outlying subspaces of high-dimensional data. In: Thirtieth International Conference on Very Large Data Bases, pp. 1265–1268 (2004)
Metadata
Title
Group Outlying Aspects Mining
Authors
Shaoni Wang
Haiyang Xia
Gang Li
Jianlong Tan
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-99365-2_18

Premium Partner