Skip to main content
Top

2017 | OriginalPaper | Chapter

Scalable IQRA_IG Algorithm: An Iterative MapReduce Approach for Reduct Computation

Authors : P. S. V. S. Sai Prasad, H. Bala Subrahmanyam, Praveen Kumar Singh

Published in: Distributed Computing and Internet Technology

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Feature Selection is an important preprocessing step in any machine learning model construction. Rough Set based feature selection (Reduct) methods provide efficient selection of attributes for the model without loss of information. Quick Reduct Algorithm is a key Reduct computation approach in Complete Symbolic Decision Systems. Authors have earlier implemented a scalable approach for Quick Reduct Algorithm as In-place MapReduce based Quick Reduct Algorithm using Twister’s Iterative MapReduce Framework. Improved Quick Reduct Algorithm is a standalone extension to Quick Reduct Algorithm by incorporating Trivial Ambiguity Resolution and Positive Region Removal. This work develops design and implementation of distributed/parallel algorithm for Improved Quick Reduct Algorithm by incorporation of Trivial Ambiguity Resolution and Positive Region Removal in In-place MapReduce based Quick Reduct Algorithm. Experiments conducted on large benchmark decision systems have empirically established the significance of computational gain and scalability of proposed algorithm in comparison to earlier approaches in literature.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Anaraki, J.R., Eftekhari, M.: Rough set based feature selection: a review. In: 2013 5th Conference on Information and Knowledge Technology (IKT), pp. 301–306, May 2013 Anaraki, J.R., Eftekhari, M.: Rough set based feature selection: a review. In: 2013 5th Conference on Information and Knowledge Technology (IKT), pp. 301–306, May 2013
3.
go back to reference Bu, Y., Howe, B., Balazinska, M., Ernst, M.D.: Haloop: efficient iterative data processing on large clusters. Proc. VLDB Endow. 3(1–2), 285–296 (2010)CrossRef Bu, Y., Howe, B., Balazinska, M., Ernst, M.D.: Haloop: efficient iterative data processing on large clusters. Proc. VLDB Endow. 3(1–2), 285–296 (2010)CrossRef
4.
go back to reference Chouchoulas, A., Shen, Q.: Rough set-aided keyword reduction for text categorization. Appl. Artif. Intell. 15(9), 843–873 (2001)CrossRef Chouchoulas, A., Shen, Q.: Rough set-aided keyword reduction for text categorization. Appl. Artif. Intell. 15(9), 843–873 (2001)CrossRef
5.
go back to reference Ekanayake, J., Li, H., Zhang, B., Gunarathne, T., Bae, S.H., Qiu, J., Fox, G.C.: Twister: a runtime for iterative MapReduce. In: Hariri, S., Keahey, K. (eds.) HPDC, pp. 810–818. ACM (2010) Ekanayake, J., Li, H., Zhang, B., Gunarathne, T., Bae, S.H., Qiu, J., Fox, G.C.: Twister: a runtime for iterative MapReduce. In: Hariri, S., Keahey, K. (eds.) HPDC, pp. 810–818. ACM (2010)
6.
go back to reference Hoa, N.S.: Some efficient algorithms for rough set methods. In: Proceedings IPMU 1996 Granada, Spain, pp. 1541–1457 (1996) Hoa, N.S.: Some efficient algorithms for rough set methods. In: Proceedings IPMU 1996 Granada, Spain, pp. 1541–1457 (1996)
7.
go back to reference Jakovits, P., Srirama, S.N.: Evaluating MapReduce frameworks for iterative scientific computing applications. In: 2014 International Conference on High Performance Computing Simulation (HPCS), pp. 226–233 (2014) Jakovits, P., Srirama, S.N.: Evaluating MapReduce frameworks for iterative scientific computing applications. In: 2014 International Conference on High Performance Computing Simulation (HPCS), pp. 226–233 (2014)
8.
go back to reference Komorowski, J., Polkowski, L., Skowron, A.: Rough sets for data mining and knowledge discovery. In: Komorowski, J., Zytkow, J. (eds.) PKDD 1997. LNCS, vol. 1263, p. 393. Springer, Heidelberg (1997). doi:10.1007/3-540-63223-9_139 CrossRef Komorowski, J., Polkowski, L., Skowron, A.: Rough sets for data mining and knowledge discovery. In: Komorowski, J., Zytkow, J. (eds.) PKDD 1997. LNCS, vol. 1263, p. 393. Springer, Heidelberg (1997). doi:10.​1007/​3-540-63223-9_​139 CrossRef
10.
go back to reference Nguyen, H.S., Skowron, A.: Boolean reasoning for feature extraction problems. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. LNCS, vol. 1325, pp. 117–126. Springer, Heidelberg (1997). doi:10.1007/3-540-63614-5_11 CrossRef Nguyen, H.S., Skowron, A.: Boolean reasoning for feature extraction problems. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. LNCS, vol. 1325, pp. 117–126. Springer, Heidelberg (1997). doi:10.​1007/​3-540-63614-5_​11 CrossRef
12.
go back to reference Sai Prasad, P.S.V.S., Raghavendra Rao, C.R.: Extensions to IQuickReduct. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds.) MIWAI 2011. LNCS (LNAI), vol. 7080, pp. 351–362. Springer, Heidelberg (2011). doi:10.1007/978-3-642-25725-4_31 CrossRef Sai Prasad, P.S.V.S., Raghavendra Rao, C.R.: Extensions to IQuickReduct. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds.) MIWAI 2011. LNCS (LNAI), vol. 7080, pp. 351–362. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-25725-4_​31 CrossRef
13.
go back to reference Qian, J., Miao, D., Zhang, Z., Yue, X.: Parallel attribute reduction algorithms using MapReduce. Inf. Sci. 279, 671–690 (2014)MathSciNetCrossRef Qian, J., Miao, D., Zhang, Z., Yue, X.: Parallel attribute reduction algorithms using MapReduce. Inf. Sci. 279, 671–690 (2014)MathSciNetCrossRef
14.
go back to reference Singh, P.K., Sai Prasad, P.S.V.S.: Scalable quick reduct algorithm: iterative MapReduce approach. In: Marathe, M., Mohania, M.K., Mausam, J.P. (eds.) CODS, pp. 25:1–25:2. ACM (2016) Singh, P.K., Sai Prasad, P.S.V.S.: Scalable quick reduct algorithm: iterative MapReduce approach. In: Marathe, M., Mohania, M.K., Mausam, J.P. (eds.) CODS, pp. 25:1–25:2. ACM (2016)
15.
go back to reference Thangavel, K., Pethalakshmi, A.: Dimensionality reduction based on rough set theory: a review. Appl. Soft Comput. 9(1), 1–12 (2009)CrossRef Thangavel, K., Pethalakshmi, A.: Dimensionality reduction based on rough set theory: a review. Appl. Soft Comput. 9(1), 1–12 (2009)CrossRef
16.
go back to reference Yang, Y., Chen, Z., Liang, Z., Wang, G.: Attribute reduction for massive data based on rough set theory and MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS (LNAI), vol. 6401, pp. 672–678. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16248-0_91 CrossRef Yang, Y., Chen, Z., Liang, Z., Wang, G.: Attribute reduction for massive data based on rough set theory and MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS (LNAI), vol. 6401, pp. 672–678. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-16248-0_​91 CrossRef
17.
go back to reference Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, p. 10. USENIX Association, Berkeley (2010) Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, p. 10. USENIX Association, Berkeley (2010)
Metadata
Title
Scalable IQRA_IG Algorithm: An Iterative MapReduce Approach for Reduct Computation
Authors
P. S. V. S. Sai Prasad
H. Bala Subrahmanyam
Praveen Kumar Singh
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-50472-8_5

Premium Partner