Skip to main content

2017 | OriginalPaper | Buchkapitel

Scalable IQRA_IG Algorithm: An Iterative MapReduce Approach for Reduct Computation

verfasst von : P. S. V. S. Sai Prasad, H. Bala Subrahmanyam, Praveen Kumar Singh

Erschienen in: Distributed Computing and Internet Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Feature Selection is an important preprocessing step in any machine learning model construction. Rough Set based feature selection (Reduct) methods provide efficient selection of attributes for the model without loss of information. Quick Reduct Algorithm is a key Reduct computation approach in Complete Symbolic Decision Systems. Authors have earlier implemented a scalable approach for Quick Reduct Algorithm as In-place MapReduce based Quick Reduct Algorithm using Twister’s Iterative MapReduce Framework. Improved Quick Reduct Algorithm is a standalone extension to Quick Reduct Algorithm by incorporating Trivial Ambiguity Resolution and Positive Region Removal. This work develops design and implementation of distributed/parallel algorithm for Improved Quick Reduct Algorithm by incorporation of Trivial Ambiguity Resolution and Positive Region Removal in In-place MapReduce based Quick Reduct Algorithm. Experiments conducted on large benchmark decision systems have empirically established the significance of computational gain and scalability of proposed algorithm in comparison to earlier approaches in literature.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Anaraki, J.R., Eftekhari, M.: Rough set based feature selection: a review. In: 2013 5th Conference on Information and Knowledge Technology (IKT), pp. 301–306, May 2013 Anaraki, J.R., Eftekhari, M.: Rough set based feature selection: a review. In: 2013 5th Conference on Information and Knowledge Technology (IKT), pp. 301–306, May 2013
3.
Zurück zum Zitat Bu, Y., Howe, B., Balazinska, M., Ernst, M.D.: Haloop: efficient iterative data processing on large clusters. Proc. VLDB Endow. 3(1–2), 285–296 (2010)CrossRef Bu, Y., Howe, B., Balazinska, M., Ernst, M.D.: Haloop: efficient iterative data processing on large clusters. Proc. VLDB Endow. 3(1–2), 285–296 (2010)CrossRef
4.
Zurück zum Zitat Chouchoulas, A., Shen, Q.: Rough set-aided keyword reduction for text categorization. Appl. Artif. Intell. 15(9), 843–873 (2001)CrossRef Chouchoulas, A., Shen, Q.: Rough set-aided keyword reduction for text categorization. Appl. Artif. Intell. 15(9), 843–873 (2001)CrossRef
5.
Zurück zum Zitat Ekanayake, J., Li, H., Zhang, B., Gunarathne, T., Bae, S.H., Qiu, J., Fox, G.C.: Twister: a runtime for iterative MapReduce. In: Hariri, S., Keahey, K. (eds.) HPDC, pp. 810–818. ACM (2010) Ekanayake, J., Li, H., Zhang, B., Gunarathne, T., Bae, S.H., Qiu, J., Fox, G.C.: Twister: a runtime for iterative MapReduce. In: Hariri, S., Keahey, K. (eds.) HPDC, pp. 810–818. ACM (2010)
6.
Zurück zum Zitat Hoa, N.S.: Some efficient algorithms for rough set methods. In: Proceedings IPMU 1996 Granada, Spain, pp. 1541–1457 (1996) Hoa, N.S.: Some efficient algorithms for rough set methods. In: Proceedings IPMU 1996 Granada, Spain, pp. 1541–1457 (1996)
7.
Zurück zum Zitat Jakovits, P., Srirama, S.N.: Evaluating MapReduce frameworks for iterative scientific computing applications. In: 2014 International Conference on High Performance Computing Simulation (HPCS), pp. 226–233 (2014) Jakovits, P., Srirama, S.N.: Evaluating MapReduce frameworks for iterative scientific computing applications. In: 2014 International Conference on High Performance Computing Simulation (HPCS), pp. 226–233 (2014)
8.
Zurück zum Zitat Komorowski, J., Polkowski, L., Skowron, A.: Rough sets for data mining and knowledge discovery. In: Komorowski, J., Zytkow, J. (eds.) PKDD 1997. LNCS, vol. 1263, p. 393. Springer, Heidelberg (1997). doi:10.1007/3-540-63223-9_139 CrossRef Komorowski, J., Polkowski, L., Skowron, A.: Rough sets for data mining and knowledge discovery. In: Komorowski, J., Zytkow, J. (eds.) PKDD 1997. LNCS, vol. 1263, p. 393. Springer, Heidelberg (1997). doi:10.​1007/​3-540-63223-9_​139 CrossRef
10.
Zurück zum Zitat Nguyen, H.S., Skowron, A.: Boolean reasoning for feature extraction problems. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. LNCS, vol. 1325, pp. 117–126. Springer, Heidelberg (1997). doi:10.1007/3-540-63614-5_11 CrossRef Nguyen, H.S., Skowron, A.: Boolean reasoning for feature extraction problems. In: Raś, Z.W., Skowron, A. (eds.) ISMIS 1997. LNCS, vol. 1325, pp. 117–126. Springer, Heidelberg (1997). doi:10.​1007/​3-540-63614-5_​11 CrossRef
12.
Zurück zum Zitat Sai Prasad, P.S.V.S., Raghavendra Rao, C.R.: Extensions to IQuickReduct. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds.) MIWAI 2011. LNCS (LNAI), vol. 7080, pp. 351–362. Springer, Heidelberg (2011). doi:10.1007/978-3-642-25725-4_31 CrossRef Sai Prasad, P.S.V.S., Raghavendra Rao, C.R.: Extensions to IQuickReduct. In: Sombattheera, C., Agarwal, A., Udgata, S.K., Lavangnananda, K. (eds.) MIWAI 2011. LNCS (LNAI), vol. 7080, pp. 351–362. Springer, Heidelberg (2011). doi:10.​1007/​978-3-642-25725-4_​31 CrossRef
13.
Zurück zum Zitat Qian, J., Miao, D., Zhang, Z., Yue, X.: Parallel attribute reduction algorithms using MapReduce. Inf. Sci. 279, 671–690 (2014)MathSciNetCrossRef Qian, J., Miao, D., Zhang, Z., Yue, X.: Parallel attribute reduction algorithms using MapReduce. Inf. Sci. 279, 671–690 (2014)MathSciNetCrossRef
14.
Zurück zum Zitat Singh, P.K., Sai Prasad, P.S.V.S.: Scalable quick reduct algorithm: iterative MapReduce approach. In: Marathe, M., Mohania, M.K., Mausam, J.P. (eds.) CODS, pp. 25:1–25:2. ACM (2016) Singh, P.K., Sai Prasad, P.S.V.S.: Scalable quick reduct algorithm: iterative MapReduce approach. In: Marathe, M., Mohania, M.K., Mausam, J.P. (eds.) CODS, pp. 25:1–25:2. ACM (2016)
15.
Zurück zum Zitat Thangavel, K., Pethalakshmi, A.: Dimensionality reduction based on rough set theory: a review. Appl. Soft Comput. 9(1), 1–12 (2009)CrossRef Thangavel, K., Pethalakshmi, A.: Dimensionality reduction based on rough set theory: a review. Appl. Soft Comput. 9(1), 1–12 (2009)CrossRef
16.
Zurück zum Zitat Yang, Y., Chen, Z., Liang, Z., Wang, G.: Attribute reduction for massive data based on rough set theory and MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS (LNAI), vol. 6401, pp. 672–678. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16248-0_91 CrossRef Yang, Y., Chen, Z., Liang, Z., Wang, G.: Attribute reduction for massive data based on rough set theory and MapReduce. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS (LNAI), vol. 6401, pp. 672–678. Springer, Heidelberg (2010). doi:10.​1007/​978-3-642-16248-0_​91 CrossRef
17.
Zurück zum Zitat Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, p. 10. USENIX Association, Berkeley (2010) Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud 2010, p. 10. USENIX Association, Berkeley (2010)
Metadaten
Titel
Scalable IQRA_IG Algorithm: An Iterative MapReduce Approach for Reduct Computation
verfasst von
P. S. V. S. Sai Prasad
H. Bala Subrahmanyam
Praveen Kumar Singh
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-50472-8_5

Premium Partner