Skip to main content
Top

2016 | OriginalPaper | Chapter

Automatic Categorization of Email into Folders by Ant Colony Decision Tree and Social Networks

Authors : Urszula Boryczka, Barbara Probierz, Jan Kozak

Published in: Intelligent Decision Technologies 2016

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a new approach to an automatic categorization of email messages into mailbox folders. The aim of this paper is to create an algorithm that would allow one to improve the classification of emails into folders by using solutions that have been applied in Ant Colony Decision Tree (ACDT). Additionally, elements of Social Network Analysis (SNA) were included in this algorithm. The new algorithm that is proposed here was tested on the publicly available Enron E-mail data set and all experiments were conducted on uncleaned data. For the purpose of comparing the results, additional tests were carried out by using selected classifiers which were generally available. The obtained results confirm that the proposed approach allows one to improve the accuracy with which new emails are assigned to particular folders based on an analysis of previous correspondence, even when uncleaned data sets are used.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aral, S., Van Alstyne, M.: Network structure & information advantage. In: Proceedings of the Academy of Management Conference, vol. 3, Philadelphia, PA. Citeseer (2007) Aral, S., Van Alstyne, M.: Network structure & information advantage. In: Proceedings of the Academy of Management Conference, vol. 3, Philadelphia, PA. Citeseer (2007)
2.
go back to reference Boryczka, U., Kozak, J.: Ant Colony Decision Trees—a new method for constructing decision trees based on Ant Colony Optimization. In: Computational Collective Intelligence. Technologies and Applications, LNCS, vol. 6421, pp. 373–382. Springer (2010) Boryczka, U., Kozak, J.: Ant Colony Decision Trees—a new method for constructing decision trees based on Ant Colony Optimization. In: Computational Collective Intelligence. Technologies and Applications, LNCS, vol. 6421, pp. 373–382. Springer (2010)
3.
go back to reference Boryczka, U., Probierz, B., Kozak, J.: An ant colony optimization algorithm for an automatic categorization of emails. Computational Collective Intelligence. Technologies and Applications, LNCS, vol. 8733, pp. 583–592. Springer, Berlin (2014) Boryczka, U., Probierz, B., Kozak, J.: An ant colony optimization algorithm for an automatic categorization of emails. Computational Collective Intelligence. Technologies and Applications, LNCS, vol. 8733, pp. 583–592. Springer, Berlin (2014)
4.
go back to reference Cummings, J.N., Cross, R.: Structural properties of work groups and their consequences for performance. Soc. Netw. 25(3), 197–210 (2003)CrossRef Cummings, J.N., Cross, R.: Structural properties of work groups and their consequences for performance. Soc. Netw. 25(3), 197–210 (2003)CrossRef
5.
go back to reference Gloor, P.A.: Swarm creativity: competitive advantage through collaborative innovation networks. Oxford University Press (2005) Gloor, P.A.: Swarm creativity: competitive advantage through collaborative innovation networks. Oxford University Press (2005)
6.
go back to reference Gloor, P.A., Grippa, F., Putzke, J., Lassenius, C., Fuehres, H., Fischbach, K., Schoder, D.: Measuring social capital in creative teams through sociometric sensors. Int. J. Organ. Des. Eng. 2(4), 380–401 (2012) Gloor, P.A., Grippa, F., Putzke, J., Lassenius, C., Fuehres, H., Fischbach, K., Schoder, D.: Measuring social capital in creative teams through sociometric sensors. Int. J. Organ. Des. Eng. 2(4), 380–401 (2012)
7.
go back to reference Kozak, J., Boryczka, U.: Enhancing the effectiveness of ant colony decision tree algorithms by co-learning. Appl. Soft Comput. 30, 166–178 (2015)CrossRef Kozak, J., Boryczka, U.: Enhancing the effectiveness of ant colony decision tree algorithms by co-learning. Appl. Soft Comput. 30, 166–178 (2015)CrossRef
8.
go back to reference Moreno, J.L.: Who shall survive? Foundations of Sociometry, Group Psychotherapy and Socio-drama. Beacon House (1953) Moreno, J.L.: Who shall survive? Foundations of Sociometry, Group Psychotherapy and Socio-drama. Beacon House (1953)
9.
go back to reference Tkacz, M.: Artificial neural networks in incomplete data sets processing. In: Intelligent Information Processing and Web Mining, pp. 577–583. Springer (2005) Tkacz, M.: Artificial neural networks in incomplete data sets processing. In: Intelligent Information Processing and Web Mining, pp. 577–583. Springer (2005)
10.
go back to reference Wilson, G., Banzhaf, W.: Discovery of email communication networks from the enron corpus with a genetic algorithm using social network analysis. In: IEEE Congress on Evolutionary Computation, 2009. CEC’09, pp. 3256–3263. IEEE (2009) Wilson, G., Banzhaf, W.: Discovery of email communication networks from the enron corpus with a genetic algorithm using social network analysis. In: IEEE Congress on Evolutionary Computation, 2009. CEC’09, pp. 3256–3263. IEEE (2009)
11.
go back to reference Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc. (2011) Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann Publishers Inc. (2011)
Metadata
Title
Automatic Categorization of Email into Folders by Ant Colony Decision Tree and Social Networks
Authors
Urszula Boryczka
Barbara Probierz
Jan Kozak
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-39627-9_7

Premium Partner