Skip to main content
Erschienen in: Soft Computing 2/2010

01.01.2010 | Focus

A three-layer back-propagation neural network for spam detection using artificial immune concentration

verfasst von: Guangchen Ruan, Ying Tan

Erschienen in: Soft Computing | Ausgabe 2/2010

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, a three-layer back-propagation neural network (BPNN) is employed for spam detection by using a concentration based feature construction (CFC) approach. In the CFC approach, ‘self’ and ‘non-self’ concentrations are constructed through ‘self’ and ‘non-self’ gene libraries, respectively, to form a two-element concentration vector for expressing the e-mail efficiently. A three-layer BPNN with two-element input is then employed to classify e-mails automatically. Comprehensive experiments are conducted on two public benchmark corpora PU1 and Ling to demonstrate that the proposed CFC approach based BPNN classifier not only has a very much fast speed but also achieves 97 and 99% of classification accuracy on corpora PU1 and Ling by just using a two-element concentration feature vector.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
The PU1 corpus and Ling corpus may be downloaded from http://​www.​cil.​pku.​edu.​cn/​resources/​.
 
Literatur
Zurück zum Zitat Androutsopoulos I, Koutsias J, Chandrinos KV, Spyropoulos CD (2000a) An experimental comparison of Naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages. In: Proceedings of the 23rd ACM SIGIR conference on research and development in information retrieval, pp 160–167 Androutsopoulos I, Koutsias J, Chandrinos KV, Spyropoulos CD (2000a) An experimental comparison of Naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages. In: Proceedings of the 23rd ACM SIGIR conference on research and development in information retrieval, pp 160–167
Zurück zum Zitat Androutsopoulos I, Koutsias J, Chandrinos KV, Paliouras G, Spyropoulos CD (2000b) An evaluation of Naive Bayesian anti-spam filtering. In: Proceedings of European conference on machine learning (ECML 2000) Androutsopoulos I, Koutsias J, Chandrinos KV, Paliouras G, Spyropoulos CD (2000b) An evaluation of Naive Bayesian anti-spam filtering. In: Proceedings of European conference on machine learning (ECML 2000)
Zurück zum Zitat Bhattacharyya M, Schultz M (2002) MET: an experimental system for Malicious email tracking. In: Proceedings of new security paradigms workshop, pp 3–10 Bhattacharyya M, Schultz M (2002) MET: an experimental system for Malicious email tracking. In: Proceedings of new security paradigms workshop, pp 3–10
Zurück zum Zitat Brendel R, Krawczyk H (2007) Detection methods of dynamic spammers’ behavior. In: International conference on dependability of computer systems, pp 145–152 Brendel R, Krawczyk H (2007) Detection methods of dynamic spammers’ behavior. In: International conference on dependability of computer systems, pp 145–152
Zurück zum Zitat Clark J, Koprinska I, Poon J (2003) A neural network based approach to automated e-mail classification. In: Proceedings of IEEE international conference on web intelligence (WI 2003), pp 702–705 Clark J, Koprinska I, Poon J (2003) A neural network based approach to automated e-mail classification. In: Proceedings of IEEE international conference on web intelligence (WI 2003), pp 702–705
Zurück zum Zitat Drucker H, Wu DH, Vapnik VN (1999) Support vector machines for spam categorization. IEEE Trans Neural Netw 10:1048–1054 Drucker H, Wu DH, Vapnik VN (1999) Support vector machines for spam categorization. IEEE Trans Neural Netw 10:1048–1054
Zurück zum Zitat Gunal S, Ergin S, Gulmezoglu MB, Gerek ON (2006) On feature extraction for spam e-mail detection. Lectures Notes on Computer Science. Springer, Berlin, pp 635–642 Gunal S, Ergin S, Gulmezoglu MB, Gerek ON (2006) On feature extraction for spam e-mail detection. Lectures Notes on Computer Science. Springer, Berlin, pp 635–642
Zurück zum Zitat Katakis I, Tsoumakas G, Vlahavas I (2006) Dynamic feature space and incremental feature selection for the classification of textual data streams. In: Proceedings of international workshop on knowledge discovery from data streams, pp 107–116 Katakis I, Tsoumakas G, Vlahavas I (2006) Dynamic feature space and incremental feature selection for the classification of textual data streams. In: Proceedings of international workshop on knowledge discovery from data streams, pp 107–116
Zurück zum Zitat Koprinska I, Poon J, Clark J, Chan J (2007) Learning to classify e-mail. Inf Sci, pp 2167–2187 Koprinska I, Poon J, Clark J, Chan J (2007) Learning to classify e-mail. Inf Sci, pp 2167–2187
Zurück zum Zitat Leiba B, Borenstein N (2004) A multifaceted approach to spam reduction. In: Proceedings of the first conference on email and antispam (CEAS 2004) Leiba B, Borenstein N (2004) A multifaceted approach to spam reduction. In: Proceedings of the first conference on email and antispam (CEAS 2004)
Zurück zum Zitat Li Y, Fang BX, Guo L, Wang S (2006) Research of a novel anti-spam technique based on users’s feedback and improved Naive Bayesian approach. In: Proceedings of IEEE international conference on networking and services (ICNS 2006), pp 86–91 Li Y, Fang BX, Guo L, Wang S (2006) Research of a novel anti-spam technique based on users’s feedback and improved Naive Bayesian approach. In: Proceedings of IEEE international conference on networking and services (ICNS 2006), pp 86–91
Zurück zum Zitat Oda T, White T (2003) Increasing the accuracy of a spam-detecting artificial immune system. In: Proceedings of IEEE congress on evolutionary computation (CEC 2003), pp 390–396 Oda T, White T (2003) Increasing the accuracy of a spam-detecting artificial immune system. In: Proceedings of IEEE congress on evolutionary computation (CEC 2003), pp 390–396
Zurück zum Zitat Oda T, White T (2005) Immunity from spam: an analysis of an artificial immune system for junk email detection. In: International conference on artificial immune systems (ICARIS 2005) Oda T, White T (2005) Immunity from spam: an analysis of an artificial immune system for junk email detection. In: International conference on artificial immune systems (ICARIS 2005)
Zurück zum Zitat Rigoutsos I, Floratos A (1998) Combinatorial pattern discovery in biological sequences: the TEIRESIAS algorithm. Bioinformatics, pp 55–67 Rigoutsos I, Floratos A (1998) Combinatorial pattern discovery in biological sequences: the TEIRESIAS algorithm. Bioinformatics, pp 55–67
Zurück zum Zitat Rigoutsos I, Huynh T (2004) Chung–Kwei: a pattern-discovery-based system for the automatic identification of unsolicited e-mail messages (SPAM). In: Proceedings of the first conference on email and antispam (CEAS 2004) Rigoutsos I, Huynh T (2004) Chung–Kwei: a pattern-discovery-based system for the automatic identification of unsolicited e-mail messages (SPAM). In: Proceedings of the first conference on email and antispam (CEAS 2004)
Zurück zum Zitat Ruan GC, Tan Y (2007) Intelligent detection approaches for spam. In: The third international conference on natural computation (ICNC 2007) vol 3, August 24–27, Haikou, China, pp 672–676 Ruan GC, Tan Y (2007) Intelligent detection approaches for spam. In: The third international conference on natural computation (ICNC 2007) vol 3, August 24–27, Haikou, China, pp 672–676
Zurück zum Zitat Ruan GC, Tan Y (2008) Uninterrupted approaches for spam detection based on SVM and AIS. IEEE Trans Syst Man Cybern B Ruan GC, Tan Y (2008) Uninterrupted approaches for spam detection based on SVM and AIS. IEEE Trans Syst Man Cybern B
Zurück zum Zitat Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors, Nature, pp 533–536 Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors, Nature, pp 533–536
Zurück zum Zitat Secker A, Freitas AA, Timmis J (2003) AISEC: an artificial immune system for email classification. In: Proceedings of IEEE congress on evolutionary computation (CEC 2003), pp 131–139 Secker A, Freitas AA, Timmis J (2003) AISEC: an artificial immune system for email classification. In: Proceedings of IEEE congress on evolutionary computation (CEC 2003), pp 131–139
Zurück zum Zitat Shrestha R, Lin YP (2005) Improved bayesian spam filtering based on co-weighted multi-area information, Lecture Notes in Artificial Intelligence. Springer, Berlin, pp 650–660 Shrestha R, Lin YP (2005) Improved bayesian spam filtering based on co-weighted multi-area information, Lecture Notes in Artificial Intelligence. Springer, Berlin, pp 650–660
Zurück zum Zitat Stolfo S, Hershkop S (2006) Behavior-based modeling and its application to email analysis. ACM Trans Internet Technol 6:187–221CrossRef Stolfo S, Hershkop S (2006) Behavior-based modeling and its application to email analysis. ACM Trans Internet Technol 6:187–221CrossRef
Zurück zum Zitat Stuart I, Cha SH, Tappert C (2004) A neural network classifier for junk e-mail. Lecture Notes on Computer Science. Springer, Berlin, pp 442–450 Stuart I, Cha SH, Tappert C (2004) A neural network classifier for junk e-mail. Lecture Notes on Computer Science. Springer, Berlin, pp 442–450
Zurück zum Zitat Tan Y, Ruan GC (2007) Recognition of electronic junk mails based on artificial immune system. In: The third joint workshop on machine perception and robotics (MPR 2007), Nov 25–27, Ritsumeikan University, Japan Tan Y, Ruan GC (2007) Recognition of electronic junk mails based on artificial immune system. In: The third joint workshop on machine perception and robotics (MPR 2007), Nov 25–27, Ritsumeikan University, Japan
Zurück zum Zitat Tan Y, Wang J (2004) A support vector network with hybrid Kernel and minimal Vapnik–Chervonenkis dimension. IEEE Trans Knowl Data Eng pp 385–395 Tan Y, Wang J (2004) A support vector network with hybrid Kernel and minimal Vapnik–Chervonenkis dimension. IEEE Trans Knowl Data Eng pp 385–395
Zurück zum Zitat Wang R, Youssef AM, Elhakeem AK (2006) On some feature selection strategies for spam filter design. In: Proceedings of Canadian conference on electrical and computer engineering, pp 2186–2189 Wang R, Youssef AM, Elhakeem AK (2006) On some feature selection strategies for spam filter design. In: Proceedings of Canadian conference on electrical and computer engineering, pp 2186–2189
Zurück zum Zitat Wu MW, Huang Y, Lu SK, Chen IY, Kuo SY (2005) A multi-faceted approach towards spam-resistible mail. In: Proceedings of IEEE Pacific Rim international symposium on dependable computing, pp 208–218 Wu MW, Huang Y, Lu SK, Chen IY, Kuo SY (2005) A multi-faceted approach towards spam-resistible mail. In: Proceedings of IEEE Pacific Rim international symposium on dependable computing, pp 208–218
Zurück zum Zitat Yeh CY, Wu CH, Doong SH (2005) Effective spam classification based on meta-heuristics. In: Proceedings of IEEE international conference on systems, man and cybernetics, pp 3872–3877 Yeh CY, Wu CH, Doong SH (2005) Effective spam classification based on meta-heuristics. In: Proceedings of IEEE international conference on systems, man and cybernetics, pp 3872–3877
Metadaten
Titel
A three-layer back-propagation neural network for spam detection using artificial immune concentration
verfasst von
Guangchen Ruan
Ying Tan
Publikationsdatum
01.01.2010
Verlag
Springer-Verlag
Erschienen in
Soft Computing / Ausgabe 2/2010
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-009-0440-2

Weitere Artikel der Ausgabe 2/2010

Soft Computing 2/2010 Zur Ausgabe

Premium Partner