Skip to main content

2018 | OriginalPaper | Buchkapitel

Classification of JPEG Files by Using Extreme Learning Machine

verfasst von : Rabei Raad Ali, Kamaruddin Malik Mohamad, Sapiee Jamel, Shamsul Kamal Ahmad Khalid

Erschienen in: Recent Advances on Soft Computing and Data Mining

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recovery of data files when their system information missing is a challenging research issue. The recovery process entails methods that analyze the structure and contents of each individual file clusters. A primary and important process of files’ recovery is determining the files’ types including JPEG, DOC or HTML. This paper proposes an Extreme Learning Machine (ELM) algorithm to assign a class label of JPEG or Non-JPEG image for files in a continuous series of data clusters. The algorithm automatically classifies the files based on evaluation measures of three methods Entropy, Byte Frequency Distribution and Rate of Change. The ELM algorithm is applied to RABEI-2017 and DFRWS-2006 datasets. The experimental results show that the ELM algorithm is able to identify JPEG files of fragmented clusters with high accuracy rate. The classification accuracy of the RABEI-2017 dataset is 90.15% and the DFRWS-2006 is 93.46%. The DFRWS-2006 has more classes than the RABEI-2017 which improves the ELM classifier fitting.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abdullah, N., Ibrahim, R., Mohamad, K.: Cluster size determination using JPEG files. In: 2012 Computational Science and Its Applications–ICCSA, pp. 353–363 (2012) Abdullah, N., Ibrahim, R., Mohamad, K.: Cluster size determination using JPEG files. In: 2012 Computational Science and Its Applications–ICCSA, pp. 353–363 (2012)
2.
Zurück zum Zitat Mohammed, M.A., Gani, M.K. A., Hamed, R.I., Mostafa, S.A., Ahmad, M.S., Ibrahim, D.A.: Solving vehicle routing problem by using improved genetic algorithm for optimal solution. J. Comput. Sci. (2017) Mohammed, M.A., Gani, M.K. A., Hamed, R.I., Mostafa, S.A., Ahmad, M.S., Ibrahim, D.A.: Solving vehicle routing problem by using improved genetic algorithm for optimal solution. J. Comput. Sci. (2017)
3.
Zurück zum Zitat Amirani, M.C., Toorani, M., Mihandoost, S.: Feature-based type identification of file fragments. Secur. Commun. Netw. 6(1), 115–128 (2013)CrossRef Amirani, M.C., Toorani, M., Mihandoost, S.: Feature-based type identification of file fragments. Secur. Commun. Netw. 6(1), 115–128 (2013)CrossRef
4.
Zurück zum Zitat Qiu, W., Zhu, R., Guo, J., Tang, X., Liu, B., Huang, Z.: A new approach to multimedia files carving. In: 2014 IEEE International Conference on Bioinformatics and Bioengineering (BIBE), pp. 105–110. IEEE, Nov 2014 Qiu, W., Zhu, R., Guo, J., Tang, X., Liu, B., Huang, Z.: A new approach to multimedia files carving. In: 2014 IEEE International Conference on Bioinformatics and Bioengineering (BIBE), pp. 105–110. IEEE, Nov 2014
5.
Zurück zum Zitat Veenman, C.J.: Statistical disk cluster classification for file carving. In: 2007 Third International Symposium on Information Assurance and Security, IAS 2007, pp. 393–398. IEEE, Aug 2007 Veenman, C.J.: Statistical disk cluster classification for file carving. In: 2007 Third International Symposium on  Information Assurance and Security, IAS 2007, pp. 393–398. IEEE, Aug 2007
6.
Zurück zum Zitat McDaniel, M., Heydari, M.H.: Content based file type detection algorithms. In: 2003 Proceedings of the 36th Annual Hawaii International Conference on System Sciences, pp. 10-pp. IEEE, Jan 2003 McDaniel, M., Heydari, M.H.: Content based file type detection algorithms. In: 2003 Proceedings of the 36th Annual Hawaii International Conference on System Sciences, pp. 10-pp. IEEE, Jan 2003
7.
Zurück zum Zitat Karresand, M., Shahmehri, N.: Oscar-file type identification of binary data in disk clusters and ram pages. Secur. Priv. Dyn. Environ. 413–424 (2006) Karresand, M., Shahmehri, N.: Oscar-file type identification of binary data in disk clusters and ram pages. Secur. Priv. Dyn. Environ. 413–424 (2006)
8.
Zurück zum Zitat Li, Q., Ong, A., Suganthan, P., Thing, V.: A novel support vector machine approach to high entropy data fragment classification. In: Proceedings of the South African Information Security Multi-Conf (SAISMC), pp. 236–247 (2011) Li, Q., Ong, A., Suganthan, P., Thing, V.: A novel support vector machine approach to high entropy data fragment classification. In: Proceedings of the South African Information Security Multi-Conf (SAISMC), pp. 236–247 (2011)
9.
Zurück zum Zitat Mehra, N., Gupta, S.: Survey on multiclass classification methods. Int. J. Comput. Sci. Inf. Technol. 4(4), 572–576 (2013) Mehra, N., Gupta, S.: Survey on multiclass classification methods. Int. J. Comput. Sci. Inf. Technol. 4(4), 572–576 (2013)
10.
Zurück zum Zitat Zhang, L., Zhang, D., Tian, F.: SVM and ELM: Who Wins? Object recognition with deep convolutional features from ImageNet. In Proceedings of ELM-2015, vol. 1, pp. 249–263. Springer International Publishing (2016) Zhang, L., Zhang, D., Tian, F.: SVM and ELM: Who Wins? Object recognition with deep convolutional features from ImageNet. In Proceedings of ELM-2015, vol. 1, pp. 249–263. Springer International Publishing (2016)
12.
Zurück zum Zitat Shannon, M.: Forensic relative strength scoring: ASCII and entropy scoring. Int. J. Digit. Evid. 2(4), 1–19 (2004) Shannon, M.: Forensic relative strength scoring: ASCII and entropy scoring. Int. J. Digit. Evid. 2(4), 1–19 (2004)
13.
Zurück zum Zitat Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)CrossRef Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)CrossRef
14.
Zurück zum Zitat Mohammed, M.A., Ghani, M. K.A., Hamed, R.I., Mostafa, S.A., Ibrahim, D.A., Jameel, H.K., Alallah, A.H.: Solving vehicle routing problem by using improved K-nearest neighbor algorithm for best solution. J. Comput. Sci. (2017). Mohammed, M.A., Ghani, M. K.A., Hamed, R.I., Mostafa, S.A., Ibrahim, D.A., Jameel, H.K., Alallah, A.H.: Solving vehicle routing problem by using improved K-nearest neighbor algorithm for best solution. J. Comput. Sci. (2017).
15.
Zurück zum Zitat Khaleefah, S.H., Nasrudin, M.F., Mostafa, S.A.: Fingerprinting of deformed paper images acquired by scanners. In: 2015 IEEE Student Conference on Research and Development (SCOReD), pp. 393–397. IEEE, Dec 2015 Khaleefah, S.H., Nasrudin, M.F., Mostafa, S.A.: Fingerprinting of deformed paper images acquired by scanners. In: 2015 IEEE Student Conference on Research and Development (SCOReD), pp. 393–397. IEEE, Dec 2015
Metadaten
Titel
Classification of JPEG Files by Using Extreme Learning Machine
verfasst von
Rabei Raad Ali
Kamaruddin Malik Mohamad
Sapiee Jamel
Shamsul Kamal Ahmad Khalid
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-72550-5_4