Skip to main content
main-content

Tipp

Weitere Kapitel dieses Buchs durch Wischen aufrufen

2020 | OriginalPaper | Buchkapitel

The Data Scientist on LinkedIn: Job Advertisement Corpus Processing with NooJ

share
TEILEN

Abstract

For organizations using big data, one of the most important element to reach tangible results is exploiting human resources: it is not possible to manage data without using them intelligently. Considering the human intervention in relation to big data, means calling into question the so-called “data scientist”. Moving from the above, the main aim of this study is using the linguistic software environment NooJ to process a large corpus of job advertisements for data scientist in Italy collected on the business-networking site LinkedIn. Creating specific linguistic resources with NooJ, we are able to identify the most required skills by companies and organizations.
Searching the ideal candidate to hire, companies pay attention equally to technical skills and soft skills, in particular, as the capacity to work in team and communicate concerns. Finally, our research confirmed that studying the context in which the single words are inserted represents a key step in the process of information extraction by texts.

Sie möchten Zugang zu diesem Inhalt erhalten? Dann informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 69.000 Bücher
  • über 500 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Testen Sie jetzt 15 Tage kostenlos.

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 50.000 Bücher
  • über 380 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




Testen Sie jetzt 15 Tage kostenlos.

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 58.000 Bücher
  • über 300 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Testen Sie jetzt 15 Tage kostenlos.

Literatur
1.
Zurück zum Zitat Nalchigar, S., Yu, E.: Business-driven data analytics: a conceptual modeling framework. Data Knowl. Eng. 117, 359–372 (2018) CrossRef Nalchigar, S., Yu, E.: Business-driven data analytics: a conceptual modeling framework. Data Knowl. Eng. 117, 359–372 (2018) CrossRef
2.
Zurück zum Zitat Oussous, A., Benjelloun, F.Z., Lahcen, A.A., Belfkih, S.: Big data technologies: a survey. J. King Saud Univ.-Comput. Inf. Sci. 30(4), 431–448 (2018) Oussous, A., Benjelloun, F.Z., Lahcen, A.A., Belfkih, S.: Big data technologies: a survey. J. King Saud Univ.-Comput. Inf. Sci. 30(4), 431–448 (2018)
3.
Zurück zum Zitat Storey, V.C., Song, I.Y.: Big data technologies and management: what conceptual modeling can do. Data Knowl. Eng. 108, 50–67, 52 (2017) CrossRef Storey, V.C., Song, I.Y.: Big data technologies and management: what conceptual modeling can do. Data Knowl. Eng. 108, 50–67, 52 (2017) CrossRef
4.
Zurück zum Zitat Davenport, T.: Big Data at Work: Dispelling the Myths, Uncovering the Opportunities. Harvard Business Review Press, Boston (2014) CrossRef Davenport, T.: Big Data at Work: Dispelling the Myths, Uncovering the Opportunities. Harvard Business Review Press, Boston (2014) CrossRef
6.
Zurück zum Zitat Song, I.Y., Zhu, Y.: Big data and data science: what should we teach? Expert Syst. 33(4), 364–373 (2016) CrossRef Song, I.Y., Zhu, Y.: Big data and data science: what should we teach? Expert Syst. 33(4), 364–373 (2016) CrossRef
7.
Zurück zum Zitat Goddard, C.: Semantic Analysis: a Practical Introduction. Oxford University Press, Oxford (2011) Goddard, C.: Semantic Analysis: a Practical Introduction. Oxford University Press, Oxford (2011)
12.
Zurück zum Zitat Davenport, T.H., Patil, D.J.: Data scientist. Harv. Bus. Rev. 90(5), 70–76, 72 (2012) Davenport, T.H., Patil, D.J.: Data scientist. Harv. Bus. Rev. 90(5), 70–76, 72 (2012)
13.
Zurück zum Zitat Agasisti, T., Bowers, A.J.: Data analytics and decision making in education: towards the educational data scientist as a key actor in schools and higher education institutions. In: Handbook of Contemporary Education Economics, p. 184 (2017) Agasisti, T., Bowers, A.J.: Data analytics and decision making in education: towards the educational data scientist as a key actor in schools and higher education institutions. In: Handbook of Contemporary Education Economics, p. 184 (2017)
14.
Zurück zum Zitat Granville, V.: Developing Analytic Talent: Becoming a Data Scientist. Wiley, Hoboken (2014) Granville, V.: Developing Analytic Talent: Becoming a Data Scientist. Wiley, Hoboken (2014)
15.
Zurück zum Zitat Fisher, D., DeLine, R., Czerwinski, M., Drucker, S.: Interactions with big data analytics. Interactions 19(3), 50–59, 57 (2012) CrossRef Fisher, D., DeLine, R., Czerwinski, M., Drucker, S.: Interactions with big data analytics. Interactions 19(3), 50–59, 57 (2012) CrossRef
16.
Zurück zum Zitat Kandel, S., Paepcke, A., Hellerstein, J.M., Heer, J.: Enterprise data analysis and visualization: an interview study. IEEE Trans. Vis. Comput. Graph. 12, 2917–2926 (2012) CrossRef Kandel, S., Paepcke, A., Hellerstein, J.M., Heer, J.: Enterprise data analysis and visualization: an interview study. IEEE Trans. Vis. Comput. Graph. 12, 2917–2926 (2012) CrossRef
17.
Zurück zum Zitat Balbi, S., Di Meglio, E.: A text mining strategy based on local contexts of words. In: Proceedings of the JADT, vol. 4, pp. 79–87 (2004) Balbi, S., Di Meglio, E.: A text mining strategy based on local contexts of words. In: Proceedings of the JADT, vol. 4, pp. 79–87 (2004)
18.
Zurück zum Zitat Iezzi, D.F., Mastrangelo, M., Sarlo, S.: Text clustering based on centrality measures: an application on job advertisements. In: 11es Journées Internationales d’analyse statistique des données textuelles, pp. 515–524 (2012) Iezzi, D.F., Mastrangelo, M., Sarlo, S.: Text clustering based on centrality measures: an application on job advertisements. In: 11es Journées Internationales d’analyse statistique des données textuelles, pp. 515–524 (2012)
19.
Zurück zum Zitat Amato, F., et al.: Challenge: processing web texts for classifying job offers. In: 2015 IEEE International Conference Semantic Computing (ICSC), pp. 460–463. IEEE (2015) Amato, F., et al.: Challenge: processing web texts for classifying job offers. In: 2015 IEEE International Conference Semantic Computing (ICSC), pp. 460–463. IEEE (2015)
20.
Zurück zum Zitat Bsiri, S., Geierhos, M., Ringlstetter, C.: Structuring job search via local grammars. Adv. Nat. Lang. Process. Appl. Res. Comput. Sci. (RCS) 33, 201–212 (2008) Bsiri, S., Geierhos, M., Ringlstetter, C.: Structuring job search via local grammars. Adv. Nat. Lang. Process. Appl. Res. Comput. Sci. (RCS) 33, 201–212 (2008)
21.
Zurück zum Zitat Loth, R., Battistelli, D., Chaumartin, F.R., De Mazancourt, H., Minel, J.L., Vinckx, A.: Linguistic information extraction for job ads (SIRE project). In: Adaptivity, Personalization and Fusion of Heterogeneous Information, pp. 222–224. Le centre de hautes etudes internationales d’informatique documentaire (2010) Loth, R., Battistelli, D., Chaumartin, F.R., De Mazancourt, H., Minel, J.L., Vinckx, A.: Linguistic information extraction for job ads (SIRE project). In: Adaptivity, Personalization and Fusion of Heterogeneous Information, pp. 222–224. Le centre de hautes etudes internationales d’informatique documentaire (2010)
22.
Zurück zum Zitat Karakatsanis, I., et al.: Data mining approach to monitoring the requirements of the job market: a case study. Inf. Syst. 65, 1–6 (2017) CrossRef Karakatsanis, I., et al.: Data mining approach to monitoring the requirements of the job market: a case study. Inf. Syst. 65, 1–6 (2017) CrossRef
23.
Zurück zum Zitat Kim, J., Moen, W., Warger, E.: Competencies required for digital curation: an analysis of job advertisements. Int. J. Digit. Curation 8(1), 66–83 (2013) CrossRef Kim, J., Moen, W., Warger, E.: Competencies required for digital curation: an analysis of job advertisements. Int. J. Digit. Curation 8(1), 66–83 (2013) CrossRef
24.
Zurück zum Zitat Gardiner, A., Aasheim, C., Rutner, P., Williams, S.: Skill requirements in big data: a content analysis of job advertisements. J. Comput. Inf. Syst. 58(4), 374–384 (2018) Gardiner, A., Aasheim, C., Rutner, P., Williams, S.: Skill requirements in big data: a content analysis of job advertisements. J. Comput. Inf. Syst. 58(4), 374–384 (2018)
25.
Zurück zum Zitat Gross, M.: On the failure of generative grammar. Language 55, 859–885 (1979) CrossRef Gross, M.: On the failure of generative grammar. Language 55, 859–885 (1979) CrossRef
26.
Zurück zum Zitat Gross, M.: Lexicon-grammar: the representation of compound words. In: Proceedings of the 11th Conference on Computational Linguistics, pp. 1–6, 4. Association for Computational Linguistics, August 1986 Gross, M.: Lexicon-grammar: the representation of compound words. In: Proceedings of the 11th Conference on Computational Linguistics, pp. 1–6, 4. Association for Computational Linguistics, August 1986
27.
Zurück zum Zitat Silberztein, M.: NooJ: a linguistic annotation system for corpus processing. In: Proceedings of HLT/EMNLP on Interactive Demonstrations, pp. 10–11. Association for Computational Linguistics (2005) Silberztein, M.: NooJ: a linguistic annotation system for corpus processing. In: Proceedings of HLT/EMNLP on Interactive Demonstrations, pp. 10–11. Association for Computational Linguistics (2005)
28.
Zurück zum Zitat Silberztein, M.: NooJ Computational Devices. Formalising Natural Languages with NooJ, pp. 1–13 (2013) Silberztein, M.: NooJ Computational Devices. Formalising Natural Languages with NooJ, pp. 1–13 (2013)
29.
Zurück zum Zitat Elia, A., Monteleone, M., Esposito, F.: Les Cahiers du dictionnaire. Dictionnaires électroniques et dictionnaires en ligne, Les Cahiers du dictionnaire 6, 43–62 (2014) Elia, A., Monteleone, M., Esposito, F.: Les Cahiers du dictionnaire. Dictionnaires électroniques et dictionnaires en ligne, Les Cahiers du dictionnaire 6, 43–62 (2014)
33.
Zurück zum Zitat Vietri, S.: Lessico-grammatica dell’italiano. Metodi, descrizioni e applicazioni. Turin, UTET (2004) Vietri, S.: Lessico-grammatica dell’italiano. Metodi, descrizioni e applicazioni. Turin, UTET (2004)
34.
Zurück zum Zitat McAfee, A., Brynjolfsson, E., Davenport, T.H., Patil, D.J., Barton, D.: Big data: the management revolution. Harv. Bus. Rev. 90(10), 60–68 (2012) McAfee, A., Brynjolfsson, E., Davenport, T.H., Patil, D.J., Barton, D.: Big data: the management revolution. Harv. Bus. Rev. 90(10), 60–68 (2012)
35.
Zurück zum Zitat Wynne, M.: Searching and concordancing. Corpus Linguist. Int. Handb. 1, 706–737 (2008) Wynne, M.: Searching and concordancing. Corpus Linguist. Int. Handb. 1, 706–737 (2008)
Metadaten
Titel
The Data Scientist on LinkedIn: Job Advertisement Corpus Processing with NooJ
verfasst von
Maddalena della Volpe
Francesca Esposito
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-38833-1_7

Premium Partner