Skip to main content
Top

2017 | OriginalPaper | Chapter

Text Analysis of Corpus Linguistics in a Post-concordancer Era

Author : Simon Ho Wang

Published in: Emerging Technologies for Education

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this methodological paper, I review a number of studies in corpus linguistics that rely heavily on off-the-shelf computer programs known as concordancers. While acknowledging the fruitful research findings generated using concordancers, it is argued that natural language processing (NLP) tools such as Stanford parser and SyntaxNet should be used to automate certain analytical procedures that are often performed manually by corpus linguistics researchers using concordancers. More collaboration efforts between NLP researchers and corpus linguists are called for to help advance the field of corpus linguistics into a post-concordancer era.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Banko, M. et al.: Open information extraction from the web. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007) (2007) Banko, M. et al.: Open information extraction from the web. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007) (2007)
2.
go back to reference Bresnan, J.: Lexical-functional syntax. Blackwell, Malden (2001) Bresnan, J.: Lexical-functional syntax. Blackwell, Malden (2001)
4.
go back to reference Chambers, A.: Integrating corpus consultation in language studies. Lang. Learn. Technol. 9(2), 111–125 (2005) Chambers, A.: Integrating corpus consultation in language studies. Lang. Learn. Technol. 9(2), 111–125 (2005)
5.
go back to reference Clegg, A.B.: Computational-linguistic approaches to biological text mining, University of London (2008) Clegg, A.B.: Computational-linguistic approaches to biological text mining, University of London (2008)
6.
go back to reference Davies, M.: Google Scholar and COCA-Academic: two very different approaches to examining academic English. J. Engl. Acad. Purp. 12(3), 155–165 (2013)CrossRef Davies, M.: Google Scholar and COCA-Academic: two very different approaches to examining academic English. J. Engl. Acad. Purp. 12(3), 155–165 (2013)CrossRef
7.
go back to reference De Marneffe, M.-C., Manning, C.D.: The Stanford typed dependencies representation. In: Coling 2008: Proceedings of the Workshop on Cross-Framework and Cross-Domain Parser Evaluation, pp. 1–8. Association for Computational Linguistics (2008) De Marneffe, M.-C., Manning, C.D.: The Stanford typed dependencies representation. In: Coling 2008: Proceedings of the Workshop on Cross-Framework and Cross-Domain Parser Evaluation, pp. 1–8. Association for Computational Linguistics (2008)
8.
go back to reference Erkan, G. et al.: Semi-supervised classification for extracting protein interaction sentences using dependency parsing. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) (2007) Erkan, G. et al.: Semi-supervised classification for extracting protein interaction sentences using dependency parsing. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) (2007)
9.
go back to reference Flowerdew, J.: Concordancing as a tool in course design. System 21(2), 231–244 (1993)CrossRef Flowerdew, J.: Concordancing as a tool in course design. System 21(2), 231–244 (1993)CrossRef
10.
go back to reference Flowerdew, J., Wang, S.H.: Author’s editor revisions to manuscripts published in international journals. J. Second Lang. Writ. 32, 39–52 (2016)CrossRef Flowerdew, J., Wang, S.H.: Author’s editor revisions to manuscripts published in international journals. J. Second Lang. Writ. 32, 39–52 (2016)CrossRef
11.
go back to reference Hyland, K., Tse, P.: Hooking the reader: a corpus study of evaluative <i> that </i> in abstracts. Engl. Specif. Purp. 24(2), 123–139 (2005)CrossRef Hyland, K., Tse, P.: Hooking the reader: a corpus study of evaluative <i> that </i> in abstracts. Engl. Specif. Purp. 24(2), 123–139 (2005)CrossRef
12.
go back to reference King, T.H. et al.: The PARC 700 dependency bank. In: 4th International Workshop on Linguistically Interpreted Corpora (LINC 2003) (2003) King, T.H. et al.: The PARC 700 dependency bank. In: 4th International Workshop on Linguistically Interpreted Corpora (LINC 2003) (2003)
13.
go back to reference Lee, D., Swales, J.: A corpus-based EAP course for NNS doctoral students: moving from available specialized corpora to self-compiled corpora. Engl. Specif. Purp. 25(1), 56–75 (2006)CrossRef Lee, D., Swales, J.: A corpus-based EAP course for NNS doctoral students: moving from available specialized corpora to self-compiled corpora. Engl. Specif. Purp. 25(1), 56–75 (2006)CrossRef
14.
go back to reference Lu, X.: Automatic analysis of syntactic complexity in second language writing. Int. J. Corpus Linguist. 15, 4 (2010)CrossRef Lu, X.: Automatic analysis of syntactic complexity in second language writing. Int. J. Corpus Linguist. 15, 4 (2010)CrossRef
15.
go back to reference Lu, X., Ai, H.: Syntactic complexity in college-level English writing: differences among writers with diverse L1 backgrounds. J. Second Lang. Writ. 29, 16–27 (2015)CrossRef Lu, X., Ai, H.: Syntactic complexity in college-level English writing: differences among writers with diverse L1 backgrounds. J. Second Lang. Writ. 29, 16–27 (2015)CrossRef
16.
go back to reference Meena, A., Prabhakar, T.V.: Sentence level sentiment analysis in the presence of conjuncts using linguistic analysis. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 573–580. Springer, Heidelberg (2007). doi:10.1007/978-3-540-71496-5_53 CrossRef Meena, A., Prabhakar, T.V.: Sentence level sentiment analysis in the presence of conjuncts using linguistic analysis. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 573–580. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-71496-5_​53 CrossRef
17.
go back to reference Ortega, L.: Syntactic complexity in L2 writing: progress and expansion. J. Second Lang. Writ. 29, 82–94 (2015)CrossRef Ortega, L.: Syntactic complexity in L2 writing: progress and expansion. J. Second Lang. Writ. 29, 82–94 (2015)CrossRef
18.
go back to reference Ortega, L.: Syntactic complexity measures and their relationship to L2 proficiency: a research synthesis of college-level L2 writing. Appl. Linguist. 24(4), 492–518 (2003)CrossRef Ortega, L.: Syntactic complexity measures and their relationship to L2 proficiency: a research synthesis of college-level L2 writing. Appl. Linguist. 24(4), 492–518 (2003)CrossRef
19.
go back to reference Thurstun, J., Candlin, C.N.: Concordancing and the teaching of the vocabulary of academic English. Engl. Specif. Purp. 17(3), 267–280 (1998)CrossRef Thurstun, J., Candlin, C.N.: Concordancing and the teaching of the vocabulary of academic English. Engl. Specif. Purp. 17(3), 267–280 (1998)CrossRef
20.
go back to reference Yoon, C.: Concordancing in L2 writing class: an overview of research and issues. J. Engl. Acad. Purp. 10(3), 130–139 (2011)CrossRef Yoon, C.: Concordancing in L2 writing class: an overview of research and issues. J. Engl. Acad. Purp. 10(3), 130–139 (2011)CrossRef
21.
go back to reference Yoon, H.: More than a linguistic reference: the influence of corpus technology on L2 academic writing. Lang. Learn. Technol. 12(2), 31–48 (2008)MathSciNet Yoon, H.: More than a linguistic reference: the influence of corpus technology on L2 academic writing. Lang. Learn. Technol. 12(2), 31–48 (2008)MathSciNet
22.
go back to reference Youn, S.J.: Measuring syntactic complexity in L2 pragmatic production: investigating relationships among pragmatics, grammar, and proficiency. System 42, 270–287 (2014)CrossRef Youn, S.J.: Measuring syntactic complexity in L2 pragmatic production: investigating relationships among pragmatics, grammar, and proficiency. System 42, 270–287 (2014)CrossRef
23.
go back to reference Zareva, A.: Self-mention and the projection of multiple identity roles in TESOL graduate student presentations: the influence of the written academic genres. Engl. Specif. Purp. 32(2), 72–83 (2013)CrossRef Zareva, A.: Self-mention and the projection of multiple identity roles in TESOL graduate student presentations: the influence of the written academic genres. Engl. Specif. Purp. 32(2), 72–83 (2013)CrossRef
24.
go back to reference Zhang, G.: It is suggested that… or it is better to…? Forms and meanings of subject it-extraposition in academic and popular writing. J. Engl. Acad. Purp. 20, 1–13 (2015)CrossRef Zhang, G.: It is suggested that… or it is better to…? Forms and meanings of subject it-extraposition in academic and popular writing. J. Engl. Acad. Purp. 20, 1–13 (2015)CrossRef
25.
go back to reference Zhuang, L. et al.: Movie review mining and summarization. In: Presented at the Proceedings of ACM Conference on Information and Knowledge Management (CIKM) (2006) Zhuang, L. et al.: Movie review mining and summarization. In: Presented at the Proceedings of ACM Conference on Information and Knowledge Management (CIKM) (2006)
Metadata
Title
Text Analysis of Corpus Linguistics in a Post-concordancer Era
Author
Simon Ho Wang
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-52836-6_41

Premium Partner