Skip to main content
Top

2. Evolution of Neural Networks to Large Language Models

  • 2023
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter delves into the fascinating evolution of language models, from early statistical methods like n-grams and hidden Markov models to the groundbreaking Transformer architecture and large language models (LLMs). It discusses the development of neural networks, particularly recurrent neural networks (RNNs), long short-term memory (LSTM) networks, and gated recurrent units (GRUs), and how they have revolutionized natural language processing (NLP). The chapter also explores the rise of attention-based models, exemplified by the Transformer, which has significantly improved the efficiency and performance of language models. Additionally, it highlights the capabilities and challenges of large language models like GPT-3, BERT, and T5, which have pushed the boundaries of what AI can achieve in understanding and generating human language. This journey through the evolution of language models offers a captivating look at the technological advancements that have shaped the field of NLP and paved the way for the development of sophisticated AI systems capable of understanding and generating human language at an unprecedented level.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Business + Economics & Engineering + Technology"

Online-Abonnement

Springer Professional "Business + Economics & Engineering + Technology" gives you access to:

  • more than 102.000 books
  • more than 537 journals

from the following subject areas:

  • Automotive
  • Construction + Real Estate
  • Business IT + Informatics
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Mechanical Engineering + Materials
  • Insurance + Risk


Secure your knowledge advantage now!

Springer Professional "Engineering + Technology"

Online-Abonnement

Springer Professional "Engineering + Technology" gives you access to:

  • more than 67.000 books
  • more than 390 journals

from the following specialised fileds:

  • Automotive
  • Business IT + Informatics
  • Construction + Real Estate
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Mechanical Engineering + Materials





 

Secure your knowledge advantage now!

Springer Professional "Business + Economics"

Online-Abonnement

Springer Professional "Business + Economics" gives you access to:

  • more than 67.000 books
  • more than 340 journals

from the following specialised fileds:

  • Construction + Real Estate
  • Business IT + Informatics
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Insurance + Risk



Secure your knowledge advantage now!

Title
Evolution of Neural Networks to Large Language Models
Authors
Akshay Kulkarni
Adarsha Shivananda
Anoosh Kulkarni
Dilip Gudivada
Copyright Year
2023
Publisher
Apress
DOI
https://doi.org/10.1007/978-1-4842-9994-4_2

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG