Skip to main content
Top

Introducing Pyra: A High-Level Linter for Data Science Software

  • 2026
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Pyra is a high-level linter designed to identify code smells in data science software, focusing on anti-patterns that can cause issues in data science pipelines. It offers an easily extensible framework to help developers achieve correct results and improve the reliability of their data science projects. The text introduces Pyra's architecture and execution flow, which involves converting a Jupyter Notebook into a Python script and performing static analysis on the Control Flow Graph (CFG) to infer abstract type information. The tool includes 16 different checkers for detecting code smells, grouped into four categories: misleading visualizations, misleading results, challenges for reproducibility, and general issues. The demo showcases Pyra's capabilities by analyzing a simple data science pipeline, highlighting issues such as inappropriate visualization choices, data leakage, and reproducibility challenges. Pyra provides warnings and suggestions for fixing identified issues, making it a valuable addition to the data science ecosystem. The tool can be easily integrated into existing IDEs as a plugin, offering real-time feedback to developers. This makes Pyra particularly useful for both experienced data scientists and beginners or experts from other fields who may not be familiar with data science best practices.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Business + Economics & Engineering + Technology"

Online-Abonnement

Springer Professional "Business + Economics & Engineering + Technology" gives you access to:

  • more than 102.000 books
  • more than 537 journals

from the following subject areas:

  • Automotive
  • Construction + Real Estate
  • Business IT + Informatics
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Mechanical Engineering + Materials
  • Insurance + Risk


Secure your knowledge advantage now!

Springer Professional "Engineering + Technology"

Online-Abonnement

Springer Professional "Engineering + Technology" gives you access to:

  • more than 67.000 books
  • more than 390 journals

from the following specialised fileds:

  • Automotive
  • Business IT + Informatics
  • Construction + Real Estate
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Mechanical Engineering + Materials





 

Secure your knowledge advantage now!

Springer Professional "Business + Economics"

Online-Abonnement

Springer Professional "Business + Economics" gives you access to:

  • more than 67.000 books
  • more than 340 journals

from the following specialised fileds:

  • Construction + Real Estate
  • Business IT + Informatics
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Insurance + Risk



Secure your knowledge advantage now!

Title
Introducing Pyra: A High-Level Linter for Data Science Software
Authors
Greta Dolcetti
Vincenzo Arceri
Antonella Mensi
Enea Zaffanella
Caterina Urban
Agostino Cortesi
Copyright Year
2026
DOI
https://doi.org/10.1007/978-3-032-06129-4_29
This content is only visible if you are logged in and have the appropriate permissions.
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH