Skip to main content
Top

Transformers only look once with nonlinear combination for real-time object detection

  • 21-05-2022
  • Review
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The article introduces TOLO, a novel real-time object detection model that combines the strengths of CNN and vision Transformer networks. TOLO consists of a CNN backbone, a Feature Fusion Neck, and lightweight vision Transformer detection heads, which work together to efficiently build long-distance dependencies among local features and detect objects with high speed and low memory overhead. The model also incorporates a nonlinear combination method to enhance the detection of false negative predicted boxes. Extensive experiments on various datasets demonstrate that TOLO outperforms existing state-of-the-art methods in terms of detection accuracy and inference speed. The article highlights the innovative approach of combining CNN and vision Transformer networks, as well as the effectiveness of the nonlinear combination method in improving object detection performance.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Transformers only look once with nonlinear combination for real-time object detection
Authors
Ruiyang Xia
Guoquan Li
Zhengwen Huang
Yu Pang
Man Qi
Publication date
21-05-2022
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 15/2022
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07333-y
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG