Article

An empirical study of the robustness of MacOS applications using random testing

Authors:
Barton P. Miller

University of Wisconsin, Madison, WI

University of Wisconsin, Madison, WI
View Profile

,
Gregory Cooksey

University of Wisconsin, Madison, WI

University of Wisconsin, Madison, WI
View Profile

,
Fredrick Moore

University of Wisconsin, Madison, WI

University of Wisconsin, Madison, WI
View Profile

RT '06: Proceedings of the 1st international workshop on Random testingJuly 2006Pages 46–54https://doi.org/10.1145/1145735.1145743

Published:17 July 2006Publication History

RT '06: Proceedings of the 1st international workshop on Random testing

Pages 46–54

ABSTRACT

We report on the fourth in a series of studies on the reliability of application programs in the face of random input. Over the previous 15 years, we have studied the reliability of UNIX command line and X-Window based (GUI) applications and Windows applications. In this study, we apply our fuzz testing techniques to applications running on the Mac OS X operating system. We continue to use a simple, or even simplistic technique: unstructured black-box random testing, considering a failure to be a crash or hang. As in the previous three studies, the technique is crude but seems to be effective in locating bugs in real programs.We tested the reliability of 135 command-line UNIX utilities and thirty graphical applications on Mac OS X by feeding random input to each. We report on application failures -- crashes (dumps core) or hangs (loops indefinitely) -- and, where source code is available, we identify the causes of these failures and categorize them.Our testing crashed only 7% of the command-line utilities, a considerably lower rate of failure than observed in almost all cases of previous studies. We found the GUI-based applications to be less reliable: of the thirty that we tested, only eight did not crash or hang. Twenty others crashed, and two hung. These GUI results were noticeably worse than either of the previous Windows (Win32) or UNIX (X-Windows) studies.

References

D. Aitel, "The Advantages of Block-Based Protocol Analysis for Security Testing", Immunity Inc., February 2002. http://www.immunitysec.com/downloads/advantages_of_block_based_analysis.htmlGoogle Scholar
Apple Computer, May 2006, http://developer.apple.com/documentation/Cocoa/Conceptual/Coco aFundamentals/WhatIsCocoa/chapter_2_section_6.html.Google Scholar
G. J. Carrette, "CRASHME: Random Input Testing", http://people.delphi.com/gjc/crashme.html, 1996.Google Scholar
J. W. Duran and S. C. Ntafos, "An Evaluation of Random Testing", IEEE Transactions on Software EngineeringSE-10, 4, July 1984, pp. 438--444.Google ScholarDigital Library
J. E. Forrester and B. P. Miller, "An Empirical Study of the Robustness of Windows NT Applications Using Random Testing", 4th USENIX Windows Systems Symposium, Seattle, August 2000. Appears (in German translation) as "Empirische Studie zur Stabilität von NT-Anwendungen", iX, September 2000. Google ScholarDigital Library
S. Garfinkel and G. Spafford, Practical UNIX & Internet Security, O'Reilly & Associates, 1996. Google ScholarDigital Library
A. Ghosh, V. Shah and M. Schmid, "Testing the Robustness of Windows NT Software", 1998 International Symposium on Software Reliability Engineering (ISSRE'98), Paderborn, Germany, November 1998. Google ScholarDigital Library
A. Ghosh, V. Shah and M. Schmid, "An Approach for Analyzing the Robustness of Windows NT Software", 21st National Information Systems Security Conference, Crystal City, VA, October 1998.Google Scholar
A. Hertzfeld, Revolution in the Valley, O'Reilly Media, Inc., Sebastopol, CA, 2004, pp. 184--185. Google ScholarDigital Library
S. Marquis, T. Dean, S. Knight, "SCL: a Language for Security Testing of Network Applications", 2005 Conference of the Centre for Advanced Studies on Collaborative Research, Toronto, October 2005. Google ScholarDigital Library
W. McKeeman, "Differential Testing for Software", Digital Technical Journal, Digital Equipment Corporation 10, 1, December 1998.Google Scholar
G. Myers, The Art of Software Testing, Wiley Publishing, New York, 1979. Google ScholarDigital Library
Microsoft Corporation, "Security and Reliability Strategies", http://www.microsoft.com/whdc/driver/security/, 2006.Google Scholar
B. P. Miller, D. Koski, C. P. Lee, V. Maganty, R. Murthy, A. Natarajan, J. Steidl, "Fuzz Revisited: A Re-examination of the Reliability of UNIX Utilities and Services", University of Wisconsin-Madison, 1995. Appears (in German translation) as "Empirische Studie zur Zuverlasskeit von UNIX-Utilities: Nichts dazu Gerlernt", iX, September 1995. ftp://grilled.cs.wisc.edu/technical_papers/fuzz-revisted.pdf.Google Scholar
B. P. Miller, L. Fredriksen, B. So, "An Empirical Study of the Reliability of UNIX Utilities", Communications of the ACM 33, 12, December 1990, pp. 32--44. Also appears in German translation as "Fatale Fehlerträchtigkeit: Eine Empirische Studie zur Zuverlassigkeit von UNIX-Utilities", iX (March 1991). ftp://grilled.cs.wisc.edu/technical_papers/fuzz.pdf. Google ScholarDigital Library
E. Sirer and B. Bershad, "Using Production Grammars in Software Testing", Symposium on Domain-Specific Languages, Austin, TX, October 1999. Google ScholarDigital Library
D. Wood, G. Gibson, and R. Katz, "Verifying a Multiprocessor Cache Controller Using Random Case Generation", Computer Science Tech report UCB/CSD-89-490, University of California, Berkeley, 1989. Google ScholarDigital Library
S. Xiao, L. Deng, S. Li and X. Wang, "Integrated TCP/IP Protocol Software Testing for Vulnerability Detection", 2003 International Conference on Computer Networks and Mobile Computing, Shanghai, October 2003. Google ScholarDigital Library

Index Terms

An empirical study of the robustness of MacOS applications using random testing
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms
      1. Graphical user interfaces
2. Software and its engineering
  1. Software creation and management
    1. Software verification and validation
      1. Software defect analysis
        Software testing and debugging
  2. Software organization and properties
    1. Contextual software domains
      1. Operating systems

Recommendations

An empirical study of the robustness of MacOS applications using random testing

We report on the fourth in a series of studies on the reliability of application programs in the face of random input. Over the previous 15 years, we have studied the reliability of UNIX command line and X-Window based (GUI) applications and Windows ...
Read More
An empirical study of the robustness of Windows NT applications using random testing
WSS'00: Proceedings of the 4th conference on USENIX Windows Systems Symposium - Volume 4

We report on the third in a series of studies on the reliability of application programs in the face of random input. In 1990 and 1995, we studied the reliability of UNIX application programs, both command line and X-Window based (GUI). In this study, ...
Read More
Comparison of adaptive random testing and random testing under various testing and debugging scenarios

Adaptive random testing is an enhancement of random testing. Previous studies on adaptive random testing assumed that once a failure is detected, testing is terminated and debugging is conducted immediately. It has been shown that adaptive random ...
Read More

Reviews

Reviewer: Andrew Brooks

Random testing really works. On Mac OS X, seven percent of 135 command-line utilities and 73 percent of 30 graphical user interface (GUI)-based applications were found to crash or hang under random testing using the freely available tools fuzz , ptyjig , and fuzz-aqua . In command-line utility testing, 24 files of random characters were generated by permuting fuzz tool options: files either did or did not include null characters, either did or did not include nonprintable characters, and were either of size 1,000, 10,000, or 100,000 characters. In GUI-based application testing, typically 100,000 random user-input events were generated using the fuzz-aqua tool. Options with this tool included the setting of the delay between events and the blocking of the sending of input events that might, for example, log out the current user. Among the root causes of failure were, sadly, some familiar culprits: failure to check return values, null-pointer dereferences, and array buffer overflows. Some minor details are unclear in this paper. Was the randomness employed fully controllable__?__ Could the same tests be repeatedly generated and executed with the same results for both command-line utilities and GUI-based applications__?__ Without explanation, no root cause analysis appears to have been undertaken for the utilities zsh and indent , which crashed. There is no disputing the fact that random testing found serious defects, that GUI reliability was found to be getting worse, and that random testing should be part of any arsenal deployed when software reliability is of paramount importance. As such, this paper is strongly recommended to the software engineering community. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
RT '06: Proceedings of the 1st international workshop on Random testing
July 2006
84 pages
ISBN:159593457X
DOI:10.1145/1145735
Program Chairs:
Johannes Mayer
Ulm University, Germany
,
Robert Merkel
Swinburne University of Technology, Australia
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 July 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
fuzz
random testing
Qualifiers
- Article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 60
  Total Citations
  View Citations
- 739
  Total Downloads
- Downloads (Last 12 months)60
- Downloads (Last 6 weeks)9
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An empirical study of the robustness of MacOS applications using random testing

RT '06: Proceedings of the 1st international workshop on Random testing

ABSTRACT

References

Cited By

Index Terms

Recommendations

An empirical study of the robustness of MacOS applications using random testing

An empirical study of the robustness of Windows NT applications using random testing

Comparison of adaptive random testing and random testing under various testing and debugging scenarios

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An empirical study of the robustness of MacOS applications using random testing

RT '06: Proceedings of the 1st international workshop on Random testing

ABSTRACT

References

Cited By

Index Terms

Recommendations

An empirical study of the robustness of MacOS applications using random testing

An empirical study of the robustness of Windows NT applications using random testing

Comparison of adaptive random testing and random testing under various testing and debugging scenarios

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media