Abstract
This paper discusses the modeling tool called SHARPE (Symbolic Hierarchical Automated Reliability and Performance Evaluator), a general hierarchical modeling tool that analyzes stochastic models of reliability, availability, performance, and performability. It allows the user to choose the number of levels of models, the type of model at each level, and which results from each model level are to act as which parameters in which higher-level models. SHARPE includes algorithms for analysis of fault trees, reliability block diagrams, acyclic series-parallel graphs, acyclic and cyclic Markov and semi-Markov models, generalized stochastic Petri nets, and closed single- and multi-chain product-form queueing networks. For many of these, the user can choose among alternative algorithms, and can decide whether to get a result in the form of a distribution function (symbolic in the time variable) or as a mean or probability. SHARPE has been useful to students, practicing engineers, and researchers. In this paper we discuss the history of SHARPE, give some examples of its use, and talk about some lessons learned.
- J. Aidemark, P. Folkesson, and J. Karlsson. Framework for node-level fault tolerance in distributed real-time systems. DSN 2005, 2005. Google ScholarDigital Library
- A. Avritzer, A. Bondi, M. Grottke, K. Trivedi, and E. Weyuker. Performance assurance via software rejuvenation: monitoring, statistics and algorithms. In Proc. International Conference on Dependable Systems and Networks, 2006. Google ScholarDigital Library
- J.B. Bowles and J. Gregory Dobbins. Approximate reliability and availability models for high availability and fault-tolerant systems with repair. In Quality and Reliability Engineering International, 2004.Google ScholarCross Ref
- G.A. Chaparro-Baquero, N.G. Santiago, W. Rivera, and J. Fernando Vega-Riveros. Petri net workflow modeling for digital publishing measuring quantitative dependability attributes. In Proc. IEEE Int. Symp. on Dependable Autonomic and Secure Computing, 2006. Google ScholarDigital Library
- D. Chen, S. Dharmaraja, D. Chen, K. Trivedi L. Li, R. Some, and A. Nikora. Reliability and availability analysis for the jpl remote exploration and experimentation system. In Proc. International Conference on Dependable Systems and Networks, 2002. Google ScholarDigital Library
- I. Chen, T. Chen, and C. Lee. Agent-based forwarding strategies for reducing location management cost in mobile networks. Mobile Network Applications, 2001. Google ScholarDigital Library
- G. Ciardo, A. Blakemore, P.F. Chimento, J. Muppala, and K. Trivedi. Automated generation and analysis of markov reward models using stochastic reward nets. In C. Meyer and R. Plemmons, editors, Linear Algebra and Markov Chains and Queueing Models, IMA Volumes in Mathematics and its Applications, volume 48. Springer-Verlag, 1992.Google Scholar
- S. Dharmaraja, K. Trivedi, and D. Logothetis. Performance modelling of wireless networks with generally distributed hand-off interarrival times. Computer Communications Journal, 2003. Google ScholarDigital Library
- J.B. Dugan, K. Trivedi, M. Smotherman, and R. Geist. The Hybrid Automated Reliability Predictor. AIAA J. Guidance, Control and Dynamics, 1986.Google Scholar
- J. Gafsi and E. Biersack. Modeling and performance comparison of reliability strategies for distributed video servers. IEEE Trans. Parallel Distributed System, 2000. Google ScholarDigital Library
- S. Garg, Y. Huang, C. Kintala, K. Trivedi, and S. Yajnik. Performance and reliability evaluation of passive replication schemes in application level fault tolerance. Proc. International Symposium on Fault Tolerant Computing (FTCS), 1999. Google ScholarDigital Library
- A. Goyal, W.C. Carter, E. de Souza e Silva, S. Lavenberg, and K. Trivedi. The system availability estimator. In Proc. IEEE Int. Symp. on Fault-Tolerant Computing, 1986.Google Scholar
- P. Heegaard and K. Trivedi. Survivability quantification of communication services. IEEE Symp. on Dependable Systems and Networks, 2008.Google ScholarCross Ref
- P. Heidelberger, J. Muppala, and K. Trivedi. Accelerating mean time to failure computations. Performance Evaluation, 1996. Google ScholarDigital Library
- C. Hirel, R.A. Sahner, X. Zang, and K. Trivedi. Reliability and performability modeling using sharpe 2000. Computer Performance Evaluation / TOOLS, 2000. Google ScholarDigital Library
- J.R. Horgan and S. London. Atac: A data-flow coverage testing tool for c. In Symp. on Assessment of Quality Software Development Tools, 1992.Google Scholar
- O. Ibe, R. Howe, and K. Trivedi. Approximate availability analysis of vaxcluster systems. IEEE Trans. on Reliability, 1989.Google ScholarCross Ref
- H. Kopetz, H. Kantz, G. Grunsteidl, P. Puschner, and J. Reisinger. Tolerating transient faults in mars. In FTCS-20, 1990.Google ScholarCross Ref
- M. Lanus, L. Yin, and K. Trivedi. Hierarchical composition and aggregation of state-based availability and performability models. IEEE Transactions on Reliability, 2003.Google Scholar
- Y. Liu, V. Mendiratta, and K. Trivedi. Survivability analysis of telephone access network. In Proc. Int. Symp. on Software Reliability Engineering, 2004. Google ScholarDigital Library
- N. Lopez-Benitez and K. Trivedi. Multiprocessor performability analysis. IEEE Trans. on Reliability, 1993.Google Scholar
- T. Luo and K. Trivedi. An improved algorithm for coherent system reliability. IEEE Trans. on Reliability, 1998.Google Scholar
- M. Lyu, X. Chen, and T. Wong. Design and evaluation of a fault-tolerant mobile-agent system. IEEE Intelligent Systems, 2004. Google ScholarDigital Library
- M. Lyu and V. Mendiratta. Software fault tolerance in a clustered architecture: Techniques and reliability modeling. Proc. of IEEE Aerospace Conference, 1999.Google ScholarCross Ref
- Y. Ma, J. Han, and K. Trivedi. Composite performance & availability analysis of wireless communication networks. IEEE Trans. on Vehicular Technology, 2001.Google Scholar
- B. Madan, S. Dharmaraja, and K. Trivedi. Combined guard channel and mobile-assisted handoff for cellular networks. IEEE Trans. on Vehicular Technology, 2008.Google Scholar
- B. Madan, K. Goseva-Popstojanova, K. Vaidyanathan, and K. Trivedi. Modeling and quantification of security attributes of software systems. Performance Evaluation, 2004.Google Scholar
- V. Mainkar. Solutions of Large and Non-Markovian Performance Models. PhD thesis, Duke University, 1994. Google ScholarDigital Library
- M. Malhotra and A. Reibman. Selecting and implementing phase approximations for semi-markov models. Stochastic Models, 1993.Google Scholar
- M. Marsan, G. Conte, and G. Balbo. A class of generalized stochastic petri nets for the performance evaluation of multiprocessor system. ACM Trans. on Computer Systems, 1984. Google ScholarDigital Library
- J. Muppala, M. Malhotra, and K. Trivedi. Stiffness-tolerant methods for transient analysis of stiff markov chains. Microelectronics and Reliability, 1994.Google Scholar
- J. Muppala, K. Trivedi, and S. Woolet. Real-time-systems performance in the presence of failures. IEEE Computer, 1991. Google ScholarDigital Library
- H. Pan. The reconstruction of sharpe. Master's thesis, Duke University, 2001.Google Scholar
- D. Raiteri, M. Iacono, G. Franceschinis, and V. Vittorini. Repairable fault tree for the automatic evaluation of repair policies. In Proc. int. Conf. on Dependable Systems and Networks, 2004. Google ScholarDigital Library
- S. Ramani, K. Trivedi, and B. Dasarathy. Performance analysis of the corba notification service. Proc. IEEE Symp. on Reliable Distributed Systems, 2001. Google ScholarDigital Library
- A. Ramesh, K. Trivedi, A. Somani, D. Twigg, U. Sandadi, and T. Sharma. An integrated reliability modeling environment. Reliability Engineering and System Safety, 1999.Google ScholarCross Ref
- A.V. Ramesh and K. Trivedi. Semi-numerical transient analysis of markov models. ACM Southeast Regional Conference, 1995. Google ScholarDigital Library
- R. Sahner and K. Trivedi. Performance and reliability analysis using directed acyclic graphs. IEEE Trans. on Software Engineering, 1987. Google ScholarDigital Library
- R. Sahner and K. Trivedi. Reliability modeling using sharpe. IEEE Trans. on Reliability, 1987.Google Scholar
- R. Sahner and K. Trivedi. A software tool for learning about stochastic models. IEEE Transactions on Education, 1993.Google Scholar
- R.A. Sahner. A hybrid, combinatorial-Markov method of solving performance and reliability models. PhD thesis, Duke University, 1986. Google ScholarDigital Library
- R.A. Sahner, K. Trivedi, and A. Puliafito. Performance and Reliability Analysis of Computer Systems: An Example-Based Approach Using the SHARPE Software Package. Kluwer Academic Publishers, 1996. Google ScholarDigital Library
- N. Sato, H. Nakamura, and K. Trivedi. Detecting performance and reliability bottlenecks of composite web services. Proc. ICSOC, 2007.Google Scholar
- W. Smith, K. Trivedi, L. Tomek, and J. Ackaret. Availability analysis of blade server systems. IBM Systems Journal, 2008. Google ScholarDigital Library
- L. Tomek and K. Trivedi. Fixed-point iteration in availability modeling. In Proceedings of the 5th International GI/ITG/GMA Conference on Fault-Tolerant Computing Systems, Tests, Diagnosis, Fault Treatment, 1991. Google ScholarDigital Library
- K. Trivedi. Availability analysis of cisco gsr 12000 and juniper m20/m40. Cisco Technical Report, 2000.Google Scholar
- K. Trivedi. Probability and Statistics with Reliability, Queuing and Computer Science Applications. John Wiley, second edition, 2001. Google ScholarDigital Library
- K. Trivedi, R. Vasireddy, D. Trindade, S. Nathan, and R. Castro. Modeling high availability systems. Proc. IEEE Pacific Rim International Symposium on Dependable Computing, 2006. Google ScholarDigital Library
- K. Trivedi, D. Wang, D. Hunt, A. Rindos, W. Smith, and B. Vashaw. Availability modeling of sip protocol on ibm websphere. PRDC, 2008. Google ScholarDigital Library
- K. Vaidyanathan, D. Selvamuthu, and K. Trivedi. Analysis of inspection-based preventive maintenance in operational software systems. Intl. Symp. on Reliable Distributed Systems, 2002. Google ScholarDigital Library
- M. Veeraraghavan and K. Trivedi. An improved algorithm for symbolic reliability analysis. IEEE Transactions on Reliability, 1991.Google Scholar
- D. Wang. Mttf computation for analytical models. Master's thesis, Duke University, 2004.Google Scholar
- D. Wang. Service Reliability: Models, Algorithms and Applications. PhD thesis, Duke University, 2007.Google Scholar
- D. Wang and K. Trivedi. Computing steady-state mean time to failure for non-coherent repairable systems. IEEE Transactions on Reliability, 2005.Google Scholar
- D. Wang, K. Trivedi, T. Sharma, A. Ramesh, D. Twigg, L. Nguyen, and Y. Liu. A new reliability estimation method for large systems. The Boeing Company patent application pending, 2008.Google Scholar
- www.borel.ee.duke.edu/bugzilla.Google Scholar
- W. Xie. Markov regenerative process in sharpe. Master's thesis, Duke University, 1999.Google Scholar
- W. Xie, Y. Cao, H. Sun, and K. Trivedi. Modeling of user perceived webserver availability. IEEE Int. Conf. on Communications, 2003.Google Scholar
- L. Yin, R. Fricks, and K. Trivedi. Application of semi-markov process and ctmc to evaluation of ups system availability. Proc. Reliability and Maintainability Symposium, 2002.Google Scholar
- X. Zang. Dependability Modeling of Computer Systems and Networks. PhD thesis, Duke University, 1999. Google ScholarDigital Library
Index Terms
- SHARPE at the age of twenty two
Recommendations
SHARPE: A Modeler's Toolkit
IPDS '96: Proceedings of the 2nd International Computer Performance and Dependability Symposium (IPDS '96)SHARPE (Symbolic Hierarchical Automated Reliability and Performance Evaluator) is a program that supports the specification and automated solution of reliability and performance models. It contains support for fault trees, reliability block diagrams, ...
Availability Models with Age-Dependent Checkpointing
SRDS '02: Proceedings of the 21st IEEE Symposium on Reliable Distributed SystemsIn this paper, we consider a new stochastic model for a file recovery action with checkpointing when the system failure occurs according to a homogeneous Poisson process.The present checkpoint model strongly depends on the system age and is quite ...
Comments