"Efficient Large-Scale Computer and Network Models Using Optimistic Parallel Simulation" Garrett R. Yaun Ph.D. Thesis, Department of Computer Science, Rensselaer Polytechnic Institute, June 2005.

ABSTRACT


Modeling and simulation is a valuable tool in the analysis of large-scale networks and computer systems. To tackle these complexities, conservative parallel simulation is often employed as an approach to reduce the runtime. Optimistic simulation has previously been viewed out of the performance envelop for such models. However, with the advent of a new technique called reverse computation the memory requirements for benchmark models have been dramatically reduced. In this thesis, we demonstrate the use of reverse computation for allowing large-scale simulation models to achieve greater scalability and performance. The models developed for this thesis consisted of network protocols and distributed computer system applications. Within these models, reverse computation was important in achieving performance gains and dispelling views that optimistic techniques operate outside of the performance envelope. These are the first real-world models to leverage reverse computation and demonstrate its efficiency. Our TCP model executed at 5.5 million packets per second which is 5.14 times greater then PDNS' packet rate of 1.07 million for the same large-scale network scenario. This experiment was performed across a distributed cluster of 32 nodes and executed on one processor per node. Observations made from the creation of these models led to the development of the reverse memory subsystem and the idea of shared event data. The contribution of this subsystem is that it allows for easier implementation of models and allows for the models to use dynamic memory. This subsystem permits for an overall reduction of memory in simulation models as compared to models that work in statically ``pre-allocated'' memory. The idea of shared event data works by decreasing the amount of duplicate information in the event. Our experiments with shared event data show significant memory reductions when there is a high degree of redundant data. These contributions when taken together as a whole enable real-world large-scale models to be efficiently developed and executed in an optimistic parallel simulation framework.

Download PDF



Download Postscript