Instruction-Level Simulation And Tracing

This is $Revision: 1.107 $, last updated $Date: 2004/06/08 17:36:44 $.
For an up-to-date version, please check www.xsim.com/bib.

WARNING: THIS PAGE IS STILL UNDER CONSTRUCTION.

Places that are known to have dubious or absent information are marked with .

There is also a simulators mailing list. To subscribe, write to <majordomo@xsim.com>. For sample messages see here.

Quick Index

Quick overview of simulation and tracing tools.
Terminology.
Glossary of terms used here.
Alphabetical listing of tools.
Bibliography.
Categorizing Tools.
Comments on some of the tools and papers.
Other work not included in this document.
Who is who in simulation and tracing (forever incomplete).
To do: Pending projects.
Selected recent changes.
Acknowledgements.

A Quick Overview of Instruction-Set Simulation and Tracing

The most important thing is what does it do? If you are building or using a simulator you need to be concerned at some level about the implementation. But first you need to figure out what you want it to do.

Why aren't you using the real thing? Do you want an accurate simulation? If yes, use real hardware. If no, make up the numbers. NullSIM. It's not as accurate, but it's cheaper and faster than any other simulation tool. It's the only universal simulator! Tired of configuring your simulator to do exactly what you want? Use NullSIM, with a familiar user interface and predictable results!

Instruction-set simulators can execute programs written or compiled for computers that do not yet exist, which no longer exist, or which are more expensive to purchase than to simulate. Simulators can also provide access to internal state that is invisible on the real hardware, can provide deterministic execution in the face of races, and can be used to ``stress test'' for situations that are hard to produce on real hardware.

Instruction-level tracing can provide detailed information about the behavior of programs; that information drives analyzers that analyze or predict behavior of various system components and which can, in turn, improve the design and implementation of everything from architectures to compilers to applications.

Although simulators and tracing tools appear to perform different tasks, they in practice do much the same work: both manipulate machine-level details, and both use similar implementation techniques.

This web page is a jumping-off point for lots of work related to instruction-level simulation and tracing. Please contribute! Please send comments, contributions, and suggestions to `pardo@xsim.com'. If you'd like to help, edit this page, there is lots that needs to be done; your help is appreciated.

This web also page lists a few OS emulation tools. Although these don't specifically fit the category of tools covered by this page, it's interesting to consider whether you could glue together a processor emulator and an OS emulator and wind up with a whole simulated system. To date, whole simulated systems are built as integrated tools, rather than being assembled modularly.

Terminology

Some terminology:

Simulation is recreating an environment in enough detail that desired effects of a ``real'' system can be observed.
Instruction-Set Simulation is simulating a processor at the instruction-set level. Instruction-set simulation is simulation that is detailed enough to run executable programs intended for the machine being simulated. It is possible to do both a more-detailed simulation, for example timing-accurate or RTL (register transfer level) simulation are even more detailed, and bus architecture or cluster simulation are less detailed.
Emulation is simulation that uses special hardware assistance [RFD 72], [Tucker 65], [Wilkes 69].
The target machine is the one being simulated; the host machine is the one where the simulation runs. This terminology parallels retargetable compiler terminology. However, there is no standard terminology where the simulation framework is produced on yet a third platform. That is, a target simulator which runs on special host hardware often has the simulation software compiled on a general-purpose machine. Some have suggested ``generation host'' or ``ghost'' for the machine where the software is created, that suggests the place where the simulator runs is the ``runtime host'' or ``rhost'' (pronounced "roast").

A Brief Categorization

A list of tools, organized according to various interesting features. See also a listing of tools ordered alphabetically. Interesting things about the tools include:

Purpose of the tool
Supports buggy applications (that is: is the tool robust in the face of application errors?).
Supports dynamic instruction space modification (a.k.a. ``Dynamic Linking'', ``Runtime Code Generation'', or ``Self-Modifying Code'')
Supports multiple target processors
Supports multiple protection domains (address spaces)
Supports signals, exceptions and asynchronous events
Supports system-mode simulation or tracing
Implementation:
Timing simulation
Performance of the tool
Product status

Purpose Of The Tool

Simulation and tracing tools can perform a wide variety of tasks. Here are some common uses:

atr: address tracing
Classical ``address tracing'' gathers a list of instruction and/or data memory references performed by a system. There are many variations, such as tracing only targets of control transfers or tracing other resources.
db: debugging
A simulator can help with debugging because: it runs deterministically and repeatably; it is possible to query system state without disturbing it; the simulator can be backed up to an earlier checkpoint in order to implement reverse execution (``foo is twelve ... what was the value of bar in the routine we just returned from?''); and because a simulator can perform consistency checks that cannot be done on real hardware.
otr: other tracing and event counting
A generalization of address tracing is to trace, count, or categorize events on any kind of processor or system event or resource. For example, a tool may collect the common values of variables; register usage patterns; interrupt or exception event counts, timing information, and so on.
sim: (instruction set) simulation
Simulators commonly implement a processor architecture that does not yet or no longer exists. Simulators can also implement other devices such as memory, bus, I/O devices, user input, and so on.
tb: tool building
Here, ``tool building'' is meant to encompass tools that are used to build other tools, for example, a tool that builds various tracing tools is a tool-building tool, whereas a configurable cache simulator is not. The usual distinction is that a tool-building tool can be extended [NG87, NG88] using a general-purpose programming language (e.g. C, C++, ...), whereas a configurable tool is programmed with a less-powerful language e.g. a list of cache size, line size, associativity, etc.

In addition, some tools are used for

os: operating system (OS) Emulation
Compare OS emulation ``as a purpose'' with simulators that emulate the OS for simplicity (see system-mode simulation or tracing).