Difference between revisions of "Releases"
Jump to navigation
Jump to search
Line 2: | Line 2: | ||
== Recent Sniper Releases == | == Recent Sniper Releases == | ||
+ | |||
+ | === Version 5.2 (September 17th, 2013) === | ||
+ | |||
+ | * Configurable coherency protocol (MSI/MESI/MESIF), made MESI the default | ||
+ | * Add more cache statistics: LRU stack distance historgram, LLC miss latency breakdown | ||
+ | * Implement auxiliary tag directories (ATD) to track constructive/destructive interference in shared caches | ||
+ | * Implement 2-level TLB hierarchy with Nehalem configuration | ||
+ | * New hooks HOOK_APPLICATION_ROI_{BEGIN,END}, called even when ROI markers are not used directly; these hooks can be used to trigger ROI from a script | ||
+ | * Improved stop-by-icount script to support ROI-relative warmup and detailed lengths | ||
+ | * Ondemand routine stack printer: configure routine_tracer/type=ondemand, then send a SIGUSR1 to Sniper to get a per-thread application backtrace | ||
+ | * Emulate leaf 11 of the cpuid instruction to pass topology information to runtimes (used by Intel OpenMP) | ||
+ | * Emulation of sched_* system calls, gettimeofday replacement, cpuid in SIFT mode | ||
+ | * Improve handling of LD_LIBRARY_PATH: use SNIPER_SIM_LD_LIBRARY_PATH for the simulator, SNIPER_APP_LD_LIBRARY_PATH for the application | ||
+ | * Re-implemented BigSmall scheduler to use thread affinity calls rather than the low-level (and error prone) moveThread API | ||
+ | * sim.thread Python interface to interact with threads (get num threads, get appid, get/set affinity) | ||
+ | * Use newest Pin version 2.13.61206 | ||
+ | * Numerous bug fixes and improvements | ||
=== Version 5.1 (June 27th, 2013) === | === Version 5.1 (June 27th, 2013) === |
Revision as of 06:15, 17 September 2013
Contents
- 1 Recent Sniper Releases
- 1.1 Version 5.2 (September 17th, 2013)
- 1.2 Version 5.1 (June 27th, 2013)
- 1.3 Version 5.0 (April 15th, 2013)
- 1.4 Version 4.2 (February 9th, 2013)
- 1.5 Version 4.1 (December 11th, 2012)
- 1.6 Version 4.0 (November 28th, 2012)
- 1.7 Version 3.07 (September 7th, 2012)
- 1.8 Version 3.06 (August 7th, 2012)
- 1.9 Version 3.05 (August 7th, 2012)
- 1.10 Version 3.04 (July 18th, 2012)
- 1.11 Version 3.03 (June 28th, 2012)
- 1.12 Version 3.02 (June 25th, 2012)
- 1.13 Version 3.01 (June 12th, 2012)
- 1.14 Version 3.0 (May 16th, 2012)
Recent Sniper Releases
Version 5.2 (September 17th, 2013)
- Configurable coherency protocol (MSI/MESI/MESIF), made MESI the default
- Add more cache statistics: LRU stack distance historgram, LLC miss latency breakdown
- Implement auxiliary tag directories (ATD) to track constructive/destructive interference in shared caches
- Implement 2-level TLB hierarchy with Nehalem configuration
- New hooks HOOK_APPLICATION_ROI_{BEGIN,END}, called even when ROI markers are not used directly; these hooks can be used to trigger ROI from a script
- Improved stop-by-icount script to support ROI-relative warmup and detailed lengths
- Ondemand routine stack printer: configure routine_tracer/type=ondemand, then send a SIGUSR1 to Sniper to get a per-thread application backtrace
- Emulate leaf 11 of the cpuid instruction to pass topology information to runtimes (used by Intel OpenMP)
- Emulation of sched_* system calls, gettimeofday replacement, cpuid in SIFT mode
- Improve handling of LD_LIBRARY_PATH: use SNIPER_SIM_LD_LIBRARY_PATH for the simulator, SNIPER_APP_LD_LIBRARY_PATH for the application
- Re-implemented BigSmall scheduler to use thread affinity calls rather than the low-level (and error prone) moveThread API
- sim.thread Python interface to interact with threads (get num threads, get appid, get/set affinity)
- Use newest Pin version 2.13.61206
- Numerous bug fixes and improvements
Version 5.1 (June 27th, 2013)
- New Suggestions for Optimization visualization (--viz-aso)
- KCacheGrind-compatible output for profiling simulated applications (--profile)
- Roaming (equal-time) scheduler allowing for thread migrations (scheduler/type=roaming)
- Support for newest Pin version 2.12.58423
- Various bugfixes and improvements
Version 5.0 (April 15th, 2013)
- Periodic sampling infrastructure
- Extensible per-thread statistics infrastructure
- Routine tracing infrastructure and per-function statistics
- NUCA cache model
- Distributed tag directories
- sim.mem Python module for reading application memory
- Various other improvements and bugfixes
Version 4.2 (February 9th, 2013)
- Various accuracy fixes for Nehalem core model
- Add cache replacement policies: NRU, MRU, NMRU, PLRU, S-RRIP, Random
- Add statistical DRAM performance model
- Add syscall enter/exit hooks
- Add topology view to visualization
- Speed up McPAT by caching architecture-specific CACTI results
- Fixes to running multiple multi-threaded workloads
- Multi-programmed mode: end simulation at first/last program end, optional trace/application restart
- PinPlay support
Version 4.1 (December 11th, 2012)
- Visualization support (--viz)
- Minor cleanups and bug fixes
Version 4.0 (November 28th, 2012)
- Thread migration and scheduler support
- Pinned (round-robin), static, random thread schedulers
- Heterogeneous configuration files with tags
- Configurable address2set hash functions for non-power of two sized caches
- Various prefetcher improvements
- DRAM cache model
- One-IPC fast-forward model
- Fault injection framework
- New SQLite3-based statistics format
- ROI support for SIFT
- Support for MPI applications (shared-memory backend)
- Limited support for Jikes/DaCapo benchmarks
- Use newest Pin 2.12.53271
- Add script for generating topology images
- Preserve history in Git repository
- Many cleanups and bugfixes
Version 3.07 (September 7th, 2012)
- Prefetcher improvements, add global history buffer-based prefetcher
- HOOK_PERIODIC_INS: Instruction-based periodic callback
- Implement CLONE_CHILD_CLEARTID syscall interface
- Add example scripts for periodic statistics, periodic McPAT, simulating limited iteration counts
- Support for Pin 2.12
- Fixes to Python environment
- Various bugfixes
Version 3.06 (August 7th, 2012)
- Fix modeled size of network messages
- Build fixes for 32-bit, compiler overrides
Version 3.05 (August 7th, 2012)
- Scheduler: expose application ID
- Add example script roi-iter.py to dynamically select ROI based on SimMarkers
- CPI stacks: --aggregate and --partial support, fixes for heterogeneous configurations
- Traces: support for 32-bit executables
- Build fixes for older Linux versions
Version 3.04 (July 18th, 2012)
- Support for running multiple multi-threaded workloads in a single simulation
- McPAT fixes for heterogeneous configurations
- Build system fixes for newer Linux versions
Version 3.03 (June 28th, 2012)
- Bugfixes in configuration parser, starting of multi-program workloads
Version 3.02 (June 25th, 2012)
- Fixes for specifying heterogeneous configurations
- L2 prefetcher improvements
- Perfect cache modeling
- Self-modifying code support
- PyControl scripting interface
- GCC 4.7 support
- McPAT integration for area, power and energy predictions
Version 3.01 (June 12th, 2012)
- Add heterogeneous cache configuration support
- Emulate pause, sleep system calls
- Improve support for 32-bit applications
- Pin 2.11 support
Version 3.0 (May 16th, 2012)
- Support for heterogeneous core types
- Separate core microarchitectural characteristics into CoreModel class
- Improve CPI stack detail
- Add initial implementation for basic L2 prefetcher
- Optionally access DRAM directly in configurations with a single LLC
- Deprecate replacement of pthread_* synchronization calls
- Support more SYS_futex options
- Remove unused code for Graphite FULL mode
- Fixes to the build system, including parallel builds (make -j)
- Support for building on 32-bit hosts
- Remove configuration defaults from code, require everything to be specified in a configuration file