- Chapter 1: Introduction
- Chapter 2: The Benchmark
- Chapter 3: Performance Tools
- Chapter 4: The Hotspot
- Chapter 5: Processor Architecture
- Chapter 6: Algorithms
- Chapter 7: Branching
- Chapter 8: Memory
- Chapter 9: Loops
- Chapter 10: Slow Operations
- Chapter 11: Floating Point
- Chapter 12: SIMD Technology
- Chapter 13: Automatic Vectorization
- Chapter 14: Processor-Specific Optimizations
- Chapter 15: Introduction to Multiprocessing
- Chapter 16: Multithreading with Open MP
- Chapter 17: Taskqueuing and Advanced Topics
- Chapter 18: Case Study: Threading a Video Codec
- Chapter 19: Designing for Performance
- Chapter 20: Putting it Together: Basic Optimizations
- Chapter 21:Putting it Together: The Last Ten Percent
- References
- Index