原始的文章應該是這篇Johan寫的
http://it.anandtech.com/weblog/showpost.aspx?i=480
Most Games are about fast caches and super integer performance. After all, most of the Floating point action is already happening on the GPU. The Core 2 CPUs were a huge step forward in integer performance (not the least because of memory disambiguation) compared to the CPUs of that time (P4 and K8). Nehalem is only a small step forward in integer performance, and the gains due to slightly increased integer performance are mostly negated by the new cache system. In a previous post I told you that most games really like the huge L2 of the Core family. With Nehalem they are getting a 32KB L1 with a 4 cycle latency, next a very small (compared to the older Intel CPUs) 256KB L2 cache with 12 cycle latency, and after that a pretty slow 40 cycle 8MB L3. When running on Penryn, they used to get a 3 cycle L1 and a 14 cycle 6144KB L2. The Penryn L2 is 24 times larger than on Nehalem!
大多數遊戲在乎的是更快的緩存和更高級的整數性能,特別是現階段的Intel處理器對緩存依賴相比AMD嚴重得多。與P4和K8相比,Core的整數性能有了很大提升,因此遊戲表現更出色,而Nehalem在這方面進步幅度很小,有的地方甚至在倒退。看看新的緩存系統:Nehalem每個核心配備了2×32KB一級緩存和256KB二級緩存,共享8MB三級緩存,延遲分別為4個、12個和40個週期,而Penryn一級緩存延遲僅為3個週期,二級緩存延遲雖然是14個週期,但容量每個核心多達6MB,是Nehalem的24倍。注意這裡說的只是高端的完整版,低端的「閹割版」不予考慮。
在Penryn上大多數遊戲的二級緩存未能命中的幾率很小,而Nehalem就不同了。雖然集成了內存控制器,但幫助並不大,三級緩存的低速度和二級緩存的小容量影響很大。