Rosenholtz proposes to replace the concept of attention with peripheral vision models, such as the Texture Tiling Model (TTM). Here, we show that the TTM fails in many psychophysical studies due to its local, single-stage, feedforward, and low-level processing. Given that both attention and peripheral vision are unsettled fields, we argue that replacing one with the other is unwarranted.