"Using a soft-core porn image as the default test for serious engineering normalizes the exclusion of women. It signals that the lab is a frat house, not a professional environment."
But the web is growing up. New models are trained on diverse, consented, curated datasets. Lena has been retired to the museum of computing—a beautiful, problematic, and utterly foundational piece of engineering history.
The most used image in the history of computer science was never meant to be an image at all. It was a signal. And that signal taught us how to build the visual web.
This is the story of how a single image defined the engineering constraints of the early internet and continues to haunt the ethics of dataset curation. At the University of Southern California’s Signal and Image Processing Institute (SIPI), assistant professor Alexander Sawchuk needed a high-contrast, high-detail image to scan for a colleague’s conference paper. The lab’s flatbed scanner (one of the first) was crude: 100 lines per inch, 6 bits per pixel.
The engineers were tired of the standard test images—stock photos of mandrills and peppers. According to lore, a graduate student named William Pratt walked in with a copy of the November 1972 issue of Playboy he had just bought. They tore out the centerfold, wrapped it around the drum scanner, and digitized a 5.12 x 5.12 cm crop of Lena Forsén’s face and hat.
In 2018, Nature and the IEEE officially discouraged the use of Lena. Computer Vision and Image Understanding banned new submissions using the image. Today’s web models (CLIP, DALL-E, MobileNet) are trained on billions of images from LAION-5B or ImageNet-22k. Lena is irrelevant for training. However, she remains the unit test —the minimal reproducible example.