Visual Thinking by Temple Grandin
Friday, January 31st, 2025 09:30 pmAt the end of December I read Visual Thinking by Temple Grandin. Grandin is one of the most famous people with autism, or perhaps it would be better to say that she's one of the people most well-known for having autism. Grandin thinks in pictures, and Visual Thinking draws on her own mind, her experience working with others, and psychological research to talk about three different types of thinking.
The book's focus is on visual thinkers, people whose main mode of thought is either object-visual thinking or visual-spatial thinking, particularly the former. People on the autism spectrum are often visual thinkers, though not all visual thinkers are autistic or vice versa and neither has a bright-line diagnostic. As an autistic child, Temple Grandin didn't learn to speak until later than many children, and she struggled in many classes in school because the material was presented for verbal thinkers and folks who could do abstract reasoning. She talks about finally connecting with coursework that made sense to her when she got to shop class, which is about working with physical objects. Another key educational experience was going to a boarding school which involved chores like caring for animals; lacking language, animals are naturally visual thinkers.
Grandin talks about the kinds of work that the three types of thinkers do well. You'll often find object visual thinkers in roles that require hands-on work: mechanics, machinists, drafters, builders, equipment operators, repair people. They're excellent at troubleshooting machinery, and they're able to tinker and build things with whatever supplies may be at hand. Visual-spatial thinkers are often drawn to engineering, mathematics, and computers; they design complex systems and find patterns in the world. Verbal thinkers are, naturally, drawn to work that's focused on words or sequential thinking. Lawyers, politicians, authors, and journalists are often verbal thinkers, and project managers can apply sequential thinking to make sure things happen on time. Grandin likes to talk about how the two types of visual thinkers are complimentary when designing something. Abstract visual thinkers can do all the modeling and calculations to ensure a factory building will be sturdy and that the machinery runs reliably. But a factory designed only by abstract thinkers likely won't run very effectively because they tend to overlook problems in the specific details of things work on the factory floor: the awkward movements a worker needs to do at the assembly line, or poor lighting of a workbench because something's blocking the lamps. Visual-spatial thinkers can make great electrical engineers, but you might want an object visualizer as an electrician. In the software space you might want an object visual thinker for a UI designer, a visual-spatial thinker writing code, and a verbal thinker as product manager, figuring out the user journeys the system will support.
A lot of the book talks about how modern American society is letting object visual thinkers down. A lot of schooling is designed for verbal or abstract thinking, and object visualizers struggle. Grandin reports that object visual thinkers often have a very difficult time with algebra, which they find too abstract and this challenge prevents many people from graduating from high school. However, many of these same people excel at geometry and trigonometry because their visual cortex can make sense of the relationships. "Let students take trigonometry if they fail algebra" might sound weird to folks used to the traditional progression of high school, but it might result in having a lot more Americans who are good at math, at least some parts of it. Grandin highlights George W. Bush's No Child Left Behind Act for making this problem worse: schools are graded and funded based on students' standardized test scores, so the incentive is to focus instruction on what will be tested. There's no standardized test that gives subjects a pile of lumber and a box of tools and asks them to assemble a chair, so when budget cuts come, classes like shop and music get taken away. Colleges and universities tend to make things even more abstract and more verbal, and the availability of trade schools, apprenticeships, and other hands-on education has been dwindling. Companies often have a college degree requirement, not realizing they may be cutting object visualizers out of the employment pool. Grandin decries that the result is an America where we're losing the ability to fix things.
This has political implications, too. (Here I'm building on Temple Grandin's foundation, she doesn't frame the book this way.) There are fewer narratives in American politics stronger than the middle class dream of success. After World War II, a man with no education beyond high school could get a good-paying job as a factory worker, a miner, an auto mechanic, doing appliance repair, or many other roles that object visual thinkers do well. These jobs offered the chance to own a home, raise a family, and enjoy leisure time; a combination that was uncommon for their parents' generation. As the 20th Century drew to a close, and accelerating in the 21st Century, the American economy shifted strongly to jobs that reward abstract or verbal thinking. Globalization sent a lot of the manufacturing jobs to lower-cost countries while domestic production became increasingly automated. (Fun fact: the U.S. is still the world's leading manufacturer, even though we employ a fraction of the manufacturing workforce we once did: American manufacturing is mostly done by machines.) Computers and the Internet gave powerful tools to abstract and verbal thinkers who are able to thrive in a service and information economy. High-tech manufacturing also made hands-on work more difficult: in the 1960s and '70s, buying a Heathkit stereo system and assembling it yourself could save significant money and teach you something useful at the same time, and you could easily fix it if a component failed. Today with high-tech automated assembly and microprocessors running everything, the cost to pay someone to fix a damaged Sonos speaker system is probably higher than the price of buying a new one. Starting in 2015, America saw Donald Trump's messaging resonate with older voters who work in these object visualizing jobs, and especially among voters who identify with such work that's been in decline like coal mining and assembly lines. Should it be any surprise in 2024 that Harris outperformed Trump by 13 points among college-educated voters, but Trump was up 24 points among men without a college degree? Millennials living in their parents' basements and playing video games all day was something of a running joke in the late aughts, but that also describes visual thinkers who've been left out of the modern American social structure and have grown to resent folks who were able to follow the abstract and verbal path through college and the modern hands-off economy.
I'd watched Temple Grandin talk about this book at Google, so I was curious how the ideas of a highly visual thinker would come across in a book, a naturally verbal and sequential medium. At times reading the book felt like listening to Temple Grandin give a talk; she often communicates in specific anecdotes, and related stories recur throughout the book. One of the big themes of the book is how people with different thinking styles can collaborate, and in the book she gives praise to her editor who's able to apply strong verbal thinking to organize the scattered notes and fragments that Temple gives her. This is certainly helpful in making this information about visual thinking accessible to verbal thinkers and readers. I found it interesting, though, that a book which is all about thinking in images and spatial relationships contains 400 pages of words and no photographs, illustrations, or diagrams. As a visual-spatial thinker, I was hoping to see some kind of 2D representation of how a visual thinker might operate. I wonder how many visual thinkers would benefit from learning this information, but get turned off by a book that's nothing but words.
I think I first heard about visual-spatial thinking, as distinct from object visualizing, from a previous Temple Grandin talk. I've considered myself on the autism spectrum for over 20 years, since I learned about Asperger's syndrome. When I heard Temple Grandin explain how her brain works I said "that's definitely not how mine works." While I can build intricate and elegant software systems, I'm perennially bad at building physical things. Cognitive tasks like mentally rotating a cube are challenging for me, but I can visualize a map of the whole world or specific areas, and at talent shows I like to recite every country in the world in geographic order. Unlike many kids with autism I didn't have any trouble learning language, and I always read above grade level. But I think my language use shows signs of spatial and abstract thinking: I've always loved puns, which feel to me like I'm demonstrating a multi-dimensional linkage between words that aren't present in the normal linear version. As a kid I often focused on the literal meaning of a set of words, much to the consternation of verbal thinkers who would say "you know what I mean!" (I still do this as an adult sometimes, but I have four decades of building a database of what people actually mean when they say something that a program would interpret differently.) I also tend to be very long-winded: an excellent verbal thinker can convey an idea without using a lot of words, while I tend to say or write down every sentence that comes to my mind related to a topic. Someone might mention a topic at lunch and I'll spend three minutes talking about all the things I learned from Wikipedia that are tangentially related. This is a bit like reading a map out loud: the river we were talking about touches another river, so let's row up stream and see where that one goes.
It's important for people creating tools and products to design with the different modes of thinking in mind. Much as a product that was designed and tested entirely by right-handed people is likely to be awkward for lefties, a product or process that was designed around the way verbal thinkers operate is likely to be frustrating to visual thinkers. Google Maps is an interesting example here. At the core, the data is inherently spatial: where is everything? But there are ways to expose that data that fit different thinking patterns. If someone wants directions from one place to another, Maps could show a path on a map and instructions like "take 6th Avenue to Broadway and turn south, then turn west on Alameda." This is great for spatial thinkers like me; I might ignore the written instructions entirely and just look at the lines on the map; if I take a different turn I can still reach the destination because I've formed a mental model of the space between here and there. Verbal thinkers might focus on the step-by-step instructions, but when they come to each intersection they'll have to figure out "which way is south?" So instead of cardinal directions, let's use left and right turns. Object visualizers still might have trouble, though: the map is an abstract set of lines and rectangles that can be hard to turn into 3D space, and the turn-by-turn instructions are a bunch of street names and numbers that need to be kept in order: "do I turn on Alameda before Broadway, or is it the other way?" So Google Maps recently added navigation by landmarks: "turn right after the McDonalds" and "turn left at the second traffic light." To get even more context, object visualizers can drop into Street View and see what all the buildings look like at an intersection. This multi-modal approach to directions is a big win over paper maps, when everyone but the spatial thinkers was faced with a daunting puzzle. The old stereotype of a woman in the passenger seat complaining that the man in the driver's seat didn't pull over and ask for directions might have been a conflict between verbal and visual thinking…
I was pleased to learn that object visual thinkers do share a trait with visual-spatial thinkers: we organize things by keeping track of their physical location. It might look like a random pile of papers on a desk to a verbal thinker, but ask a visual thinker to retrieve a particular document and they can probably find it quickly. But if a verbal thinker comes in and tries to be helpful by organizing all the papers into an efficient filing system the visual thinker will be unable to find anything, because it's not where they last put it. These two organization approaches are "piling" and "filing." My coworkers are often confused about how I can manage having 20 browser windows open with 50+ tabs in each. I find it quite easy: each window has a loosely-related theme, and I remember where on the desktop each window sits. When my workstation switched to Wayland, Chrome was unable to restore windows to their original locations when it opened and I was completely flustered; I had to spend 20 minutes a week figuring out which window was which and moving them back to their original location. On the other hand, I always wonder how someone who's only got one browser window open can keep track of what they were working on :-)
- Object visual thinking involves mental images of specific things or scenes. Thinking about airplanes involves recalling mental images of specific airplanes the person has seen, in person or in a photograph: perhaps a Boeing 737 or a Cessna 182. Temple Grandin is an object visual thinker, and famously designs livestock handling facilities by visualizing what an animal would see from a particular location.
- Visual-spatial thinking involves thinking in relations, abstractions, and patterns. Thinking about airplanes might involve thinking about the general structure of an airplane—the fuselage connects to wings, the cockpit, tail, etc.—without thinking about any specific plane.
- Verbal thinking involves words and sequences rather than images. Thinking about airplanes might involve the process of flight: boarding, taxiing, acceleration on the runway, liftoff, and so on.
The book's focus is on visual thinkers, people whose main mode of thought is either object-visual thinking or visual-spatial thinking, particularly the former. People on the autism spectrum are often visual thinkers, though not all visual thinkers are autistic or vice versa and neither has a bright-line diagnostic. As an autistic child, Temple Grandin didn't learn to speak until later than many children, and she struggled in many classes in school because the material was presented for verbal thinkers and folks who could do abstract reasoning. She talks about finally connecting with coursework that made sense to her when she got to shop class, which is about working with physical objects. Another key educational experience was going to a boarding school which involved chores like caring for animals; lacking language, animals are naturally visual thinkers.
Grandin talks about the kinds of work that the three types of thinkers do well. You'll often find object visual thinkers in roles that require hands-on work: mechanics, machinists, drafters, builders, equipment operators, repair people. They're excellent at troubleshooting machinery, and they're able to tinker and build things with whatever supplies may be at hand. Visual-spatial thinkers are often drawn to engineering, mathematics, and computers; they design complex systems and find patterns in the world. Verbal thinkers are, naturally, drawn to work that's focused on words or sequential thinking. Lawyers, politicians, authors, and journalists are often verbal thinkers, and project managers can apply sequential thinking to make sure things happen on time. Grandin likes to talk about how the two types of visual thinkers are complimentary when designing something. Abstract visual thinkers can do all the modeling and calculations to ensure a factory building will be sturdy and that the machinery runs reliably. But a factory designed only by abstract thinkers likely won't run very effectively because they tend to overlook problems in the specific details of things work on the factory floor: the awkward movements a worker needs to do at the assembly line, or poor lighting of a workbench because something's blocking the lamps. Visual-spatial thinkers can make great electrical engineers, but you might want an object visualizer as an electrician. In the software space you might want an object visual thinker for a UI designer, a visual-spatial thinker writing code, and a verbal thinker as product manager, figuring out the user journeys the system will support.
A lot of the book talks about how modern American society is letting object visual thinkers down. A lot of schooling is designed for verbal or abstract thinking, and object visualizers struggle. Grandin reports that object visual thinkers often have a very difficult time with algebra, which they find too abstract and this challenge prevents many people from graduating from high school. However, many of these same people excel at geometry and trigonometry because their visual cortex can make sense of the relationships. "Let students take trigonometry if they fail algebra" might sound weird to folks used to the traditional progression of high school, but it might result in having a lot more Americans who are good at math, at least some parts of it. Grandin highlights George W. Bush's No Child Left Behind Act for making this problem worse: schools are graded and funded based on students' standardized test scores, so the incentive is to focus instruction on what will be tested. There's no standardized test that gives subjects a pile of lumber and a box of tools and asks them to assemble a chair, so when budget cuts come, classes like shop and music get taken away. Colleges and universities tend to make things even more abstract and more verbal, and the availability of trade schools, apprenticeships, and other hands-on education has been dwindling. Companies often have a college degree requirement, not realizing they may be cutting object visualizers out of the employment pool. Grandin decries that the result is an America where we're losing the ability to fix things.
This has political implications, too. (Here I'm building on Temple Grandin's foundation, she doesn't frame the book this way.) There are fewer narratives in American politics stronger than the middle class dream of success. After World War II, a man with no education beyond high school could get a good-paying job as a factory worker, a miner, an auto mechanic, doing appliance repair, or many other roles that object visual thinkers do well. These jobs offered the chance to own a home, raise a family, and enjoy leisure time; a combination that was uncommon for their parents' generation. As the 20th Century drew to a close, and accelerating in the 21st Century, the American economy shifted strongly to jobs that reward abstract or verbal thinking. Globalization sent a lot of the manufacturing jobs to lower-cost countries while domestic production became increasingly automated. (Fun fact: the U.S. is still the world's leading manufacturer, even though we employ a fraction of the manufacturing workforce we once did: American manufacturing is mostly done by machines.) Computers and the Internet gave powerful tools to abstract and verbal thinkers who are able to thrive in a service and information economy. High-tech manufacturing also made hands-on work more difficult: in the 1960s and '70s, buying a Heathkit stereo system and assembling it yourself could save significant money and teach you something useful at the same time, and you could easily fix it if a component failed. Today with high-tech automated assembly and microprocessors running everything, the cost to pay someone to fix a damaged Sonos speaker system is probably higher than the price of buying a new one. Starting in 2015, America saw Donald Trump's messaging resonate with older voters who work in these object visualizing jobs, and especially among voters who identify with such work that's been in decline like coal mining and assembly lines. Should it be any surprise in 2024 that Harris outperformed Trump by 13 points among college-educated voters, but Trump was up 24 points among men without a college degree? Millennials living in their parents' basements and playing video games all day was something of a running joke in the late aughts, but that also describes visual thinkers who've been left out of the modern American social structure and have grown to resent folks who were able to follow the abstract and verbal path through college and the modern hands-off economy.
I'd watched Temple Grandin talk about this book at Google, so I was curious how the ideas of a highly visual thinker would come across in a book, a naturally verbal and sequential medium. At times reading the book felt like listening to Temple Grandin give a talk; she often communicates in specific anecdotes, and related stories recur throughout the book. One of the big themes of the book is how people with different thinking styles can collaborate, and in the book she gives praise to her editor who's able to apply strong verbal thinking to organize the scattered notes and fragments that Temple gives her. This is certainly helpful in making this information about visual thinking accessible to verbal thinkers and readers. I found it interesting, though, that a book which is all about thinking in images and spatial relationships contains 400 pages of words and no photographs, illustrations, or diagrams. As a visual-spatial thinker, I was hoping to see some kind of 2D representation of how a visual thinker might operate. I wonder how many visual thinkers would benefit from learning this information, but get turned off by a book that's nothing but words.
I think I first heard about visual-spatial thinking, as distinct from object visualizing, from a previous Temple Grandin talk. I've considered myself on the autism spectrum for over 20 years, since I learned about Asperger's syndrome. When I heard Temple Grandin explain how her brain works I said "that's definitely not how mine works." While I can build intricate and elegant software systems, I'm perennially bad at building physical things. Cognitive tasks like mentally rotating a cube are challenging for me, but I can visualize a map of the whole world or specific areas, and at talent shows I like to recite every country in the world in geographic order. Unlike many kids with autism I didn't have any trouble learning language, and I always read above grade level. But I think my language use shows signs of spatial and abstract thinking: I've always loved puns, which feel to me like I'm demonstrating a multi-dimensional linkage between words that aren't present in the normal linear version. As a kid I often focused on the literal meaning of a set of words, much to the consternation of verbal thinkers who would say "you know what I mean!" (I still do this as an adult sometimes, but I have four decades of building a database of what people actually mean when they say something that a program would interpret differently.) I also tend to be very long-winded: an excellent verbal thinker can convey an idea without using a lot of words, while I tend to say or write down every sentence that comes to my mind related to a topic. Someone might mention a topic at lunch and I'll spend three minutes talking about all the things I learned from Wikipedia that are tangentially related. This is a bit like reading a map out loud: the river we were talking about touches another river, so let's row up stream and see where that one goes.
It's important for people creating tools and products to design with the different modes of thinking in mind. Much as a product that was designed and tested entirely by right-handed people is likely to be awkward for lefties, a product or process that was designed around the way verbal thinkers operate is likely to be frustrating to visual thinkers. Google Maps is an interesting example here. At the core, the data is inherently spatial: where is everything? But there are ways to expose that data that fit different thinking patterns. If someone wants directions from one place to another, Maps could show a path on a map and instructions like "take 6th Avenue to Broadway and turn south, then turn west on Alameda." This is great for spatial thinkers like me; I might ignore the written instructions entirely and just look at the lines on the map; if I take a different turn I can still reach the destination because I've formed a mental model of the space between here and there. Verbal thinkers might focus on the step-by-step instructions, but when they come to each intersection they'll have to figure out "which way is south?" So instead of cardinal directions, let's use left and right turns. Object visualizers still might have trouble, though: the map is an abstract set of lines and rectangles that can be hard to turn into 3D space, and the turn-by-turn instructions are a bunch of street names and numbers that need to be kept in order: "do I turn on Alameda before Broadway, or is it the other way?" So Google Maps recently added navigation by landmarks: "turn right after the McDonalds" and "turn left at the second traffic light." To get even more context, object visualizers can drop into Street View and see what all the buildings look like at an intersection. This multi-modal approach to directions is a big win over paper maps, when everyone but the spatial thinkers was faced with a daunting puzzle. The old stereotype of a woman in the passenger seat complaining that the man in the driver's seat didn't pull over and ask for directions might have been a conflict between verbal and visual thinking…
I was pleased to learn that object visual thinkers do share a trait with visual-spatial thinkers: we organize things by keeping track of their physical location. It might look like a random pile of papers on a desk to a verbal thinker, but ask a visual thinker to retrieve a particular document and they can probably find it quickly. But if a verbal thinker comes in and tries to be helpful by organizing all the papers into an efficient filing system the visual thinker will be unable to find anything, because it's not where they last put it. These two organization approaches are "piling" and "filing." My coworkers are often confused about how I can manage having 20 browser windows open with 50+ tabs in each. I find it quite easy: each window has a loosely-related theme, and I remember where on the desktop each window sits. When my workstation switched to Wayland, Chrome was unable to restore windows to their original locations when it opened and I was completely flustered; I had to spend 20 minutes a week figuring out which window was which and moving them back to their original location. On the other hand, I always wonder how someone who's only got one browser window open can keep track of what they were working on :-)