Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
×
Science

Graphics That Seem Clear Can Easily Be Misread (scientificamerican.com) 54

An anonymous reader shares a report: "A picture is worth a thousand words." That saying leads us to believe that we can readily interpret a chart correctly. But charts are visual arguments, and they are easy to misunderstand if we do not pay close attention. Alberto Cairo, chair of visual journalism at the University of Miami, reveals pitfalls in an example diagrammed here. Learning how to better read graphics can help us navigate a world in which truth may be hidden or twisted. Say that you are obese, and you've grown tired of family, friends and your doctor telling you that obesity may increase your risk for diabetes, heart disease, even cancer -- all of which could shorten your life. One day you see this chart. Suddenly you feel better because it shows that, in general, the more obese people a country has (right side of chart), the higher the life expectancy (top of chart). Therefore, obese people must live longer, you think. After all, the correlation (red line) is quite strong.

The chart itself is not incorrect. But it doesn't really show that the more obese people are, the longer they live. A more thorough description would be: "At the national level -- country by country -- there is a positive association between obesity rates and life expectancy at birth, and vice versa." Still, this does not mean that a positive association will hold at the local or individual level or that there is a causal link. Two fallacies are involved. First, a pattern in aggregated data can disappear or even reverse once you explore the numbers at different levels of detail. If the countries are split by income levels, the strong positive correlation becomes much weaker as income rises. In the highest-income nations (chart on bottom right), the association is negative (higher obesity rates mean lower life expectancy). The pattern remains negative when you look at the U.S., state by state: life expectancy at birth drops as obesity rises. Yet this hides the second fallacy: the negative association can be affected by many other factors. Exercise and access to health care, for example, are associated with life expectancy. So is income.

This discussion has been archived. No new comments can be posted.

Graphics That Seem Clear Can Easily Be Misread

Comments Filter:
  • Until I read the summary, I thought the title was referring to mostly transparent graphs overlaid on a background picture where the picture would somehow obscure the graph. (PowerPoint excitement, don't you know.)
    • Re: (Score:3, Funny)

      by bob4u2c ( 73467 )

      referring to mostly transparent graphs overlaid on a background picture

      DUDE! Get out of my mind!

      I'm waiting for the follow up: "Article headlines that seem clear can easily be misread".

  • by BringsApples ( 3418089 ) on Friday August 23, 2019 @11:58AM (#59116754)

    Information That Seems Clear Can Easily Be Misunderstood

    ftfy

  • Statistics is an art not a science. Basically you choose your conclusion first and then you can always gin up the statistics needed.

    e.g. A cousin of mine used to work for an economics thinktank in an unnamed country. They try to keep on the good side of the party in power.
    They released statistics for the last 5 years showing the economy had grown. Elections happened , the opposition party came to power so they released statistics for the last 10 years which showed over 10 years the economy had stagnated und

    • Re: (Score:2, Funny)

      by Anonymous Coward

      anecdotes with no backing evidence are even more artistic. let's gamble the future of civilization on your misapprehensions

      • As Obama just bought a seaside (really right by the seaside, not even on raised land) villa for $15 million, that sends a pretty clear evidential signal that being cautious around statistics is A-OK.

    • Written in 1954, but still just as relevant today. Should be required reading in schools.

      https://archive.org/details/Ho... [archive.org]

    • Statistics is an art not a science.

      Many years ago, "The Economist" published an article on the dangers of extrapolation.

      They made a century chart of the amount of blades in a razor. For a long, long time only one.

      Then suddenly two . . . then three . . . then four . . . then five over a short period of time.

      Their extrapolation showed that we should have like 43 today.

    • This drivel has been modded up? Sure, you can work backwards from a conclusion, but then you're not applying the science of statistics, you're just telling an elaborate lie.

      Statistics is a science: a series of methods that can be applied to data in order to draw conclusions about that data. Conclusions that have been found to be valid in a wide variety of situations.

      And that includes climate, by the way. Climate scientists have no need to fabricate data. Publicly available data from a wide variety of source

    • Global warming is hoopla until you've lived it. And then you're told it's anecdotal. Regardless of whether there are 4 billion people that have lived the same changes. The cause of Global Warming may not be 100% clear, but it's effects truly are.
      • by ghoul ( 157158 )

        Even if Global Warming is actually happening, there is no clear science on whether it is actually harmful. All we have is anecdotes on some rich guys coastal property will lose value so we need to stop millions of people from developing using coal.

        The cost benefit analysis has been done and been proven that climate change mitigation measures will kill a lot more humans in developing countries than any sea level rise will.

        Millions die today due to lack of clean drinking water, sanitation, easily treatable di

    • I'd say that this shows that statistics is like a science in that it takes training / knowledge to use it correctly, and that it can be used to mislead non experts.

  • I normally at work after generate data sets, I try to encourage people not to treat the data as the call to action, but to look at parts and ask more questions about it.
    Where in turn you pull more detailed focused data, an helps answer the big question then opens up smaller questions.

    For the most part we just want to see a Good vs Bad value. But real life isn't like that.

    Unemployment is low. The numbers show that.
    However why is it so low?
    Are businesses booming and need more workers?
    Are the baby boomers retiring at a rate faster then they can fill the spots?
    Are we transitioning from a duel income family to a single income family again?
    Are people living their lives in moms basement?
    Is there an influx of people going to school full time?
    Are migration policies working to reduce the workforce?
    For the people that are employed, are they working jobs best suited towards their skills?

    Depending the answers to the questions, we could be in a really good place, or a really bad one, or it can be a combination of all the answers making a more complex situation where they issues that need to be addressed, while other things are boosting the average.

    • by Woeful Countenance ( 1160487 ) on Friday August 23, 2019 @01:02PM (#59117078)

      Unemployment is an interesting example. The US Bureau of Labor Statistics releases six different unemployment numbers, called U1 through U6 [bls.gov], based on different assumptions and different definitions of "unemployment". The number always cited is U3, which has an unintuitively narrow definition of unemployment. For example, as I understand it (I don't have a reference for this), anyone who has been paid at least $25 (USD) in the past two weeks in return for labor is considered "employed". And, of course, "discouraged workers", who are no longer looking for jobs, are not counted as unemployed.

  • facepalm (Score:4, Insightful)

    by the_skywise ( 189793 ) on Friday August 23, 2019 @12:05PM (#59116800)
    Oh lookie - we rediscovered why we use WORDS and not pictographs or EMOJI to communicate!
    • we rediscovered why we use WORDS and not pictographs or EMOJI to communicate!

      I only wish it was true.

      I recently designed a website (sad, but true). As I used to design data entry screens in Fortran on VT52's, I have been somewhat pissed that high-end Intel mega-cpus on the latest, most incredibly high performance broadband go slower than a Lear-Siegler ADM3A at the end of a 300 baud dial-up line. As data entry systems go, web sites seem piss-poor. I wondered why.

      After a couple of days wearing my best

      • I had a motorola startac with a hardware keyboard and a low res pixel screen that could do voice recognition, I could never type faster then the phone could keep up, crystal clear calls, never a dead spot, never hot to the touch on charging, perfect GPS, lasted 2 days without a charge. Now, I have a Samsung S8+ that can out type, voice recognition that misunderstands words ("ecoute-moi" is heard as "gwan"), needs charging within 8h of no use, a high res screen that shows more data than my startac but much
  • by methano ( 519830 ) on Friday August 23, 2019 @12:09PM (#59116820)
    So Alberto Cairo has figured out that a lot of people are stupid and you can fool them with statistics, especially when you provide the statistics in a graphical form. That seems like a lot of wasted research money that could be better wasted in my field, cancer research.
    • Seems to be either how we are intrinsically programmed or what we learn after years of x-y plots. It is kind of implied that there's a relationship between the two variables and so we jump right to finding a trendline rather than considering if its a coincidence, both variables are correlated to a 3rd variable that isn't mentioned etc.So in a nut shell I agree, imo (who elses'?) use your words, figure out what the real dependent and independent variables are and then only plot those when presenting things t

      • It is kind of implied that there's a relationship between the two variables

        More than an implication. They call one axis the "dependent variable" for a reason. If there is no real dependence, then the plot is bogus to begin with.

      • They writer gets the graph from a scientist, and gives it to an artist. The artist reads the headline and re-arts the graph to clearly and dramatically support the headline. That's his job, and he's good at it. There are literally people getting paid to make the data look dramatic.
    • No, the only thing Albert Cairo did was fool the editors of a semi-popular magazine into paying him for writing a silly article.

      Even stupid people don’t believe being fatter is healthier. If you show them that graph, their likely takeaway is “science is stoooopid cuz it thinks fatter people be healthier”.

  • Graphing a correlation doesn't prove causation. People assume causation when they see a fancy plot of correlation. This is what passes for News For Nerds these days. Don't nerds already know this?
    • Graphing a correlation doesn't even prove a meaningful correlation exists. If the best you can do is to draw a line through a blob of dots and quote a low correlation coefficient (i.e. less than ~0.7), then it means you haven't properly isolated the variables and drawing conclusions from those trends is more likely to be just confirmation bias rather than a meaningful trend.

  • by Ross Williams ( 4670663 ) on Friday August 23, 2019 @12:15PM (#59116850)
    The problem here does not lie with the graphic representation, but with the statistics themselves. Those problems are so ubiquitous in even reputable science that they barely require mentioning.
  • by pz ( 113803 ) on Friday August 23, 2019 @12:23PM (#59116888) Journal

    Correlation is not causation.

    NONE of the graphs in the article show anything other than a correlation. If the correlation fits your expectations, then you will have a tendency to believe it. You should only believe the presented results when they have been shown to represent a causal link. Causal links are really, really hard to show without controlled experiments, and observational results (i.e., those without experimental manipulation and at least one control group) in general fall short.

    Repeat after me: correlation is not causation.

  • ... this would be a story about how Edward Tufte might have been all wrong about charts. Instead it turns out to be more about the phenomena that the unknowing reader can be swayed by a fancy chart depicting highly questionable conclusions. "Oooh! Pretty colors! I can eat anything I want and live longer!"

    This article [theinnovat...rprise.com] doesn't have the prestigious name of SciAm associated with it but it says much the same thing. (Geez... I recall when I subscribed to SciAm back in college that the articles were significant

  • Move to Hawaii.

    Now, where'd I leave my Whopper?!

  • Comment removed (Score:4, Informative)

    by account_deleted ( 4530225 ) on Friday August 23, 2019 @12:29PM (#59116918)
    Comment removed based on user account deletion
    • by Jaime2 ( 824950 )
      Actually, Geico's slogan is worse. It's actually "15 minutes could save you 15% or more on car insurance". This has no concrete meaning at all since the "or more means that there is no upper bound to the savings and the "could" means that all other outcomes are also possible. It makes no statement that any customer will ever experience any savings.
    • According to the radio show "Under the Influence [www.cbc.ca]", the gecko was created because "a Screen Actors Guild strike prevented advertisers from using live actors."

  • "Visual thinkers" (Score:2, Insightful)

    by hey! ( 33014 )

    That always irritates me. It's not that visual thinkers don't exist, it's that people who describe themselves that way are usually just too lazy to do much thinking at all. They like pictures because they can glance at them and jump to conclusions. If you draw them a diagram and ask them to study it, they'll almost always fixate on some irrelevant graphical presentation detail.

  • Simpson's Paradox (Score:4, Interesting)

    by Woeful Countenance ( 1160487 ) on Friday August 23, 2019 @01:13PM (#59117160)

    ... happens with dissimilar groups are improperly aggregated into one group when analyzing data. Like this: "... a pattern in aggregated data can disappear or even reverse once you explore the numbers at different levels of detail."

    So how do you know which groups are "dissimilar"? That's one reason why study design and analysis are hard.

  • ...for statistics and charts is to make numbers lie. SHOCKING. :facepalm:
  • ...are you suggesting that data can be misrepresented by graphs? Or that people jump to simplistic conclusions?

    No, that's simply impossible.

  • https://www.amazon.com/Mathema... [amazon.com]
    I read this years ago, and how I read and interpret news, statistics, charts, polls, and other numerical claims has been different ever since. It's filled with a great many examples with clear explanations of how a chart or claim could be both true and misleading.
    With this background and an understanding of confirmation bias ( https://en.wikipedia.org/wiki/... [wikipedia.org] ) I've made sure to double-check any stats or charts that not only seem fishy, but ones that either seem too
  • Rule 1 of statistics. Statistics can be used to show any relationship you want. In other words, statisticians can lie and still be accurate. Whether that relationship has predictive value depends on the chi-square test. Statistics without validation is still insanity. Causation is a chain of directly linked events. Correlation is an observation of the environment of that causal chain. And only SOMETIMES very INFREQUENTLY does that correlation have a causative element.
  • The given example showed that countries with a higher obesity rate had a higher life expectancy.

    This is not a problem of bad graphics, but instead a clear case of Correlation not equaling Causation.

    In this case, the explanation was simple - both obesity and life expectancy are highly correlated with wealth of the country. The wealthier the country, the more food they had, so a higher obesity rate. Similarly, the wealthier the country the better the medical care and more importantly, education, both of whi

The truth of a proposition has nothing to do with its credibility. And vice versa.

Working...