Sister blog of Physicists of the Caribbean in which I babble about non-astronomy stuff, because everyone needs a hobby

Tuesday 2 October 2018

The problems of relying only on the data

Nothing underscores the consequences of data analysis gone awry more than the story of Robert McNamara. As the Vietnam conflict escalated and the United States sent more troops, it became clear that this was a war of wills, not of territory. America’s strategy was to pound the Viet Cong to the negotiation table. The way to measure progress, therefore, was by the number of enemy killed. The body count was published daily in the newspapers. To the war’s supporters it was proof of progress; to critics, evidence of its immorality. The body count was the data point that defined an era.

McNamara relied on the figures, fetishized them. With his perfectly combed-back hair and his flawlessly knotted tie, McNamara felt he could comprehend what was happening on the ground only by staring at a spreadsheet—at all those orderly rows and columns, calculations and charts, whose mastery seemed to bring him one standard deviation closer to God.

In 1977, two years after the last helicopter lifted off the rooftop of the U.S. embassy in Saigon, a retired Army general, Douglas Kinnard, published a landmark survey called The War Managers that revealed the quagmire of quantification. A mere 2 percent of America’s generals considered the body count a valid way to measure progress. “A fake—totally worthless,” wrote one general in his comments. “Often blatant lies,” wrote another. “They were grossly exaggerated by many units primarily because of the incredible interest shown by people like McNamara,” said a third.

The use, abuse, and misuse of data by the U.S. military during the Vietnam War is a troubling lesson about the limitations of information as the world hurls toward the big-data era. The underlying data can be of poor quality. It can be biased. It can be misanalyzed or used misleadingly. And even more damning, data can fail to capture what it purports to quantify.

 “It is true enough that not every conceivable complex human situation can be fully reduced to the lines on a graph, or to percentage points on a chart, or to figures on a balance sheet,” said McNamara in a speech in 1967, as domestic protests were growing. “But all reality can be reasoned about. And not to quantify what can be quantified is only to be content with something less than the full range of reason.” If only the right data were used in the right way, not respected for data’s sake.

“When a company is filled with engineers, it turns to engineering to solve problems. Reduce each decision to a simple logic problem. That data eventually becomes a crutch for every decision, paralyzing the company.”

Big data will be a foundation for improving the drugs we take, the way we learn, and the actions of individuals. However, the risk is that its extraordinary powers may lure us to commit the sin of McNamara: to become so fixated on the data, and so obsessed with the power and promise it offers, that we fail to appreciate its inherent ability to mislead.

https://www.technologyreview.com/s/514591/the-dictatorship-of-data/

1 comment:

  1. And then there is the USSR in 1983.

    "Would they stop calling us the Evil Empire? That's hurtful! And needlessly aggressive. In fact, that Abler Archer exercise from NATO in West Germany, you know what it reminds me? Operation Barbarossa. They also said it was large-scale exercises to cover up gathering an invasion army..."
    "Then we'll ask the KGB to check the West for preparations of an attack. Here's a grid with thousands of data points to check that could potentially indicate it."
    ...
    "You know, that grid with random disparate data points is kind of full..."
    "That's it. Mobilize the Red Army. Full nuclear alert. I'll be in the bunker with the nuclear button ready. As soon as we detect their attack, we launch all we've got."
    ...
    Meanwhile, at a Soviet ICBM detection station
    "Uh, we've just detected five ICBMs."
    "Let's see - well, that's got to be a false positive. There is no way they would send only five of them."
    "But aren't we supposed to send the info to high command anyway, and let them interpret it?"
    "... you know what, they're a bit stressed out lately. Let's not bother them."

    ReplyDelete

Due to a small but consistent influx of spam, comments will now be checked before publishing. Only egregious spam/illegal/racist crap will be disapproved, everything else will be published.

Whose cloud is it anyway ?

I really don't understand the most militant climate activists who are also opposed to geoengineering . Or rather, I think I understand t...