Watson Wasn't Perfect: IBM Explains the 'Jeopardy!' Errors

Dawn Kawamoto

February 17, 2011 at 8:15 p.m.·6 min read

Even big brains can have a blip. And IBM's supercomputer Watson is no exception, despite its bank of 90 IBM Power 750 servers that can process the equivalent of 1 million books of information a second.

Those blips, obviously, were few and far between -- and they didn't slow down Watson's highly publicized victory this week against two human champions on the TV game show Jeopardy!. The flesh-and-blood competitors were no slouches, either. Brad Rutter had earned $3.26 million playing Jeopardy!, making him the largest dollar winner in the show's history. And Ken Jennings had the program's longest-running winning streak.

You Can't Blame "Human Error"

On Monday, Day 1 of the three-day contest, Watson tied Rutter at $5,000 each in winnings. Along the way, however, the supercomputer hit a few blips when it came to processing clues given by Jeopardy! host Alex Trebek. Knowing that writing off the mistakes to "human error" wouldn't cut it, Big Blue's (IBM) Watson team have explained the supercomputer's miscalculations in a variety of posts on the Internet.

Sponsored Links

One of those misfires occurred during the "Name that Decade" category. The clue: "The first modern crossword puzzle is published and Oreo cookies are introduced."

Jennings was first to buzz, but gave he wrong answer of "What is the 1920s?". Watson was next to answer, but apparently wasn't listening. The mega-machine repeated Jennings' mistake, prompting a gentle scolding from Trebek. But Watson isn't listening to Trebek, either: The supercomputer has no ears, nor the ability for speech recognition.

Chris Welty works on Watson's algorithms team. According to an Ars Technica post, Welty and his crew thought it wasn't necessary for Watson to crunch through the other contestants' wrong answers.

An Arm and a Leg

Similarly, Watson may have benefited from Jennings' wrong answer in responding to the clue: "It was the anatomical oddity of U.S. Gymnast George Eyser, who won a gold medal on the parallel bars in 1904." Jennings answered Eyser was missing an arm -- and Watson then offered up, "What is a leg?"

Although Watson got the body part right, he was dinged for failing to note it the leg in question was "missing."

In a blog post, David Ferrucci, who heads up the Watson project, noted Watson likely didn't understand the word "oddity." According to Ferrucci: "The computer wouldn't know that a missing leg is odder than anything else."

But Watson's creators note that, over time and by playing more games and by gaining greater exposure to more material, the supercomputer could possibly gain a greater understanding of those concepts, since it's loaded with machine-learning technology. Watson's rivals, in mapping out their strategy to go head-to-head with it, were well aware he seemed to struggle with abstract concepts and short clues.

On Day 2, Watson missed one clue by a country mile -- better make that an entire country. During a Final Jeopardy! segment that included the "U.S. Cities" category, the clue was: "Its largest airport was named for a World War II hero; its second-largest, for a World War II battle."

Watson responded "What is Toronto???," while contestants Jennings and Rutter correctly answered Chicago -- for the city's O'Hare and Midway airports.

In a blog post, Ferrucci pointed to several issues that may have tripped-up Watson:

First, the category names on Jeopardy! are tricky. The answers often do not exactly fit the category. Watson, in his training phase, learned that categories only weakly suggest the kind of answer that is expected, and, therefore, the machine downgrades their significance. The way the language was parsed provided an advantage for the humans and a disadvantage for Watson, as well. "What US city" wasn't in the question. If it had been, Watson would have given US cities much more weight as it searched for the answer. Adding to the confusion for Watson, there are cities named Toronto in the United States and the Toronto in Canada has an American League baseball team. It probably picked up those facts from the written material it has digested. Also, the machine didn't find much evidence to connect either city's airport to World War II. (Chicago was a very close second on Watson's list of possible answers.) So this is just one of those situations that's a snap for a reasonably knowledgeable human but a true brain teaser for the machine.

Learning the Game

Like any artful player, however, Watson developed a sense of when to hold, to fold or to play.

Watson knew the Toronto answer could be big-time bust, so it wagered a mere $947.

According to a blog post by Gerald Tesauro, an IBM researcher, Watson's wagering style largely hinges on two questions: "How likely am I to answer the Daily Double clue correctly?" and "How much will a given bet increase or decrease my winning changes when I get the Daily Double right or wrong?"

As any Jeopardy! fan knows, the Daily Double can make or break a winning streak. A couple of these are hidden on the game board, and players who land one can bet from $5 to their entire holdings on that single clue. A Daily Double not only has the potential for a contestant to double his money but it's also not subject to a rival jumping in with his own answer.

Wonky Wagers

In making a wager, Watson first relies on mathematical models and algorithms, processing the data from its vast database, to determine the likelihood of a correct answer.

Answering the second question is far more involved. Watson uses a Game State Evaluator, a complex model that estimates its chances of winning based on such things as the competitors' scores, the number of remaining Daily Doubles and value of the clues remaining.

That technology also includes an in-category Daily Double confidence level, providing Watson with a view into its odds of winning a game based on a Daily Double bet. And its risk analytics software also weighs the likelihood of winning with a particular bet.

Because of Watson's betting strategy, it often ends up with nontraditional bets that forgo rounded values. Hence, the wonky $947 Toronto bet, versus a $900 or $1,000 bet.

Notes Tesauro: "Such values may make the arithmetic a little more challenging for the humans when computing their bets."

As if that's the only thing humans have to worry about when it comes to Watson. . .

Get info on stocks mentioned in this article:

HuffPost
George Conway Details ‘Oh, It’s Daddy’ Call To Ivanka That Exposed Trump’s Fears
It showed the then-president "was very, very concerned," said the conservative attorney.
4 hours ago
NY Daily News
OJ Simpson did not die surrounded by loved ones, says lawyer
The family of O.J. Simpson announced last week the former football star died on April 10 “surrounded by his children and grandchildren.” But according to Simpson’s longtime lawyer Malcolm LaVergne, the 76-year-old father of four was a sole visitor away from dying alone. LaVergne declined to tell The Associated Press who was at Simpson’s bedside when the acquitted double-murder defendant ...
16 hours ago
HuffPost
Michael Cohen Explains Exactly Why Donald Trump’s Barron Graduation Ban Whine Is ‘Comical’
The former president's onetime right-hand man pointed out Trump's history when it comes to marking his children's educational milestones.
a day ago
Snopes
Fact Check: Rumor Alleges Reba McEntire Faces 'Serious Charges' and Asked for Prayers Regarding Fox News Lawsuit. Here's the Truth
"Martha MacCallum was outraged, saying she will be filing a lawsuit against Reba McEntire and Fox for violating [a] contract," an online article read.
15 hours ago
The Daily Beast
Prince Harry Renounces His British Residency, Says America Is His Home
Photo Illustration by Thomas Levinson/The Daily Beast/GettyPrince Harry has publicly renounced his British residency, in paperwork coinciding with his first public appearance since his sister-in-law, Kate Middleton, was diagnosed with cancer.Harry spoke via video link on Wednesday at the annual general meeting of Travalyst, the sustainable travel organization he founded in 2019, before quitting the royal family.As part of the organization’s year-end procedures, it also filed company returns in w
19 hours ago
Hello!
Victoria Beckham's rarely-seen sister shares incredible photo alongside Spice Girl - and they could be twins
Victoria Beckham marked her 50th birthday on Wednesday and her rarely-seen sister shared a fabulous photo of her Spice Girl sister. See photo.
8 hours ago
The Canadian Press
Police announce nine suspects in $24M gold and cash heist at Toronto Pearson
TORONTO — Two men who worked for Air Canada and an alleged firearms trafficker are among nine people charged in a heist of nearly $24 million in gold and cash from Toronto's Pearson airport a year ago, police said Wednesday, offering new details of what happened in the "sensational" case. Peel Regional Police said their joint investigation – dubbed Project 24K – with the U.S. Alcohol, Tobacco and Firearms Bureau has resulted in a combined 19 criminal charges against the suspects, including multi
22 hours ago
HuffPost
Ex-Aide Reveals What Donald Trump Really Fears In Hush Money Trial
Because it's "not a case that keeps him up at night," claimed Alyssa Farah Griffin.
a day ago
People
Megan Fox Snaps Makeup-Free Selfie in Bra, Boxers and 26-Inch Blue Hair Extensions 'Post-Coachella'
The actress told PEOPLE on April 12 that she lengthened her bob to give it “Coachella energy”
16 hours ago
People
Gigi Hadid Is Back in a Bikini and Mermaid Hair for Victoria's Secret: See the Sexy New Campaign
The supermodel joins Emily Ratajkowski, Paloma Elsesser and Tina Kunakey in the new summer 2024 campaign
19 hours ago
Cosmo
Shania Twain is unrecognisable with butt-skimming peroxide blonde hair
Shania Twain just shared snaps with super long peroxide blonde hair. It's giving 00's Jessica Simpson and we're not mad at it.
2 hours ago
HuffPost
Lara Trump's Take On Father-In-Law's Hush Money Charges Is A Real Doozy
She may have understated the allegations just a touch.
4 hours ago
Yahoo News Canada
Loblaws Canada groceries: Shoppers slam store for green onions with roots chopped off — 'I wouldn't buy those'
A photo of green onions being sold with the roots chopped off at a Toronto Loblaws store is stirring more anger online against the Canadian grocery giant.
2 days ago
HuffPost
'He Pushed Me': Wife’s Dying Words Help Convict Man Of Murder
Moments before her death, a pregnant British attorney said her husband had shoved her off a cliff.
a day ago
HuffPost
Jordan Klepper Has Mind-Melting Encounter With Trump Supporters Outside NY Trial
"The Daily Show" correspondent tried logic on some of the ex-president's fans. He didn't get very far.
a day ago
WWD
Nike’s Women’s Olympic Uniform for USA Track and Field Criticized for Being Too Revealing: The Controversy Explained
All about the controversy and how athletes are responding.
19 hours ago
Hello!
The antiquated rule Lady Louise Windsor has to follow with her brother James, Earl of Wessex
The Duke and Duchess of Edinburgh's daughter Lady Louise Windsor has to follow a very antiquated rule with regards to her younger brother James, Earl of Wessex
3 hours ago
INSIDER
A beheading meme and 'Mark Ruffalo, naked' — how 'anti-Trump' posts got 5 New Yorkers booted from the hush-money trial
Some hilarious "anti-Trump" memes have been read aloud at his hush-money trial. Trump was not laughing.
15 hours ago
Cosmopolitan
This Lip Reading of What Taylor Swift Said to Travis Kelce at Coachella Is So Funny
Check out this hilarious lip reading of what Taylor Swift said to Travis Kelce during Coachella.
a day ago
People
Mariska Hargitay, Dressed in Her “SVU” Gear, Mistaken for Real-Life Police Officer By Young Girl Looking for Her Mom
The actress was filming one of the final episodes of the show's historic 25th season in New York City when she was approached by the child
22 hours ago

Latest Stories