Why Proving You Aren’t a Robot Has Never Been Harder | WSJ Tech Behind

Subtitles section Play video

It's not just you.
Captcha, the online test to tell whether you're a human or a robot, has been getting harder.
This is one of the guys who invented it.
Oh yeah, I fail them all the time.
I never know how much to say there's a traffic light if it's only like a tiny little corner of it.
The problem is that Captcha was designed to keep malicious bots out of certain websites.
But every time you've solved a test, you've actually made those bots smarter.
As more and more data was fed into these perceptual systems, they simply got a lot better at solving the perceptual tasks.
So what does it take to design a puzzle that can outsmart a bot but still be solved by any human?
This is the tech behind Captcha.
So Captcha was first used around the same time that Yahoo began giving out free email addresses.
This was the year 2000.
There were people who were writing programs to abuse different web services.
And there was no easy way to stop them.
And the idea was there should be a test that is really quick that humans can pass but computers could not.
Luis's test looked like this.
A string of letters, slightly warped and distorted, that the end user had to input into a text field.
He called it Captcha, which stands for Completely Automated Public Turing Test to Tell Computers and Humans Apart.
And he took advantage of the fact that computers at the time weren't very good at something called OCR, optical character recognition.
The way it works is a simple game of pairs.
The program scans the image and compares any shapes that emerge, like this letter P, against a database of different letters in different fonts.
The second it finds a match, it has identified the letter.
But if that P was warped, overlapping another letter, or had marks through it, then the program will struggle to find anything in its database that matches and won't be able to identify the letter.
Simple, but enough to keep out the malicious bots.
The technology behind them was relatively primitive when it came to having access to what a letter looked like.
But that advantage wouldn't last long.
Here's Luis from 2010.
In 2007, ReCaptcha was launched, an updated version that also helped to scan books.
At the time, groups like the New York Times and the Internet Archive were in the process of trying to digitize old literature.
Since optical character recognition wasn't very accurate at the time, the digitization had errors.
So, to try and help resolve those errors, ReCaptcha began showing two words from a scammed document.
One was a control word, a word the computer knew, but the other was a word it couldn't quite identify.
In answering the Captcha, half of your answer was being used to pass the test, but the other half was used to tell the computer what word it was looking at, helping to improve OCR.
Not only are you authenticating yourself as a human, but in addition you're helping to digitize books and newspapers.
The technology was so efficient that Google acquired it in 2009.
But a new problem emerged.
Because Captchas relied on machines being bad at reading.
And we just taught them to be very good at it.
Eventually, the bots were able to get Captchas right more frequently than humans.
The test needed to evolve.
So in 2012, Google deployed this image-based Captcha.
There was a switch to go from distorted characters to the harder problem of distinguishing certain things in images, where you have to pick all the ones that have a traffic light or a bicycle or whatever.
Interestingly, when it comes to boxes like this, where it's not exactly clear if this counts as a traffic light or not, there isn't actually a right or wrong answer.
The way to get it right is just what the majority of the human population says.
But for machines, this was a huge new challenge.
After all, they'd only just learned how to read.
See, with text-based Captchas, the machines only had to identify a limited range of variables – letters and numbers – from a black and white background.
But now with image Captchas, they had to be able to identify anything and spot an object in a very busy background.
However, computer vision was just around the corner.
Whereas you and I would process the stream of perceptual input, the computer vision system is basically taking the pixels and processes strings of pixels – vectorised, we would say – and as it processes these images, it's picking up patterns in the images, it's picking up patterns in the pixels.
Now for any computer vision system to perform well, it needs a lot of labelled data.
It's being trained to identify cars by having many, many, many images of cars.
The problem was that image-based Captcha was essentially a data labelling task.
By solving it, you were once again generating data that could help a bot defeat it.
The systems just kept getting better and better because they kept getting bigger and bigger.
A new approach was needed.
So in 2014, Google launched the NoCaptcha.
One simple box.
But this version of Captcha wasn't looking at whether you clicked the box.
It was looking at how you clicked it.
And how you interacted with the rest of the internet.
See, if you write code to make an object move to a certain point, like a cursor, the simplest version will make it move in a straight line at a constant speed, like a robot.
But humans naturally aren't that accurate.
We overshoot.
We don't move in a perfectly straight line.
And that is what this version of Captcha was looking for.
Human flaws.
It was also monitoring things like your internet history and your typing speed.
If the history was just a string of repeated attacks on the same website, you're probably a bot.
But if you stopped halfway through the day to browse for shoes or look at cat videos, you might just be human.
By 2018, Google had done away with the tick box entirely and launched ReCaptcha v3, based solely on that hidden data.
But even flawed human characteristics is something that a smart algorithm can eventually learn to mimic.
So are the bots always destined to win?
Well, perhaps.
See, despite other companies coming up with new and inventive tests, there's a quirk at the very heart of Captcha that means it's likely the bots will always be able to win eventually.
And to understand why, you need to go back to before the internet was invented.
To the man who Captcha is named after, Alan Turing.
He's considered one of the founding fathers of AI after he penned this paper, Computing
Machinery and Intelligence.
In it, he describes a method to test whether you're talking to a human or a machine.
We had a questioner right behind the screen and trying to differentiate between a human and a computer machine giving answers.
This now has famously become known as the Turing test.
The problem with Captcha is that the questioner isn't human.
It's a computer, and therefore any information that's input into the computer has the potential to be used by AI models to train Captcha-defeating bots.
Even Luis's original paper on Captchas says that the technology will act as a security measure for websites and advance the field of AI.
You have the research world almost continually catching up and surpassing the Captcha world.
But the solution may be to take the test out of the computer and into the real world.
We all have mobile phones and they have so many different sensors within them.
Being able to tilt the phone on instruction or being able to take a few steps in one direction or another using the sensors in the phone.
But in order for us to truly prove that we're human online, we may have to find an entirely new approach.
I think the question now is not, are we doomed?
It's how do we seize control again?
I would not be designing a Captcha today.
I think that's a losing battle.
This is too hard.