(cache)Google’s new reCAPTCHA has a dark side | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

		Google’s new reCAPTCHA has a dark side (fastcompany.com)
		718 points by ProAm 8 hours ago \| hide \| past \| web \| favorite \| 419 comments

dessant 7 hours ago [-]

Google has been doing the same with reCAPTCHA v2 [1]. They are aware of the legal risk of outright blocking users from accessing services, so reCAPTCHA v3 contains no user facing UI, Google merely makes a suggestion in the form of a user score, so the responsibility to delay or block access and the legal liability that comes with it falls on websites.

reCAPTCHA v2 is superseded by v3 because it presents a broader opportunity for Google to collect data, and do so with reduced legal risk.

Since reCAPTCHA v3 scripts must be loaded on every page of a site, you must send Google your browsing history and detailed data about how you interact with sites in order to access basic services on the internet, such as paying your bills, or accessing healthcare services.

It's needless to say that the kind of data that is collected by reCAPTCHA v3 is extremely sensitive. Those requests contain data about your motor skills, health issues, and your interests and desires based on how you interact with content. Everything about you that can be inferred or extracted from a website visit is collected and sent to Google.

If you'll refuse to transmit personal data to Google, websites will hinder or block your access.

[1] https://github.com/w3c/apa/issues/25

benreesman 5 hours ago [-]

Your comment adds a lot to the conversation, so I don’t want to be more contrary than necessary.

It’s nonetheless a shame that it’s so universally misunderstood how ad-supported megacorps make their money that even highly sophisticated users of the web still talk about the value of personal data (source: I ran Facebook’s ads backend for years).

Much like the highest information-gain feature for the future price of a security is it’s most recent price: ad historical CTR and user historical CTR (called “clickiness” in the business) are basically the whole show when predicting user cross ad CTR. The big shops like to ham up their data advantage with one hand (to advertisers) while washing the other hand of it (to regulators).

As with so many things Hanlon’s Razor cuts deeply here: if your browsing history can juice CTR prediction then I’ve never seen it. I have seen careers premised on that idea, but I’ve never seen it work.

__jal 4 hours ago [-]

> It’s nonetheless a shame that it’s so universally misunderstood how ad-supported megacorps make their money that even highly sophisticated users of the web still talk about the value of personal data (source: I ran Facebook’s ads backend for years).

That may be the case for some people, but that is not my complaint, nor that of many folks I know.

I simply don't care how FB, Google and other surveillance outfits make money. I don't care about marketers' careers or their CTRs. I don't even care about putting a dollar value on my LTV to them.

I care about denying them visibility into my datastream. It is zero-sum. They have no right to it, and I have every right to try to limit their visibility.

Why? None of your business. Seriously - nobody is owed an explanation for not wanting robots watching.

But I will answer anyway. It is because of future risks. These professional panty sniffers already have the raw material for many thousands of lawsuits, divorces and less legal outcomes in their databases. Who knows what particular bits of information will leak in 10 years, or when FB goes bankrupt? I have no desire to be part of what I suspect will become a massive clusterfuck within our lifetimes.

If you're correct that this data has so little value, then it is more likely it will leak. FB and Google are the equivalent of Superfund sites waiting to happen, and storing that data should be considered criminal.

stickfigure 3 hours ago [-]

They have no right to it, and I have every right to try to limit their visibility.

That's entirely fair! But also: You have no right to use my website, and I have every right to limit your access.

Recaptcha is simply part of this negotiation.

__jal 3 hours ago [-]

> You have no right to use my website

Of course.

> Recaptcha is simply part of this negotiation.

It is only a negotiation if I know it is there.

jjeaff 2 hours ago [-]

I'm sure it will be mentioned in the 40 page privacy and cookie policy that pops up on every website asking you to agree before continuing.

benreesman 2 hours ago [-]

You’re commenting on HN, you know it’s there.

zcid 1 hour ago [-]

And what about the other 99.99999% of people that use the web? Do they also understand what is going on behind the scenes?

snlnspc 1 hour ago [-]

I didn't, but assume this is the case with everything. I mostly care about giving my data away for free (cut me in please), but none of my non-HN commenting roommates knew. Is their privacy less important than mine?

saagarjha 2 hours ago [-]

I do, and I can make an informed choice. Unless your website has a very eclectic audience, I’m not the only one using your services.

ajxs 1 hour ago [-]

Is that so? What about the webmaster who simply wants to combat bots using his page, is the extent of data gathering on Google's behalf just part of the deal? What if selling user data is against the webmaster's ethics? "Don't use it I guess" Sure, except that no one in the exchange was told the extent to which this data is used, or what for. Users of Google's Captcha aren't told about this exchange. I disagree entirely that it's a matter of voluntarily opting in and out of Google's domain. Their business model depends on becoming inescapable, and they're not being honest about how their services collect our data.

p49k 2 hours ago [-]

That's what's great about GDPR. It makes privacy a fundamental right that can't be bargained away, much like you can't sign a contract binding you to slavery and you can't accept a bonus from your employer in exchange from losing your mandated breaks.

ajxs 1 hour ago [-]

If I could upvote this comment twice, I would. This succinctly summarises my views on the subject. We shouldn't have to justify _why_ we don't want our private information harvested by these companies. I would still feel remarkably uneasy even _if_ Facebook and Google were demonstrably benevolent citizens of the online world, but we've seen time and time again how invasive and malicious they can be. The fact that both of these companies have political ambition makes the entire situation much scarier. Count me out.

benreesman 4 hours ago [-]

You could either stop using these services or (as I suspect) you find them too valuable to dismiss entirely quarantine them to a VPN/incognito interaction in less time than it took to type that comment.

I don’t want to single you out personally but there’s a broad trend on HN of bitter-sounding commentary on the surveillance powers of these companies by people who can easily defeat any tracking that it’s economical for them to even attempt let alone execute that reeks of sour grapes that a mediocre employee at one of these places makes 3-20x what anyone makes (as a rank and file employee) anywhere else.

Again, you’re not likely part of that group, but seriously who hangs out on HN and can’t configure a VPN?

kbenson 3 hours ago [-]

> You could either stop using these services or

How do you stop using a service when you have little or no indication that it does something like this before hand, and afterwards the privacy is already gone?

If I use a site and view my profile page and the url contains aa account id or username and some google or facebook analytics is loaded, or a like button is sitting somewhere, how am I to know that before the page is loaded? What if I'm visiting the site for the first time after it's been added?

It doesn't even matter if I have an account on Google or Facebook, they'll create profiles for me aggregating my data anyway.

> quarantine them to a VPN/incognito interaction

Which does very little. I spent a few hours this morning trying to get a system non-unique on panopticlick, but the canvas and WebGL hashing is enough to dwarf all the other metrics. There are extensions to help with that, but for the purpose I was attempting, were sub-optimal (and the one that seemed to do time-based salting of the hashes wasn't working right).

So, I don't have any confidence that a VPN and incognito really does much at all.

benreesman 3 hours ago [-]

I assure you that a clean browser and IP will break any surveillance that I know about.

kbenson 3 hours ago [-]

No, a clean browser and IP with the combination of what fonts I have installed, how my video card renders a canvas and WebGL instance (which may be affected not just by the video card you have, but the driver version used with it), my screen size, and a few other system level items that come through may or may or may not be enough to uniquely identify you. Along with linking to a prior profile if you screw up one time (or load a URL that has identifying information they can use), and you're busted.

So, sure, a clean browser and IP and never logging into a site you're previously visiting might be enough, but who does that, and doesn't that halfway defeat the purpose?

benreesman 3 hours ago [-]

I appreciate the information-theoretic validity of your argument, but if you think that one of these firms cares enough about your buying preferences to burn enough compute to find that correlation then you either work for the CIA or are mistaken.

kbenson 3 hours ago [-]

It doesn't take a lot of compute resources to have multiple profiles, and when evidence of a high assurance level (a referring URL that is known to designate a specific user of a major service) to link it with other profiles that also have that designation.

To me, that seems par for the course for any service that's generating profiles of browsing behavior and trying to make any sort of decisions based on it. It reduces cruft and duplicate profiles while also providing more accurate information. Why wouldn't it be done?

> the information-theoretic validity of your argument

The portion about canvas, WebGL and AudtioContext hashing is not theory at all, it's well known practice from years ago. Jest the other day here there was a story about some advertiser on Stack Overflow trying to use the audio hashing to tracking purposes.

Hell, if you get enough identifiable bits of entropy, you can probably assume weak to strong level matching using a bit-level Levenshtein distance that's low enough.

benreesman 2 hours ago [-]

GitHub is always at your disposal. NV doesn’t sell the consumer cards to enterprises. So on AWS a multi-GPU box will cost you about 12 dollars an hour. If you can disambiguate, let’s just say 85% of profiles absent IP or cookies, well I think you just broke the academic SOTA and I’d love to make some calls.

Cheat sheet: you can’t.

saagarjha 1 hour ago [-]

> GitHub is always at your disposal. NV doesn’t sell the consumer cards to enterprises. So on AWS a multi-GPU box will cost you about 12 dollars an hour.

I don’t see how this is related to the claim, since it doesn’t solve the problem. But the advertising company that I let run code on my website will certainly do the job pretty well, I’d say.

literallycancer 3 hours ago [-]

You use something that blocks scripts (like uMatrix) with an aggressive ruleset. On some sites you'll need to allow things to make them work. If they are loading trackers from the same servers that they load content from, you can't do much without wasting more time than you want. I'd say it breaks most of the tracking though.

More sites than you'd expect work without js or with first-party js only. It's annoying when you need to read a news site, because those are usually bloated garbage. Not a huge loss.

kbenson 2 hours ago [-]

This was already with uBlock Origin. Also tried combinations of Ghostery and Privacy badger. All of it made very little difference for panopticlick, and that's probably a low-bar compared to what's common these days.

amenod 3 hours ago [-]

> You could either stop using these services or ...

Are you serious? Have you tried not using their services? Try blocking Google Analytics, Tag Manager, ReCaptcha, fonts, gstatic,... What you will see is that you can no longer access much of the Internet. Want to participate in StackOverflow? Good luck if you block Google.

My beef is not with them trying to find my data when I'm on their site(s). They are however everywhere, on almost every site I visit. Coupled with their (impressive) technical provess it is beyond creepy, and there is simply no way one can avoid them.

I don't know what the solution is or will be, but as far as I'm concerned, this should be illegal.

Freak_NL 3 hours ago [-]

> Try blocking Google Analytics, Tag Manager

Blocking those two doesn't seem to break much, does it? I have uBlock Origin and/or Privacy Badger block them everywhere.

ReCaptcha on the other hand…

Just this week I needed it to complete the booking of an airline ticket and just now buying a high chair for my son. And today I've completed the blasted thing ten times in a row because of a game installer that was failing at a certain point (GTA V's Social Club thing); each attempt to figure out what was wrong meant completing the ReCaptcha again.

Fire hydrants, parking metres, pedestrian crossings, road signs, hills, chimneys, steps, cyclists, buses — that's what the internet looks like in 2019.

benreesman 3 hours ago [-]

Unfortunately politically acceptable regulation only deters new ventures because it makes the costs of compliance too high.

The right vehicle for this is antitrust, but if you think you can sell that in this climate then I’ve got a great deal for you on the London Bridge.

dmitriid 3 hours ago [-]

The costs of compliance are not too high. Compliance is actually ridiculously easy for new companies: they need to collect only the data they need. That is all there is

benreesman 3 hours ago [-]

https://en.m.wikipedia.org/wiki/General_Data_Protection_Regu...

dmitriid 3 hours ago [-]

Yes. Your point? It’s actually ridiculously easy to be compliant with GDPR.

Edit: That is, ridiculously easy for new companies. Incumbents have been hoarding data for too long and it was actually harder for existing companies to become compliant.

benreesman 2 hours ago [-]

If you don’t think that lawyer fees scale linearly with regulation complexity you’re either an early Uber employee or mistaken.

When you’ve built a social consumer business in Europe that is profitable after compliance, send me a term sheet.

snlnspc 1 hour ago [-]

I enjoyed reading what you said as a different perspective on the backend of ad technology vs privacy up until this comment thread.

I didn't build a profitable social consumer business in Europe after compliance, but I was part of a team that implemented compliance for a long existing company within the US due to them having clients and client's clients in Europe. They're profitable. Do you want my term sheet? Or are you weakly attempting to flex while complaining that people's basic right to privacy is preventing you from earning obscene amounts of money?

losteric 3 hours ago [-]

Most people here can avoid the impact of climate change - do you think we shouldn't talk about that either?

These are societal problems. It's good to care about people beyond yourself, and to talk about the professional ethical responsibilities of software engineers with regards to corporate mass-surveillance.

bad_user 3 hours ago [-]

How about our friends and family? Should we configure a VPN for them too?

Btw the argument you just made applies to any form of surveillance or censorship. Just because your can still find functional VPN services for China, is China's great firewall OK?

And what happens when web services start blocking VPNs?

Netflix does it quite successfully. And I'm sure Cloudflare could provide such a service for free.

benreesman 3 hours ago [-]

I’m not making a moral argument for the surveillance state, I wear Curve25519 on one arm and the word “citizenfour” on the other.

I agree that there is a vast and almost impossible to regulate overreach by these companies. Your argument is extremely compelling.

But when HN users complain about being spied on I smell a FAANG rejection letter.

stOneskull 3 hours ago [-]

People care about others, not just themselves.

benreesman 3 hours ago [-]

I agree, but search “HN levels.fyi” to understand that we’re in the minority on that.

xg15 3 hours ago [-]

Unless the topic is affordable housing, that is.

heartbreak 2 hours ago [-]

> But when HN users complain about being spied on I smell a FAANG rejection letter.

You’re projecting Ben.

saagarjha 1 hour ago [-]

> when HN users complain about being spied on I smell a FAANG rejection letter

I work at a FAANG: here’s my complaint about being spied on.

deckard1 3 hours ago [-]

> You could either stop using these services

No you can't. Facebook creates shadow profiles for every single person in the world. If any single one of your friends has WhatsApp, Facebook has your phone number. They have your phone number and the entire address book of your friend, who probably has friends in common. If two of your friends have WhatsApp and they both have your number...

You see where I'm going here? There are pictures of me on Facebook that I did not put there. From friends or friends of friends.

I'm not even scratching the surface of what Google knows with GPS and WiFi connections.

No one consented to any of this bullshit.

benreesman 3 hours ago [-]

There’s a reasonable argument in there, but it applies to any world in which digital cameras are cheap.

This is in a sense the worst kind of argument: superficially correct but really meant to tap into a popular groundswell of sentiment.

The question isn’t “can FB use an off-the-shelf CNN to identify me personally” but rather:

“If it weren’t FB who would be doing it instead?”

and:

“Should cheap digital cameras be illegal?”

cyphar 24 minutes ago [-]

> The question is “If it weren’t FB who would be doing it instead?” [...] “Should cheap digital cameras be illegal?”

Those are a complete non-sequitur.

Facebook (and Google) analyse every single photo that goes through their system with state-of-the-art ML (it's so good that it almost beat humans at matching faces ~5 years ago). This is a scale of surveillance which the human race has never encountered before in our history[+], and is a serious problem that we (as a society) need to make a decision on. In many countries, car license plates are OCR'd and automatically tracked whenever they travel on almost any main public road. Facial recognition in public places and on public transport is becoming a prevalent problem. And wearing masks is illegal in many countries -- meaning there is no way of "opting out" of the pervasive surveillance in the physical world. None of these things were nearly as commonplace ~30 years ago.

Cheap digital cameras are a completely unrelated topic. And if such large-scale surveillance was made illegal then nobody would be doing it legally, and those doing it would be held accountable for the public health risk they pose.

[+] The Stazi and KGB only really had filing cabinets for tracking people and physical surveillance measures. The Gestapo didn't even have that (the Third Reich had census data which was tabulated using IBM machines in order to track who was Jewish within the Third Reich).

saagarjha 4 hours ago [-]

> reeks of sour grapes that a mediocre employee at one of these places makes 3-20x what anyone makes (as a rank and file employee) anywhere else

This is not an argument and moreover not even true: there are companies that pay well and don’t collect reams of data on their users.

benreesman 3 hours ago [-]

You didn’t address my argument and unless you’ve been on more comp committees than me then I would annotate that as sources needed.

saagarjha 2 hours ago [-]

Like I said: it’s not an argument, it’s an attack. Plus I’m sure that there’d be many people here able to counter your claim regardless of the compensation number you drop.

heartbreak 3 hours ago [-]

This is a ridiculous argument. Advanced technical competency can not be a prerequisite for maintaining personal privacy.

benreesman 3 hours ago [-]

We’re on a site premised on entrepreneurship, and you’re pointing out what sounds like a big market gap. I angel invest now and then, if you have a plausible way to make two billion people care about something that we agree could be better my email is in my profile.

Even from the inside I didn’t see a way, but I’ve been wrong before.

FabHK 1 hour ago [-]

Yes, looks like the industry cannot solve that problem alone, just like the electricity and chemical industries somehow didn't achieve clean air and water out of the goodness of their hearts. Another market gap. Or, wait, a case for government regulation.

__jal 3 hours ago [-]

> or (as I suspect) you find them too valuable to dismiss entirely quarantine

You are wrong. I block the known IP blocks of the big surveillance shops and a lot of the small ones[1].

> sour grapes that a mediocre employee at one of these places makes 3-20x what anyone makes

Are you sincerely saying you believe people who are uneasy about surveillance are just jealous?

[1] Twitter is currently an exception, I was playing with something. But I'm going back to blocking them soon.

13415 3 hours ago [-]

A VPN will not help you against advanced behavioral browser fingerprinting like in this new Captcha. Not only do they have lists of VPN servers anyway, if you inadvertently log into your Google account once from the VPN (e.g. by launching your browser from your normal account), then the VPN IP(s) will be forever associated with your account and normal IPs, and they already know from the Captcha data that you're one and the same person. All the VPN does is adding the information that you sometimes use VPN servers of company such-and-such.

davidgerard 3 hours ago [-]

> sour grapes

seriously?

benreesman 3 hours ago [-]

search “HN levels.fyi”

dang 1 hour ago [-]

I appreciate your comments in this thread very much but could you please stop baiting people on this point? If there's one thing I've learned from running HN it's that the generalizations about the community that people come up with are invariably wrong. They're overgeneralized from a small sample of what the generalizer happened to notice—and since we're far more likely to notice what rubs us the wrong way, the results always have have sharp edges. In other words, people remember what they saw here that they liked the least, then tar the whole with it. To borrow your phrase, the TLDR is less interesting.

luckylion 3 hours ago [-]

> Again, you’re not likely part of that group, but seriously who hangs out on HN and can’t configure a VPN?

Recaptcha tracks users / devices, not IPs. A VPN won't help, it'll only lower your score. At that point: not allowing them to track you just means you can't use large parts of the web.

"You don't want that GPS tracker installed into your skull? Well, we won't force you, of course, but public transportation, government services and most grocery stores can only be used by GPS-skull-people"

benreesman 3 hours ago [-]

Wild speculative hyperbole hurts the case of people like you and I who care about doing something positive on the ground today.

bongobongo 3 hours ago [-]

It is not "wild speculative hyperbole" not to give the benefit of the doubt to companies that have repeatedly demonstrated that they are not entitled to the benefit of the doubt.

benreesman 3 hours ago [-]

GPS tracker installed in people’s skull sounds hyperbolic to me.

gfosco 5 hours ago [-]

I think it's worth pointing out that the comment you replied to didn't mention money, advertising, or CTR. People are concerned about data collection for more reasons than that. You've seen these attempts and entire careers about it without "juicing" CTR, so perhaps that isn't the true intent.

benreesman 5 hours ago [-]

I admit that I inferred the proposed intent for grabbing maximum personal data, but if you’re interested in anecdotes from the trenches: no one below senior director level gets a couple million in stock for any other reason than they pushed CTR by a few basis points. What I was trying to say is that seen through the lens of mechanism design no one is incentivized to query the like button table because there’s no upside in it.

jakobegger 4 hours ago [-]

I'm not sure I understand correctly. Are you saying that all the personal user data is in reality not as valuable as everyone says it is? That is, all those megacorps are collecting terabytes of mostly useless data?

Then why is this data collected and archived in the first place?

benreesman 4 hours ago [-]

I was never involved in those decisions but I suspect that when you’ve got a multi-dollar CPM and your biggest pain in the ass is pouring concrete and running power fast enough that a few PB of spinning disks are cheap enough that you hang onto it in case you ever find a way to make it useful.

ViViDboarder 4 hours ago [-]

That sounds logical. It’s also exactly the reason many of us don’t want to give up our information to these companies. There is absolute uncertainty as to how it will be used in the future.

avarun 4 hours ago [-]

Yes, that's exactly what he's saying.

It's mostly all collected because it's easier for them to collect it than not to collect it, and nobody is stopping them from collecting it.

EpicEng 5 hours ago [-]

The fact that so much potentially sensitive data exists in a few repositories is in itself a bit foreboding. Who knows what companies will be able to glean from it one, five, or twenty years down the road?

My behavior on the web being tracked by corporations with little incentive to do right by me is worrisome.

benreesman 4 hours ago [-]

I’m more concerned that they’re designing the next version of the Web right under our noses than that they know what kind of sneakers I’m 8% more likely to buy.

saagarjha 3 hours ago [-]

However, I’m concerned that the data also allows them to see that I am 93% more likely to vote for a certain political candidate, 22% more likely to contract a chronic disease in the next ten years, and 16% more likely that I will have a friend that homosexual.

EpicEng 2 hours ago [-]

I'm not thinking about ad delivery, I'm thinking about behavioral analysis. Knowing how a person thinks and acts can be a very useful weapon in the wrong hands, and FB and the like have done nothing to make me think their hands are the right ones (I don't think any are really.)

owldimoon 4 hours ago [-]

I'm not sure where to add this comment, but I just wanted to briefly say that I appreciate your contributions to this topic. Both in terms of content and tone/delivery. These seem like constructive and valuable comments to me, so thanks!

ehsankia 4 hours ago [-]

What other reasons do you think the original post was implying for Google to collect all this data?

benreesman 4 hours ago [-]

I think the implication was that the leadership is hanging onto all that data because of an immediate fiduciary obligation. I suspect that it’s more in the nature of when you’re running a business in which a few hundred million QPS is slow that you archive in case it ever becomes useful.

bscphil 4 hours ago [-]

Unless the point of your comment was to deny that Google is collecting this data at all (because, according to you, there's no financial incentive), I don't see the relevancy of your criticism. The complaint of the top level comment was that Google is collecting extremely personal data on us. Your response is that Google doesn't have an immediate financial incentive to do this. If you're not actually denying that Google collects this data, why does that matter? For most of us, the fact that our personal data has some financial value to a corporation is irrelevant to the fact that we don't want them to have it.

smsm42 5 hours ago [-]

That's the annoying part of it. They try to collect everything about me, down to my favorite color and the brand of tea I am drinking, and they can't even deliver a semi-relevant ad. Best they can do is to bombard me with shoe and riding classes ads for 6 month after I search for "weight of a horseshoe" and stuff like that. They kill the privacy, they make 99% of the sites unusable without an ad blocker, and at the end it doesn't even amount to them making relevant ads...

benreesman 4 hours ago [-]

If I were you I’d be more worried that Google de facto controls whatever we’re calling HTTP these days than that they have a BigTable entry that ties a browser you once used to a preference for Earl Grey.

saagarjha 3 hours ago [-]

This is a false dichotomy: I don’t see why I can’t be worried about both.

smsm42 3 hours ago [-]

I am worried by both. In fact, I am worried by more than these two things about Google, but listing them all would probably take this discussion way off course.

dessant 3 hours ago [-]

No offence, but your posts in this thread appear to be projections, and they derail the conversation.

The main topic we discuss is corporate surveillance. We are concerned about all the personal data that leaves our control. We are worried that evading this type of surveillance becomes increasingly difficult.

Some HN users may know how to mitigate these risks, but most people may not know how to defend themselves against corporate surveillance.

This is why me must speak up now, and not just for ourselves.

jackpirate 5 hours ago [-]

Can you provide any evidence that personal data doesn't improve CTR prediction for companies like Google/Facebook?

You state yourself that Google/Facebook publicly claim to advertisers that personal data improves CTR prediction. So I have a hard time believing that personal data isn't useful.

benreesman 5 hours ago [-]

I’m already on a shaky limb being so candid about how the business actually works. If you want the opinion (albeit a little dated but still relevant) of someone who doesn’t give a fuck about who the truth pisses off I recommend a book called “Chaos Monkeys” written by a former YC (exited) founder.

benreesman 3 hours ago [-]

I am falling behind replying to all the comments that this has generated.

For the record I am inked all over with anti-equation group stuff: I agree that these companies are too big and powerful (and I would know).

I just don’t see a solution with the present judiciary. If anyone has a bright idea my email is in my profile.

I will thank you all in advance for not shooting the messenger.

kjaftaedi 5 hours ago [-]

Anectotally, I keep no browser history and do not feel my experience with captchas is different than a user who does.

benreesman 5 hours ago [-]

I would contend that the reason for that is that none of the engineers involved get paid more if that experience is different.

luckylion 3 hours ago [-]

> If your browsing history can juice CTR prediction then I’ve never seen it. I have seen careers premised on that idea, but I’ve never seen it work.

Isn't demographic targeting exactly that, based on your browsing history? Will showing an ad for a car wash have the same CTR for people that liked car products as for people that did not like car products? Or is your point that it still has to be a human that inputs "this is about car things, please show it to people that like car things" and it's not a magic AI that optimizes it automatically? And in that case: isn't that just a matter of time? Build the profile today, build the tech that uses it tomorrow?

anth_anm 3 hours ago [flagged] [-]

> source: I ran Facebook’s ads backend for years

Why would anyone ever trust a goddamn thing you have to say about their data?

Unless they pay your salary and are asking you to give your expertise on hoarding and abusing user data, obviously.

dang 1 hour ago [-]

We've banned this account for repeatedly breaking the site guidelines and ignoring our many requests to stop.

If we allow users to harass and attack people who have genuine expertise for posting here, does that make HN better or worse? Obviously worse. Mob behaviors like this are incompatible with curiosity.

https://news.ycombinator.com/newsguidelines.html

benreesman 3 hours ago [-]

Me spilling tea about the business is far more in the spirit of a whistleblower than a shill.

I have nothing to gain and everything to lose by shedding light on one of the most powerful entities in existence.

But TLDR it’s not as interesting as people like to think.

saagarjha 1 hour ago [-]

You can gain internet points on a social website…

dao- 6 hours ago [-]

I'm not sure there's a significant legal difference in the end. If someone could demonstrate that alternative browsers regularly get a lower score than Chrome, that seems like a pretty good antitrust case.

Or were you referring to the risk that individuals would sue Google for getting blocked from random, potentially essential websites?

bduerst 6 hours ago [-]

Not GP but they most likely meant the second. The V2 prompt blocks people from accessing services, which could be construed as damages at scale.

You do bring up a good point about the V3 being potential antitrust issue, but that has always been a potential problem even with earlier versions of recaptcha. With V3, it's also deferring the liability to the webmaster. The action that the website takes with the score is up to them - in the end it's just a number.

miohtama 3 hours ago [-]

From the service provider and devops perspective I find reCAPTCHA beautiful. It brings down malicious form fill, form spam, user creation and password brute forcing rates.

Also as a VPN user, I found out that migrating to more expensive, higher grade VPN, solved a lot of my problems.

In the end it is not privacy, not your VPN that matters from the service provider point of view. It matters that your IP address is spewing malicious garbage. I do not want to spend time sorting it out, as I can focus my activities to revenue generating tasks. Harming some cheap VPN users in the process is collateral damage, but I rather take it than build a form with a perfect attack mitigation and 10x cost.

I hope to see some alternative for reCAPTCHA that does not come with such a strong privacy oriented risks. hCAPTCHA https://www.hcaptcha.com/ seems to be interesting, also monetization point of view. But they are not yet well established company and I do not know what other risks their approach would bring.

OrgNet 3 hours ago [-]

I don't even use a VPN and have lots of issues solving google's captcha...

miohtama 3 hours ago [-]

Potential other causes

- Your ISP is a source of a lot of malicious traffic

- You have some browser extension or other adjustments that makes it harder to analyse you as a genuine web browser

For example, using a browser automation like Selenium testing triggers "hard" reCAPTCHA. Not sure if this because of some automated API that Selenium exposes, or just because your browser profile looks virgin (no cookies) without any prior reCAPTCHA solves.

OrgNet 3 hours ago [-]

I use pretty standard extensions... uBlockO, decentral eyes, smart referrer... I just wish that companies would stop using Google's reCAPTCHA service.

Also my IP address rarely changes and I don't think that any malicious traffic is coming from it.

And I have Comcast, so I hope that they didn't blacklist all of us...

(I did talk bad about Google a few times though, maybe that's it)

saagarjha 1 hour ago [-]

Those aren’t extensions that an average user would install.

bscphil 4 hours ago [-]

> Since reCAPTCHA v3 scripts must be loaded on every page of a site, you must send Google your browsing history and detailed data about how you interact with sites in order to access basic services on the internet, such as paying your bills, or accessing healthcare services.

> If you'll refuse to transmit personal data to Google, websites will hinder or block your access.

I wonder how true this really is. 20% or so of web users have ad blockers, and most ad blockers block scripts like Google Analytics out of the box. It isn't hard to see that most of them will not make exceptions for a new Google tracking script. So any site that does any kind of testing at all is going to see that ~15% or so of their users drop off if they block users who don't have a reCaptcha v3 score. The only sane business decision in response to this is to go with some alternative.

(Of course, there will be some sites that continue to block users, it's just that they will mostly be the sites that already block users running ad blockers.)

judge2020 3 hours ago [-]

Even UBO doesn't block ReCaptcha by default, so I don't see Rv3 being added to easylist anytime soon.

AgentME 18 minutes ago [-]

It doesn't block it because it's generally not active on all pages of a site. The description of v3 sounds more like Google Analytics and will probably be treated similarly.

stri8ed 6 hours ago [-]

Is it that different from the way Google Analytics works?

frenchyatwork 6 hours ago [-]

It is, in the sense that it's easy to disable Google Analytics by disabling tracking in Firefox, and there's no consequences. If a website uses reCAPTCHA, and you have tracking disabled, the website will break.

behringer 3 hours ago [-]

Works for me. Assuming one uses something like Privacy Badger and it if it were programmed to block reCaptcha, these websites that require recaptcha will go the way of anti-adblocker popups. People will simply say no and hit the X and go to their competitors.

joaobeno 3 hours ago [-]

Sure, my gov (Brazil) uses reCaptcha on the page where you can check your electoral status (For example: if you can vote, where, and if not, what is missing). Where can I find a competitor for that?

behringer 2 hours ago [-]

Ask your political representative why they're relying on a foreign ad service to manage their government websites.

dessant 6 hours ago [-]

You should expect a similar impact on your privacy.

The important difference is that unlike Google Analytics, reCAPTCHA v3 is inescapable. You cannot prevent the collection of your personal data, because then you would loose access to large portions of the web.

tinus_hn 6 hours ago [-]

You can’t block recaptcha!

edraferi 5 hours ago [-]

Why not? Is it always self-hosted?

blacksmith_tb 5 hours ago [-]

I think they meant "you can't block reCAPTCHA and still access services behind it" - technically you could add a rule to uBlock Origin etc. to block it, but then you'd be unable to use those site/services.

thekyle 5 hours ago [-]

> Since reCAPTCHA v3 scripts must be loaded on every page of a site, you must send Google your browsing history and detailed data about how you interact with sites in order to access basic services on the internet, such as paying your bills, or accessing healthcare services.

I don't believe this is true. You only need to include the JavaScript on pages which actively use the reCAPTCHA score. For example, you might only include it on the login and user registration pages.

zymhan 4 hours ago [-]

Did you read the article?

> To make this risk-score system work accurately, website administrators are supposed to embed reCaptcha v3 code on all of the pages of their website, not just on forms or log-in pages.

luckylion 5 hours ago [-]

Isn't the idea that they can decide whether it's a user or a bot based on what the user does in general, not just whether their browser executes JS on this page that you want to protect?

Running headless chrome is trivial, so just having it sit on the one page where you need to check it won't help much. Collecting more data on the user's action on your site will provide a much clearer picture, much like a video from somebody walking through a store will help you make a decision about whether he's trying to steal something than a single picture of him standing at the check out.

judge2020 4 hours ago [-]

The big "if" here is whether or not Google is actually factoring the user's activity into the score. For all we know, there could be a 80/20 split between "Google account activity" and "human-like behavior on website" when Google outputs a trust score.

dessant 5 hours ago [-]

This comment has been detached, originally it was a reply to https://news.ycombinator.com/item?id=20295333.

bad_user 3 hours ago [-]

If the v3 script is supposed to be installed on all pages of the website, in order to track the user's actions, I don't understand how that can be done without explicit user consent under GDPR.

_Codemonkeyism 3 hours ago [-]

"reCAPTCHA v2 is superseded by v3 because it presents a broader opportunity for Google to collect data, and do so with reduced legal risk."

And if you use something to prevent tracking - in my case Brave - reCAPTCHA is a huge pain that often takes dozens of clicks to make it through - delayed by Google to wait out bots.

Some times I think reCAPTCHAs main goal is to bring back those opposing tracking back into the fold of Chrome with painful recaptchas.

modzu 5 hours ago [-]

in other words this is a callout to all webmasters:

please consider not using recaptcha.

superasn 7 hours ago [-]

There are a lot of sites that are totally unusable on Firefox regardless how much you use ff.

I do all my mobile browsing on FF yet when I try to use some websites I always get this Recaptcha failed error(1) while it works flawlessly on chrome though I never use it often. Try it, maybe it will happen for you too.

Same happens on most sites which show you that "checking your browser" page via cloudflare too.

The web is very unusable unless you're using chrome because of such antics.

(1) https://cdn3.imggmi.com/uploads/2019/6/27/0dd96b25707ce6e236...

ulfw 7 hours ago [-]

It's even worse when you're running a VPN (especially one of the major public ones). When I see reCAPTCHA I basically give up as sometimes I have to go through 6 or 7 full sets to be let into a site. It's the evil of the internet this.

oil25 7 hours ago [-]

reCAPTCHA on VPN is difficult, but on the Tor network, they are downright impossible. I've never been able to get past it, even after a few dozen painful attempts. That means Google services are entirely off-limits over Tor, even Search, which is a disgrace.

beefhash 6 hours ago [-]

> That means Google services are entirely off-limits over Tor

If only it was Google services alone. CloudFlare loves serving up a ReCAPTCHA for Tor users before they can even passively read site contents. That hugely expands the damage done.

deftnerd 6 hours ago [-]

Install the PrivacyPass Firefox or Chrome extension. It was developed by Cloudflare, Firefox, and Tor in partnership. It has you answer a ReCAPTCHA and using some crypto magic, generate a bunch of CAPTCHA bypass tokens that can't be traced to your specific computer.

https://support.cloudflare.com/hc/en-us/articles/11500199265...

https://blog.cloudflare.com/cloudflare-supports-privacy-pass...

https://blog.cloudflare.com/privacy-pass-the-math/

https://github.com/privacypass/challenge-bypass-extension

johndough 5 hours ago [-]

Does not work with Tor.

The plugin requires "privacy passes". Those passes can be obtained by solving captchas, but when trying to do so, one is greeted with this message about being blocked: https://i.imgur.com/qXJfl6J.png

beardog 4 hours ago [-]

Try rebuilding your Tor circuit when this happens.

https://tb-manual.torproject.org/managing-identities/

Sirened 22 minutes ago [-]

This sort of breaks tor though, doesn't it? Tor works really well if you stay on the same circuit for a while since it reduces the chances you have a compromised circuit. If you start getting recaptcha to block every exit node except those you control, you essentially have amplified your effective strength on the tor network.

redwards510 5 hours ago [-]

This sounds pretty good, but you still have to pass a captcha in order to get a pass, and sometimes that is impossible (or at least I just give up because I lost interest after 20 puzzles).

If it was developed in conjunction with Tor, how come it doesn't come bundled with the Tor browser or Tails?

baq 6 hours ago [-]

they have a patent on giving out unbeatable challenges when the computer thinks it's dealing with a 'malicious agent'.

https://patents.google.com/patent/US9407661B2/en

Liquix 6 hours ago [-]

So if you're running the wrong combination of addons/VPNs/browser you're denied access to half the web because Big G says so? And now they're aggressively pushing sysadmins to install silent data harvesting scripts on every page of their sites? WTF more will it take to get people interested in breaking up these monopolies?

redwards510 5 hours ago [-]

Not denied access, they just keep serving up challenges to you until you give up, WAY worse than saying "You appear to be a bot, sorry".

pnw_hazor 6 hours ago [-]

I hear Google is a fun place to work and that they pay well.

Until software developers care -- nothing will happen.

Liquix 5 hours ago [-]

Were Carnegie Steel or Bell fun places to work? Probably, they had cash spilling out their ears.

Monopolies need to be broken up because they threaten the free market and consequently our way of life - not because employees revolt.

tjpnz 5 hours ago [-]

From what I've seen (and most of it's anecdotal) things do appear to be changing. There are already people who won't go anywhere near Facebook now for personal ethical reasons, and even concerns that it might hurt future career prospects.

lonelappd 6 hours ago [-]

That's Juniper, not Google.

Juniper patented saying "No" to a client.

baq 6 hours ago [-]

oh, sorry about that, it's conveniently grayed out in the corner :)

neilv 6 hours ago [-]

Tor users don't want to be running reCAPTCHA at all. There's a few privacy problems for people who run that or other ambitious cross-site snooping. Usual stuff (requests, cookies, JS fingerprinting, etc.), behavioral fingerprinting, and very detailed monitoring of what information you were accessing/reading and possibly even entering.

jimmaswell 6 hours ago [-]

You can hardly blame anyone for blocking Tor traffic. You might not be using it for abuse but a large volume of abuse originates from it.

verisimilitudes 5 hours ago [-]

>You can hardly blame anyone for blocking Tor traffic.

Yes I can and do. It's bad enough that some websites won't let you do certain things over Tor, but preventing access to the website entirely is unacceptable. I made this account and comment entirely over Tor.

I don't see how it's okay to block Tor. That generic claim is made, but how are your spam measures doing if you couldn't handle Tor spam?

>You might not be using it for abuse but a large volume of abuse originates from it.

There is infinitely more ''abuse'' coming from Google, and yet it seems most every page I visit contains Google malware.

On principle, I hold the idea that Tor should be a first-class citizen and not disadvantaged in any way. Notice that Google's ''HTTP/3'' is over UDP, which Tor doesn't work with; I don't find that a coincidence.

judge2020 3 hours ago [-]

https://blog.cloudflare.com/the-trouble-with-tor/

> like all IP addresses that connect to our network, we check the requests that they make and assign a threat score to the IP. Unfortunately, since such a high percentage of requests that are coming from the Tor network are malicious, the IPs of the Tor exit nodes often have a very high threat score.

Faark 4 hours ago [-]

And that will never change if significant services keep blocking Tor users. Thus we have a feedback loop effectively fighting privacy...

jimmaswell 4 hours ago [-]

Somehow I doubt most Tor users are really just in it for privacy for general browsing, especially since it's so slow and limited. You can get a VPN for that. Unless you're a total privacy purist, there's not much incentive to use Tor unless you're buying drugs/something else illegal or just curious to look around the dark web.

Faark 1 hour ago [-]

Tor is free with no signup / cc required. This makes a huge difference, especially for younger users. Did for me back then, at least.

Initially it was slow, yes. But totally fine the last few years for normal browsing and reasonable downloads. Speedtest.net, speedtest.googlefiber & fast.com just now gave me 5, 6 & 10Mbps for whatever server in Ghana i got. Only the high ping causes loading times to still be a bit annoying.

But right now the biggest reason not to use Tor for anything "legit" is the many services blocking you, since indeed most current Tor users are not what those services want and the race to the bottom of Tor will continue, if we haven't reached it already.

w1nst0nsm1th 2 hours ago [-]

Tor is slow if you're used to browse with a 50 MB internet connection speed.

My own connection doesn't go over 1.6MB download speed, and only if the weather is clear and I have the wind in the back.

You can now achieve a 500KB or more speed in most Tor connection, which is enough to have a confortable browsing experience, imo.

The real downside is the google captcha, which happens sometimes to even denie you to solve a captcha in the first place for web pages where there is no user input.

BeetleB 3 hours ago [-]

>You might not be using it for abuse but a large volume of abuse originates from it.

Given that Tor is a tiny percentage of Internet traffic, most of the abusive volume out there has little to do with Tor.

ramraj07 6 hours ago [-]

I'm assuming you are not logged into a Google account during this? What happens if you create a throwaway Google account while on Tor? Or is that also impossible?

innot 6 hours ago [-]

I remember that these days google requires a phone number. Finding a throwaway number is hard, especially in some countries.

nullwasamistake 7 hours ago [-]

Sometimes Google will flag you as robot and never allow you to pass no matter how many you get right. It's total horseshit

kemiller2002 7 hours ago [-]

Thanks for verifying that for me. I thought it was just me being horrible at figuring out what they want.

markholmes 7 hours ago [-]

No, the same thing happens to me. I often run ProtonVPN + Firefox with uBlock Origin and a couple other privacy-related addons.

peterlk 6 hours ago [-]

I prefer to see the silver lining in this. If Google wants to break the web for Firefox, fine. I'll keep using (and evangelising) FF, and the sites that are broken won't get FF traffic. I believe that FF is doing the right thing far users, and Google, while in a powerful position is currently on the losing side of history with respect to privacy. Apple is taking that fight to them, and putting budget behind inte convincing average internet users that privacy is cool, and Google abuses your privacy.

The walled garden approach worked for a while for Microsoft, and it's working for now for Google, but eventually, it stops working. Once people leave, walled gardens keep them away.

nprateem 6 hours ago [-]

Only the majority of the Internet isn't a walled garden is it? It's more like a minefield because you don't know whether a site is going to use recaptcha and block/hinder your access.

You can't just opt out of using half the Internet because you value privacy, and nor should you have to. This requires legislation to stop.

pergadad 7 hours ago [-]

I have the same experience, some pages don't work on FF but fine on Chrome. I like to apply Occam's Razor, but with so many users it seems to me as if that's either by design, or certainly there is little desire to fix the issue.

superasn 7 hours ago [-]

Worst part is my chrome installation is 100% fresh with no browsing history and FF has cookies and history older than an year ago.. still google trusts Chrome more than FF?

dmix 7 hours ago [-]

If they looked for identifying information in cookies or browsing history people would be even more upset and spammers would just simulate it with browser bots... which is why I believe it takes a black box approach to each detection regardless of external state. Besides obviously the cookies set within the iframe of the recatcha.

This of course doesn’t help explain why Firefox is so heavily targeted by what’s supposed to be a neutral utility like Google Analytics...

kohtatsu 7 hours ago [-]

I've heard that being signed into your Google account can make the challenges simpler, presumably reducing things like the noise and the slow-fade load animations.

dmix 6 hours ago [-]

That too could be isolated to a single reCAPTCHA session, keeping within the scope of a single iframe or page load.

The idea of tracking your history across multiple reCAPTCHA loads across multiple domains to build a user profile is what sounds like a giant privacy red flag, even though it's entirely possible given the current implementation.

Additionally asking hosts to include JS directly onto their domain which sets 3rd party cookies/data across every page in addition to tracking referring domains is equally a bad idea. reCAPTCHA 2/3 does require loading 3rd party JS directly on page, which I'd imagine is necessary to create callbacks in the frontend upon verification (as iframe content messaging is very awkward):

https://developers.google.com/recaptcha/docs/v3

Ideally the JS simply loads an iframe of the captcha HTML and handles the callbacks from events in the iframe. That's it. It shouldn't be touching anything else on your website. I'd be curious to see a reverse engineering to see how much the JS really does...

antsar 7 hours ago [-]

To be fair, its not super-hard to follow the incentives there...

SquareWheel 7 hours ago [-]

reCaptcha isn't able to read your non-Google cookies or history, so most of that isn't being considered.

megous 4 hours ago [-]

https://codelabs.developers.google.com/codelabs/reCAPTCHA/in...

Yeah, no. It certainly can read non-google cookies on the page (not httpOnly cookies, though).

SquareWheel 3 hours ago [-]

I'm not sure what the link is meant to show, but "cookies on the page" is very different than the years worth of user history and cookies that GP mentioned.

ceejayoz 7 hours ago [-]

I was under the impression sites using Google Analytics were included as a reCaptcha signal.

SquareWheel 7 hours ago [-]

The signals aren't documented (for obvious reasons), but I'd be surprised if Google Analytics were a signal. These things are usually kept separate, and Analytics is a lot less user-specific under GDPR as the anonymizeIP flag is now very common.

That said, I've no evidence one way or the other!

My understanding is that it comes down to information they can read about your browser (does this look like a bot environment?), and heuristically how the user has behaved since the JS has been loaded (mouse movements, time between actions, etc).

tracker1 5 hours ago [-]

I know if I was running a mechanical turk or bot farm, I'd be using a Chrome user agent via puppeteer. I'm not sure WTF they are doing other than being malicious against non-chrome.

paulcarroty 7 hours ago [-]

Also Google punishes Firefox users by forcing them to click on pictures 2-3 times more than Chrome.

1_player 7 hours ago [-]

Same with Brave: I'm logged in into a gmail account and a custom domain hosted on gmail, yet every time there's a reCAPTCHA widget on the site, I have to do it 2 or 3 times before I'm let in.

One trick that seems to help fool that awful piece of tech: click slowly on the images, as if you were thinking a second or two before each click. Maybe click a wrong image and deselect it again. In other words, behave like a slow human, and it seems to work better than if I solve it as quickly as possible.

ballenf 6 hours ago [-]

I also have the feeling that making mistakes — selecting an image that looks like a traffic light but isn’t — sometimes results in faster admittance than being surgically accurate.

Again, being slower and more error prone seems to be rewarded.

incompatible 1 hour ago [-]

I don't even know what the right answer is in a lot of cases. There's a bit of the traffic light casing at the edge of a square, does that count as a traffic light, or only the lamp itself?

washadjeffmad 2 hours ago [-]

Which I intentionally and repeatedly fail anyway, because I'm not training Google's AI so they can sell it for use in drone strikes.

If this reduces the world Google allows me to access, it doesn't diminish mine because of it.

tomc1985 5 hours ago [-]

Other than the occasional reCAPTCHA gaslighting (which does occasionally block some service if it gates logins behind it) that we're all familiar with, I have completely excised Chrome from my life and am able to go to most any website without issue. That's with uBO and Privacy Badger running

mmanfrin 6 hours ago [-]

Kickstarter's login has not worked for me with firefox for over a year. People just don't test with anything but chrome and it shows.

iforgotpassword 6 hours ago [-]

That's odd. I never had issues with it on Firefox. Most of the time I just check the box and it's happy, sometimes I have to do an image puzzle. And that's with ublock origin. Maybe it depends on country or isp? My work place has its own /16.

Fogest 7 hours ago [-]

Funny enough I had to wait on the 5 second Cloudflare check to access that image. However I am using Chrome. That check I have found to be rather annoying. I assumed it would do it once, but it seems I have to go through it daily on sites I use regardless of which browser or device I use.

quickthrower2 7 hours ago [-]

That image link is protected by cloudflare too! Irony intended?

oil25 7 hours ago [-]

The sad and depressing state of the Web in 2019: proprietary third-party JavaScript and cookies are required to view a simple, innocent JPG.

jk2faster 7 hours ago [-]

And it'll only get worse. From the article:

> "If you have a Google account it’s more likely you are human"

So, in the future if we don't keep signed into our google account(and let google know every article we read and every website we browse), we'll be cut off from the half of the internet or even more. The amount of control a handful of companies have over the internet is suffocating to know!

svnpenn 7 hours ago [-]

I have better luck if I log into Gmail :(

cbsks 8 hours ago [-]

You can view your reCaptcha V3 score here: https://recaptcha-demo.appspot.com/recaptcha-v3-request-scor...

I get .7 on my iPhone, I’m guessing that my liberal use of Firefox containers and the cookie auto-delete extension on my desktop will give me a much lower score and cause me to have to jump through extra hoops at websites that implement it, just like the reCaptcha V2 does.

Edit: I also got 0.7 on Firefox with strict content blocking (which is supposed to block fingerprinters), uBlock Origin, and Cookie AutoDelete. I get 0.9 from a container which is logged into Google.

danShumway 8 hours ago [-]

With Firefox fingerprint resisting turned on and with Ublock Origin/UMatrix, I get a score of 0.1. And I'm not even on a VPN; I'm sure on my home network I'd have an even lower score.

To me, it feels like Google's entire strategy behind reCaptcha is to make it harder to protect your privacy. We've basically given up on the idea that there are tasks only humans can do, and to me V3 feels like Google openly saying, "You know how we can prove you're not a robot? Because we literally know exactly who you are." I don't even know if it should be called a captcha -- it feels like it's just identity verification.

I don't think this is an acceptable tradeoff. I know that when reCaptcha shows up on HN there's often a crowd that says, "but how else can we block bots?" I'm gonna draw a personal line in the sand and say that I think protecting privacy is more important than stopping bots. If your website can't stop bots without violating my privacy, then I'm starting to feel like I might be on the bots' side.

dbtx 6 hours ago [-]

> it feels like Google's entire strategy behind reCaptcha is to make it harder to protect your privacy

For the irony, I'm still logged into GMail and it still works perfectly, as basic HTML, even with google.com forbidden to run scripts. But it's the flippin' reCaptchas all over the place that make me temp-allow google.com, and then a reload later, temp-allow gstatic.com and reload again. Only then I get to use someone else's site normally, and I can disallow again... it's irritating. And then, this.

BTW that page plainly says the scores are samples and not related to reality. Refresh a few times and watch it change. 0.3, 0.7, and 0.9 seem to be my lucky numbers. I see everyone else getting those and 0.1.

Please stop reading things into it oh it's too late. Maybe they suddenly started seeing this page hundreds of times in the referrer and added that bit afterward, I don't know.

danShumway 5 hours ago [-]

Dunno if it's changed recently or if I just didn't refresh enough before, but I'm now seeing basically random numbers as well.

If anyone wants a fun weekend project, I would love for there to be a few public sites I can reliably check my production score on.

I'm not sure it matters though, since I'm just ignoring most sites that use reCaptcha now. For sites I can't ignore, I've taken to emailing them with my requests instead -- recently I tried to use Spotify's internal data export tool and it wouldn't let me past. If you're not going to let me use a website to manage my existing account, then your support team can do it for me.

fasicle 7 hours ago [-]

Not sure how much Ublock Origin makes a difference. I have a score of 0.9 with it turned on.

xbkingx 4 hours ago [-]

Same. FF dev, uBlock, Decentraleyes

Changing the FF content policy from Standard to Strict appears to have no impact on the score.

Opening in a Private window drops it to 0.7 for me. I have a bunch of add ons allowed in Private Browsing, so not surprised it only dropped a little.

Of course, if you have 3rd party frames and scripts disabled globally via uBlock, it doesn't even load.

asdff 6 hours ago [-]

I think this score is fishy. Ran the test three times and got three different scores.

ravenstine 5 hours ago [-]

I get the exact same score no matter what browser I use, despite uBlock Origin & Privacy Badger & Decentraleyes, even in private mode and with a VPN connection from a country I normally don't use. Hmmmmm...

megous 4 hours ago [-]

When I just keep reloading, I get either 0.9 or 0.1. I get 0.1 more often. Interesting.

Maybe some browser extension can monitor the score and tell me what it currently is on each page load, when reCaptcha is used on some website. I'd just keep reloading, until it's good, and then try the captcha.

eikenberry 6 hours ago [-]

Ublock Origin + NoScript on FF 60.7.2esr and got 0.9 as well.

[edit] tried in a private window and got the same score.

nprateem 6 hours ago [-]

Does it change if you set privacy.resistFingerprinting=true in about:config?

alteria 5 hours ago [-]

FF private window + UBlock + Resist Fingerprinting = 0.1 for me

In my main FF window with UBlock + Resist Fingerprinting, logged into a ton of Google accounts, I also got 0.1

Going to guess that without fingerprinting data they are probably going to give you a 0.1.

eikenberry 4 hours ago [-]

Do you need to restart FF with that? After setting it to true and using a private window, FF still registers a score of 0.9.

skybrian 6 hours ago [-]

It seems totally reasonable that Google knows you're not a bot if you have a Google account. This isn't the problem, although it hides the problem.

The problem is that they aren't trying harder for users who aren't logged in.

OrgNet 3 hours ago [-]

I also get 0.1 with the same config as you, except that I had uMatrix disabled (which if anything, should improve the score in Google's eyes)...

so why are they having you solve image puzzles if they know that they are going to fail you? even if they know that you are human...

nothis 7 hours ago [-]

I’m just waiting for the AI-generates fake people and whatever way they will come up to monetize that!

diminoten 5 hours ago [-]

Your privacy isn't nearly as important as you think, and as long as you continue to overvalue it, you'll continue to be unwilling to trade it for convenience.

That's on you, not Google.

ronjouch 7 hours ago [-]

Using Firefox with uBlock and Cookie-Autodelete I get 0.1

Using Chrome, even incognito and with uBlock I get 0.7

(╯°□°）╯︵ ┻━┻. F you, Google, this is blatant bullying, technically unjustifyable abuse of your stranglehold over the whole web platform.

rbobrowicz 7 hours ago [-]

To offer a different datapoint:

On FireFox with uBlock on and logged into my corporate gmail I get 0.9, switching to a private tab I get 0.7. This is with every privacy setting turned on in the FF options.

dvaun 7 hours ago [-]

I also have a similar result (0.7) using my browser at work. I am using containers, uBlock, privacy badger and auto-delete cookies.

Loulybob 7 hours ago [-]

Using chrome on my phone I get 0.9, but if I switch to Firefox I get 0.1.

This is essentially going to let Google gatekeep the web if you aren't using their services.

Fogest 6 hours ago [-]

Really? I don't think so. I get a 0.9 on Google Chrome, and a 0.7 on Firefox. I heavily use Chrome and I have not used Firefox apart from maybe testing some local websites. Despite this I still got 0.7 on there. I expected lower since I don't use the browser.

Drdrdrq 6 hours ago [-]

On a flip side: you really should check privacy settings in your Firefox, it seems Google can track you easily there. ;)

Ftuuky 4 hours ago [-]

I use Firefox with Google container and uBlock Origin and Privacy Badger and also get a score of 0.7

How can I get better privacy settings?

llao 7 hours ago [-]

> NOTE:This is a sample implementation, the score returned here is not a reflection on your Google account or type of traffic.

josteink 6 hours ago [-]

This comment should probably be higher up in the thread.

on_and_off 6 hours ago [-]

It is both funny and sad to read this thread.

sleavey 8 hours ago [-]

I get 0.1 continuously, possibly because I have resist fingerprinting enabled in Firefox. I'm not changing anything to compensate that score; it shows I must be doing something right. If I encounter a reCAPTCHA I will continue to (usually) just leave the site it's on.

pbhjpbhj 8 hours ago [-]

Same, the way to look at a low score is "I'm getting privacy right".

shawnz 7 hours ago [-]

Contrary to the results here, using Firefox + uBlock with DNT and tracking protection enabled, I get a score of 0.9. In private browsing mode it's 0.7.

I wonder how many people here are using a VPN or accessing from a non-western country -- I'd bet those are much bigger factors

jedberg 6 hours ago [-]

Were you logged into your Google account? That seems to almost guarantee a .9

shawnz 4 hours ago [-]

Yes, although not when private browsing of course.

eswat 7 hours ago [-]

FF logged into Google account: 0.9

FF incognito window not logged into Google account: 0.7

FF incognito window not logged into Google account through VPN: 0.3

FYI I have uBlock, pi-hole and a bunch of privacy widgets enabled

GordonS 48 minutes ago [-]

0.3 with Brave on Android, no extensions. 0.9 with Chrome on the same device, same connection.

Brave isn't particularly "unusual", and is even based on Chromium - surely this is Google blatantly punishing non-Chrome users?

lucb1e 7 hours ago [-]

This looks like a RNG: I got 0.7, 0.9, and 0.1 successively. It can't make up its mind whether I'm almost certainly not a bot (0.9) or almost certainly a bot (0.1)?

qes 37 minutes ago [-]

> This looks like a RNG

Come on, how is everyone in this chain so blind. It's literally in bold and the single largest block of content on the page:

NOTE:This is a sample implementation, the score returned here is not a reflection on your Google account or type of traffic. In production, refer to the distribution of scores shown in your admin interface and adjust your own threshold accordingly. Do not raise issues regarding the score you see here.

shawnz 7 hours ago [-]

Perhaps the rapid, repeated identical requests outweighed the initial factors which gave you a positive response

lucb1e 4 hours ago [-]

Might very well be. I also get errors on hacker news about "can't process requests that fast". When asking about it (initially because I thought votes didn't work randomly), the limit is a few requests per second. Turns out I click faster than that, either by reading a whole comment thread and making up my mind whose comments were most helpful (to upvote all at once) or by navigating too fast.

on_and_off 6 hours ago [-]

from the link

>the score returned here is not a reflection on your Google account or type of traffic

I got random scores as well. It looks like this is just a sample of the data structure that the service returns, not the actual score.

lucb1e 4 hours ago [-]

That would be a useless site, but that's not how I read it. I understand it as "this is not that Google thinks your account is a bot, it's that this request might be made by a bot. And since you didn't use this site as a normal website, it also doesn't score your type of traffic, just this one request". You might be right, but it really does seem to be doing a request to their API.

on_and_off 1 hour ago [-]

>That would be a useless site

looks like it is a demo of the API for people wanting to consume it. knowing what the payload looks like is not useless at all in this case.

superasn 7 hours ago [-]

I too got 0.1 even though I'm not on a VPN, and have a stock FF installation with just uBlock addon. I think my ISP may have some part in it but still 0.1 score is 100% bot right?

I'm also logged into google and fb which also doesn't affect my score. Only shows how broken their algorithm is :(

edit: just tried it with chrome and my score jumped to 0.9! So definitely not my ISP. It's just my browser that Recaptcha doesn't like. If you put two and two together that's really evil shit, even for Google!

zcid 1 hour ago [-]

So, I still have to whitelist Google in uMatrix and allow cookies for this to work. Even after doing so, I get a 0.1. I reloaded the page to check for variation as some other users mentioned but get the same score each time. I guess Google is saying I shouldn't be allowed to use the internet.

archy_ 5 hours ago [-]

Google is putting a number on us, is honestly some Minority Report level dystopia. Google is already using this to make life hell for anyone who cares about their privacy, we need to do something about this before they finish putting up their iron curtain over the web. Would it be possible to sue website owners for requiring such invasive measures? I'd love to see this ruled as monopoly power and Google broken up but that's probably not very realistic so we would probably do better to make using Google captchas more expensive in court costs alone than just building their own solutions to fight bots.

KumarAseem 7 hours ago [-]

I got 0.7 on FF, 0.3 on Opera and Chrome, all in incognito mode. Maybe they have just a few values and return it based on AND OR logic of 2-4 variable. Or maybe they are just playing around trying to gather some stats, for some "Don't be Evil" purpose!

Grue3 7 hours ago [-]

Work Firefox which I use all the time, no addons (including any adblockers): 0.1

Almost unused Chrome installation, also without addons: 0.7

qqii 7 hours ago [-]

Seeing what everyone else has posted I'm very suprised that I've received a 0.3 using Chrome on Android. I'm logged in to Google and most of my browsing is via Chrome or Chrome based webview. At least on my phone I've never cleared my cookies or done anything special.

xahrepap 7 hours ago [-]

I get .9 in Firefox on my MBP with UBlock Origin installed. I wondered if it was because I was logged in to Google, so I tried Incognito and got .7. In a never-before-used container I also get .7.

del82 8 hours ago [-]

Oscillates between 0.1 and 0.7 for me, and I'm changing nothing on my end (just hitting "Try again"). Does it have to do with refresh speed, I wonder?

Privacy Badger and ABP on my work (less-locked-down) Mac.

SquareWheel 7 hours ago [-]

Hitting the same URL over and over again is bot-like behaviour. When working with reCaptcha on forms I usually start getting hit after 4-5 test submissions.

ixwt 8 hours ago [-]

I get a 0.7 on my computer on Firefox. If I use the same website in Chrome (which is signed into a Google account) I get a 0.9. I guess it's a [0,1] scale?

benologist 8 hours ago [-]

I'm guessing their a-listers came up with something like this:

    // TODO: add impressive-looking math
    if (signedin && trackedEverywhere) {
         return 0.9
    } else {
         return 0.7
    }

I think we give Google way too much credit for their talent. This is the same company that didn't feel like finishing their website for two decades and subsequently stole $75 million from their users even when Google knew [1].

The same company that somehow still doesn't reconcile amounts owed and just keeps the money when they randomly-ban users and hide behind fake support emails, but they did feel like paying $11 million to keep that away from scrutiny [2].

[1] https://www.businessinsider.com/google-emails-adtrader-lawsu...

[2] https://www.searchenginejournal.com/adsense-lawsuit/248135/

asark 7 hours ago [-]

Google consistently gives me the impression of a company that (I suppose) has tons of smart people in it, but has badly broken management & incentive structures leading them to constantly do bafflingly stupid stuff at both large and small scales, even by the standards of a bigcorp, to the point that they survive only because they've got one hell of a golden goose.

stirfrykitty 7 hours ago [-]

Good info. Thank you.

And in keeping with recent revelations on Google's manipulation of search results, I think they have really gone beyond the pale. I un-archived my old iPhone two days ago and went back to iOS after the James O'Keefe/Project Veritas revelations. I now cannot, in good conscience, use anything Google. I always knew about the tracking and all that because, after all, they are an ad company. I'm now in the process of moving all of my domains over to Fastmail, which I've used since 2002. I'm using Qwant, Startpage, and DDG for search. FF for browser with many about:config tweaks and several add-ons.

memmcgee 2 hours ago [-]

You know Project Veritas is a load of shit right?

Twirrim 7 hours ago [-]

If I sign out of my google account in Chrome it drops from 0.9 to 0.7.

I could have sworn I'd never signed in to Chrome using my google account, but I guess I must have mistakenly signed in to gmail or something.

I use FF as my main browser, only ever drop back to Chrome sporadically, or when I really want tabs to be completely isolated (there are some annoyingly CPU/power intensive stuff I do from time to time, and I can just renice Chrome while I get on with other stuff.)

oil25 7 hours ago [-]

> I could have sworn I'd never signed in to Chrome using my google account, but I guess I must have mistakenly signed in to gmail or something.

Chrome 69 tricked users into signing into the browser, myself included - https://lifehacker.com/how-to-disable-chromes-automatic-sign...

That was the last straw to uninstall Chrome from all my devices and I've been a happy Firefox user ever since. Well, except now reCAPTCHA hardly ever works.

lostmyoldone 7 hours ago [-]

I believe that's a "feature" they added a while back, auto-signing you into chrome as soon as you was logged into gmail.

btown 6 hours ago [-]

The GP post's IP address or other fingerprint may be validated from other Google properties they might have visited, so I wouldn't put so much stock in the 0.7.

Honestly... if it's the same team that did ReCaptcha 2.0, this is a team that pulls out all the stops. Per https://github.com/neuroradiology/InsideReCaptcha ... they implemented a freaking VM in Javascript to obfuscate the code that combines various signals. There's a lot going on here that's likely highly obfuscated and quantized before it's displayed to us.

EDIT: non-paywall link for [1] in the parent post: https://outline.com/aA7HS5

owaislone 7 hours ago [-]

I get 0.9 on Firefox which is my main browser and 0.7 on Chrome which I use only for hangouts.

bluetidepro 7 hours ago [-]

I got a 0.9. What's it out of? 1? Sorry if I completely missed that somewhere already.

jk2faster 6 hours ago [-]

Yes, it is out of 1. From https://developers.google.com/recaptcha/docs/v3, > reCAPTCHA v3 returns a score (1.0 is very likely a good interaction, 0.0 is very likely a bot).

jedberg 6 hours ago [-]

From my computer, where I browse fairly equally with all three of Chrome, Safari, and Firefox (albeit different sites), I get the following scores:

Chrome: .9

Safari: .7

Firefox: .1

I have adblock running on all three, and I use containers on Firefox.

minieggs 6 hours ago [-]

Stock Qutebrowser 0.7, FF w/ all the usual extensions (ublock origin) 0.7. Don't know if it matters but I'm rolling Arch. Just adding another point of data for those curious.

jermaustin1 7 hours ago [-]

What is most odd is I get 0.7 on iOS Safari which I use for 100% of my purposeful mobile browsing, but I get .9 on iOS Chrome, which is only used when I accidentally click on links from gmail (so very, very rarely).

djrogers 7 hours ago [-]

Not really odd at all - if you're using the gmail app, there's a shared authentication cookie in all Google apps - including Chrome, so Google knows who you are in Chrome.

lostlogin 7 hours ago [-]

It seems a lot is iOS users get 0.7.

zhte415 7 hours ago [-]

A consistent 0.3.

> error-codes": ["score-threshold-not-met"]

Not sure if happy or not happy with that. I will conclude happy enough.

Linux, on VPN, Firefox. Not logged into any Google services. Cleared caches (still same IP), no difference.

ilikehurdles 6 hours ago [-]

With desktop Chrome I get a 0.3. My browser sends Do Not Track, has PrivacyBadger extension, and has that useless google-profile-in-the-browser feature disabled.

fybe 8 hours ago [-]

Interesting.

I get a 0.7 on Chrome with no account logged in and uBlock Origin installed.

Same browser, same plugin but incognito it's 0.1.

Papa google needs my data to trust me. Makes complete sense but still interesting that you can affect your score by giving in.

fibers 6 hours ago [-]

Interestingly enough I got .9 on Edge with Ublock origin installed. Perhaps this has something to do with how Edge is using webkit now?

sarathyweb 5 hours ago [-]

I got 0.9 in my Android phone running chrome. When I opened it in incognito mode, my score was reduced to 0.7

Zekio 7 hours ago [-]

interesting my score is 0.9 if I allowed google to track me using cookies, if I block the cookies it goes to 0.7 and if I enable content blocking in Firefox it drops to 0.1

fisherjeff 7 hours ago [-]

Using desktop Safari incognito without a Google account and Ghostery enabled, I get 0.7 too. Interestingly, disabling CSS drops me to 0.1...

denzil_correa 6 hours ago [-]

It gives me 0.7 on Safari (uBlock Origin) while 0.3 on Chrome (uBlock Origin) - both macOS Mojave.

wil421 5 hours ago [-]

The first time it failed the second time I got a .7 iPhone Xs.

helper 7 hours ago [-]

I get 0.7 in both desktop (linux) chrome and firefox. I get 0.3 from android chrome.

gpm 7 hours ago [-]

Firefox mobile w/ ublock: 0.9

keiru 7 hours ago [-]

>Please upgrade to a supported browser to get a reCAPTCHA challenge

I guess this is a 0 for me then

TheArcane 7 hours ago [-]

I use the same extensions on desktop and get 0.3 on my android Firefox

pdimitar 6 hours ago [-]

iPhone with a good (not amazing) adblocker: 0.7

Safari macOS with the same adblocker: 0.7

Firefox macOS with a lot of adblockers: 0.1

chenshuiluke 6 hours ago [-]

I get 0.9 on my Firefox

qwsxyh 7 hours ago [-]

Firefox with uBlock O I get 0.9. Don't know what everyone else here is talking about.

sdegutis 7 hours ago [-]

It didn't load for me and I couldn't figure out why.

Then I remembered that I put this in my /etc/hosts a few weeks ago and forgot about it.

    127.0.0.1       google.com
    127.0.0.1       www.google.com

[Edit] So if nothing shows up for you on that page, check for that. Also I just generally recommend it. Google has some unethical practices and duckduckgo.com is pretty good.

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | Legal | Apply to YC | Contact