LinkedIn sues software company allegedly scraping data from profiles

0cf8612b2e1e · 2025-10-03T19:39:09 1759520349

Yeah, only Microsoft is allowed to indiscriminately scrape the web!

I somehow want both parties to lose.

hbn · 2025-10-03T21:20:04 1759526404

LinkedIn is the only website on the internet I want scraped so I can view it without it sending a notification to every person whose profile I look at

MisterSandman · 2025-10-03T21:56:53 1759528613

You can turn on Private Browsing, even on a free account. It also prevents YOU from seeing who viewed you, though, unless you buy premium.

ares623 · 2025-10-03T19:22:14 1759519334

Can the company just claim it’s for AI training and it’s fair use?

ashu1461 · 2025-10-03T20:18:13 1759522693

It has started to backfire.

Claude also had to a pay almost 1.5b for illegally training / scrapping.

https://www.cnn.com/2025/09/05/business/anthropic-ai-settlem...

teachrdan · 2025-10-03T20:22:09 1759522929

IIUC that was for illegally downloading ebooks and other media -- it had nothing to do with training per se. Scraping publicly accessible data is generally legal, although Microsoft/LinkedIn clearly think they have enough of a leg to stand on to at least litigate this.

woodrowbarlow · 2025-10-03T20:21:59 1759522919

anthropic was _not_ sued for including data scraped from public websites. they were sued for including data extracted from pirated books.

sfifs · 2025-10-04T02:04:06 1759543446

Not an expert but there was a court ruling in the US I think last year where circumventing login protection through bot operated accounts when the login is intended for human use was ruled as violation of CFAA. The current state of litigation in the US seems to be that scraping public facing data/websites has been considered as permissible by the courts but data behind a login intended for humans is not. I think there's still a split between the circuits, so this will go through some years of appeal yet.

tracker1 · 2025-10-03T20:21:24 1759522884

The company that put an Email proxy on people's phones to scrape all email going in and out has a complaint about scraping?

pona-a · 2025-10-03T20:28:11 1759523291

I haven't heard of it and I couldn't find the story by these keywords. Can you tell me more? I'm genuinely interested.

openmosix · 2025-10-06T16:33:39 1759768419

A product called "Linkedin Intro" that was killed within 6 months due to backlash and significant security flaws. It was somehow creating a reverse imap proxy to intercept your email traffic, and "decorate" emails with someone's linkedin profile.

It was ~12 years ago, so there's not much left around, but here is an engineering blog post from Linkedin talking about how they architected it https://engineering.linkedin.com/mobile/linkedin-intro-doing...

tracker1 · 2025-10-06T16:03:43 1759766623

https://marco.org/2013/10/25/linkedin-intro-insecurity

I don't recall all of the specific details, but I just remember reading about it at the time and how they bypassed some of iOS security protections to do it. Adn that they didn't get perma-banned from the various app stores back then is beyond me. It's a huge part of why I avoid installing apps on my phone in general.

openmosix · 2025-10-06T16:34:07 1759768447

https://engineering.linkedin.com/mobile/linkedin-intro-doing...

spindump8930 · 2025-10-04T00:02:55 1759536175

Is the proxy here linkedin messaging/mail instead of direct email?

tracker1 · 2025-10-06T16:03:52 1759766632

https://marco.org/2013/10/25/linkedin-intro-insecurity

I don't recall all of the specific details, but I just remember reading about it at the time and how they bypassed some of iOS security protections to do it. Adn that they didn't get perma-banned from the various app stores back then is beyond me. It's a huge part of why I avoid installing apps on my phone in general.

callc · 2025-10-03T20:36:01 1759523761

Whoa, really? That is diabolical. Can you provide more info?

tracker1 · 2025-10-06T16:05:39 1759766739

https://marco.org/2013/10/25/linkedin-intro-insecurity

I don't recall all of the specific details, but I just remember reading about it at the time and how they bypassed some of iOS security protections to do it. Adn that they didn't get perma-banned from the various app stores back then is beyond me. It's a huge part of why I avoid installing apps on my phone in general.

Poomba · 2025-10-03T19:38:19 1759520299

Why are they going after the small fish?

If they really want to put a dent into this, go after the biggest players scraping LinkedIn: PeopleDataLabs and Apollo.io (and no, taking down their company page does not count)

tomkarho · 2025-10-03T20:02:59 1759521779

Victory against small fish => establish legal precedence

legal precedence => Surer victory in the future for similar lawsuits

ashu1461 · 2025-10-03T20:20:37 1759522837

Reminds me of the Apple vs Pear law suit

https://www.entrepreneur.com/business-news/apple-sues-small-...

The dispute was settled because Pear agreed to slightly alter its logo, instead of continuing full litigation (maybe because of resources / dollars it would consume)

imglorp · 2025-10-03T21:03:19 1759525399

Seems there is a scraping precedent already, set by Linkedin v HiQ

https://www.fbm.com/publications/what-recent-rulings-in-hiq-...

deepsun · 2025-10-03T20:45:57 1759524357

Only if the case goes to trial.

If they settle, or the case got dismissed -- no precedent is set.

BolexNOLA · 2025-10-03T20:56:26 1759524986

If that’s going to happen with a small fish then it was certainly going to happen against a big fish. Cheaper, faster, and easier to attack a smaller business first. There is literally no reason to go after a big dog unless they did something particularly egregious and/or distinct that you can anchor your argument with. Unless your goal is just to waste their time and that of their lawyers I guess, though I think we would all assume the goal is to win ultimately.

stackskipton · 2025-10-03T21:08:57 1759525737

Even the legal filing and motions can help shape a case since they get rulings and such back. If a judge rejects a motion, maybe they need to approach it a different way when they go after big fish.

Only way this is not beneficial is if software company settle or gets dismissed right away.

RobRivera · 2025-10-03T20:05:54 1759521954

Against bigger fish.

And there's always a bigger fish.

deadbabe · 2025-10-03T19:40:23 1759520423

Go after small fish that no one cares about first to normalize the activity, then move up to bigger and bigger targets until you become inevitable.

el_benhameen · 2025-10-03T20:04:25 1759521865

Or, go after the small fish who can’t afford to have a biglaw team on retainer, bulldoze them to get a legal precedent set, and then use the example to extract concessions from the bigger players.

Jach · 2025-10-03T20:10:28 1759522228

A smaller company without a big legal team is probably more likely to settle than a big company. Settlements don't establish precedent.

deadbabe · 2025-10-03T20:42:30 1759524150

So you get money on the way up until you find a company willing to battle in court and lose.

Goofy_Coyote · 2025-10-03T19:44:28 1759520668

Because they either have side deals with the big names, or they want to set precedent for going after them.

Not trying to be a conspiracy theorist here, but my bet is on having a deal with the big players, we allow you to scrape us (or we give you a pipe you can consume out of), and you pay us in monetary or non-monetary terms; like how many business exchanges work

Poomba · 2025-10-03T20:05:47 1759521947

I doubt they have side deals. They took action on some of them by removing their company page, but that is like a slap in the hand.

If you want to make a big deal about this, tell us you at least sent a letter to the big players too. Otherwise, dont put up such a huge show

altairprime · 2025-10-03T20:07:31 1759522051

They have a trademark ridealong whose chances improve against a less-recognized company.

nextworddev · 2025-10-03T19:29:42 1759519782

A bunch of GTM and Sales APIs recently stopped offering their LinkedIn APis. Seems like the lawsuits are working to scare them off.

Prediction: this will be a very much pay to play market

Poomba · 2025-10-03T19:38:50 1759520330

Examples?

1vuio0pswjnm7 · 2025-10-03T22:36:28 1759530988

Complaint

https://storage.courtlistener.com/recap/gov.uscourts.cand.45...

mtlynch · 2025-10-03T20:03:15 1759521795

This happened before in hiQ Labs v. LinkedIn.[0]

I've heard a lot of people cite this case as proof that scraping is legal, but it seems like the decision kept going back and forth in appeals, and I never understood what precedent it set, if any, around the legality of scraping.

[0] https://en.wikipedia.org/wiki/HiQ_Labs_v._LinkedIn

sorum · 2025-10-04T12:05:44 1759579544

This one seems different from the (correct) ruling in favor in hiQ Labs, where the courts were quite clear that scraping the public Internet was completely legal.

This is a case of a company creating millions of fake user accounts, so they’re behind the login wall and not on the public side of the Internet anymore. At least, that’s how I’m reading this.

johnnienaked · 2025-10-03T20:24:55 1759523095

Only a linkedin executive could consider user submitted personal information to be "their" data

dylan604 · 2025-10-03T20:40:22 1759524022

They are responsible for it. If people are gaining access to that data in ways other than what the users were led to believe, it is LI's problem

johnnienaked · 2025-10-03T22:13:11 1759529591

Can't you gain access simply by making a free account?

dylan604 · 2025-10-04T15:45:45 1759592745

Not sure your point, because of course you can. But when you make that account you agree to terms. Those terms do not permit you to take the data presented to be stored in your own database to monetize on your end. Make your own website to collect data. You’re being obtuse about this. Is it deliberate?

johnnienaked · 2025-10-05T07:34:36 1759649676

This is just gatekeeping as a business model, and it's a bad one.

myzie · 2025-10-03T22:53:42 1759532022

Related research on past litigation in this area for anyone that wants to go deeper:

https://deepnoodle.ai/research/linkedin-legal-battles-tos-vi...

Simulacra · 2025-10-03T22:14:48 1759529688

Oh dear, my office has been scraping LinkedIn forever. We use it to make visual networks of contacts in our industry, and relate that to whom we have working for the company. oops.

polishdude20 · 2025-10-04T04:19:27 1759551567

On that note, I've noticed an uptick in past coworkers as Facebook recommended friends. How does it know about these people I've worked with?

atonse · 2025-10-03T20:21:53 1759522913

I'm old enough to remember when pretty much every single social media company had really nice APIs so third party clients could be built.

Oh man, a lot of the web really feels very enshittified these days.

repeek · 2025-10-03T21:03:15 1759525395

Curious if Dex (YC 19) (getdex.com) is at risk — their LinkedIn integration requires a chrome extension to scrape data rather than LinkedIn APIs.

myzie · 2025-10-04T01:03:35 1759539815

The Chrome extension approach may shift some (most?) of the risk to the end user, since technically they are now the one scraping. Theoretically getdex would be relatively better off in this arrangement, while putting their customers into a legal gray area.

BenGosub · 2025-10-04T10:06:16 1759572376

There are already many companies offering bots creation for social media, they might not sell the data, but they do sell the bots.

nathan_compton · 2025-10-03T20:33:32 1759523612

If I had the Infinity Gems but I could only use them once, I would strongly consider snapping LinkedIn out of existence.

dylan604 · 2025-10-03T20:37:36 1759523856

please, go bigger and do all social media types

realaaa · 2025-10-05T02:00:31 1759629631

they could have instead try to understand what are they missing (what / how is driving that scraping demand?) - and maybe try to do that themselves

or partner up to amplify that other use case

but I guess we are in the lawyers divide and conquer mentality these days

SilverElfin · 2025-10-03T15:50:26 1759506626

I don’t get why LinkedIn should be gatekeeping this data that it doesn’t create. It’s bad for society.

brailsafe · 2025-10-03T19:56:55 1759521415

They also make it difficult to destroy. Try deleting your post or comment history, and you can only do it slowly one by one, with only a few sketchy tools for making it faster that go against their terms of service.

cwnyth · 2025-10-03T21:21:27 1759526487

Compared to HN, which doesn't allow for any comments to be deleted?

type0 · 2025-10-03T23:56:57 1759535817

HN doesn't require you to give out your name and email

iamleppert · 2025-10-03T20:33:53 1759523633

Have ChatGPT code up a script for you, that you can paste into developer tools. It's how I deleted all my content from there.

SilverElfin · 2025-10-03T20:25:24 1759523124

Other social media do it too. At best you can only delete your entire account.

motoxpro · 2025-10-03T19:26:28 1759519588

I think most users don't want their data to be used by anyone and everyone. I sure don't. If one user needs access to their own data, they can always export it and take it where they please.

For most people the dangers of openness (see Cambridge Analytica), the lack of upside and the lack of security in small players mean that walled gardens are the best solution for the majority of people.

This lawsuit is exactly why people trust walled gardens to keep their data walled off. Because I trusted LinkedIn, not ProAPI and whatever malicious actors they sell to.

neilv · 2025-10-03T19:36:11 1759520171

> This lawsuit is exactly why people trust walled gardens to keep their data walled off. Because I trusted LinkedIn, not [...]

Obviously LinkedIn is also in the business of selling the data about you, and also access to you.

LinkedIn just doesn't like this other company leeching off that data LinkedIn got about you, and then competing with LinkedIn in making money off that data (including access).

motoxpro · 2025-10-03T19:51:06 1759521066

Selling data inside their walled garden in a way I am OK with in exchange for a free service.

Not a 3rd party selling my information to a scam farm in a foreign land that has no laws that will use all of that information to extract money from my parents.

LamaOfRuin · 2025-10-03T19:39:48 1759520388

But linkedin is doing so in accordance with the legal agreement you have with them, which I am able to exit at any time and instruct them to remove my data. I can't do this for every company that illegally (in many jurisdictions) hordes information about me.

add-sub-mul-div · 2025-10-03T19:43:46 1759520626

You're currently on one of the very few sites with no delete/edit button for your own content (after a short initial period.) It's the only site I can think of that hoards my data like that. Which is why I only post anonymous throwaway content here.

reorder9695 · 2025-10-03T21:21:39 1759526499

I think trusting data you post publicly to only remain exactly where you publish it is naive at best. I think it's much more sensible to think that as soon as you put something public, it will exist somewhere forever, and it's foolish to believe otherwise.

MangoToupe · 2025-10-03T19:34:18 1759520058

I don't even trust LinkedIn, but it's not like I can sue them for offering antisocial terms, let alone force them to a negotiation table. It's just a shitty situation all around. At the very least they should pay me to use the site if they're making money off of it.

motoxpro · 2025-10-03T19:55:26 1759521326

If everyone has access to your data it becomes even more worthless and you will definitely not get aid for it. At least now I can keep it somewhere and they can use it to fund engineers to keep the service up, lawyers to make sure your data stays safe, etc.

You are free to leave and delete your data, unlike if everyone has access to it then it is out there in perpetuity.

You definitely can't sue a data broker to pay you/stop using your data.

singlepaynews · 2025-10-03T19:47:35 1759520855

I sure do! If LinkedIn can't market my resume to open roles then letting recruiters roll their own scrapers against it is the next best thing. I understand that LI owns my data, I just wish they were effective in using it!

(edit: "my" data, as in the data I post there.)

motoxpro · 2025-10-03T19:56:41 1759521401

I guess that was my point, YOU are free to export your data and post it on the internet, but don't make everyone (me) do the same.

singlepaynews · 2025-10-05T18:11:12 1759687872

I don't see how I (or LinkedIn) is making you do anything? LinkedIn is a place I can post data. I choose to do so in an attempt to market my resume. I fully expect that the data I post on LinkedIn's server becomes and is the property of LinkedIn, and wish it was more effective at extracting value from it?

Because LinkedIn is less effective than I'd like, I support 3rd parties scraping the data I posted there, again on the hope that they'd be more successful at marketing that data, which I would benefit from as the data is my resume.

motoxpro · 2025-10-07T09:44:56 1759830296

We're in agreement. I was just saying I don't support 3rd parties scraping my data so as they just get yours and not mine then have at it!

animitronix · 2025-10-03T19:51:18 1759521078

So are they gonna go after pitchbook and crunchbase too or nah?

ozim · 2025-10-03T20:25:37 1759523137

Well maybe I can get that company to backup my LinkedIn posts because it is utterly broken to download anything about my profile to make a backup.

There is an API option but endpoints from documentation just return 404. There is Data Privacy "download my data" I wanted really data like my posts, photos not crappy CSV having basic properties. In the end there is "View the rich media" but also I have to click one by one and there is no text for posts on the images - I can do that going one by one of my posts and copy pasting. It sucks despite "your data belongs to you" texts on the labels.

phoronixrly · 2025-10-03T20:26:39 1759523199

Back up your linkedin posts? What valuable information was ever contained in a linkedin post?

ozim · 2025-10-03T21:50:32 1759528232

These are my posts I have personal attachment to what I wrote.

Most of what I wrote I have in my notes anyway — but still if they say it is my data and I can always download it, I really want to download it and not like that someone just puts up lies on their website like "data is yours you can always download it".

subscribed · 2025-10-04T10:59:46 1759575586

LOL, what sort of snarky and patronising response is this?

All the response was in the comment you try to ridicule.

xyst · 2025-10-03T20:10:52 1759522252

basically, linkedin is just pissed off they weren't getting a cut of the profits this small company made on linkedins (already public?) data.

The winners here are the law firms on both the plaintiff and defendant sides. Drag this through the court system for as long as possible. PR. PR. PR. Then settle out of court for an "undisclosed amount."

This is the mafia equivalent of "sending a message" in corporate land. Yawn.

_imnothere · 2025-10-04T15:45:59 1759592759

So tired of their auth wall, screw 'em.

saltyoldman · 2025-10-03T15:45:44 1759506344

They're owned by Microsoft and poorly managed. Hundreds of people get locked out daily and can no longer access or change their OWN data. I say, let the scrapers take them down. We need to stop the walled in gardens of data these companies DONT own - it's the user's data.

anfilt · 2025-10-03T19:23:42 1759519422

I hope linkedIn looses.