We’re automating web scraping at scale with AI. Web scraping used to be the same for decades (writing and maintaining rule-based scripts for each source), and we're fully automating that process with LLMs.
We're also heavily focused on making ethical scraping the default (robots.txt checks, rate limiting, any custom compliance rules).
But we can't do it alone. We are looking for people who share our passion for software craftsmanship, data, and AI. We are growing fast, have a “no-bullshit” policy, and try to minimize the distance between the code you write and the customers who use it. We iterate to greatness :)
If that sounds like you, please email me at (adrian at kadoa dot com) and mention HN in the subject line. Ideally you include an ETL or scraping project you've been working on in the past.
What's the endgame of this increasing arms race? A gated web where you need to log in everywhere? Even more captchas and Cloudflare becoming the gateway to the internet? There must be a better way.
We're somehow still stuck with CAPTCHAs (and other challenges), a 25 years old concept that wastes millions of human hours and billions in infra costs [0].
Why does everyone think chat is better UX than traditional interfaces? I get the AI hype, but so many products are not a fit for chat interfaces.
Why would I use a chat to do what could be done quicker with a simple and intuitive button/input UX (e.g. Booking or Zillow search/filter)?
Chat also has really poor discoverability of what I can actually do with it.
But we can't do it alone. We are looking for people who share our passion for software craftsmanship, data, and AI. We are growing fast, have a “no-bullshit” policy, and try to minimize the distance between the code you ship and the customers who use it.
While sweden has a lot of straight road stretches specifically designed to serve as emergency airfields, it is a lot easier to find 500 m of suitable road than 1600.
Meta's goal with Llama was to target OpenAI with a "scorched earth" approach by releasing powerful open models to disrupt the competitive landscape. Looks like OpenAI is now using the same playbook.
It seems like the various Chinese companies are far outplaying Meta at that game. It remains to be seen if they’re able to throw money at the problem to turn things around.
Good move for China. No one was going to trust their models outright, now they not only have a track record, but they were able to undercut the value of US models at the same time.
But we can't do it alone. We are looking for people who share our passion for software craftsmanship, data, and AI. We are growing fast, have a “no-bullshit” policy, and are very execution focused.