Hacker Newsnew | past | comments | ask | show | jobs | submit | _chse_'s commentslogin


Perfect! Cheers!

No worries, enjoy!

What would you like to see?

Data is awesome. It is big enough to extrapolate usage to larger internet imho.

A simple contingency table sorted by usage would be nice - make it as an image, no extra interactivity needed.

Thanks.


Can do, I'll find another spot.

I'm a long time lurker who has only recently started posting. What's in the archives themselves are just JSON files. I'll post an article next time with what's here, that way it isn't just Dropbox links.

Sure. But you do understand how it looks right? Nothing against you.

This dataset does include Bloomreach Discovery, Coveo and Algolia. These were detected by looking through HTTP responses for publicly available web pages. For example, Coveo was detected by searching a script tag's src attribute for "static.cloud.coveo.com".

You can check out everything that was detected in the full version here: https://versiondb.io/detection_list.json

If you'd like to know how the others were detected, I can go through that as well. See if it's what you're after.


I had just released https://versiondb.io a few hours ago. It's something where you're able to get a slice of what's running on the web without breaking the bank. The full version contains over 4M domains and over 3K detected technologies.

Have fun and I hope you guys find it useful.


It feels like the type of tool that the HN crowd would be building...

Bang on, I've been working on something that I've intended to be a cost-effective alternative to BuiltWith. I was thinking about just selling the datasets and allowing users to extract whatever they need from it. What technologies are you after?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: