Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
druskacik
on Aug 21, 2024
|
parent
|
context
|
favorite
| on:
The semantic web is now widely adopted
There's a project [0] that parses Commoncrawl data for various schemas, it contains some interesting datasets.
[0]
http://webdatacommons.org/
undefinedblog
on Aug 21, 2024
[–]
That’s a really useful link, thanks for sharing. We’re building a scrapping service and only parsing rely on native html tags and open graph metadata, based on this link we should definitely take a step forward to parse JSON-LD as well.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
[0] http://webdatacommons.org/