Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
lloydatkinson
on Sept 17, 2023
|
parent
|
context
|
favorite
| on:
New York Times doesn’t want its stories archived
That sounds really interesting and a reflection of the BBCs generally worsening journalism. Do you have the script and repo available publicly? It would be interesting.
baz00
on Sept 17, 2023
[–]
I would be embarrassed to publish it in its current state if I’m honest as it was written when I was drunk, cynical and after a breakup.
Please take the idea in concept and do a better job that I did!
lloydatkinson
on Sept 17, 2023
|
parent
[–]
I’m tempted! How did you store the content, as raw HTML or plain text? I imagine raw HTML would make automated diffs harder.
baz00
on Sept 17, 2023
|
root
|
parent
[–]
It was python so I used Beautiful Soup to scrape and extract just the text.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: