Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Interesting!

I wonder what the use case is compared to extracting this information in the programming language and then storing it alongside the PDF in separate table columns?



It can be useful for improving ingestion pipeline: put your pdf collection in a temp table and then extract with pure SQL the information you want.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: