Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

As a data scientist, I’m happy that most data continues to be so terribly formatted and inconsistent as to break and confuse AI. But for how long that’s true, who knows!



I would've said that understanding and fixing the data would be one of the best usecases for the AI.


Unfortunately, there are still many ways to “fix” things that have a lot of trade-offs or downstream consequences for analysis. For most basic cleaning tasks, LLMs are also still way too slow.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: