Yes old, but even worse, it is not a well argued review. Yes, Bayesian statistic...

nxobject · 2025-05-24T20:30:33 1748118633

On the subject of prioritizing EDA:

I need to look this up, but I recall in the 90s a social psychology journal briefly had a policy of "if you show us you're handling your data ethically, you can just show us a self-explanatory plot if you're conducting simple comparisons instead of NHST". That was after some early discussions about statistical reform in the 90s - Cohen's "The Earth is round (p < .05)" I think kick-started things off.

wiz21c · 2025-05-24T19:51:58 1748116318

Definitely. It always amazes me that in many situations, I'm applying some stats algorithm just to conclude: let's look at these data some more...

jononor · 2025-05-24T21:56:57 1748123817

Yes. And the same for DS/ML people also, please. The amount of ML people that can meaningfully drill down and actually understand the data is surprisingly low sometimes. Even worse for being able to understand a phenomena _using data_.

Charon77 · 2025-05-25T05:04:15 1748149455

When you have a lot of fancy metrics/models/bootsraps to throw at, people would just see what sticks.

jononor · 2025-05-25T19:32:20 1748201540

Happens all the time. Problems come quickly when the datasets used for evaluation are not clean, or the evaluation is incorrect - data leakage, problematic imbalance between groups, distribution shifts vs the actual production data. Or people just checking the average performance, but not typical or worst case. Have seen many people run in circles chasing metrics that are meaningless to the task they are supposed to be solving.