Having worked as a data scientist at multiple companies (From FANG to startup), the first thing I look at when I get my hands on data is the existence of the Pareto principle.
I still haven’t found one company where this principle didn’t show up.
What does this prove? If you have lots of data and dimensions, I bet you could just as likely find distributions that are roughly 50/50, 60/40, 90/10, 100/0 if you looked for them.
I still haven’t found one company where this principle didn’t show up.