Twitter | Search | |
Jason N. Doctor
An urn contains 10K red & blue marbles of unknown proportion. Win $1M if you estimate the correct proportion. You can either: (1) sample 80%, or, (2) stir the marbles & sample 1%. Which do you do? Meng shows it is better to stir! Big Data is biased.
Reply Retweet Like More
Jason N. Doctor Nov 2
Replying to @jasndoc
Money invested in Big Data may be better spent on representative sampling.
Reply Retweet Like
Steve Pitts Nov 3
Replying to @jasndoc @SimonDeDeo
This is why the NCHS sample surveys are often better than AHRQ claims data, with 100/th sample size. I worry that increasing institutional non-response will kill this advantage.
Reply Retweet Like
Curtis Atkisson Nov 3
Replying to @jasndoc @SimonDeDeo
This paper is magnificent and has repercussions far beyond Big Data.
Reply Retweet Like
Curtis Atkisson Nov 3
Replying to @jasndoc @SimonDeDeo
Our group that studies human behavioral evolution at UC Davis worry this is a major complication for the study of the evolution of human behavior. I'm interested in doubt a more thorough analysis on this issue in this field.
Reply Retweet Like
Protik Islam-Jakobsson Nov 4
Reply Retweet Like
Oscar D Torson Nov 4
Replying to @ikitorp @jasndoc and 2 others
that is non-sense of course, in the sense that it depends on a lot of unstated assumptions.
Reply Retweet Like
Nathan Oseroff-Spicer Nov 4
Replying to @jasndoc @gztstatistics
What if the marbles are different sizes?
Reply Retweet Like
Edward Patrick Vogel Nov 4
All the more reason to stir thoroughly. But yeah. Going to need to test on the size variations
Reply Retweet Like
Jarrod Dalton Nov 4
Replying to @jasndoc
Corollary: “Big data” are heterogeneous. Understand up front the nature of the populations that you are studying so that you avoid generalizing effects across diverse groups.
Reply Retweet Like
john stuart thrill Nov 4
Replying to @jasndoc
Are you building in the assumption that the 80% sample doesn’t stir? Because if not it seems like there is a 50/50 shot of that one being way better
Reply Retweet Like