Twitter | Search | |
Xan Gregg
Software development . Creator of & . , , . Views my own.
1,251
Tweets
771
Following
655
Followers
Tweets
Xan Gregg 7h
Replying to @danz_68 @jburnmurdoch
If you're planning an analysis: 2. was vs. stacked bars (and maybe 3). 4. was vs. stacked areas (harsher peaks lead to greater slopes) +5. Less accurate reading of total by year, which is a distraction here. Also, I imagine the lumpy edges make the interiors less lumpy.
Reply Retweet Like
Xan Gregg 13h
Nice insight on confidence intervals from on the blog: "The 'confidence' that we have is in the *process* of calculating confidence intervals...not in our specific confidence interval."
Reply Retweet Like
Xan Gregg Aug 15
I think this is the paper (from ) I saw on it at JSM. Includes references to related material. PDF:
Reply Retweet Like
Xan Gregg Aug 15
Are you thinking of the sine illusion for assessing vertical distance between the lines?
Reply Retweet Like
Xan Gregg Aug 15
Replying to @danz_68 @jburnmurdoch
I put data inferred from the stacked bars on github for experimenting: For instance, here's stacked, ordered and slightly smoothed areas.
Reply Retweet Like
Xan Gregg Aug 13
If I see a button like this, I will click it. Can't help it.
Reply Retweet Like
Xan Gregg Aug 13
Replying to @danz_68
Very nice views. Thanks.
Reply Retweet Like
Xan Gregg Aug 13
Replying to @danz_68
Did you expect the error imbalance? Is it that your algorithm goes left-to-right and then reverses, so positive error is favored?
Reply Retweet Like
Xan Gregg retweeted
Daniel Zvinca Aug 13
Replying to @xangregg
Forcing the same deviation range, a kde dot plot with smaller dot size will look something like below
Reply Retweet Like
Xan Gregg retweeted
Daniel Zvinca Aug 13
Replying to @xangregg
In case you want to estimate the "error" of your histogram replace the bars with dots with diameter=bar width and stack them in the bar space. Each dot deviation from real value is color encoded
Reply Retweet Like
Xan Gregg Aug 13
Replying to @danz_68
I also tried to take packet headers into account. Based on a Cisco doc, I guessed 38 header bytes for every 500 content bytes.
Reply Retweet Like
Xan Gregg Aug 13
I put my internet speed test data on github in case anyone wants to try different views of the bimodal distribution (KDE dots ?)
Reply Retweet Like
Xan Gregg Aug 12
Replying to @danz_68
The data is the 2018 data set at . Plus, I split off category from their main_category when count was >= 5000.
Reply Retweet Like
Xan Gregg Aug 12
Replying to @jbaysdon
Didn't track the tie-dye! 😀 Nor the rainstorms we've been having.
Reply Retweet Like
Xan Gregg Aug 12
Replying to @xangregg
Looking at date and time separately with a heatmap of download speeds. Some bad days, but no strong pattern.
Reply Retweet Like
Xan Gregg Aug 12
Replying to @xangregg
Here's the breakdown by 4-hour window. It's not just evenings that have slow times.
Reply Retweet Like
Xan Gregg Aug 12
Distribution of my home internet speeds on my "up to 60Mb/s" plan. Better than I expected -- the slow times leave a big impression.
Reply Retweet Like
Xan Gregg Aug 11
For an unsummarized view of the Kickstarter projects, here's one dot per project for the 250K projects with a pledge and funding amount within a factor of 100. Interesting drop-off just before 1: not many end just short of goal.
Reply Retweet Like
Xan Gregg Aug 11
Average Kickstarter funding by category. It was beautiful when I made it but now seems terrible: averaging hides the skewed distributions, scales put most info into the corner, labeling difficult, too many colors to distinguish...
Reply Retweet Like
Xan Gregg Aug 11
Kickstarter success rates by category as a mosaic chart. Tabletop Games do much better than Video Games. Data (small categories were grouped into their main category):
Reply Retweet Like