TFW the top /r/dataisbeautiful post has data all wrong (How much do different subreddits value comments?) [OC]

Image from external-preview.redd.it and submitted by fhoffa
image showing TFW the top /r/dataisbeautiful post has data all wrong (How much do different subreddits value comments?) [OC]

fhoffa on March 3rd, 2020 at 21:51 UTC »

By @felipehoffa

Made with BigQuery and Data Studio

Data collected by /u/Stuck_In_the_Matrix

The original has huge sampling problems:

/r/askreddit is depicted as <50%, but the real number is 93%. /r/politics is depicted as <10%, but the real number is 51%. etc

Based on this dataisbeautiful post.

Here with all data from 2019-08:

160 subs: https://i.imgur.com/Edc2px1.png

More details on /r/bigquery.

Lurkers-gotta-post on March 3rd, 2020 at 22:03 UTC »

This needs to get big. If there's one thing that drives me up the wall, it's misinformation touted as fact. Posts like these get recited over and over throughout reddit, so great job OP.

MansfromDaVinci on March 3rd, 2020 at 23:19 UTC »

I thought the data looked wonky ty for doing this