[OC] Word cloud comparison between user comments on /r/The_Donald and /r/SandersForPresident subreddits

This comment was posted to reddit on May 28, 2020 at 4:09 pm and was deleted within 7 hour(s) and 33 minutes.

[OC] Word cloud comparison between user comments on /r/The_Donald and /r/SandersForPresident subreddits

To be fair, it is actually a poor and skewed dataset, no need for irony.

OP says they used the top 15 posts of all time to make this. That is... not a great choice. Why?

Well I just took the trip over there and CTRL-F'ed the top 5 all-time posts there. Only 1 thread contains "newsfake" and "cnncnn".

And yeah, those words appear a lot in that thread. Because it's a meme thread... (like the "Comcast sucks. Upvote this thread and it'll show up on Google").

CNNCNN (395 matches and I only loaded 1 extra page of comments): https://i.imgur.com/AsGOJQb.png

NEWSFAKE (>1000 matches): https://i.imgur.com/vTDnKWL.png

(obviously, "FAKE NEWS" is also written on there an absurd amount of times since there's like 10 "FAKE NEWS" for every missing-a-space "NEWSFAKE", which probably contributes to it being the to 2 words in the cloud)

I'm pretty sure the whole point of a word cloud is defeated if you include data where people intentionally wrote a couple words thousands of times as a joke. Also pretty sketchy imo that OP didn't know this and instead said that they were hashtags. Those are obviously not hashtags...

/r/dataisbeautiful Thread Parent Link - i.redd.it

[OC] Word cloud comparison between user comments on /r/The_Donald and /r/SandersForPresident subreddits

Recently removed from /r/dataisbeautiful

More Random Comments