The news sources that reddit prefers
What sources does each subreddit prefer for it’s news? Look at these screenshots, and try out the interactive version.
Based on a post by /u/subroutines on /r/dataisbeautiful. Data in BigQuery, provided by pushshift.io. Interactive version hosted on Google Data Studio. Try it and share — Data Studio is now globally available, and for free!
/r/politics + /r/The_Donald + /r/worldnews + /r/news
/r/EnoughTrumpSpam + /r/conspiracy + /r/uncensorednews + /r/Conservative
/r/technology + /r/Futurology + /r/science
/r/unitedkingdom + /r/australia + /r/canada + /r/europe
/r/nfl + /r/soccer + /r/hockey + /r/nba
/r/WayOfTheBern + /r/hillaryclinton + /r/democrats
/r/movies + /r/television
/r/nottheonion + /r/UpliftingNews + /r/environment
/r/Conservative + /r/progressive + /r/Libertarian
Reverse: Top subs for media
/r/The_Donald favorites:
Same, without /r/The_Donald:
MSM news:
Same, without /r/politics:
Source
Make your own (w/ Data Studio), or change the rules by running a variation of this BigQuery query:
#standardSQL
SELECT domain, subreddit, count_dom, COUNT(*) posts FROM (
SELECT id, domain, subreddit, COUNT(*) OVER(PARTITION BY domain) count_dom
FROM `fh-bigquery.reddit_posts.2017_01`
WHERE score>25
AND domain NOT IN (
'puu.sh', 'zkillboard.com', 'gifsound.com', 'youtu.be', 'bato.to', 'archive.is', 'archive.fo',
'pbs.twimg.com', 'streamable.com', 'cdn.awwni.me')
AND NOT domain LIKE 'self.%'
AND NOT domain LIKE '%redd.it%'
AND NOT domain LIKE '%sli.mg%'
AND NOT domain LIKE '%instagram%'
AND NOT domain LIKE '%steamcommunity%'
AND NOT domain LIKE '%gfycat%'
AND NOT domain LIKE '%fav.me%'
AND NOT domain LIKE '%steampower%'
AND NOT domain LIKE '%amazon%'
AND NOT domain LIKE '%twitch%'
AND NOT domain LIKE '%blogspot%'
AND NOT domain LIKE '%mixtape%'
AND NOT domain LIKE '%spotify%'
AND NOT domain LIKE '%prntscr%'
AND NOT domain LIKE '%akamai%'
AND NOT domain LIKE '%vid.me%'
AND NOT domain LIKE '%github%'
AND NOT domain LIKE '%google%'
AND NOT domain LIKE '%vimeo%'
AND NOT domain LIKE '%medium%'
AND NOT domain LIKE '%upload%'
AND NOT domain LIKE '%imgur%'
AND NOT domain LIKE '%youtube%'
AND NOT domain LIKE '%giphy%'
AND NOT domain LIKE '%reddit%'
AND NOT domain LIKE '%twitter%'
AND NOT domain LIKE '%tumblr.com'
AND NOT domain LIKE '%giphy%'
AND NOT domain LIKE '%flickr%'
AND NOT domain LIKE '%deviantart.com'
AND NOT domain LIKE '%facebook%'
AND NOT domain LIKE '%instagram%'
AND NOT domain LIKE '%twitter%'
AND NOT domain LIKE '%pinimg%'
AND NOT domain LIKE '%gyazo%'
AND NOT domain LIKE '%artstation%'
AND NOT domain LIKE '%ytimg%'
AND NOT domain LIKE '%imgflip%'
AND NOT domain LIKE '%soundcloud%'
AND NOT domain LIKE '%soundcloud%'
AND NOT domain LIKE '%ppy.sh%'
AND NOT over_18
)
WHERE count_dom>20
GROUP BY 1, 2, 3
ORDER BY 4 DESC
Want more?
Want more stories? Check my Medium, follow me on twitter, and subscribe to reddit.com/r/bigquery. And try BigQuery — every month you get a full terabyte of analysis for free.
Which subreddits have the most energy - how upvotes translate into pageviews
How much is a reddit upvote worth? We’ll find how much an upvote is worth, measured by imgur views.medium.com
How much is a reddit upvote worth? We’ll find how much an upvote is worth, measured by imgur views.medium.com
Visualizing reddit’s activity and uptime over the past 8 years
How hard would it be to visualize reddit’s uptime from 2008 to 2016? Let’s do it here, while going through most of its…medium.freecodecamp.com
How hard would it be to visualize reddit’s uptime from 2008 to 2016? Let’s do it here, while going through most of its…medium.freecodecamp.com