console.cloud.google.com uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. Learn more

Start your Free Trial with $300 in credit. Don’t worry—you won’t be charged if you run out of credits. Learn more
Skip to main content Accessibility Help Accessibility Feedback
Console Logo
Console Logo

Product details

Hacker News

Y Combinator

Stories and comments since 2006

Overview
Samples
Related Products

Overview

This dataset contains all stories and comments from Hacker News from its launch in 2006 to present. Each story contains a story ID, the author that made the post, when it was written, and the number of points the story received.

This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .

Additional details

  • Type: Data
  • Category: Encyclopedic, Social
  • Dataset source: Hacker News API
  • Cloud service: BigQuery
  • Expected update frequency: Daily

    Samples

    Here are some examples of SQL queries you can run on this data in BigQuery.

    How are Hacker News story points distributed?
    If you use the score as a dimension (group by score, in SQL) and count the number of posts with each score, you can get an idea about how likely a story is to get a given score. Run this query

    Where do the stories live?
    By parsing out the host from the URL you can see where Hacker News stories originate. Run this query

    Terms of Service

    This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - https://github.com/HackerNews/API - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

    Related Products

    Customers who use this product also use the following products

    GitHub Activity Data
    GitHub

    Includes activity from over 3M open source GitHub repositories

    Google Trends
    BigQuery Public Datasets Program

    Daily top 25 and top 25 rising terms in the United States

    BigQuery API
    Google Enterprise API

    A data platform for customers to create, manage, share and query data.

    Compute Engine API
    Google Enterprise API

    Compute Engine API

    Gemini API
    Google

    Build with latest models from Google Deepmind using the Gemini API for Developers

    Google Drive API
    Google Enterprise API

    Create and manage resources in Google Drive.

    Your page may be loading slowly because you're building optimized sources. If you intended on using uncompiled sources, please click this link.

    Hide the shortcuts helper

    Google Cloud Console has failed to load JavaScript sources from www.gstatic.com.
    Possible reasons are:

    • www.gstatic.com or its IP addresses are blocked by your network administrator
    • Google has temporarily blocked your account or network due to excessive automated requests
    Please contact your network administrator for further assistance.

    Help
    Cloud Hub
    Solutions
    Billing
    IAM & Admin
    Marketplace
    APIs & Services
    Vertex AI
    Compute Engine
    Kubernetes Engine
    Cloud Storage
    Security
    BigQuery
    Monitoring
    Cloud Run
    VPC Network
    Cloud SQL
    Google Maps Platform
    Click to view dataset