Crawlzo / Products / App Stores & Software / GitHub

GitHub Scraper API

Turn any GitHub repository, topic, or trending page into structured JSON: stars, forks, open issues, language breakdown, latest release, contributors, and topics.

App listingsRatings & reviewsCharts & rankingsDeveloper data

Talk to our team ↗All products

▸ Overview

GitHub is the center of open-source software, where a project's stars, release cadence, and trending placement signal its momentum better than any store rank. The GitHub Scraper API resolves repository pages, release lists, topic feeds, and the daily trending board into validated JSON keyed to owner and repo.

Developer-tooling vendors, VCs scouting open source, and OSS maintainers use it to watch star growth, catch new releases, and discover rising projects by language. Star and fork counts are read at request time, and the trending board is captured per language and time window.

GitHub Scraper API · request

# POST a target — get validated JSON back
curl https://api.crawlzo.com/v4/scrape \
  -H "Authorization: Bearer $CRAWLZO_KEY" \
  -d '{
  "url": "https://www.github.com/",
  "type": "app",
  "geo": "us"
  }'

// ← response
{
  "status": "ok",
  "data": {
    "title": "...",
    "developer": "...",
    "rating": 4.7,
    "ratings_count": 184220,
    "price": "Free",
    "category": "Productivity",
    "rank": 12
  }
}

▸ What you can extract

Every public field, structured for you.

GitHub data parsed into clean, validated JSON. Pull any group below on its own, or combine them in a single request.

Listing details

Title, developer, description, icon
Category, price, in-app purchases
Version, size, update date
Screenshots and preview media

Ratings & reviews

Aggregate rating and ratings count
Review text, rating, author, version
Rating distribution and developer replies
Full review pagination

Charts & rankings

Top free / paid / grossing rank
Category and country charts
Rank history over time

Developer details

Developer name and other titles
Website and support links
Privacy and data-use labels

Search results

Ranked apps per keyword + country
Ad vs. organic placement
Keyword visibility signals

▸ Built on the Crawlzo engine

The hard parts, already solved.

Live rank & rating
Chart position and rating resolved at request time, per country.LIVE
Anti-bot bypass built in
Residential fingerprints, TLS evasion, behaviour modelling, CAPTCHA solving — every request.STEALTH
Schema-validated JSON
Hand us a schema; runs that don't conform are rejected and never billed.SCHEMA
Pay for data delivered
Valid rows only. Retries, blocks, and 5xxs on us.$0 / FAIL

▸ What teams build with it

Common use cases.

[ 01 ]

Open-source star and fork growth tracking

[ 02 ]

Release and tag monitoring for dependencies

[ 03 ]

Trending-repo discovery by language

[ 04 ]

Developer-tool and ecosystem market research

▸ FAQ

GitHub scraping, answered.

Can I monitor releases and trending repos?

Yes. We return the latest release tag and notes per repository, and the trending board scoped by language and by daily, weekly, or monthly window.

How is the data delivered?

Structured JSON straight from the API, or pushed to your stack natively — S3, BigQuery, Snowflake, Postgres, Kafka, or any HTTPS webhook. Call it from Python, Node, Go, Rust, or any HTTP client. The data lands where your pipeline already lives.

Do I pay for blocked or failed requests?

No. You pay for valid, schema-passing rows only. Retries, blocks, CAPTCHAs, and 5xxs are on us. If a run doesn't return data that conforms to the schema, it isn't billed.

How do you handle anti-bot protection?

Every request routes through the same engine behind our Web Unblocker API: compliant residential IPs, real browser fingerprints, TLS-level evasion, behaviour modelling, and built-in CAPTCHA solving. Hard targets become routine.

Is this compliant and is the data mine?

Yes. We respect robots policies, rate budgets, and ToS-aware allow/deny lists. We deliver and move on — no row-level retention beyond your replay window. GDPR DPA, PII redaction, and custom data residency available on request.

GitHub Scraper API

Every public field, structured for you.

Listing details

Ratings & reviews

Charts & rankings

Developer details

Search results

The hard parts, already solved.

Common use cases.

GitHub scraping, answered.

Related scrapers.

Apple App Store Scraper API

Google Play Scraper API

Steam Scraper API

Product Hunt Scraper API

Start pulling GitHub data this week.