Crawlzo / Products / Social Media / Mastodon

Mastodon Scraper API

Pull any public Mastodon account or post into structured JSON — profile details, toot text and media, boost and favourite counts, plus the instance the content lives on.

Profiles · postsVideos & engagementComments & followersPublic data only

Talk to our team ↗All products

▸ Overview

Mastodon is a federated microblogging platform made of thousands of independently run instances that share content over ActivityPub. The Mastodon Scraper API handles this fragmentation for you, resolving public accounts and posts on any instance and returning a unified record that includes the home server, so cross-instance data stays comparable.

Open-web researchers and trust-and-safety teams use it to observe how topics spread across the fediverse without standing up their own server. Only public toots and profiles are collected, with boost, favourite, and reply counts normalized into one schema regardless of which instance served them.

Mastodon Scraper API · request

# POST a target — get validated JSON back
curl https://api.crawlzo.com/v4/scrape \
  -H "Authorization: Bearer $CRAWLZO_KEY" \
  -d '{
  "url": "https://www.mastodon.com/",
  "type": "profile",
  "include": "recent_posts"
  }'

// ← response
{
  "status": "ok",
  "data": {
    "username": "...",
    "followers": 1840221,
    "posts": 412,
    "verified": true,
    "recent_posts": [
      { "id": "…", "likes": 21044, "comments": 882 }
    ]
  }
}

▸ What you can extract

Every public field, structured for you.

Mastodon data parsed into clean, validated JSON. Pull any group below on its own, or combine them in a single request.

Profile details

Username, display name, user ID, bio
Followers, following, post/content count
Verified / creator / business flags
Avatar, banner, external links

Post & content details

Text / caption, media URLs, timestamp
Likes, comments, shares, reactions
Hashtags, mentions, links
Content type and permalink

Video & engagement

View / play count, watch metrics
Video URL, duration, thumbnail
Engagement-rate calculation

Comments

Comment text, author, timestamp
Reply threads and like counts
Top vs. recent ordering

Followers & connections

Follower / following list samples
Audience growth over time
Mutual-connection signals

Topic & hashtag feeds

Top / recent content per topic or tag
Volume and trend signals
Related tag discovery

▸ Built on the Crawlzo engine

The hard parts, already solved.

Public engagement metrics
Followers, likes, views, comments — normalized across the network.METRICS
Anti-bot bypass built in
Residential fingerprints, TLS evasion, behaviour modelling, CAPTCHA solving — every request.STEALTH
Schema-validated JSON
Hand us a schema; runs that don't conform are rejected and never billed.SCHEMA
Pay for data delivered
Valid rows only. Retries, blocks, and 5xxs on us.$0 / FAIL

▸ What teams build with it

Common use cases.

[ 01 ]

Fediverse and ActivityPub research

[ 02 ]

Cross-instance topic monitoring

[ 03 ]

Community migration analysis

[ 04 ]

Public account and engagement tracking

▸ FAQ

Mastodon scraping, answered.

Does it work across different Mastodon instances?

Yes. Pass an account or post URL from any instance and we resolve it on that server, returning the originating instance alongside the content so federated data lands in a single consistent schema.

How is the data delivered?

Structured JSON straight from the API, or pushed to your stack natively — S3, BigQuery, Snowflake, Postgres, Kafka, or any HTTPS webhook. Call it from Python, Node, Go, Rust, or any HTTP client. The data lands where your pipeline already lives.

Do I pay for blocked or failed requests?

No. You pay for valid, schema-passing rows only. Retries, blocks, CAPTCHAs, and 5xxs are on us. If a run doesn't return data that conforms to the schema, it isn't billed.

How do you handle anti-bot protection?

Every request routes through the same engine behind our Web Unblocker API: compliant residential IPs, real browser fingerprints, TLS-level evasion, behaviour modelling, and built-in CAPTCHA solving. Hard targets become routine.

Is this compliant and is the data mine?

Yes. We respect robots policies, rate budgets, and ToS-aware allow/deny lists. We deliver and move on — no row-level retention beyond your replay window. GDPR DPA, PII redaction, and custom data residency available on request.

Mastodon Scraper API

Every public field, structured for you.

Profile details

Post & content details

Video & engagement

Comments

Followers & connections

Topic & hashtag feeds

The hard parts, already solved.

Common use cases.

Mastodon scraping, answered.

Related scrapers.

Instagram Scraper API

Facebook Scraper API

LinkedIn Scraper API

TikTok Scraper API

Start pulling Mastodon data this week.