Crawlzo / Products / Social Media / Tumblr

Tumblr Scraper API

Convert any public Tumblr blog or tag feed into structured JSON — post bodies across text, photo, quote, and link types, plus tags, reblog trails, and note counts.

Profiles · postsVideos & engagementComments & followersPublic data only

Talk to our team ↗All products

▸ Overview

Tumblr is a microblogging community built around themed blogs, remix culture, and dense tag-based discovery. The Tumblr Scraper API walks public blog archives and tag pages, parsing each post into its native type and capturing the reblog trail that shows how content spread from one blog to the next.

Fandom researchers, trend forecasters, and culture analysts rely on it to map niche communities that rarely surface on mainstream networks. We collect public posts only and standardize note counts, tag lists, and post formats into one consistent schema.

Tumblr Scraper API · request

# POST a target — get validated JSON back
curl https://api.crawlzo.com/v4/scrape \
  -H "Authorization: Bearer $CRAWLZO_KEY" \
  -d '{
  "url": "https://www.tumblr.com/",
  "type": "profile",
  "include": "recent_posts"
  }'

// ← response
{
  "status": "ok",
  "data": {
    "username": "...",
    "followers": 1840221,
    "posts": 412,
    "verified": true,
    "recent_posts": [
      { "id": "…", "likes": 21044, "comments": 882 }
    ]
  }
}

▸ What you can extract

Every public field, structured for you.

Tumblr data parsed into clean, validated JSON. Pull any group below on its own, or combine them in a single request.

Profile details

Username, display name, user ID, bio
Followers, following, post/content count
Verified / creator / business flags
Avatar, banner, external links

Post & content details

Text / caption, media URLs, timestamp
Likes, comments, shares, reactions
Hashtags, mentions, links
Content type and permalink

Video & engagement

View / play count, watch metrics
Video URL, duration, thumbnail
Engagement-rate calculation

Comments

Comment text, author, timestamp
Reply threads and like counts
Top vs. recent ordering

Followers & connections

Follower / following list samples
Audience growth over time
Mutual-connection signals

Topic & hashtag feeds

Top / recent content per topic or tag
Volume and trend signals
Related tag discovery

▸ Built on the Crawlzo engine

The hard parts, already solved.

Public engagement metrics
Followers, likes, views, comments — normalized across the network.METRICS
Anti-bot bypass built in
Residential fingerprints, TLS evasion, behaviour modelling, CAPTCHA solving — every request.STEALTH
Schema-validated JSON
Hand us a schema; runs that don't conform are rejected and never billed.SCHEMA
Pay for data delivered
Valid rows only. Retries, blocks, and 5xxs on us.$0 / FAIL

▸ What teams build with it

Common use cases.

[ 01 ]

Fandom and subculture research

[ 02 ]

Tag-based trend discovery

[ 03 ]

Reblog and meme propagation analysis

[ 04 ]

Niche community content archiving

▸ FAQ

Tumblr scraping, answered.

Can I scrape every post under a specific tag?

Yes. Pass a tag and we page through the public tag feed, returning each post with its blog, type, body, note count, and timestamp so you can track a topic across many blogs at once.

How is the data delivered?

Structured JSON straight from the API, or pushed to your stack natively — S3, BigQuery, Snowflake, Postgres, Kafka, or any HTTPS webhook. Call it from Python, Node, Go, Rust, or any HTTP client. The data lands where your pipeline already lives.

Do I pay for blocked or failed requests?

No. You pay for valid, schema-passing rows only. Retries, blocks, CAPTCHAs, and 5xxs are on us. If a run doesn't return data that conforms to the schema, it isn't billed.

How do you handle anti-bot protection?

Every request routes through the same engine behind our Web Unblocker API: compliant residential IPs, real browser fingerprints, TLS-level evasion, behaviour modelling, and built-in CAPTCHA solving. Hard targets become routine.

Is this compliant and is the data mine?

Yes. We respect robots policies, rate budgets, and ToS-aware allow/deny lists. We deliver and move on — no row-level retention beyond your replay window. GDPR DPA, PII redaction, and custom data residency available on request.

Tumblr Scraper API

Every public field, structured for you.

Profile details

Post & content details

Video & engagement

Comments

Followers & connections

Topic & hashtag feeds

The hard parts, already solved.

Common use cases.

Tumblr scraping, answered.

Related scrapers.

Instagram Scraper API

Facebook Scraper API

LinkedIn Scraper API

TikTok Scraper API

Start pulling Tumblr data this week.