Convert any public Tumblr blog or tag feed into structured JSON — post bodies across text, photo, quote, and link types, plus tags, reblog trails, and note counts.
Tumblr is a microblogging community built around themed blogs, remix culture, and dense tag-based discovery. The Tumblr Scraper API walks public blog archives and tag pages, parsing each post into its native type and capturing the reblog trail that shows how content spread from one blog to the next.
Fandom researchers, trend forecasters, and culture analysts rely on it to map niche communities that rarely surface on mainstream networks. We collect public posts only and standardize note counts, tag lists, and post formats into one consistent schema.
# POST a target — get validated JSON back
curl https://api.crawlzo.com/v4/scrape \
-H "Authorization: Bearer $CRAWLZO_KEY" \
-d '{
"url": "https://www.tumblr.com/",
"type": "profile",
"include": "recent_posts"
}'
// ← response
{
"status": "ok",
"data": {
"username": "...",
"followers": 1840221,
"posts": 412,
"verified": true,
"recent_posts": [
{ "id": "…", "likes": 21044, "comments": 882 }
]
}
} "type": "profile",
"include": "recent_posts"Tumblr data parsed into clean, validated JSON. Pull any group below on its own, or combine them in a single request.
Fandom and subculture research
Tag-based trend discovery
Reblog and meme propagation analysis
Niche community content archiving
Yes. Pass a tag and we page through the public tag feed, returning each post with its blog, type, body, note count, and timestamp so you can track a topic across many blogs at once.
Structured JSON straight from the API, or pushed to your stack natively — S3, BigQuery, Snowflake, Postgres, Kafka, or any HTTPS webhook. Call it from Python, Node, Go, Rust, or any HTTP client. The data lands where your pipeline already lives.
No. You pay for valid, schema-passing rows only. Retries, blocks, CAPTCHAs, and 5xxs are on us. If a run doesn't return data that conforms to the schema, it isn't billed.
Every request routes through the same engine behind our Web Unblocker API: compliant residential IPs, real browser fingerprints, TLS-level evasion, behaviour modelling, and built-in CAPTCHA solving. Hard targets become routine.
Yes. We respect robots policies, rate budgets, and ToS-aware allow/deny lists. We deliver and move on — no row-level retention beyond your replay window. GDPR DPA, PII redaction, and custom data residency available on request.