Turn any Goodreads book or author page into JSON: the average star rating, ratings and reviews counts, editions and ISBNs, reader reviews, and author bibliographies.
Goodreads is the largest community for readers, pairing book metadata with millions of star ratings and written reviews across editions. The Goodreads Scraper API resolves book pages, author profiles, and review streams into validated JSON with average ratings, ISBNs, publication details, and review text.
It serves publishing analytics and reading-recommendation tools that need reader sentiment and bibliographic data together. Average ratings resolve at request time and reader reviews paginate fully across each edition.
# POST a target — get validated JSON back
curl https://api.crawlzo.com/v4/scrape \
-H "Authorization: Bearer $CRAWLZO_KEY" \
-d '{
"url": "https://www.goodreads.com/",
"type": "title",
"geo": "us"
}'
// ← response
{
"status": "ok",
"data": {
"title": "...",
"year": 2024,
"rating": 8.4,
"votes": 184220,
"genres": ["Drama"],
"runtime_min": 128
}
} "type": "title",
"geo": "us"Goodreads data parsed into clean, validated JSON. Pull any group below on its own, or combine them in a single request.
Publishing and title-performance analytics
Reader sentiment and review mining
Author bibliography and edition mapping
Book recommendation and discovery models
Yes. Each book returns the average rating, ratings and reviews counts, publication date, and the list of editions with their ISBNs, plus paginated reader reviews.
Structured JSON straight from the API, or pushed to your stack natively — S3, BigQuery, Snowflake, Postgres, Kafka, or any HTTPS webhook. Call it from Python, Node, Go, Rust, or any HTTP client. The data lands where your pipeline already lives.
No. You pay for valid, schema-passing rows only. Retries, blocks, CAPTCHAs, and 5xxs are on us. If a run doesn't return data that conforms to the schema, it isn't billed.
Every request routes through the same engine behind our Web Unblocker API: compliant residential IPs, real browser fingerprints, TLS-level evasion, behaviour modelling, and built-in CAPTCHA solving. Hard targets become routine.
Yes. We respect robots policies, rate budgets, and ToS-aware allow/deny lists. We deliver and move on — no row-level retention beyond your replay window. GDPR DPA, PII redaction, and custom data residency available on request.