Install
openclaw skills install @browseract-cli/trustpilot-reviewsTrustpilot customer reviews scraper for any company listed on trustpilot.com — given a company domain (e.g. shopify.com, apple.com, shopwagandtail.com) plus optional filters (page number, single star rating 1-5, single language ISO code, verified-only flag, has-company-reply flag, date period last30days/last3months/last6months/last12months, free-text keyword search, and reviewer countryCode client-side filter), extracts paginated reviews with 20+ fields per review: reviewId, reviewUrl, reviewTitle, reviewDescription, reviewRatingScore (1-5), reviewDate (ISO publishedDate), reviewDateOfExperience, reviewLabel, isReviewVerified, reviewer (display name), reviewerId, reviewersCountry (ISO Alpha-2), reviewLanguage, reviewCompanyResponse (the company reply text), plus pagination metadata (currentPage, totalPages, totalCount). Optional flags add reviewer profile metadata (reviewerNumberOfReviews, reviewerProfileIsVerified), reply analysis dates (companyReplyPublishedDate, companyReplyUpdatedDate), extended metadata (reviewSource, reviewVerificationSource, reviewLikes), and review photos. Use when user mentions Trustpilot reviews, scrape Trustpilot, get reviews from Trustpilot, Trustpilot review scraper, Trustpilot customer reviews, extract Trustpilot reviews, Trustpilot review data, download Trustpilot reviews, paginate Trustpilot, paginated Trustpilot reviews, filter Trustpilot reviews by star, filter Trustpilot by language, verified reviews Trustpilot, has reply Trustpilot, company response Trustpilot, Trustpilot reply analysis, brand monitoring Trustpilot, sentiment analysis Trustpilot, competitor reviews Trustpilot, bulk reviews from a Trustpilot page, batch download reviews, trustpilot.com reviews scraping, fake review research Trustpilot, Trustpilot dataset, persona research from Trustpilot reviews, Trustpilot keyword search reviews, Trustpilot date range reviews, review export Trustpilot. Also applies to comparing review trends across competitors, training data for LLM sentiment classifiers, monitoring competitor response time, exporting reviews for BI dashboards, lead generation from active reviewers, ML / fake-review detection, and aggregator sites that display third-party reviews.
openclaw skills install @browseract-cli/trustpilot-reviewsInput a company domain plus optional filters → output paginated review records with reviewer info, ratings, dates, company replies, and 20+ structured fields per review.
All process output to user (progress updates, process notifications) follows the user's language.
Extract one page of customer reviews from trustpilot.com/review/{domain} with the chosen filters applied. Designed to be called in a loop across pages to collect the full filtered review set.
https://www.trustpilot.com/review/{domain}If browser-act has been confirmed available in the current session → skip this step.
Invoke browser-act via Skill tool to load usage. If installation or configuration issues arise, follow its guidance to resolve then retry.
This Skill's operational boundary = what the user can manually do in their browser. It only reads data already displayed to the user on the page, never bypassing authentication or access controls. JS code is encapsulated in Python files under the
scripts/directory, invoked viaeval "$(python scripts/xxx.py {params})".$(...)is bash syntax; it is recommended to use the bash tool for execution.
Trustpilot accepts filter values as URL query parameters. Build the URL before navigating:
https://www.trustpilot.com/review/{domain}?page={page}&languages={lang}&sort=recency&stars={stars}&verified={true|false}&replies={true|false}&date={period}&search={keyword}
All parameters except domain are optional. Recommended construction rules:
page: integer ≥ 1 (omit for page 1; pagination defaults to 20 per page)languages: single ISO code (e.g. en, da, de, es, fr); omit or use all for no language filtersort: only recency is currently honoured by the public review page (older relevancy values are stripped on redirect)stars: single value 1, 2, 3, 4, or 5 — only one star tier can be selectedverified: true to keep only verified reviewsreplies: true to keep only reviews that received a company responsedate: one of last30days, last3months, last6months, last12monthssearch: URL-encoded keyword to search inside review textCountry-of-reviewer filtering is not supported by Trustpilot's URL — it is applied client-side after extraction using --filter-country (see below).
The Trustpilot review page is server-rendered with Next.js; the full reviews array is embedded in <script id="__NEXT_DATA__">. Extraction reads it directly — no separate API call needed.
Steps:
navigate {built-url}wait stable --timeout 30000eval "$(python scripts/extract-reviews.py [options])"Parameters (all optional flags / values):
--include-reviewer-metadata: adds reviewerNumberOfReviews, reviewerProfileIsVerified.--include-reply-analysis: adds companyReplyPublishedDate, companyReplyUpdatedDate (null when no reply).--include-extended-metadata: adds reviewSource (e.g. Organic, Shopify, InvitationApi), reviewVerificationSource (e.g. invitation, complianceDocumentation), reviewLikes.--include-review-photos: adds reviewPhotoUrls array via best-effort DOM scan of the review card; returns [] when no photos.--no-experience-date: omit the default reviewDateOfExperience field.--filter-country {ISO2}: client-side filter — keep only reviews whose reviewer countryCode matches the given ISO Alpha-2 code (e.g. US, GB, DK). Case-insensitive. Applied after the page-level filters from the URL.Output example (one page, with all optional flags):
{
"reviews": [ // array of review objects (≤ perPage items)
{
"companyName": "Wag + Tail",
"companyPageUrl": "https://www.trustpilot.com/review/shopwagandtail.com?page=1&languages=en",
"businessUnitId": "624c24851220e2743a4d7916", // Trustpilot internal company ID
"reviewId": "65fd6a055de8c560404349e1",
"reviewUrl": "https://www.trustpilot.com/reviews/65fd6a055de8c560404349e1",
"reviewTitle": "Great product looks so cute on my girl",
"reviewDescription": "Great product looks so cute on my girl and fits so well",
"reviewRatingScore": 5, // 1-5
"reviewDate": "2024-03-22T13:22:45.000Z", // when review was posted
"reviewLabel": "verified", // "verified" or "not-verified"
"isReviewVerified": true,
"reviewer": "Angela Rosalie",
"reviewerId": "656cc367ad3fb4001307cba8",
"reviewersCountry": "US", // ISO Alpha-2
"reviewLanguage": "en",
"reviewCompanyResponse": "", // empty string when no reply
"scrapedDateTime": "2026-06-26T04:13:03.026Z",
"scrapedAtReviewPageNumber": 1,
"reviewDateOfExperience": "2024-01-27T00:00:00.000Z", // default; remove with --no-experience-date
"reviewerNumberOfReviews": 2, // --include-reviewer-metadata
"reviewerProfileIsVerified": false, // --include-reviewer-metadata
"companyReplyPublishedDate": null, // --include-reply-analysis (null when no reply)
"companyReplyUpdatedDate": null, // --include-reply-analysis
"reviewSource": "Shopify", // --include-extended-metadata
"reviewVerificationSource": "invitation", // --include-extended-metadata
"reviewLikes": 0, // --include-extended-metadata
"reviewPhotoUrls": [] // --include-review-photos
}
],
"pagination": {
"currentPage": 1,
"perPage": 20,
"totalPages": 12,
"totalCount": 231 // total reviews matching the active filters
},
"activeFilters": { // what Trustpilot actually applied (after URL normalization)
"languages": "en",
"sort": "recency",
"stars": null,
"verified": false,
"replies": false,
"date": null,
"search": null,
"countryFilter": null // mirrors --filter-country when set
},
"companyContext": { // company snapshot for joining with company-info output
"businessUnitId": "624c24851220e2743a4d7916",
"companyName": "Wag + Tail",
"identifyingName": "shopwagandtail.com",
"trustScore": 4.7,
"stars": 4.5,
"totalReviews": 258
},
"totalFiltered": 231, // duplicate of pagination.totalCount, kept for clarity
"scrapedDateTime": "2026-06-26T04:13:03.026Z",
"isCompanyFound": true
}
Output example (company not found):
{
"isCompanyFound": false,
"companyPageUrl": "https://www.trustpilot.com/review/nonexistent-xyz-9999.com",
"searchedDomain": "nonexistent-xyz-9999.com",
"scrapedDateTime": "2026-06-26T04:13:45.151Z",
"reviews": [],
"pagination": null
}
Error handling: when the script fails to find __NEXT_DATA__ or businessUnit (and the page is not the 404 page) it returns {"error": true, "message": "..."}. Resolution sequence: (1) confirm wait stable finished; (2) re-run the extraction; (3) if Trustpilot served an anti-bot challenge, switch to a stealth browser with proxy and retry once.
[DOM] languages — read from the __NEXT_DATA__ SSR data on any Trustpilot review page:
eval "(() => { const s=document.querySelector('script#__NEXT_DATA__'); const d=JSON.parse(s.textContent); const langs=d.props.pageProps.filters.reviewStatistics.reviewLanguages; return JSON.stringify(langs.map(l => ({ code: l.isoCode, name: l.displayName, count: l.reviewCount }))); })()"
Each entry: { code, name, count }. The literal all is also accepted as the URL value to disable the language filter.
[Static] sort: recency (currently the only honoured value on the public review page; older relevancy is stripped on redirect).
[Static] stars: 1, 2, 3, 4, 5 — only one value at a time.
[Static] date: last30days, last3months, last6months, last12months.
[Static] verified, replies: true or false.
[Static] filterByCountryOfReviewers: any ISO Alpha-2 country code (e.g. US, GB, DE, DK). Applied client-side via --filter-country.
URL Pagination: URL pattern https://www.trustpilot.com/review/{domain}?page={N}&{otherFilters}. Next page = increment page by 1. Termination conditions: (a) pagination.currentPage >= pagination.totalPages, OR (b) reviews.length === 0 on the current page, OR (c) caller-supplied endAtPageNumber reached.
Recommended loop pattern (bash):
PAGE=1
END=0 # 0 means "until totalPages"
while : ; do
URL="https://www.trustpilot.com/review/${DOMAIN}?page=${PAGE}&languages=en&sort=recency"
browser-act --session {S} navigate "$URL"
browser-act --session {S} wait stable --timeout 30000
RESP=$(browser-act --session {S} eval "$(python scripts/extract-reviews.py --include-extended-metadata)")
# append RESP.reviews to a JSONL file
TOTAL=$(echo "$RESP" | python -c "import sys,json; print(json.loads(sys.stdin.read())['pagination']['totalPages'])")
PAGE=$((PAGE+1))
if [ "$PAGE" -gt "$TOTAL" ]; then break; fi
if [ "$END" -gt 0 ] && [ "$PAGE" -gt "$END" ]; then break; fi
sleep 2
done
isCompanyFound === true && Array.isArray(reviews) && pagination.currentPage === expected_page && (reviews.length > 0 || pagination.totalCount === 0) — for an existing company; isCompanyFound === false && searchedDomain !== null && reviews.length === 0 — for a not-found domain.
sort=relevancy is no longer honoured on the public review page; only recency is applied. Build URLs accordingly.filterByCountryOfReviewers is client-side only: a page may return up to perPage reviews of which fewer match the country filter, so the returned list per page can be sparse. To collect a fixed-size sample by country, page deeper or remove other restrictive filters.pagination.totalCount represents reviews after server-side filters and may differ from companyContext.totalReviews (Trustpilot's official count may include hidden / machine-filtered reviews).reviewPhotoUrls is best-effort: Trustpilot does not consistently expose review photos in the SSR JSON, so the script falls back to scanning known photo-host URL patterns inside the review card DOM. Reviews without photos return []; some valid photos may be missed on layout changes.page=1 to the bare URL — pagination.currentPage still reports 1, so callers should trust that field rather than the URL.Path: {working-directory}/browser-act-skill-forge-memories/trustpilot-reviews-scraper-trustpilot-reviews.memory.md (working directory is determined by the Agent running the Skill, typically the project root or current working directory)
Before execution: If the file exists, read it first — it records unexpected situations encountered during past executions (e.g., a strategy has become ineffective); adjust strategy order accordingly.
After execution: If an unexpected situation is encountered (strategy became ineffective, page redesigned, anti-scraping upgraded, better path discovered), append a line:
{YYYY-MM-DD}: {what happened} → {conclusion}
Normal execution does not write to the file. Do not record what keywords were used or how many results were returned — those are task outputs, not experience.