Yelp Scraping Parameters
Web Scraper API Yelp Scraping Parameters
Configure Yelp Scraping Parameters using Thordata's Web Scraper API, including Search URL, Business URL, Category, Location, and other parameters.
Unique Identifier:
token,Access token(required)
This parameter is used as the API access token to ensure the legitimacy of the scraping request.
Request examples:
Authorization: Bearer ********************
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer ********************" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
Product - Scrape business information:
1. Yelp - Scrape business information by Search Filters
spider_id ,Scraper tool (required)
Defines which scraper tool to use.
Request examples:
spider_id=yelp_business-overview_by-search-filters
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
category ,Category (required)
This parameter specifies the category to be scraped.
Request examples:
"category": "cafe"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
location ,Location (required)
This parameter specifies the location to be scraped.
Request examples:
"location": "new york"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
business_page_turning,Max business pages (optional)
This parameter specifies the pages to collect.
Request examples:
"business_page_turning": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
2. Yelp - Scrape business information by Search URL
spider_id ,Scraper tool (required)
Defines which scraper tool to use.
Request examples:
spider_id=yelp_business-overview_by-search-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-url" ^
-d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
search_url ,Search URL (required)
This parameter specifies the Search URL to be scraped.
Request examples:
"search_url": "https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-url" ^
-d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
business_page_turning,Max business pages (optional)
This parameter specifies the pages to collect.
Request examples:
"business_page_turning": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-url" ^
-d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
3. Yelp - Scrape business information by Business URL
spider_id ,Scraper tool (required)
Defines which scraper tool to use.
Request examples:
spider_id=yelp_business-overview_by-business-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
business_url ,Business URL (required)
This parameter specifies the Business URL to search.
Request examples:
"business_url": "https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
Product - Scrape business reviews information:
1. Yelp - Scrape business reviews information by Search Filters
spider_id ,Scraper tool(required)
Defines which scraper tool to use.
Request examples:
spider_id=yelp_business-reviews_by-search-filters
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
category, Category(required)
This parameter specifies the business category for crawling.
Request examples:
"category": "cafe"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
location,Location(required)
This parameter specifies the business location for crawling.
Request examples:
"location": "Stowe, VT"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
business_page_turning,Max business pages(optional)
This parameter specifies the max business pages for crawling.
Request examples:
"business_page_turning": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
review_page_turning,Max review pages(optional)
This parameter specifies the max review pages for crawling.
Request examples:
"max review pages": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-search-filters" ^
-d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
2. Yelp - Scrape business reviews information by Search URL
spider_id ,Scraper tool (required)
Defines which scraper tool to use.
Request examples:
spider_id=yelp_business-overview_by-search-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-url" ^
-d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
search_url,Search URL (required)
This parameter specifies the Search URL for crawling.
Request examples:
"search_url": "https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-search-url" ^
-d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
3. Yelp - Scrape business reviews information by Business URL
spider_id ,Scraper tool (required)
Defines which scraper tool to use.
Request examples:
spider_id=yelp_business-overview_by-business-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-overview_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
business_url,Business URL(required)
This parameter specifies the Business URL for crawling.
Request examples:
"business_url": "https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
unrecommended_reviews,Unrecommended Reviews(required)
This parameter is used to specify whether to capture unrecommended comments.
Request examples:
"unrecommended_reviews": "yes"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
sort_by,Sort by(optional)
This parameter is used to specify the sorting method for capturing comments.
Request examples:
"sort_by": "DATE_DESC"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
review_page_turning,Max review pages(optional)
This parameter is used to specify the max review pages to collect.
Request examples:
"review_page_turning": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=yelp.com" ^
-d "spider_id=yelp_business-reviews_by-business-url" ^
-d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
If you need further assistance, please contact us via email at [email protected].
Last updated
Was this helpful?