Yelp 抓取參數

Web Scraper API Yelp 抓取參數

使用 Thordata 的 Web Scraper API 配置 Yelp 抓取參數，包括包括搜尋URL、商家URL、類別、位置和其他參數。

唯一标識：

token ，訪問令牌（必填）

此參數用作 API 存取令牌，以確保抓取的合法性。

請求範例：

Authorization: Bearer ********************

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer ********************" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

一、產品-抓取商家資訊：

1. Yelp - 透過搜尋過濾器抓取商家訊息

spider_id ，所屬抓取工具 (必填)

它定義了要使用的抓取工具。

請求範例：

spider_id=yelp_business-overview_by-search-filters

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

category ，類別（必填）

此參數指定要抓取的類別。

請求範例:

"category": "cafe"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

location ，地點（必填）

此參數指定要抓取的位置。

請求範例:

"location": "new york"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

business_page_turning，最多商家清單頁數（可選）

此參數指定要收集的商家清單頁數。

請求範例:

"business_page_turning": "1"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

2. Yelp - 透過搜尋網址抓取商業訊息

spider_id ，所屬抓取工具 (必填)

它定義了要使用的抓取工具。

請求範例：

spider_id=yelp_business-overview_by-search-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

search_url ，搜尋網址（必填）

此参数指定要抓取的搜索网址。

請求範例:

"search_url": "https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

business_page_turning，最多商家清單頁數（可選）

此參數指定要收集的商家清單頁數。

請求範例:

"business_page_turning": "1"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

3. Yelp - 透過商家URL抓取商業訊息

spider_id ，所屬抓取工具 (必填)

它定義了要使用的抓取工具。

請求範例：

spider_id=yelp_business-overview_by-business-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

business_url ，商家URL(必填)

此參數指定要搜尋的商家網址。

請求範例:

"business_url": "https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

產品 - 抓取商家評論資訊：

1. Yelp - 透過搜尋篩選器抓取商家評論訊息

spider_id ，所屬抓取工具 (必填)

它定義了要使用的抓取工具。

請求範例：

spider_id=yelp_business-reviews_by-search-filters

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

category， 類別（必填）

此參數指定要抓取的業務類別。

請求範例:

"category": "cafe"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

location，地點（必填）

此參數指定要抓取特定位置的商家。

請求範例:

"location": "Stowe, VT"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

business_page_turning，最多商家清單頁數（可選）

此參數指定要收集的商家清單頁數。

請求範例:

"business_page_turning": "1"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

review_page_turning，最多評論頁數（可選）

此參數指定要抓取的最大評論頁數。

請求範例:

"max review pages": "1"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

2. Yelp - 透過搜尋網址抓取商家評論訊息

spider_id ，所屬抓取工具 (必填)

它定義了要使用的抓取工具。

請求範例：

spider_id=yelp_business-overview_by-search-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

search_url，搜尋URL（必填）

此參數指定要抓取的搜尋 URL。

請求範例:

"search_url": "https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

3. Yelp - 透過商家網址抓取商家評論訊息

spider_id ，所屬抓取工具 (必填)

它定義了要使用的抓取工具。

請求範例：

spider_id=yelp_business-overview_by-business-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

business_url，商家URL（必填）

此參數指定要抓取的商家URL。

請求範例:

"business_url": "https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

unrecommended_reviews，不推薦評論（必填）

此參數用於指定是否抓取不建議的評論。

請求範例:

"unrecommended_reviews": "yes"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

sort_by，排序方式（可選）

此參數用於指定抓取評論結果的排序方法。

請求範例:

"sort_by": "DATE_DESC"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

review_page_turning，最多評論頁數（可選）

此參數用於指定要收集的最大評論頁數。

請求範例:

"review_page_turning": "1"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

如果您需要進一步的協助，請透過電子郵件聯繫 [email protected]。

PreviousIndeed 抓取參數 NextGithub 抓取參數

Last updated 3 months ago