# Yelp 抓取参数

Web Scraper API Yelp 抓取参数

\
使用 Thordata 的 Web Scraper API 配置 Yelp 抓取参数，包括搜索URL、商家URL、类别、位置和其他参数。

**唯一标识:**

<details>

<summary><code>token</code>，<strong>访问令牌（必填）</strong></summary>

此参数用作 API 访问令牌，以确保抓取的合法性。

**请求示例：**

`Authorization: Bearer ********************`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer ********************" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

**产品 - 抓取商家信息**

1\. Yelp - 通过搜索过滤器抓取商家信息

<details>

<summary><code>spider_id</code> ，<strong>抓取工具（必填）</strong></summary>

定义要使用的抓取工具。

**请求示例:**&#x20;

`spider_id=yelp_business-overview_by-search-filters`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>category</code> ，<strong>类别（必填）</strong></summary>

此参数指定要抓取的类别。

**请求示例:**&#x20;

`"category": "cafe"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>location</code> ，<strong>地点（必填）</strong></summary>

此参数指定要抓取的位置。

**请求示例:**

`"location": "new york"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>business_page_turning</code>，<strong>最多商家列表页数（可选）</strong></summary>

此参数指定要收集的商家列表页数。

**请求示例:**

`"business_page_turning": "1"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"new york\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

2\. Yelp - 通过搜索网址抓取商业信息

<details>

<summary><code>spider_id</code> ，<strong>抓取工具（必填）</strong></summary>

定义要使用的抓取工具。

**请求示例:**

`spider_id=yelp_business-overview_by-search-url`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>search_url</code> ，<strong>搜索网址（必填）</strong></summary>

此参数指定要抓取的搜索网址。

**请求示例:**

`"search_url": "https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>business_page_turning</code>，<strong>最多商家列表页数（可选）</strong></summary>

此参数指定要收集的商家列表页数。

**请求示例:**

`"business_page_turning": "1"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

3\. Yelp - 通过商家URL抓取商业信息

<details>

<summary><code>spider_id</code> ，<strong>抓取工具（必填）</strong></summary>

定义要使用的抓取工具。

**请求示例:**

`spider_id=yelp_business-overview_by-business-url`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>business_url</code> ，<strong>商家URL (必填)</strong></summary>

此参数指定要搜索的商家网址。

**请求示例:**

`"business_url": "https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
```

</details>

**产品 - 抓取商家评论信息:**

1\. Yelp - 通过搜索筛选器抓取商家评论信息

<details>

<summary><code>spider_id</code> ，<strong>抓取工具（必填）</strong></summary>

定义要使用的抓取工具。

**请求示例:**

`spider_id=yelp_business-reviews_by-search-filters`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary> <code>category</code>， <strong>类别（必填）</strong></summary>

此参数指定要抓取的业务类别。

**请求示例:**

`"category": "cafe"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary> <code>location</code>，<strong>地点（必填）</strong></summary>

此参数指定要抓取特定位置的商家。

**请求示例:**

`"location": "Stowe, VT"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary> <code>business_page_turning</code>，<strong>最多商家列表页数（可选）</strong></summary>

此参数指定要收集的商家列表页数。

**请求示例:**

`"business_page_turning": "1"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary> <code>review_page_turning</code>，<strong>最多评论页数（可选）</strong></summary>

此参数指定要抓取的最大评论页数。

**请求示例:**

`"max review pages": "1"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-search-filters" ^
  -d "spider_parameters=[{\"category\": \"cafe\",\"location\": \"Stowe, VT\",\"business_page_turning\": \"1\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

2\. Yelp - **通过搜索网址抓取商家评论信息**

<details>

<summary><code>spider_id</code> ，<strong>抓取工具（必填）</strong></summary>

定义要使用的抓取工具。

**请求示例:**

`spider_id=yelp_business-overview_by-search-url`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>search_url</code>，<strong>搜索URL（必填）</strong></summary>

此参数指定要抓取的搜索 URL。

**请求示例:**

`"search_url": "https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-search-url" ^
  -d "spider_parameters=[{\"search_url\": \"https://www.yelp.com/search?find_desc=Cafes%26find_loc=Stowe%2C%2BVT\",\"business_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

3\. Yelp - **通过商家网址抓取商家评论信息**

<details>

<summary><code>spider_id</code> ，<strong>抓取工具（必填）</strong></summary>

定义要使用的抓取工具。

**请求示例:**

`spider_id=yelp_business-overview_by-business-url`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-overview_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>business_url</code>，<strong>商家URL（必填）</strong></summary>

此参数指定要抓取的商家URL。

**请求示例:**

`"business_url": "https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>unrecommended_reviews</code>，<strong>不推荐评论（必填）</strong></summary>

此参数用于指定是否抓取不推荐的评论。

**请求示例:**

`"unrecommended_reviews": "yes"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>sort_by</code>，<strong>排序方式（可选）</strong></summary>

此参数用于指定抓取评论结果的排序方法。

**请求示例:**

`"sort_by": "DATE_DESC"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

<details>

<summary><code>review_page_turning</code>，<strong>最多评论页数（可选）</strong></summary>

此参数用于指定要收集的最大评论页数。

**请求示例:**

`"review_page_turning": "1"`

```sh
curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=yelp.com" ^
  -d "spider_id=yelp_business-reviews_by-business-url" ^
  -d "spider_parameters=[{\"business_url\": \"https://www.yelp.com/biz/the-round-hearth-caf%C3%A9-and-marketplace-stowe?osq=Sandwiches\",\"unrecommended_reviews\": \"yes\",\"sort_by\": \"DATE_DESC\",\"review_page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

```

</details>

​

如果您需要进一步的帮助，请通过电子邮件联系 <support@thordata.com>。
