Amazon 抓取参数
Web Scraper API Amazon 抓取参数
使用 Thordata 的 Web Scraper API 配置 Amazon 抓取参数,包括产品、Token、抓取URLs、卖家URLs、域名、关键词、排序方式、UPC、抓取最大页数、子类别等参数。
唯一标识:
token
,访问令牌(必填)
该参数用于确保爬取请求合法性的 API 访问令牌。
示例请求:
header "Authorization: Bearer ********************************"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=3 --header "Authorization: Bearer ********************************" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726\"}],\"spider_id\":\"3\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
产品-抓取产品信息工具: 1. Amazon - 通过 URL 收集产品信息
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=3
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=3 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726\"}],\"spider_id\":\"3\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=3 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726\"}],\"spider_id\":\"3\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过畅销商品 URL 收集产品信息
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=366
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=366 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/Best-Sellers-Tools-Home-Improvement-Kitchen-Bath-Fixtures/zgbs/hi/3754161/ref=zg_bs_unv_hi_2_680350011_1\",\"collect_child_categories\":\"\"}],\"spider_id\":\"366\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取类别目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/Best-Sellers-Tools-Home-Improvement-Kitchen-Bath-Fixtures/zgbs/hi/3754161/ref=zg_bs_unv_hi_2_680350011_1"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=366 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/Best-Sellers-Tools-Home-Improvement-Kitchen-Bath-Fixtures/zgbs/hi/3754161/ref=zg_bs_unv_hi_2_680350011_1\",\"collect_child_categories\":\"\"}],\"spider_id\":\"366\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
collect_child_categories
,子类别(可选)
该参数用于指定在主类别下要采集的子类别商品范围。
示例请求:
"collect_child_categories":""
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=366 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/Best-Sellers-Tools-Home-Improvement-Kitchen-Bath-Fixtures/zgbs/hi/3754161/ref=zg_bs_unv_hi_2_680350011_1\",\"collect_child_categories\":\"\"}],\"spider_id\":\"366\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过类别 URL 收集产品数据
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=353
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=353 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?i=luggage-intl-ship\",\"sort_by\":\"Best Sellers\"}],\"spider_id\":\"353\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取类别目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
"url":"https://www.amazon.com/s?i=luggage-intl-ship"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=353 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?i=luggage-intl-ship\",\"sort_by\":\"Best Sellers\"}],\"spider_id\":\"353\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
sort_b
,排序方式(可选)
该参数用于配置类别页面抓取时的排序方式,支持按畅销品、最新到货、客户评论、价格或特色排序。参数值:
Best Sellers
Newest Arrivals
Average Rating
Price: High to Low
Price: Low to High
Featured Recommendations
示例请求:
"sort_by":"Best Sellers"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=353 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?i=luggage-intl-ship\",\"sort_by\":\"Best Sellers\"}],\"spider_id\":\"353\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过关键词收集产品信息
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=352
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=352 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Apple Watch\"}],\"spider_id\":\"352\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
keyword
,关键词(必填)
该参数用于指定抓取 Amazon 产品时使用的搜索关键词。
示例请求:
[{"keyword":"Apple Watch"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=352 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Apple Watch\"}],\"spider_id\":\"352\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过 UPC 收集产品信息
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=367
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=367 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"upc\":\"841710108224\"}],\"spider_id\":\"367\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
upc
,UPC(必填)
Amazon 产品所对应的UPC。需注意,每个 UPC 可能关联多个产品。
示例请求:
[{"upc":"841710108224"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=367 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"upc\":\"841710108224\"}],\"spider_id\":\"367\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
产品-抓取产品评论工具 1. Amazon - 通过 URL 收集产品评论
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=368
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=368 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726\"}],\"spider_id\":\"368\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=368 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/HISDERN-Checkered-Handkerchief-Classic-Necktie/dp/B0BRXPR726\"}],\"spider_id\":\"368\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
产品-抓取全球产品信息工具 1. Amazon - 通过 URL 收集全球产品数据
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=377
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=377 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/dp/B0CHHSFMRL/\"}],\"spider_id\":\"377\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/dp/B0CHHSFMRL/"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=377 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/dp/B0CHHSFMRL/\"}],\"spider_id\":\"377\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
2.Amazon - 通过品牌收集全球产品数据
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=376
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=376 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?k=football^&i=fashion^&rh=n%3A7141123011%2Cp_123%3A233083\"}],\"spider_id\":\"376\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/s?k=football^&i=fashion^&rh=n%3A7141123011%2Cp_123%3A233083"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=376 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?k=football^&i=fashion^&rh=n%3A7141123011%2Cp_123%3A233083\"}],\"spider_id\":\"376\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过类别 URL 收集全球产品数据
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=375
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=375 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?i=luggage-intl-ship\",\"sort_by\":\"Best Sellers\",\"get_sponsored\":\"true\"}],\"spider_id\":\"375\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
sort_b
,排序方式(可选)
该参数用于配置类别页面抓取时的排序方式,支持按畅销品、最新到货、客户评论、价格或特色排序。 参数值:
Best Sellers
Newest Arrivals
Average Rating
Price: High to Low
Price: Low to High
Featured Recommendations
示例请求:
"sort_by":"Best Sellers"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=375 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?i=luggage-intl-ship\",\"sort_by\":\"Best Sellers\",\"get_sponsored\":\"true\"}],\"spider_id\":\"375\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
get_sponsored
,获取赞助商品(可选)
该参数用于配置 勾选 “True” 可同时收集赞助商品,即使它们不属于该类别。
参数值:true
false
示例请求:
"get_sponsored":"true"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=375 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/s?i=luggage-intl-ship\",\"sort_by\":\"Best Sellers\",\"get_sponsored\":\"true\"}],\"spider_id\":\"375\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过关键词收集全球产品数据
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=354
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=354 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Mitutoyo\",\"domain\":\"https://www.amazon.com\",\"page_turning\":\"1\"}],\"spider_id\":\"354\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
keyword
,关键词(必填)
该参数用于指定抓取 Amazon 产品时使用的搜索关键词。
示例请求:
"keyword":"Mitutoyo"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=354 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Mitutoyo\",\"domain\":\"https://www.amazon.com\",\"page_turning\":\"1\"}],\"spider_id\":\"354\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
domain
,主域名(必填)
该参数用于指定卖家主页的 URL,以便抓取该卖家所售产品的数据。
示例请求:
"domain":"https://www.amazon.com"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=354 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Mitutoyo\",\"domain\":\"https://www.amazon.com\",\"page_turning\":\"1\"}],\"spider_id\":\"354\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
page_turning
,回溯页数(可选)
该参数用于设置抓取搜索结果的页数。输入数字 n 表示抓取从第一页到第 n 页的所有产品数据。
示例请求:
"page_turning":"1"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=354 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Mitutoyo\",\"domain\":\"https://www.amazon.com\",\"page_turning\":\"1\"}],\"spider_id\":\"354\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
Amazon - 通过卖家收集全球产品数据
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=374
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=374 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/stores/page/4750579C-0CF4-4BF9-B7E8-F782E52D7683?ingress=0^&visitId=63ba151f-9b4f-4938-b6a3-fd23c249b72f\"}],\"spider_id\":\"374\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/stores/page/4750579C-0CF4-4BF9-B7E8-F782E52D7683?ingress=0^&visitId=63ba151f-9b4f-4938-b6a3-fd23c249b72f"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=374 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/stores/page/4750579C-0CF4-4BF9-B7E8-F782E52D7683?ingress=0^&visitId=63ba151f-9b4f-4938-b6a3-fd23c249b72f\"}],\"spider_id\":\"374\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
产品-抓取卖家信息工具 1.Amazon - 通过 URL 收集卖家信息
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=369
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=369 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/sp?ie=UTF8^&seller=ADZ7LD48GVFQJ^&asin=B07H56J7K1^&ref_=dp_merchant_link^&isAmazonFulfilled=1\"}],\"spider_id\":\"369\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
url
,抓取目标链接(必填)
该参数用于指定待抓取的 Amazon 的访问 URL 地址。
示例请求:
[{"url":"https://www.amazon.com/sp?ie=UTF8^&seller=ADZ7LD48GVFQJ^&asin=B07H56J7K1^&ref_=dp_merchant_link^&isAmazonFulfilled=1"}]
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=369 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"url\":\"https://www.amazon.com/sp?ie=UTF8^&seller=ADZ7LD48GVFQJ^&asin=B07H56J7K1^&ref_=dp_merchant_link^&isAmazonFulfilled=1\"}],\"spider_id\":\"369\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
产品-抓取产品列表 1.Amazon - 通过关键词收集产品列表
prodect_id
,所属抓取工具(必填)
定义了使用哪个抓取工具。
示例请求:
prodect_id=370
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=370 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Dry Dog Food\",\"domain\":\"https://www.amazon.com/\",\"page_turning\":\"1\"}],\"spider_id\":\"370\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
keyword
,关键词(必填)
该参数用于指定抓取 Amazon 产品时使用的搜索关键词。
示例请求:
"keyword":"Dry Dog Food"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=370 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Dry Dog Food\",\"domain\":\"https://www.amazon.com/\",\"page_turning\":\"1\"}],\"spider_id\":\"370\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
domain
,主域名(必填)
该参数用于指定卖家主页的 URL,以便抓取该卖家所售产品的数据。
示例请求:
"domain":"https://www.amazon.com/"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=370 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Dry Dog Food\",\"domain\":\"https://www.amazon.com/\",\"page_turning\":\"1\"}],\"spider_id\":\"370\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
page_turning
,回溯页数(可选)
该参数用于设置抓取搜索结果的页数。输入数字 n 表示抓取从第一页到第 n 页的所有产品数据。
示例请求:
"page_turning":"1"
curl --request POST --url https://scraperapi.thordata.com/builder?prodect_id=370 --header "Authorization: Bearer Token-ID" --header "content-type: application/json" --data "{\"spider_info\":{\"spider_parameters\":[{\"keyword\":\"Dry Dog Food\",\"domain\":\"https://www.amazon.com/\",\"page_turning\":\"1\"}],\"spider_id\":\"370\",\"spider_errors\":true},\"spider_name\":\"amazon.com\"}"
如果您需要更多帮助,请通过邮箱support@thordata.com联系我们。
Last updated
Was this helpful?