Walmart 抓取参数

Web Scraper API Walmart 抓取参数

使用 Thordata 的 Web Scraper API 配置 Walmart 抓取参数,包括 URL 、类别 URL、所有变体、关键词、主域名、SKU 等参数。

唯一标识:

token 访问令牌(必填)

此参数用作 API 访问令牌,以确保抓取请求的合法性。

请求示例:

Authorization: Bearer ********************

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer ********************" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.walmart.com/ip/Starburst-Pink-Yellow-Mix-Chewy-Fruit-Candy-3-lbs-Original-Strawberry-Lemon-Soft-Chew-Candies-Bulk-Family-Size-Assortment-Pack-Individually-Wrapped-4/1016450755?adsRedirect=true\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

一、产品-抓取 Walmart 产品信息:

  1. Walmart - 通过 URL 抓取产品信息

spider_id所属抓取工具(必填)

它定义了要使用的抓取工具。

请求示例:

spider_id=walmart_product_by-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.walmart.com/ip/Starburst-Pink-Yellow-Mix-Chewy-Fruit-Candy-3-lbs-Original-Strawberry-Lemon-Soft-Chew-Candies-Bulk-Family-Size-Assortment-Pack-Individually-Wrapped-4/1016450755?adsRedirect=true\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
urlURL(必填)

该参数用于指定待抓取的 Walmart 产品 URL。

请求示例:

"url": "https://www.walmart.com/ip/Starburst-Pink-Yellow-Mix-Chewy-Fruit-Candy-3-lbs-Original-Strawberry-Lemon-Soft-Chew-Candies-Bulk-Family-Size-Assortment-Pack-Individually-Wrapped-4/1016450755?adsRedirect=true"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.walmart.com/ip/Starburst-Pink-Yellow-Mix-Chewy-Fruit-Candy-3-lbs-Original-Strawberry-Lemon-Soft-Chew-Candies-Bulk-Family-Size-Assortment-Pack-Individually-Wrapped-4/1016450755?adsRedirect=true\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
all_variations, 所有变体(可选)

该参数用于指定是否收集所有产品变量,设置为 true 为收集。 参数值:truefalse

请求示例:

"all_variations": "false"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.walmart.com/ip/Starburst-Pink-Yellow-Mix-Chewy-Fruit-Candy-3-lbs-Original-Strawberry-Lemon-Soft-Chew-Candies-Bulk-Family-Size-Assortment-Pack-Individually-Wrapped-4/1016450755?adsRedirect=true\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
  1. Walmart - 通过类别 URL 抓取产品信息

spider_id所属抓取工具(必填)

它定义了要使用的抓取工具。

请求示例:

spider_id=walmart_product_by-category-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-category-url" ^
  -d "spider_parameters=[{\"category_url\": \"https://www.walmart.com/shop/deals/food/beverages?povid=976759_itemcarousel_976782_Drinkupthesavings_Viewall_Rweb_May_08\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
category_url, 类别URL(必填)

该参数用于指定 Walmart 的特定类别网址来查找新产品。

请求示例:

"category_url": "https://www.walmart.com/shop/deals/food/beverages?povid=976759_itemcarousel_976782_Drinkupthesavings_Viewall_Rweb_May_08"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-category-url" ^
  -d "spider_parameters=[{\"category_url\": \"https://www.walmart.com/shop/deals/food/beverages?povid=976759_itemcarousel_976782_Drinkupthesavings_Viewall_Rweb_May_08\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
all_variations, 所有变体(可选)

该参数用于指定是否收集所有产品变量,设置为 true 为收集。 参数值:truefalse

请求示例:

"all_variations": "false"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-category-url" ^
  -d "spider_parameters=[{\"category_url\": \"https://www.walmart.com/shop/deals/food/beverages?povid=976759_itemcarousel_976782_Drinkupthesavings_Viewall_Rweb_May_08\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
  1. Walmart - 通过关键词抓取产品信息

spider_id所属抓取工具(必填)

它定义了要使用的抓取工具。

请求示例:

spider_id=walmart_product_by-keywords

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-keywords" ^
  -d "spider_parameters=[{\"keyword\": \"leggins\",\"domain\": \"https://www.walmart.com/\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
keyword, 关键词(必填)

该参数用于指定抓取 Walmart 产品的搜索关键词。

请求示例:

"keyword": "leggins"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-keywords" ^
  -d "spider_parameters=[{\"keyword\": \"leggins\",\"domain\": \"https://www.walmart.com/\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
domain, 主域名(可选)

该参数用于指定抓取 Walmart 产品信息的主域名。

请求示例:

"domain": "https://www.walmart.com/"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-keywords" ^
  -d "spider_parameters=[{\"keyword\": \"leggins\",\"domain\": \"https://www.walmart.com/\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
all_variations, 所有变体(可选)

该参数用于指定是否收集所有产品变量,设置为 true 为收集。 参数值:truefalse

请求示例:

"all_variations": "false"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-keywords" ^
  -d "spider_parameters=[{\"keyword\": \"leggins\",\"domain\": \"https://www.walmart.com/\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
  1. Walmart - 通过 SKU 抓取产品信息

spider_id所属抓取工具(必填)

它定义了要使用的抓取工具。

请求示例:

spider_id=walmart_product_by-sku

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-sku" ^
  -d "spider_parameters=[{\"sku\": \"439179861\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
sku, SKU(必填)

该参数用于指定待抓取的 SKU 产品唯一代码。

请求示例:

"sku": "439179861"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-sku" ^
  -d "spider_parameters=[{\"sku\": \"439179861\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
all_variations, 所有变体(可选)

该参数用于指定是否收集所有产品变量,设置为 true 为收集。 参数值:truefalse

请求示例:

"all_variations": "false"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=walmart.com" ^
  -d "spider_id=walmart_product_by-sku" ^
  -d "spider_parameters=[{\"sku\": \"439179861\",\"all_variations\": \"false\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

如果您需要进一步的帮助,请通过电子邮件联系 [email protected]

Last updated

Was this helpful?