Indeed 抓取参数

Web Scraper API Indeed 抓取参数

使用 Thordata 的 Web Scraper API 配置 Indeed 抓取参数，包括公司URL、职位URL，职位列表URL，关键词，地区等参数。

唯一标识:

token ，访问令牌（必填）

此参数用作 API 访问令牌，以确保抓取的合法性。

请求示例：

Authorization: Bearer ********************

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer ********************" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"indeed.com\",\"keyword\": \"analyst\",\"location\": \"New York, NY\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

一、产品-抓取公司信息：

1.Indeed -通过公司URL抓取公司信息

spider_id ，所属抓取工具 (必填)

它定义了要使用的抓取工具。

请求示例：

spider_id=indeed_companies-info_by-company-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-company-url" ^
  -d "spider_parameters=[{\"company_url\": \"https://www.indeed.com/cmp/Allstate-Insurance\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

company_url ，公司 URL (必填)

该参数用于指定抓取的公司URL。

请求示例：

"company_url": "https://www.indeed.com/cmp/Allstate-Insurance"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-company-url" ^
  -d "spider_parameters=[{\"company_url\": \"https://www.indeed.com/cmp/Allstate-Insurance\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

2.Indeed -通过公司列表URL抓取公司信息

spider_id ，所属抓取工具 (必填)

它定义了要使用的抓取工具。

请求示例：

spider_id=indeed_companies-info_by-company-list-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-company-list-url" ^
  -d "spider_parameters=[{\"company_list_url\": \"https://www.indeed.com/companies/browse-companies\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

company_list_url ，公司列表URL (必填)

该参数用于指定抓取的公司列表URL。

请求示例：

"company_list_url": "https://www.indeed.com/companies/browse-companies"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-company-list-url" ^
  -d "spider_parameters=[{\"company_list_url\": \"https://www.indeed.com/companies/browse-companies\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

3.Indeed -通过关键词抓取公司信息

spider_id ，所属抓取工具 (必填)

它定义了要使用的抓取工具。

请求示例：

spider_id=indeed_companies-info_by-keyword

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-keyword" ^
  -d "spider_parameters=[{\"keyword\": \"openai\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

keyword ，关键词(必填)

该参数用于指定搜索关键词。

请求示例：

"keyword": "openai"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-keyword" ^
  -d "spider_parameters=[{\"keyword\": \"openai\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

4.Indeed -通过行业和地区抓取公司信息

spider_id ，所属抓取工具 (必填)

它定义了要使用的抓取工具。

请求示例：

spider_id=indeed_companies-info_by-industry-and-state

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-industry-and-state" ^
  -d "spider_parameters=[{\"industry\": \"Advertising %26 Public Relations\",\"state\": \"United States\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

industry ，行业 (必填)

该参数用于指定公司所属行业。

请求示例：

"industry": "Advertising %26 Public Relations"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-industry-and-state" ^
  -d "spider_parameters=[{\"industry\": \"Advertising %26 Public Relations\",\"state\": \"United States\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

state ，地区 (选填)

该参数用于指定公司所在地区。

请求示例：

"state": "United States"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_companies-info_by-industry-and-state" ^
  -d "spider_parameters=[{\"industry\": \"Advertising %26 Public Relations\",\"state\": \"United States\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

二、产品-抓取职位信息：

1. Indeed -通过职位URL抓取职位信息

spider_id ，所属抓取工具 (必填)

它定义了要使用的抓取工具。

请求示例：

spider_id=indeed_job-listings_by-job-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-job-url" ^
  -d "spider_parameters=[{\"job_url\": \"https://fr.indeed.com/viewjob?jk=55b3e5dfa0c2ff66\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

job_url， 职位URL（必填）

此参数指定要抓取的职位 URL。

请求示例：

"job_url": "https://fr.indeed.com/viewjob?jk=55b3e5dfa0c2ff66"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-job-url" ^
  -d "spider_parameters=[{\"job_url\": \"https://fr.indeed.com/viewjob?jk=55b3e5dfa0c2ff66\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

2. Indeed -通过关键词抓取职位信息

spider_id ，所属抓取工具 (必填)

它定义了要使用的抓取工具。

请求示例：

spider_id=indeed_job-listings_by-keyword

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"indeed.com\",\"keyword\": \"analyst\",\"location\": \"New York, NY\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

keyword， 关键词（必填）

此参数指定要抓取的搜索关键词。

请求示例：

"keyword": "sale"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

country 国家（必填）

此参数指定要抓取的职位所在国家。

请求示例：

"country": "BR"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

domain 域名（必填）

此参数指定要抓取的域名。

请求示例：

"domain": "fr.indeed.com"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

location 位置（必填）

此参数指定抓取特定位置的职位。

请求示例：

"location": "Tokyo"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

date_posted 发布日期（选填）

此参数指定抓取特定日期发布的职位。

请求示例：

"date_posted": "The past 24 hours"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
 -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

posted_by 发布者（选填）

此参数指定抓取特定发布者发布的职位。

请求示例：

"posted_by": "Employer"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

pay 薪酬（选填）

此参数指定抓取特定薪酬的职位。

请求示例：

"pay": "50"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

location_radius 位置半径（选填）

此参数指定抓取特定位置半径内的职位。

请求示例：

"location_radius": "Exact location only"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=indeed.com" ^
  -d "spider_id=indeed_job-listings_by-keyword" ^
  -d "spider_parameters=[{\"country\": \"BR\",\"domain\": \"fr.indeed.com\",\"keyword\": \"sale\",\"location\": \"Tokyo\",\"date_posted\": \"The past 24 hours\",\"posted_by\": \"Employer\",\"pay\": \"50\",\"location_radius\": \"Exact location only\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

如果您需要进一步的帮助，请通过电子邮件联系 [email protected]。

PreviousLinkedIn 抓取参数 NextYelp 抓取参数

Last updated 3 months ago