LinkedIn 抓取参数
Web Scraper API LinkedIn 抓取参数
使用 Thordata 的 Web Scraper API 配置 LinkedIn 抓取参数,包括URL、职位URL,职位列表URL,关键词等参数。
唯一标识:
token ,访问令牌(必填)
此参数用作 API 访问令牌,以确保抓取的合法性。
请求示例:
Authorization: Bearer ********************
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer ********************" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-job-listing-url" ^
-d "spider_parameters=[{\"job_listing_url\": \"https://www.linkedin.com/jobs/reddit-inc.-jobs-worldwide?f_C=150573\",\"page_turning\": \"\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"一、产品-抓取公司信息:
1. LinkedIn -通过公司URL抓取公司信息
spider_id ,所属抓取工具 (必填)
它定义了要使用的抓取工具。
请求示例:
spider_id=linkedin_company_information_by-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_company_information_by-url" ^
-d "spider_parameters=[{\"url\": \"https://www.linkedin.com/company/dynamo-software\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
url ,URL (必填)
该参数用于指定抓取的公司URL。
请求示例:
"url": "https://www.linkedin.com/company/dynamo-software"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_company_information_by-url" ^
-d "spider_parameters=[{\"url\": \"https://www.linkedin.com/company/dynamo-software\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"二、产品-抓取职位信息:
1. LinkedIn -通过职位列表URL抓取职位信息
spider_id ,所属抓取工具 (必填)
它定义了要使用的抓取工具。
请求示例:
spider_id=linkedin_job_listings_information_by-job-listing-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-job-listing-url" ^
-d "spider_parameters=[{\"job_listing_url\": \"https://www.linkedin.com/jobs/reddit-inc.-jobs-worldwide?f_C=150573\",\"page_turning\": \"\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"job_listing_url, 职位列表URL(必填)
此参数指定要抓取的职位列表 URL。
请求示例:
"job_listing_url": "https://www.linkedin.com/jobs/reddit-inc.-jobs-worldwide?f_C=150573"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-job-listing-url" ^
-d "spider_parameters=[{\"job_listing_url\": \"https://www.linkedin.com/jobs/reddit-inc.-jobs-worldwide?f_C=150573\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"page_turning,翻页(可选)
此参数指定抓取时的翻页数量。
请求示例:
"page_turning": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-job-listing-url" ^
-d "spider_parameters=[{\"job_listing_url\": \"https://www.linkedin.com/jobs/reddit-inc.-jobs-worldwide?f_C=150573\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
2. LinkedIn - 通过职位URL抓取职位信息
spider_id ,所属抓取工具 (必填)
它定义了要使用的抓取工具。
请求示例:
spider_id=linkedin_job_listings_information_by-job-listing-url
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-job-url" ^
-d "spider_parameters=[{\"job_url\": \"https://www.linkedin.com/jobs/view/senior-client-partner-large-customer-sales-gaming-at-reddit-inc-4303761033?position=10%26pageNum=0%26refId=kHRQtl6Ws14VG9y3UloI4w%3D%3D%26trackingId=%2Fb2esqHHEjp1FoEkC8PfuQ%3D%3D\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
job_url,职位URL (required)
此参数指定要抓取的职位URL。
请求示例:
"job_url": "https://www.linkedin.com/jobs/view/senior-client-partner-large-customer-sales-gaming-at-reddit-inc-4303761033?position=10&pageNum=0&refId=kHRQtl6Ws14VG9y3UloI4w%3D%3D&trackingId=%2Fb2esqHHEjp1FoEkC8PfuQ%3D%3D"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-job-url" ^
-d "spider_parameters=[{\"job_url\": \"https://www.linkedin.com/jobs/view/senior-client-partner-large-customer-sales-gaming-at-reddit-inc-4303761033?position=10%26pageNum=0%26refId=kHRQtl6Ws14VG9y3UloI4w%3D%3D%26trackingId=%2Fb2esqHHEjp1FoEkC8PfuQ%3D%3D\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"3. LinkedIn -通过关键词抓取职位信息
spider_id ,所属抓取工具 (必填)
它定义了要使用的抓取工具。
请求示例:
spider_id=linkedin_job_listings_information_by-keyword
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"\",\"experience_level\": \"\",\"job_type\": \"\",\"remote\": \"\",\"company\": \"\",\"selective_search\": \"\",\"jobs_to_not_include\": \"\",\"location_radius\": \"\",\"page_turning\": \"\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"location,职位地点(必填)
此参数指定抓取的职位的位置。
请求示例:
"location": "New York"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"keyword,关键词(可选)
此参数指定要抓取的关键词。
请求示例:
"Keyword": "product manager"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"time_range,时间范围(可选)
此参数指定抓取的时间范围。
请求示例:
"time_range": "Any_time"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"experience_level,经验级别(可选)
此参数用于指定要抓取对经验级别有特定要求的职位。
请求示例:
"experience_level": "Internship"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"job_type,职位类型(可选)
此参数指定爬取的时间范围。
请求示例:
"job_type": "Full_time"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"remote,远程工作(可选)
该参数用于指定要抓取的职位的远程选项。
请求示例:
"remote": "On_site"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"company,公司(可选)
该参数用于指定要抓取的职位所属的特定公司。
请求示例:
"company": "Reddit, Inc."
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"selective_search,选择性搜索(可选)
当该参数设置为“true”时,将排除不包含特定关键词的职位。
请求示例:
"selective_search": "true"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"jobs_to_not_include,不包括的职位(可选)
该参数用于指定不抓取的职位id。
请求示例:
"jobs_to_not_include": "4321229455"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"location_radius,位置半径(可选)
该参数用于指定要抓取的职位的距离。
请求示例:
"location_radius": "Exact_location"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"page_turning,翻页(可选)
该参数用于指定翻页的数量。
请求示例:
"page_turning": "1"
curl -X POST "https://scraperapi.thordata.com/builder" ^
-H "Authorization: Bearer Token-ID" ^
-H "Content-Type: application/x-www-form-urlencoded" ^
-d "spider_name=linkedin.com" ^
-d "spider_id=linkedin_job_listings_information_by-keyword" ^
-d "spider_parameters=[{\"location\": \"New York\",\"keyword\": \"product manager\",\"time_range\": \"Any_time\",\"experience_level\": \"Internship\",\"job_type\": \"Full_time\",\"remote\": \"On_site\",\"company\": \"Reddit, Inc.\",\"selective_search\": \"true\",\"jobs_to_not_include\": \"4321229455\",\"location_radius\": \"Exact_location\",\"page_turning\": \"1\"}]" ^
-d "spider_errors=true" ^
-d "file_name={{TasksID}}"
如果您需要进一步的帮助,请通过电子邮件联系 [email protected]。
Last updated
Was this helpful?