YouTube Scraping Parameters

Web Scraper API YouTube scraping parameters

Configure YouTube scraping parameters using Thordata's Web Scraper API, including parameters such as products, Token, scraping URLs, resolution, audio format, subtitles, and subtitle language.

Unique identifiers:

token ,Access Token (Required)

This parameter is used as an API access token to ensure the legitimacy of crawling requests.

Request Example: "Authorization: Bearer ********************************"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer ********************" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_video_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"resolution\": \"2160p\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

I. Product - Video Downloader

  1. YouTube - Grab video files via URL

spider_id ,Associated Scraping Tool (Required)

It defines the crawler to use.

Request Example: spider_id=youtube_video_by-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_video_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"resolution\": \"2160p\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
resolution ,Resolution (Optional)

This parameter is used to specify the target resolution for crawling the video. If the selected resolution is unavailable or not precisely matched, the system will automatically downgrade to the available option with lower quality. Parameter Values: 360p 480p 720p 1080p 1440p 2160p

Request Example: "resolution": "2160p"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_video_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"resolution\": \"2160p\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
is_subtitles ,Subtitles (Required)

This parameter controls whether to download subtitle files simultaneously with YouTube audio. Subtitles are in .vvt format. Note that during the download process, the audio and subtitles will be saved as two separate files. Parameter value:true false

Request Example: "is_subtitles":"true"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_video_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"resolution\": \"2160p\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
subtitles_language ,Subtitles Language (Optional)

This parameter defines the language of the subtitles to be downloaded when crawling YouTube. It should be noted that if the selected language is unavailable, the system will return English subtitles by default.

Abkhazian ab, Afar aa, Afrikaans af, Akan ak, Albanian sq, Amharic am, Arabic ar, Armenian hy, Assamese as, Aymara ay, Azerbaijani az, Bengali bn, Bashkir ba, Basque eu, Belarusian be, Bhojpuri bho, Bosnian bs, Breton br, Bulgarian bg, Burmese my, Catalan ca, Cebuano ceb, Simplified Chinese zh-Hans, Traditional Chinese zh-Hant, Corsican co, Croatian hr, Czech cs, Danish da, Divehi dv, Dutch nl, Dzongkha (Bhutan) dz, English (original) en-orig, English en, Esperanto eo, Estonian et, Ewe ee, Faroese fo, Fijian fj, Filipino fil, Finnish fi, French fr, Ga gaa, Galician gl, Ganda lg, Georgian ka, German de, Greek el, Guarani gn, Gujarati gu, Haitian Creole ht, Hausa ha, Hawaiian haw, Hebrew iw, Hindi hi, Hmong hmn, Hungarian hu, Icelandic is, Igbo ig, Indonesian id, Inuktitut iu, Irish ga, Italian it, Japanese ja, Javanese jv, Greenlandic kl, Kannada kn, Kazakh kk, Khasi kha, Khmer km, Kinyarwanda rw, Korean ko, Krio kri, Kurdish ku, Kyrgyz ky, Lao lo, Latin la, Latvian lv, Lingala ln, Lithuanian lt, Luba-Katanga lua, Luo luo, Luxembourgish lb, Macedonian mk, Malagasy mg, Malay ms, Malayalam ml, Maltese mt, Manx gv, Maori mi, Marathi mr, Mongolian mn, Mauritian Creole mfe, Nepali ne, Newari new, Northern Sotho nso, Norwegian no, Nyanja ny, Occitan oc, Oriya or, Oromo om, Ossetic os, Pampanga pam, Pashto ps, Persian fa, Polish pl, Portuguese (Brazil) pt, Portuguese (Portugal) pt-PT, Punjabi pa, Quechua qu, Romanian ro, Rundi rn, Russian ru, Samoan sm, Sango sg, Sanskrit sa, Scottish Gaelic gd, Serbian sr, Seychelles Creole French crs, Shona sn, Sindhi sd, Sinhala si, Slovak sk, Slovenian sl, Somali so, Southern Sotho st, Spanish es, Sundanese su, Swahili sw

Request Example: "subtitles_language":"en"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_video_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"resolution\": \"2160p\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

II. Product - Audio Downloader

  1. YouTube - Grab audio and subtitle files via URL

spider_id ,Associated Scraping Tool (Required)

It defines the crawler to use.

Request Example: spider_id=youtube_audio_by-url

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_audio_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"audio_format\": \"mp3\",\"bitrate\": \"320\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
audio_format ,Audio Format (Required)

This parameter is used to specify the output format of the audio file to be crawled. The default format is .opus, and it can be switched to .mp3 format. Parameter Values:opus mp3

Request Example: "audio_format":"mp3"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_audio_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"audio_format\": \"mp3\",\"bitrate\": \"320\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
bitrateBit rate (optional)

This parameter sets the bitrate (bitrate) of the audio file, affecting both sound quality and file size. Recommended bitrate values ​​include: 48 Kbps, 64 Kbps, 128 Kbps, 160 Kbps, 256 Kbps, and 320 Kbps. Please note that the captured audio file may not support the selected bitrate, and the system will default to backward compatibility. If none of the selected bitrates are available, the system will automatically download a supported bitrate. Parameter Value: 48Kbps 64Kbps 128Kbps 160Kbps 256Kbps 320Kbps

Request Example:

"bitrate": "320"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_audio_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"audio_format\": \"mp3\",\"bitrate\": \"320\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
is_subtitles ,Subtitles (Required)

This parameter is used to control whether to synchronously download subtitle files when crawling YouTube audio. The subtitle format is .srt. It should be noted that during the download, the audio and subtitles will be saved as two separate files. Parameter Values:true false

Request Example: "is_subtitles":"true"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_audio_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"audio_format\": \"mp3\",\"bitrate\": \"320\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
subtitles_language ,Subtitles Language (Optional)

This parameter defines the language of the subtitles to be downloaded when crawling YouTube. It should be noted that if the selected language is unavailable, the system will return English subtitles by default.

Abkhazian ab, Afar aa, Afrikaans af, Akan ak, Albanian sq, Amharic am, Arabic ar, Armenian hy, Assamese as, Aymara ay, Azerbaijani az, Bengali bn, Bashkir ba, Basque eu, Belarusian be, Bhojpuri bho, Bosnian bs, Breton br, Bulgarian bg, Burmese my, Catalan ca, Cebuano ceb, Simplified Chinese zh-Hans, Traditional Chinese zh-Hant, Corsican co, Croatian hr, Czech cs, Danish da, Divehi dv, Dutch nl, Dzongkha (Bhutan) dz, English (original) en-orig, English en, Esperanto eo, Estonian et, Ewe ee, Faroese fo, Fijian fj, Filipino fil, Finnish fi, French fr, Ga gaa, Galician gl, Ganda lg, Georgian ka, German de, Greek el, Guarani gn, Gujarati gu, Haitian Creole ht, Hausa ha, Hawaiian haw, Hebrew iw, Hindi hi, Hmong hmn, Hungarian hu, Icelandic is, Igbo ig, Indonesian id, Inuktitut iu, Irish ga, Italian it, Japanese ja, Javanese jv, Greenlandic kl, Kannada kn, Kazakh kk, Khasi kha, Khmer km, Kinyarwanda rw, Korean ko, Krio kri, Kurdish ku, Kyrgyz ky, Lao lo, Latin la, Latvian lv, Lingala ln, Lithuanian lt, Luba-Katanga lua, Luo luo, Luxembourgish lb, Macedonian mk, Malagasy mg, Malay ms, Malayalam ml, Maltese mt, Manx gv, Maori mi, Marathi mr, Mongolian mn, Mauritian Creole mfe, Nepali ne, Newari new, Northern Sotho nso, Norwegian no, Nyanja ny, Occitan oc, Oriya or, Oromo om, Ossetic os, Pampanga pam, Pashto ps, Persian fa, Polish pl, Portuguese (Brazil) pt, Portuguese (Portugal) pt-PT, Punjabi pa, Quechua qu, Romanian ro, Rundi rn, Russian ru, Samoan sm, Sango sg, Sanskrit sa, Scottish Gaelic gd, Serbian sr, Seychelles Creole French crs, Shona sn, Sindhi sd, Sinhala si, Slovak sk, Slovenian sl, Somali so, Southern Sotho st, Spanish es, Sundanese su, Swahili sw

Request Example: "subtitles_language":"en"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=youtube.com" ^
  -d "spider_id=youtube_audio_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/watch?v=8RePenzQH80\"}]" ^
  -d "spider_universal={\"audio_format\": \"mp3\",\"bitrate\": \"320\",\"is_subtitles\": \"true\",\"subtitles_language\": \"en\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

III. Product - Basic Information Downloader

  1. YouTube - Capture basic video information using a unique ID

spider_id ,Associated Scraping Tool (Required)

It defines the crawler to use.

Request Example:

spider_id=youtube_product_by-id

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_product_by-id" ^
  -d "spider_parameters=[{\"video_id\": \"8RePenzQH80\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{VideoID}}"
video_idVideo unique ID (Required)

This parameter specifies the unique ID of the YouTube video to be crawled.

Request Example:

"video_id": "8RePenzQH80"

curl -X POST "https://scraperapi.thordata.com/builder" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_product_by-id" ^
  -d "spider_parameters=[{\"video_id\": \"8RePenzQH80\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{VideoID}}"

IV. Product - Subtitle Downloader

  1. YouTube - Extract subtitle files from videos using unique IDs

spider_id ,Associated Scraping Tool (Required)

It defines the crawler to use.

Request Example:

spider_id=youtube_transcript_by-id

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_transcript_by-id" ^
  -d "spider_parameters=[{\"video_id\": \"8RePenzQH80\"}]" ^
  -d "spider_universal={\"subtitles_language\": \"en\",\"subtitles_type\": \"auto_generated\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
video_idVideo unique ID (Required)

This parameter specifies the unique ID of the YouTube video to be crawled.

Request Example:

"video_id": "8RePenzQH80"

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_transcript_by-id" ^
  -d "spider_parameters=[{\"video_id\": \"8RePenzQH80\"}]" ^
  -d "spider_universal={\"subtitles_language\": \"en\",\"subtitles_type\": \"auto_generated\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
subtitles_language ,Subtitles Language(Required)

This parameter defines the language of the subtitles to be downloaded when crawling YouTube. It should be noted that if the selected language is unavailable, the system will return English subtitles by default.

Abkhazian ab, Afar aa, Afrikaans af, Akan ak, Albanian sq, Amharic am, Arabic ar, Armenian hy, Assamese as, Aymara ay, Azerbaijani az, Bengali bn, Bashkir ba, Basque eu, Belarusian be, Bhojpuri bho, Bosnian bs, Breton br, Bulgarian bg, Burmese my, Catalan ca, Cebuano ceb, Simplified Chinese zh-Hans, Traditional Chinese zh-Hant, Corsican co, Croatian hr, Czech cs, Danish da, Divehi dv, Dutch nl, Dzongkha (Bhutan) dz, English (original) en-orig, English en, Esperanto eo, Estonian et, Ewe ee, Faroese fo, Fijian fj, Filipino fil, Finnish fi, French fr, Ga gaa, Galician gl, Ganda lg, Georgian ka, German de, Greek el, Guarani gn, Gujarati gu, Haitian Creole ht, Hausa ha, Hawaiian haw, Hebrew iw, Hindi hi, Hmong hmn, Hungarian hu, Icelandic is, Igbo ig, Indonesian id, Inuktitut iu, Irish ga, Italian it, Japanese ja, Javanese jv, Greenlandic kl, Kannada kn, Kazakh kk, Khasi kha, Khmer km, Kinyarwanda rw, Korean ko, Krio kri, Kurdish ku, Kyrgyz ky, Lao lo, Latin la, Latvian lv, Lingala ln, Lithuanian lt, Luba-Katanga lua, Luo luo, Luxembourgish lb, Macedonian mk, Malagasy mg, Malay ms, Malayalam ml, Maltese mt, Manx gv, Maori mi, Marathi mr, Mongolian mn, Mauritian Creole mfe, Nepali ne, Newari new, Northern Sotho nso, Norwegian no, Nyanja ny, Occitan oc, Oriya or, Oromo om, Ossetic os, Pampanga pam, Pashto ps, Persian fa, Polish pl, Portuguese (Brazil) pt, Portuguese (Portugal) pt-PT, Punjabi pa, Quechua qu, Romanian ro, Rundi rn, Russian ru, Samoan sm, Sango sg, Sanskrit sa, Scottish Gaelic gd, Serbian sr, Seychelles Creole French crs, Shona sn, Sindhi sd, Sinhala si, Slovak sk, Slovenian sl, Somali so, Southern Sotho st, Spanish es, Sundanese su, Swahili sw

Request Example: "subtitles_language":"en"

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_transcript_by-id" ^
  -d "spider_parameters=[{\"video_id\": \"8RePenzQH80\"}]" ^
  -d "spider_universal={\"subtitles_language\": \"en\",\"subtitles_type\": \"auto_generated\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
subtitles_type ,Subtitle type(Required)

This parameter specifies whether the captured subtitles are automatically generated or user-uploaded. Parameter value:auto_generated uploader_provided

Request Example:

"subtitles_type": "auto_generated"

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_transcript_by-id" ^
  -d "spider_parameters=[{\"video_id\": \"8RePenzQH80\"}]" ^
  -d "spider_universal={\"subtitles_language\": \"en\",\"subtitles_type\": \"auto_generated\"}" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

V. Product Review Downloader

  1. YouTube - Capture video comment information using unique ID

spider_id ,Associated Scraping Tool (Required)

It defines the crawler to use.

Request Example:

spider_id=youtube_profiles_by-url

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_profiles_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/@mrbeast\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
url ,Channel URL (Required)

This parameter is used to specify the access URL address of the YouTube channel to be crawled.

Request Example:

"url": "https://www.youtube.com/@mrbeast"

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_profiles_by-url" ^
  -d "spider_parameters=[{\"url\": \"https://www.youtube.com/@mrbeast\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

2. YouTube - Fetch profile information via unique ID

spider_id ,Associated Scraping Tool (Required)

It defines the crawler to use.

Request Example:

spider_id=youtube_profiles_by-keyword

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_profiles_by-keyword" ^
  -d "spider_parameters=[{\"keyword\": \"MrBeast\",\"page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
keywordKeywords (Required)

This parameter specifies the keyword to search YouTube channels for.

Request Example:

"keyword": "MrBeast"

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_profiles_by-keyword" ^
  -d "spider_parameters=[{\"keyword\": \"MrBeast\",\"page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"
page_turningNumber of crawled pages (Required)

This parameter sets the number of pages of search results to crawl. Entering a number "n" will crawl all product data from page 1 to page n. Parameter value:1-100000

Request Example:

"page_turning": "1"

curl -X POST "https://scraperapi.thordata.com/builde" ^
  -H "Authorization: Bearer Token-ID" ^
  -H "Content-Type: application/x-www-form-urlencoded" ^
  -d "spider_name=www.youtube.com" ^
  -d "spider_id=youtube_profiles_by-keyword" ^
  -d "spider_parameters=[{\"keyword\": \"MrBeast\",\"page_turning\": \"1\"}]" ^
  -d "spider_errors=true" ^
  -d "file_name={{TasksID}}"

If you need further help, please contact us at [email protected].

Last updated

Was this helpful?