Commit Graph

106 Commits (d59a219b65329fdd97a8aa37895fbae2374d409a)

Author SHA1 Message Date
chio0hai 389896df85
[extractor/txxx] Add extractors ()
Authored by: chio0hai
Closes 
bashonly 8aa0bd5d10
[extractor/generic] Avoid catastrophic backtracking in KVS regex
Authored by: bashonly
bashonly 05997b6e98
[extractor/generic] Decode unicode-escaped embed URLs ()
Authored by: bashonly
Closes 
Simon Sawicki 32a84bcf4e
Update to ytdl-commit-195f22f6
[generic] Improve KVS (etc) extraction
195f22f679

Closes 
Authored by: Grub4k, pukkandan
coletdjnz 3e01ce744a
[extractor/generic] Use `Accept-Encoding: identity` for initial request
The existing comment seems to imply this was the desired behavior from the beginning.

Partial fix for https://github.com/yt-dlp/yt-dlp/issues/5855, https://github.com/yt-dlp/yt-dlp/issues/5851, https://github.com/yt-dlp/yt-dlp/issues/4748
bashonly 3cf50fa8e9
[downloader/ffmpeg] Fix headers for video+audio formats ()
Authored by: bashonly, Grub4K
nixxo 10dc85924a
[extractor/mediaset] Better embed detection and error messages ()
Authored by: nixxo
pukkandan 7991ae57a8
[extractor/sibnet] Separate from VKIE
Fixes bfd973ece3 (commitcomment-91834251)
bashonly 3b021eacef
[extractor/generic] Add `fragment_query` extractor arg for DASH and HLS ()
* `fragment_query`: passthrough any query in generic mpd/m3u8 manifest URLs to their fragments
* Add support for `extra_param_to_segment_url` to DASH downloader
Authored by: bashonly, pukkandan
bashonly ed027fd9d8
[extractor/generic] Fix JSON LD manifest extraction ()
Closes 
Authored by: bashonly, pukkandan
bashonly f5a9e9df0d
[extractor/brightcove] Add `BrightcoveNewBaseIE` and fix embed extraction ()
* Move Brightcove embed extraction and tests into the IEs
* Split `BrightcoveNewBaseIE` from `BrightcoveNewIE`
* Fix bug in ade1fa70cb with the "wrong" spelling of `referrer` being smuggled

Closes 
pukkandan 9f14daf22b
[extractor] Deprecate `_sort_formats`
pukkandan 4de88a6a36
[extractor/generic] Don't report redirect to https
MMM 0d8affc17f
[extractor/rumble] Add HLS formats and extract more metadata ()
Closes ,  
Authored by: flashdagger
pukkandan 62b8dac490
[extractor] Improve `_generic_title`
pukkandan d5d1df8afd
[cleanup Misc
Closes 
Matthew 6dca2aa66d
[extractor/generic:quoted-html] Add extractor ()
Extracts embeds from escaped HTML within `data-html` attribute.
Related: https://github.com/ytdl-org/youtube-dl/issues/21294, https://github.com/yt-dlp/yt-dlp/pull/5121

Authored by: coletdjnz
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
pukkandan ade1fa70cb
[extractor/generic] Separate embed extraction into own function ()
Simon Sawicki 1d55ebabc9
[extractor/common] Fix `json_ld` type checks ()
Closes , 
Authored by: Grub4K
coletdjnz d3a3d7f0cc
[extractor/JWPlatform] Fix extractor ()
Fix bitrate and filesize extraction and support embeds with unquoted urls.

Related:  

Authored by: coletdjnz
pukkandan 0f60ba6e65
[extractor] Improve json+ld extraction
Related 
coletdjnz 3c757d5ed2
[extractor/wistia] Add support for channels ()
Fixes https://github.com/yt-dlp/yt-dlp/issues/4748
Related: https://github.com/yt-dlp/yt-dlp/issues/4985

Authored by: coletdjnz
bashonly 163281178a
[extractor/wistia] Match IDs in embed URLs ()
Closes 
Authored by: bashonly
pukkandan 2fa669f759
[docs] Misc improvements
Closes , Closes , Closes , Closes , Closes 
pukkandan 3166e6840c
[extractor/generic] Pass through referer from json-ld
Closes 
pukkandan deae7c1711
[cleanup] Misc
pukkandan 4e4982ab5b
[extractor/generic] Don't return JW player without formats
CLoses 
pukkandan 2516cafb28
Fix bug in fe7866d0ed
Lesmiscore 62b58c0936
[docs] Consistent use of `e.g.` ()
Authored by: Lesmiscore
pukkandan bfd973ece3 [extractors] Use new framework for existing embeds ()
`Brightcove` is difficult to migrate because it's subclasses may depend
on the signature of the current functions. So it is left as-is for now

Note: Tests have not been migrated
pukkandan f14a2d8382 [extractor/html5] Separate into own extractor ()
Closes 

Authored by: coletdjnz, pukkandan
pukkandan 5fff2e576f [extractor/camtasia] Separate into own extractor ()
Authored by: coletdjnz
pukkandan f2e8dbcc00 [extractor, test] Basic framework for embed tests ()
and split download tests so they can be more easily run in CI

Authored by: coletdjnz
pukkandan 8f97a15d1c [extractor] Framework for embed detection ()
pukkandan 6154438178
[extractor/generic] Remove HEAD request
pukkandan ae61d108dd
[cleanup] Misc cleanup
crazymoose77756 962ffcf89c
[cleanup] Fix some typos ()
Authored by: crazymoose77756
pukkandan 14f25df2b6
[compat] Remove deprecated functions from core code
pukkandan 7b2c3f47c6
[cleanup] Misc
Elyse 612e31f5ea
[extractor/substack] Add extractor ()
Closes 
Authored by: elyse0
pukkandan f3c0c77304
[extractor] Handle `json_ld` with multiple `@type`s
Closes: 
pukkandan 79e591b59b
[extractor/rumble] Detect JS embed
Closes 
pukkandan 21a73e9f39
[extractor/generic] Revert e6ae51c123
85553414ae made it unnecessary
pukkandan 55baa67c7c
[extractor/jwplatform] Look for `data-video-jw-id`
Closes 
pukkandan b801cd7179
[tiktok] Detect embeds
Closes 
pukkandan 80e8493ee7
[utils] `is_html`: Handle double BOM
Closes 
pukkandan aedaa455d9
[vimeo] Fix extractors
Closes , Closes , Closes , Closes 
May also fix , 
pukkandan d6bf1161db
[generic] Refactor `_extract_rss`
Closes 
pukkandan 2414649192
[cleanup] Misc cleanup
pukkandan 0f06bcd759
[cleanup] Minor fixes (See desc)
* [youtube] Fix `--youtube-skip-dash-manifest`
* [build] Use `$()` in `Makefile`. Closes 
* Fix bug in 385ffb467b
* Fix bug in 43d7f5a5d0
* [cleanup] Remove unnecessary `utf-8` from `str.encode`/`bytes.decode`
* [utils] LazyList: Expose unnecessarily "protected" attributes
and other minor cleanup