Commit Graph

1322 Commits (c071dbdc182b8b50cca7b5b6c5903d6fbf0d3fd7)

Author SHA1 Message Date
Simon Sawicki 0b6f829b1d
[utils] `traverse_obj`: Move `is_user_input` into output template ()
Authored by: Grub4K
pukkandan 993edd3f6e
[outtmpl] Support multiplication
Related: 
sepro 1732eccc0a
[core] Parse `release_year` from `release_date` ()
Closes 
Authored by: seproDev
coletdjnz ccfd70f4c2
[rh:websockets] Migrate websockets to networking framework ()
* Adds a basic WebSocket framework
* Introduces new minimum `websockets` version of 12.0
* Deprecates `WebSocketsWrapper`

Fixes https://github.com/yt-dlp/yt-dlp/issues/8439

Authored by: coletdjnz
sepro 45d82be65f
[ie/nebula] Overhaul extractors ()
Closes , Closes , Closes , Closes , Closes 
Authored by: elyse0, pukkandan, seproDev

Co-authored-by: Elyse <26639800+elyse0@users.noreply.github.com>
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
bashonly f4b95acafc
Remove Python 3.7 support ()
Closes 
Authored by: bashonly
bashonly f04b5bedad
[ie] Do not smuggle `http_headers`
See: https://github.com/yt-dlp/yt-dlp/security/advisories/GHSA-3ch3-jhc6-5r8x

Authored by: coletdjnz
bashonly 87264d4fda
[test:update] Implement simple updater unit tests
Authored by: bashonly
Simon Sawicki 4e38e2ae9d
[rh:requests] Handle both `bytes` and `int` for `IncompleteRead.partial` (Fix 8a8b54523a) ()
Authored by: bashonly, coletdjnz, Grub4K
coletdjnz 8a8b54523a
[rh:requests] Add handler for `requests` HTTP library ()
Adds support for HTTPS proxies and persistent connections (keep-alive)

Closes https://github.com/yt-dlp/yt-dlp/issues/1890
Resolves https://github.com/yt-dlp/yt-dlp/issues/4070
Resolves https://github.com/ytdl-org/youtube-dl/issues/32549
Resolves https://github.com/ytdl-org/youtube-dl/issues/14523
Resolves https://github.com/ytdl-org/youtube-dl/issues/13734

Authored by: coletdjnz, Grub4K, bashonly
Awal Garg 9d7ded6419
[utils] `js_to_json`: Fix `Date` constructor parsing ()
Authored by: awalgarg, Grub4K
Simon Sawicki 088add9567
[cleanup] Misc
Authored by: Grub4K
Simon Sawicki de015e9307
[core] Prevent RCE when using `--exec` with `%q` (CVE-2023-40581)
The shell escape function is now using `""` instead of `\"`. `utils.Popen` has been patched to properly quote commands.

Prior to this fix using `--exec` together with `%q` when on Windows could cause remote code to execute. See https://github.com/yt-dlp/yt-dlp/security/advisories/GHSA-42h4-v29r-42qg for reference.

Authored by: Grub4K
Simon Sawicki 61bdf15fc7
[core] Raise minimum recommended Python version to 3.8 ()
Authored by: Grub4K
bashonly 5ca095cbcd
[cleanup] Misc ()
Closes , Closes 
Authored by: barsnick, sqrtNOT, gamer191, coletdjnz, Grub4K, bashonly
bashonly 661c9a1d02
[test:download] Test for `expected_exception`
Authored by: at-wat

Co-authored-by: Atsushi Watanabe <atsushi.w@ieee.org>
bashonly 52414d64ca
[utils] `js_to_json`: Handle `Array` objects
Authored by: Grub4K, std-move

Co-authored-by: std-move <26625259+std-move@users.noreply.github.com>
Co-authored-by: Simon Sawicki <accounts@grub4k.xyz>
coletdjnz 20fbbd9249
[networking] Fix various socks proxy bugs ()
- Fixed support for IPv6 socks proxies
- Fixed support for IPv6 over socks5
- Fixed --source-address not being obeyed for socks4 and socks5
- Fixed socks4a when the destination address is an IPv4 address

Closes https://github.com/yt-dlp/yt-dlp/issues/7959
Fixes https://github.com/ytdl-org/youtube-dl/issues/15368

Authored by: coletdjnz
Co-authored-by: Simon Sawicki <accounts@grub4k.xyz>
Co-authored-by: bashonly <bashonly@bashonly.com>
coletdjnz fcd6a76adc
[tests] Add tests for socks proxies ()
Authored by: coletdjnz
coletdjnz db7b054a61
[networking] Add request handler preference framework ()
Preference functions that take a request and a request handler instance can be registered to prioritize different request handlers per request.

Authored by: coletdjnz
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
pukkandan 6148833f5c
[cleanup] Misc
pukkandan 3f7965105d
[utils] HTTPHeaderDict: Handle byte values
pukkandan de20687ee6
[test] Fix `test_load_certifi`
Closes , 
coletdjnz 4bf912282a
[networking] Remove dot segments during URL normalization ()
This implements RFC3986 5.2.4 remove_dot_segments during the URL normalization process.

Closes , 

Authored by: coletdjnz
coletdjnz bbeacff7fc
[networking] Ignore invalid proxies in env ()
Authored by: coletdjnz
coletdjnz 95abea9a03
[test] Fix `httplib_validation_errors` test for old Python versions ()
Fixes https://github.com/yt-dlp/yt-dlp/issues/7674

Authored by: coletdjnz
coletdjnz 86aea0d3a2
[networking] Add strict Request extension checking ()
Authored by: coletdjnz
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
pukkandan a250b24733
[compat] Ensure submodules are imported correctly
Closes 
pukkandan 62b5c94cad
[cleanup] Misc fixes
Closes 
pukkandan a264433c9f
[outtmpl] Fix replacement for `playlist_index`
bashonly e57eb98222
[fd/external] Fix ffmpeg input from stdin ()
Bugfix for 1ceb657bdd

Authored by: bashonly
bashonly af86873218
[utils] Improve `parse_duration`
Authored by: bashonly
bashonly 75dc8e673b
[networking] Fix `--legacy-server-connect` ()
Bugfix for 227bf1a33b

Authored by: bashonly
bashonly 71baa490eb
[networking] Fix POST requests with zero-length payloads ()
Bugfix for 227bf1a33b

Authored by: bashonly
bashonly 42ded0a429
[fd/external] Fixes to cookie handling
- Fix bug in `axel` Cookie header arg
- Pass cookies to `curl` as strings
- Write session cookies for `aria2c` and `wget`

Closes 
Authored by: bashonly
bashonly 6c5211cebe
[core] Fix HTTP headers and cookie handling
- Remove `Cookie` header from `http_headers` immediately after loading into cookiejar
- Restore compat for `--load-info-json` cookies
- Add more tests
- Fix improper passing of Cookie header by `MailRu` extractor

Closes 
Authored by: bashonly, pukkandan
coletdjnz 3d2623a898
[compat, networking] Deprecate old functions ()
Authored by: coletdjnz, pukkandan
coletdjnz 227bf1a33b
[networking] Rewrite architecture ()
New networking interface consists of a `RequestDirector` that directs
each `Request` to appropriate `RequestHandler` and returns the
`Response` or raises `RequestError`. The handlers define adapters to
transform its internal Request/Response/Errors to our interfaces.

User-facing changes:
- Fix issues with per request proxies on redirects for urllib
- Support for `ALL_PROXY` environment variable for proxy setting
- Support for `socks5h` proxy
   - Closes https://github.com/yt-dlp/yt-dlp/issues/6325, https://github.com/ytdl-org/youtube-dl/issues/22618, https://github.com/ytdl-org/youtube-dl/pull/28093
- Raise error when using `https` proxy instead of silently converting it to `http`

Authored by: coletdjnz
pukkandan c365dba843
[networking] Add module ()
No actual changes - code is only moved around
pukkandan 1b392f905d
[utils] Add temporary shim for logging
Related: , 
Mahmoud Abdel-Fattah 2af4eeb772
[utils] `clean_podcast_url`: Handle more trackers ()
Authored by: mabdelfattah, bashonly
Closes 
Simon Sawicki 3121512228
[core] Change how `Cookie` headers are handled
Cookies are now saved and loaded under `cookies` key in the info dict
instead of `http_headers.Cookie`. Cookies passed in headers are
auto-scoped to the input URLs with a warning.

Ref: https://github.com/yt-dlp/yt-dlp/security/advisories/GHSA-v8mc-9377-rwjj

Authored by: Grub4K
coletdjnz f8b4bcc0a7
[core] Prevent `Cookie` leaks on HTTP redirect
Ref: https://github.com/yt-dlp/yt-dlp/security/advisories/GHSA-v8mc-9377-rwjj

Authored by: coletdjnz
bashonly 1ceb657bdd
[fd/external] Scope cookies
- ffmpeg: Calculate cookies from cookiejar and pass with `-cookies` arg instead of `-headers`
- aria2c, curl, wget: Write cookiejar to file and use external FD built-in cookiejar support
- httpie: Calculate cookies from cookiejar instead of `http_headers`
- axel: Calculate cookies from cookiejar and disable http redirection if cookies are passed
    - May break redirects, but axel simply don't have proper cookie support

Ref: https://github.com/yt-dlp/yt-dlp/security/advisories/GHSA-v8mc-9377-rwjj

Authored by: bashonly, coletdjnz
pukkandan 47bcd43724
[outtmpl] Pad `playlist_index` etc even when with internal formatting
Closes 
pukkandan cd810afe2a
[extractor/youtube] Improve nsig function name extraction
pukkandan 42f2d40b47
Update to ytdl-commit-07af47
[YouTube] Improve fix for ae8ba2c
07af47960f
pukkandan 1619ab3e67
Bugfix for ebe1b4e34f
pukkandan ad54c9130e
[cleanup] Misc
Closes , Closes , Closes , Closes , Closes 
Authored by: mikf, freezboltz, pukkandan
pukkandan ebe1b4e34f
[outtmpl] Fix some minor bugs
Closes 
pukkandan eedda5252c
[utils] `FormatSorter`: Improve `size` and `br`
Closes 

Previously, when some formats have accurate size and some approximate,
the ones with accurate size was always prioritized

For formats with known tbr and unknown vbr/abr, we were setting
(vbr=tbr, abr=0) for sorting to work. This is no longer needed.

Authored by pukkandan, u-spec-png
pukkandan 01aba2519a
[jsinterp] Fix global object extraction
Closes 
bashonly b4a252fba8
[jsinterp] Fix division ()
* Fixes nsig decryption for Youtube JS player `8c7583ff`

Authored by: bashonly
Matt Broadway b38d4c941d
[cookies] Update for chromium changes ()
Authored by: mbway
coletdjnz daafbf49b3
[core] Support decoding multiple content encodings ()
Authored by: coletdjnz
coletdjnz b87e01c123
[cookies] Move `YoutubeDLCookieJar` to cookies module ()
Authored by: coletdjnz
coletdjnz 08916a49c7
[core] Improve HTTP redirect handling ()
Aligns HTTP redirect handling with what browsers commonly do and RFC standards. 

Fixes issues afac4caa7d missed.

Authored by: coletdjnz
pukkandan 7aeda6cc9e
[jsinterp] Do not compile regex
pukkandan 4823ec9f46
Update to ytdl-commit-d1c6c5
[YouTube] [core] Improve platform debug log, based on yt-dlp
d1c6c5c4d6

Except:
    * 6ed34338285f722d0da312ce0af3a15a077a3e2a [jsinterp] Add short-cut evaluation for common expression
        * There was no performance improvement when tested with https://github.com/ytdl-org/youtube-dl/issues/30641
    * e8de54bce50f6f77a4d7e8e80675f7003d5bf630 [core] Handle `/../` sequences in HTTP URLs
        * We plan to implement this differently
pukkandan 6f2287cb18
[cleanup] Misc
Closes , closes 
pukkandan 1d7656184c
[jsinterp] Handle `NaN` in bitwise operators
Closes 
Simon Sawicki b079c26f0a
[utils] `traverse_obj`: More fixes ()
- Fix result when branching with `traverse_string`
- Fix `slice` path on `dict`s
- Fix tests and docstrings from 21b5ec86c2
- Add `is_iterable_like` helper function

Authored by: Grub4K
pukkandan 7cf51f2191
[jsinterp] Handle negative numbers better
Closes 
Simon Sawicki 21b5ec86c2
[utils] `traverse_obj`: Allow iterables in traversal ()
Authored by: Grub4K
pukkandan ec9311c41b
[outtmpl] Support `str.format` syntax inside replacements
Closes 
pukkandan 78fde6e339
[outtmpl] Allow `\n` in replacements and default.
Fixes: https://github.com/yt-dlp/yt-dlp/issues/6808#issuecomment-1510055357
Fixes: https://github.com/yt-dlp/yt-dlp/issues/6808#issuecomment-1510363645
Simon Sawicki 0898c5c8cc
[utils] `js_to_json`: Implement template strings ()
Authored by: Grub4K
pukkandan f68434cc74
[extractor] Extract more metadata from ISM
Fixes 81b6102d20 (r105892531)
pukkandan 9acf1ee25f
[jsinterp] Handle `Date` at epoch 0
Closes 
pukkandan 65f6e80780
[dependencies] Simplify `Cryptodome`
Closes , closes , closes 
pukkandan 45b2ee6f4f
Update to ytdl-commit-2dd6c6e
[YouTube] Avoid crash if uploader_id extraction fails
2dd6c6edd8

Except:
    * 295736c9cba714fb5de7d1c3dd31d86e50091cf8 [jsinterp] Improve parsing
    * 384f632e8a9b61e864a26678d85b2b39933b9bae [ITV] Overhaul ITV extractor
    * 33db85c571304bbd6863e3407ad8d08764c9e53b [feat]: Add support to external downloader aria2p
Simon Sawicki 6839ae1f6d
[utils] `traverse_obj`: Fix more bugs
and cleanup uses of `default=[]`

Continued from b1bde57bef
Simon Sawicki b1bde57bef
[utils] `traverse_obj`: Fix several behavioral problems
See  for further info

Authored by: Grub4K
pukkandan f6a765ceb5
[dependencies] Standardize `Cryptodome` imports
Simon Sawicki 776995bc10
[utils] `traverse_obj`: Various improvements
- Add `set` key for transformations/filters
- Add `re.Match` group names
- Fix behavior for `expected_type` with `dict` key
- Raise for filter function signature mismatch in debug

Authored by: Grub4K
pukkandan 8b008d6254
[jsinterp] Support `if` statements
Closes 
Simon Sawicki 773c272d66
Fix config locations ()
Bug in 8e40b9d1ec
Closes 

Authored by: Grub4k, coletdjnz, pukkandan
Matthew 8e40b9d1ec
Improve plugin architecture ()
to make plugins easier to develop and use:
* Plugins are now loaded as namespace packages.
* Plugins can be loaded in any distribution of yt-dlp (binary, pip, source, etc.).
* Plugin packages can be installed and managed via pip, or dropped into any of the documented locations.
* Users do not need to edit any code files to install plugins.
* Backwards-compatible with previous plugin architecture.

As a side-effect, yt-dlp will now search in a few more locations for config files.

Closes https://github.com/yt-dlp/yt-dlp/issues/1389

Authored by: flashdagger, coletdjnz, pukkandan, Grub4K
Co-authored-by: Marcel <flashdagger@googlemail.com>
Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
Co-authored-by: Simon Sawicki <accounts@grub4k.xyz>
pukkandan fbb7383306
Add `weba` to known extensions
ChillingPepper d5f043d127
[utils] js_to_json: Fix bug in f55523c ()
Authored by: ChillingPepper, pukkandan
pukkandan 9f14daf22b
[extractor] Deprecate `_sort_formats`
pukkandan 6368e2e639
[cleanup] Misc
Closes 
Robert Geislinger 7aaf4cd2a8
[cleanup] Misc
Closes , Closes 

Authored by: pukkandan, Alienmaster
pukkandan 495322b95b
[test] Allow `extract_flat` in download tests
Authored by: coletdjnz, pukkandan
Matthew 86973308cd
[extractor/youtube:tab] Update tab handling for redesign ()
Closes , , 
Authored by: coletdjnz, pukkandan
pukkandan 46d09f8707
[cleanup] Lint and misc cleanup
pukkandan db4678e448
Update to ytdl-commit-de39d128
[extractor/ceskatelevize] Back-port extractor from yt-dlp
de39d1281c

Closes , Closes , Closes 
Alex Karabanov c9bd65185c
[extractor/zenyandex] Fix extractors (, )
Closes 
Authored by:  lksj, puc9, pukkandan

Co-authored-by: puc9 <51006296+puc9@users.noreply.github.com>
Ajay Ramachandran 63c547d71c
[SponsorBlock] Support `chapter` category ()
Authored by: ajayyy, pukkandan
Simon Sawicki a71b812f53
[utils] `js_to_json`: Improve escape handling ()
Authored by: Grub4K
Simon Sawicki 36069409ec
[cookies] Improve `LenientSimpleCookie` ()
Closes  
Authored by: Grub4K
pukkandan 0468a3b325
[jsinterp] Improve separating regex
Fixes https://github.com/yt-dlp/yt-dlp/issues/4635#issuecomment-1273974909
Matthew 4c9a1a3ba5
[extractor/wordpress:mb.miniAudioPlayer] Add embed extractor ()
Closes https://github.com/yt-dlp/yt-dlp/issues/4994

Authored by: coletdjnz
Simon Sawicki 7b0127e1e1
[utils] `traverse_obj`: Allow `re.Match` objects ()
Authored by: Grub4K
Simon Sawicki f99bbfc983
[utils] `traverse_obj`: Always return list when branching ()
Fixes 
Authored by: Grub4K
nixxo 81b6102d20
[downloader/ism] Support ec-3 codec ()
Closes 
Authored by: nixxo
Simon Sawicki ab029d7e92
[utils] `traverse_obj`: Rewrite, document and add tests ()
Authored by: Grub4K
Simon Sawicki 8817a80d3a
[cookies] Parse cookies leniently ()
Closes , 
Authored by: Grub4K
Elyse 7657ec7ed6
[utils] `base_url`: URL paths can contain `&` ()
Authored by: elyse0
Closes 
pukkandan 07a1250e0e
[outtmpl] Curly braces to filter keys