Commit Graph

1001 Commits (8ff961d10faed848009f9e2ec03fa390b486694d)

Author SHA1 Message Date
dirkf 73e1ab6125 [test:download] Only extract enough videos for playlist_mincount
dirkf 7a497f1405 Rework 2c2c2bd with an actual Mix page and realistic playlist size
From 2c2c2bd348 (commitcomment-65953545)
dirkf 5add3f4373 Merge branch 'pukkandan-yt-searchurl' into yt-dl-master
Closes 
dirkf 1e677567cd
[YouTube] Fix n-sig for player e06dea74 ()
From yt-dl commit 48416bc
dirkf 9d142109f4 Back-port test_youtube_signature.py from yt-dlp and fix JSInterp accordingly
df e1eae16b56 Handle default in switch better
Add a1fc7ca074
Thanks coletdjnz
df 96f87aaa3b Back-port JS interpreter upgrade from yt-dlp PR
df 39ca35e765 Fix test_youtube_flat_playlist_extraction
df d76d59d99d Remove obsolete non-working test_youtube_toptracks
df 2c2c2bd348 Fix test_youtube_mix
df 46e0a729b2 Remove obsolete test_youtube_course
df 57044eaceb Fix test_youtube_playlist_noplaylist
pukkandan a3373da70c
Merge branch 'UP/youtube-dl' into dl/YoutubeSearchURLIE
pukkandan ed99d68bdd
Add back `YoutubeSearchURLIE`
Sergey M․ c4a451bcdd
[test_execution] Add test for lazy extractors (refs )
Sergey M․ 5ad69d3d0e
[test_youtube_misc] Move YoutubeIE.extract_id test into separate module
PrinceOfPuppers 70baa7bfae
[test_youtube_lists] Actualize youtube flat playlist test (closes )
Remita Amine 99c68db0a8 [youtube] add support phone/tablet JS player(closes )
Remita Amine b46483a6ec [youtube/test_youtube_signature] fix test
Remita Amine 9c724601ba [youtube] remove description chapters tests
video description no longer contain yt.www.watch.player.seekTo
function
Sergey M․ 142c584063
Introduce --output-na-placeholder (closes )
Sergey M․ d81a213cfb
[YoutubeDL] Raise syntax error for format selection expressions with multiple + operators (closes )
nixxo 3a61e6d360
[rai] improve subtitles extraction ()
closes 
Remita Amine e88c9ef62a [utils] add a function to clean podcast URLs
Remita Amine 9dd674e1d2 [utils] accept only supported protocols in url_or_none
Sergey M․ af1312bfc3
[youtube:tab] Extend _VALID_URL (closes )
Sergey M․ 03d3af9768
[test_InfoExtractor] PEP 8
Sergey M․ 1727541315
[extractor/common] Improve JSON-LD interaction statistic extraction (refs )
Sergey M․ 5a1fbbf8b7
[extractor/common] Fix inline HTML5 media tags processing and add test (closes )
Sergey M․ 191286265d
[youtube:tab] Fix feeds extraction (closes , closes )
Josh Soref 71ddc222ad
Fix typos ()
* spelling: authorization

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: brightcove

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: creation

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: exceeded

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: exception

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: extension

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: extracting

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: extraction

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: frontline

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: improve

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: length

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: listsubtitles

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: multimedia

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: obfuscated

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: partitioning

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: playlist

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: playlists

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: restriction

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: services

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: split

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: srmediathek

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: support

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: thumbnail

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: verification

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>

* spelling: whitespaces

Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
Sergey M․ ab0eda99e1
[YoutubeDL] Fix --ignore-errors for playlists with generator-based entries of url_transparent (closes )
Sergey M․ 2864179293
[youtube] Improve extraction
+ Add support for --no-playlist (closes )
* Improve playlist and mix extraction (closes , closes , closes , closes )
+ Extract playlist uploader data
* Update tests
Sergey M․ fe07e788bf
[utils] Skip ! prefixed code in js_to_json
Sergey M․ 2de2ca6659
[youtube] Rework extractors
WIP
Kevin O'Connor 4eda10499e
[utils] Don't attempt to coerce JS strings to numbers in js_to_json ()
The current logic in `js_to_json` tries to rewrite octal/hex numbers to
decimal. However, when the logic actually happens the `"` or `'` have
already been trimmed off. This causes what were originally strings, that
happen to look like octal/hex numbers, to get rewritten to decimal and
returned as a number rather than a string.

In practive something like:

```js
{
  "0x40": "foo",
  "040": "bar",
}
```

would get rewritten as:

```json
{
  64: "foo",
  32: "bar
}
```

This is problematic since this isn't valid JSON as you cannot have
non-string keys.
Sergey M․ 1d9bf655e6
[utils] Recognize wav mimetype (closes )
Sergey M․ 84213ea8d4
[youtube] Extract chapters from JSON (closes )
Sergey M․ c380cc28c4
[utils] Improve cookie files support
+ Add support for UTF-8 in cookie files
* Skip malformed cookie file entries instead of crashing (invalid entry len, invalid expires at)
Sergey M․ e40c758c2a
[youtube] Improve player id extraction and add tests
Sergey M․ 042b664933
Revert "[utils] Add support for cookies with spaces used instead of tabs"
According to [1] TABs must be used as separators between fields.
Files produces by some tools with spaces as separators are considered
malformed.

1. https://curl.haxx.se/docs/http-cookies.html

This reverts commit cff99c91d1.
Sergey M․ cff99c91d1
[utils] Add support for cookies with spaces used instead of tabs
Sergey M․ ea17979d83
[test_subtitles] Remove obsolete test
Sergey M․ 4e9e1e240d
[test_YoutubeDL] Add tests for (closes )
Sergey M․ e0abaab293
[test_YoutubeDL] Fix get_ids
Sergey M․ 42db58ec73
[utils] Improve str_to_int
Remita Amine 348c6bf1c1 [utils] handle int values passed to str_to_int
Sergey M․ 1ced222120
[utils] Add generic caesar cipher and rot47
InfernalUnderling 9d30c2132a [utils] Handle rd-suffixed day parts in unified_strdate ()
Remita Amine 237513e801 [yahoo] restore support for cbs suffixed URLs