dirkf
a25e9f3c84
[compat] Use `compat_open()`
1 year ago
dirkf
b2ba24bb02
[InfoExtractor] Add `_match_valid_url()` class method and refactor
...
* API compatible with yt-dlp
* also support Sequence of patterns in _VALID_URL
* one place to compile _VALID_URL
* TODO: remove existing extractor shims
1 year ago
dirkf
b2741f2654
[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlp
...
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386
thanks selfisekai
* add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921 ,
thanks Lesmiscore, pukkandan
* add tests for the above
* also fix HTML5 type recognition and tests, from
222a230871
,
thanks Lesmiscore
* update extractors in PR using above, fix tests.
1 year ago
dirkf
8465222041
[Clipchamp] Add new extractor back-ported from yt-dlp
1 year ago
dirkf
4339910df3
[DLF] Add site extractors back-ported from yt-dlp
...
* from https://github.com/yt-dlp/yt-dlp/pull/6697 , thanks nick-cd
1 year ago
dirkf
eaaf4c6736
[Whyp] Add extractor back-ported from yt-dlp
...
* from https://github.com/yt-dlp/yt-dlp/pull/6803 , thanks CoryTibbettsDev
1 year ago
dirkf
4566e6e53e
[GlobalPlayer] Add site extractors back-ported from yt-dlp
...
* from https://github.com/yt-dlp/yt-dlp/pull/6903 , thanks garret1317
1 year ago
dirkf
1e8ccdd2eb
[InfoExtractor] Support groups in _`search_regex()`, etc
1 year ago
dirkf
fa7f0effbe
[YouTube] Avoid crash in author extraction
2 years ago
pukkandan
9112e668a5
[YouTube] Improve nsig function name extraction
...
Fixes player b7910ca8, using `,` vs `;`
See https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-1602231170
Co-authored-by: dirkf
2 years ago
dirkf
07af47960f
[YouTube] Improve fix for ae8ba2c
...
Thx: https://github.com/yt-dlp/yt-dlp/commit/01aba25
2 years ago
dirkf
ae8ba2c319
[YouTube] Fix `KeyError QV` in signature extraction failed
...
* temporarily force missing global definition into sig JS
* improve test: thanks https://github.com/yt-dlp/yt-dlp/issues/7327#issuecomment-1595274615
* resolves #32314
2 years ago
dirkf
ee731f3d00
[ITV] Fix UA capitalisation in 384f632
2 years ago
dirkf
64d6dd64c8
[YouTube] Support Releases tab
2 years ago
dirkf
2da3fa04a6
[YouTube] Simplify signature patterns
2 years ago
pukkandan
3f6d2bd76f
[extractor/youtube] Bypass throttling for `-f17`
...
and related cleanup
Thanks @AudricV for the finding
Ref: yt-dlp/yt-dlp/commit/c9abebb
2 years ago
pukkandan
88f28f620b
[extractor/youtube] Construct fragment list lazily
...
Ref: yt-dlp/yt-dlp/commit/e389d17
See: yt-dlp/yt-dlp#6517
2 years ago
dirkf
6fece0a96b
[AENetworksBaseIE] Report missing show data instead of crash
2 years ago
pukkandan
3da17834a4
[Youtube] Construct dash formats with `range` query
...
See yt-dlp/yt_dlp#6369
2 years ago
dirkf
f7ce98a21e
[YouTube] Support @owner format in uploader_id etc
...
* implement https://github.com/ytdl-org/youtube-dl/issues/31530#issuecomment-1435734719
* update affected tests
* misc clean-ups
2 years ago
pukkandan
1d3751c3fe
Escape URLs in `sanitized_Request`, not `sanitize_url` d2558234cf5dd12d6896eed5427b7dcdb3ab7b5a added escaping of URLs while sanitizing. However, `sanitize_url` may not always receive an actual URL. Eg: When using `youtube-dl "search query" --default-search ytsearch`, `search query` gets escaped to `search%20query` before being prefixed with `ytsearch:` which is not the intended behavior. So the escaping is moved to `sanitized_Request` instead.
2 years ago
df
6067451e43
[Vimeo] Fix e19ec52
for tween-age Pythons
...
* a check in older Pythons in the 2.7 and earlier, 3.3, 3.4 series caused "sre_constants.error: nothing to repeat"
* satisfy the check by avoiding nested qualifiers that can match empty string
Resolves #31597
2 years ago
dirkf
2dd6c6edd8
[YouTube] Avoid crash if uploader_id extraction fails
...
See #31530 .
2 years ago
dirkf
42b098dd79
[InfoExtractor] Handle unquoted values in OpenGraph searches
2 years ago
fonkap
6f8c2635a5
[StreamsbIE] Add extractor for streamsb.com (viewsb.com) ( #31517 )
...
* Add extractor for streamsb.com (viewsb.com)
* make data url using app.js version
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
fonkap
de48105dd8
[KommunetvIE] Add extractor for kommunetv.no ( #31516 )
...
* Add extractor for kommunetv.no
* Using utils.update_url instead of regex
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
fonkap
822f19f05d
[FileMoonIE] Add extractor for filemoon.sx ( #31515 )
...
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
Valentin Metz
f33923cba7
[rbgtum] Add new extractor ( #31305 )
...
* [rbgtum] Add new extractor
* Small update, force CI
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
dirkf
e8198c517b
[YouTube] Fix tests
2 years ago
dirkf
bafb6dec72
[YouTube] Refresh compat/utils usage
...
* import parse_qs()
* import parse_qs in lazy_extractors (clears old TODO)
* clean up old compiled lazy_extractors for Py2
* use update_url()
2 years ago
dirkf
30e986b834
[YouTube] Add `signatureTimestamp` for age-gate bypass
2 years ago
dirkf
58988c1421
[YouTube] Bypass age-gating for certain restricted videos
...
* Use TVHTML5_SIMPLY_EMBEDDED_PLAYER client
* Also add and fix tests
* Introduce and use new utility function `update_url()`
2 years ago
dirkf
e19ec52322
[Vimeo] Support /user{video_id}/{slug} URL format
2 years ago
dirkf
f2f90887ca
[Vimeo] Fix `Unable to extract info section` redux
...
* as reported in yt-dlp/yt-dlp#6149
* also allow newline in target JSON object
2 years ago
dirkf
d947ffe8e3
[IGN] Overhaul extractor to avoid URL redirection loop
...
Consequently/also:
* centralise video data extraction
* detect 404 and 503 expected errors
* handle the test video in IGNVideo
* handle two additional page formats for the tests in IGNArticle
2 years ago
dirkf
384f632e8a
[ITV] Overhaul ITV extractor ( #30266 )
...
* support ITVX URLs (thanks Vangelis66)
* support legacy ITV Hub URLs
* include extraction fix 4c57dd2
from sleaux-meaux 3 May 2021
* include extraction fix 6fbcc16, fix by staubichsauger & pukkandan
* work-around duration parsing pending fix to utils.parse_duration
* apply default vanilla UA for pages and media to avoid site blocking
* also detect and report `Episode not found` instead of generic 404
* rework ITVBTCCIE with geo-block detection, best effort geo-restriction handling, news article support
* fix tests
2 years ago
dirkf
9d17948b5a
[myvideoge] Add new extractor ( #31360 )
...
NB download tests on CI servers blocked
Co-authored-by: Alfonso Solbes <fonk666@gmail.com>
2 years ago
afterdelight
f316f5d4e3
[xhamster] add support for new domain xhvid.com ( #31370 )
2 years ago
dirkf
bc6f94e459
[FIFA] Back-port extractor from yt-dlp ( #31385 )
2 years ago
Epsilonator
be3392a0d4
[Blerp] Add new extractor ( #31398 )
...
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
zhangeric-15
6d829d8119
[YouTube] Fix not finding videos listed under a channel's "shorts" subpage. ( #31409 )
...
Resolves #31336
Co-authored-by: Jouni Järvinen <rautamiekka@users.noreply.github.com>
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
Ruowang Sun
98b0cf1cd0
[Callin] Add new extractor ( #31414 )
...
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
Leon Etienne
e9611a2a36
[pr0gramm] implement InfoExtractor, Resolves #31433 ( #31434 )
...
* [pr0gramm] implement infoextractor
* [pr0gramm] remove misplaced comment, uncapture regex-group
* [pr0gramm]: specify utf-8 coding
* [pr0gramm]: add trailing comma to lists for maintainability
* [pr0gramm]: ie only sets upload_date attribute
* [pr0gramm]: add video_id to title
* [pr0gramm]: more forgiving _valid_url regex
* [pr0gramm]: add uploader to title, if set
* Discriminate URL pattern
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
JChris246
807e593a32
[cammodels] fix and improve extractor ( #31453 )
...
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
Brian Marks
37cbdfa0e7
[americastestkitchen] Add support for downloading entire series ( #31493 )
...
Also
* support new sites and URL patterns
* back-port from yt-dlp
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
dirkf
195f22f679
[generic] Improve KVS (etc) extraction
2 years ago
dirkf
fc2beab0e7
[generic] Improve KVS (etc) extraction
...
* detect kt_player('kt_player', 'https://.../kt_player.swf?v=5 ...
* detect age limit if 18 USC 2257 is mentioned
* test with shooshtime.com
Partially resolves #31332 .
2 years ago
FraFraFra-LongD
1a4fbe8462
Added ThisVid.com support ( #29187 )
...
* add ThisVidIE, ThisVidMemberIE, ThisVidPlaylistIE
* redirect embed to main page for more metadata
* use KVS extraction newly added to GenericIE and remove duplicate tests
* also add MrDeepFake etc compat to GenericIE
(closes #22390 )
Co-authored-by: dirkf <fieldhouse@gmx.net>
2 years ago
dirkf
c2f9be3e63
[generic] Add KVS player extraction
2 years ago
dirkf
604762a9f8
[common:jwplayer] Improve jwplayer extraction and parsing ( #31000 )
...
* don't crash parser if jwplayer_data is invalid (empty, or no formats)
* use `label` in `sources[n]` as `format_id`
* relax `jwplayer().setup(...)` RE (also rework PR #27274 enhancement)
* detect more manifest formats in _parse_jwplayer_formats() (from PR #29596 )
* improve metadata extraction (from PR #25433 )
* remember URLs in a set
* use parse_resolution() in format
* extract filesize in format (from yt-dlp)
Co-authored-by: kikuyan <kikuyan@users.noreply.github.com>
Co-authored-by: martin54 <martin54@users.noreply.github.com>
2 years ago