Commit Graph

21061 Commits (80eb0bd9b94106df9e1e5ac288def6e239937329)
 

Author SHA1 Message Date
pukkandan 80185155a1
[ukcolumn] Add Extractor
Closes #287
4 years ago
pukkandan c755f1901f
[CBS] Improve `_VALID_URL` to support movies
Closes #290
Tested by: BeeMuffins
4 years ago
pukkandan 68b91dc905
[youtube] Add `oembed` to reserved names 4 years ago
pukkandan 88f06afc0c
[rmcdecouverte] Improve `_VALID_URL`
Closes #291
4 years ago
CXwudi 40078a55e2
[niconico] Fix bug in thumbnail extraction #289
Bug from: 6b1d8c1e30
Authored by: CXwudi
4 years ago
pukkandan d2558234cf
[utils] Escape URL while sanitizing
Closes #263

While this fixes the issue in question, it does not try to address the root-cause of the problem
Refer: 915f911e36, f5fa042c82
4 years ago
pukkandan f5fa042c82
Revert "[utils] Encode URLs in `YoutubeDLCookieProcessor`"
This reverts commit 915f911e36.

When the request is copied, `unredirected_hdrs` are not copied, which causes issues elsewhere
Reopens #263
4 years ago
pukkandan 07e4a40a9a
[crackle] Improve extraction (See desc)
Closes #282

* Refactor authorization as an extension to `_download_json`
* Better error messages and warnings
* Respect `--ignore-no-formats-error`
* Extract subtitles from manifests
* Try with crackle's geo-location service if all hard-coded countries fail
4 years ago
pukkandan e28f1c0ae8
[cleanup] Fix linter and some typos
* Also remove inconsistent use of `"` in setup.py
4 years ago
pukkandan ef39f8600a
[curiositystream] Fix collections
Closes #277

* A bug with authentication was reported in <https://github.com/yt-dlp/yt-dlp/issues/277#issuecomment-828254721> but cannot be tested without an account
4 years ago
pukkandan 2291dbce2a
[niconico] Fix HLS formats
Closes #171

* The structure of the API JSON was changed
* Smile Video seems to be no longer available. So remove the warning
* Move ping to downloader
* Change heartbeat interval to 40sec
* Remove unnecessary API headers

Authored-by: CXwudi, tsukumijima, nao20010128nao, pukkandan
Tested by: tsukumijima
4 years ago
pukkandan 58f197b76c
Revert "[core] be able to hand over id and title using url_result"
This reverts commit 0704d2224b.

This is a commit from `youtube-dlc`. It is not clear what the original purpose of this was. It seems to be a way for extractors to pass `title` and `id` through when the entry is processed by another extractor

* But `title` can already be passed through using `url_transparent`
* `id` is never supposed to be passed through since it could cause issues with archiving
4 years ago
pukkandan 895b0931e5
[youtube:tab] Detect playlists inside community posts 4 years ago
pukkandan 1ad047d0f7
[nebula] Move to nebula.app
Closes #272
Tested by: Lamieur
4 years ago
pukkandan be6202f12b
Subtitle extraction from streaming media manifests #247
Authored by fstirlitz
Modified from: https://github.com/ytdl-org/youtube-dl/pull/6144

Closes: #73
Fixes:
https://github.com/ytdl-org/youtube-dl/issues/6106
https://github.com/ytdl-org/youtube-dl/issues/14977
https://github.com/ytdl-org/youtube-dl/issues/21438
https://github.com/ytdl-org/youtube-dl/issues/23609
https://github.com/ytdl-org/youtube-dl/issues/28132

Might also fix (untested):
https://github.com/ytdl-org/youtube-dl/issues/15424
https://github.com/ytdl-org/youtube-dl/issues/18267
https://github.com/ytdl-org/youtube-dl/issues/23899
https://github.com/ytdl-org/youtube-dl/issues/24375
https://github.com/ytdl-org/youtube-dl/issues/24595
https://github.com/ytdl-org/youtube-dl/issues/27899

Related:
https://github.com/ytdl-org/youtube-dl/issues/22379
https://github.com/ytdl-org/youtube-dl/pull/24517
https://github.com/ytdl-org/youtube-dl/pull/24886
https://github.com/ytdl-org/youtube-dl/pull/27215

Notes:
* The functions `extractor.common._extract_..._formats` are still kept for compatibility
* Only some extractors have currently been moved to using `_extract_..._formats_and_subtitles`
* Direct subtitle manifests (without a master) are not supported and are wrongly identified as containing video formats
* AES support is untested
* The fragmented TTML subtitles extracted from DASH/ISM are valid, but are unsupported by `ffmpeg` and most video players
    * Their XML fragments can be dumped using `ffmpeg -i in.mp4 -f data -map 0 -c copy out.ttml`.
        Once the unnecessary headers are stripped out of this, it becomes a valid self-contained ttml file
    * The ttml subs downloaded from DASH manifests can also be directly opened with <https://github.com/SubtitleEdit>
* Fragmented WebVTT files extracted from DASH/ISM are also unsupported by most tools
    * Unlike the ttml files, the XML fragments of these cannot be dumped using `ffmpeg`
    * The webtt subs extracted from DASH can be parsed by <https://github.com/gpac/gpac>
    * But validity of the those extracted from ISM are untested
4 years ago
Felix S e8f834cd8d [threeqsdn] Extract subtitles from streaming manifests 4 years ago
Felix S e0e624ca7f [canvas] Extract subtitles from streaming manifests 4 years ago
Felix S ec4f374c05 [wat] Extract subtitles from streaming manifests 4 years ago
Felix S c811e8d8bd [atresplayer] Extract subtitles from streaming manifests 4 years ago
Felix S b2cd5da460 [francetv] Extract subtitles from the HLS manifest 4 years ago
Felix S 2de3b21e05 [uplynk] Extract subtitles from HLS manifests 4 years ago
Felix S 4bed436371 [twitter] Extract subtitles from HLS manifests 4 years ago
Felix S efe9dba595 [srgssr] Extract subtitles from HLS manifests 4 years ago
Felix S 47f4203dd3 [nytimes] Extract subtitles from HLS manifests 4 years ago
Felix S 015c10aeec [roosterteeth] Use common code for subtitle extraction 4 years ago
Felix S a00d781b73 [elonet] Use common code for subtitle extraction 4 years ago
Felix S 0c541b563f [tv4] Extract subtitles from streaming manifests 4 years ago
Felix S 64a5cf7929 [byutv] Extract subtitles from streaming manifests 4 years ago
Felix S 7a450a3b1c [generic] Extract subtitles from direct SSTR manifest links 4 years ago
Felix S 7de27caf16 [generic] Extract subtitles from direct DASH manifest links 4 years ago
Felix S c26326c1be [generic] Extract subtitles from direct HLS manifest links 4 years ago
Felix S 66a1b8643a [downloader/ism] Support muxing TTML subtitles 4 years ago
Felix S 15828bcf25 [downloader/hls] Handle MPEG-2 PES timestamp overflow 4 years ago
Felix S 333217f43e [downloader/hls] Remove duplicate cues using a sliding window of candidates 4 years ago
Felix S 4a2f19abbd [downloader/hls] Assemble single-file WebVTT subtitles from HLS segments 4 years ago
Felix S 5fbcebed8c [test] Test SSTR manifest parsing 4 years ago
Felix S becdc7f82c [test] Test subtitle extraction from DASH manifests 4 years ago
Felix S 73b9088a1c [test] Test subtitle extraction from HLS manifests 4 years ago
Felix S f6a1d69a87 [extractor/common] Extend _extract_akamai_formats to also extract subtitle tracks 4 years ago
Felix S fd76a14259 [extractor/common, downloader/ism] Extract SSTR subtitle tracks
_parse_ism_formats was extended into _parse_ism_formats_and_subtitles;
all direct users were updated, though _extract_ism_formats was left
as a compatibility wrapper.

The SSTR downloader was also modified in order to prepare for muxing
subtitle streams, although no support for any subtitle codecs was
added in this commit.
4 years ago
Felix S 171e59edd4 [extractor/common] Extract DASH subtitle tracks
_extract_mpd_formats and _parse_mpd_formats were extended into
_…_formats_and_subtitles; wrappers with old names are provided
for compatibility.
4 years ago
Felix S a0c3b2d5cf [extractor/common] Extract HLS subtitle tracks
_extract_m3u8_formats is renamed to _extract_m3u8_formats_and_subtitles
and extended to handle subtitle tracks instead of skipping them;
a wrapper with the old name is provided for compatibility.

_parse_m3u8_formats is likewise renamed and extended, but without adding
the compatibility wrapper; the test suite is adjusted to test the enhanced
method instead.
4 years ago
Felix S 19bb39202d [extractor/common] Generalise _merge_subtitles
This allows modifying a subtitles dictionary in-place.
4 years ago
Felix S d4553567d2 [downloader/ism] Prevent writing the header again when resuming an interrupted download 4 years ago
Felix S 4d49884c58 [downloader/fragment] Allow persisting extra state when a download is interrupted 4 years ago
Felix S 5873d4ccdd [utils] Improve bug_report_message
Add an optional argument specifying the text that should go before
the message.
4 years ago
Hadi0609 db9a564b6a
[zee5] Fix extraction for some URLs (#279)
Closes: #278
4 years ago
Felix S c72967d5de
[mediasite] Generalize URL pattern (#275)
Authored by: fstirlitz
4 years ago
pukkandan 598d185db1
Fix case sensitivity of format selector
Bug introduced in f8d4ad9ab0
4 years ago
pukkandan b982cbdd0e
[limelight] Obey `allow_unplayable_formats` 4 years ago