Commit Graph

222 Commits (3bf8c316a663741ab806f48cf9121209aba88b72)

Author SHA1 Message Date
Sergey M․ 74fe23ec35 [extractor/common] Style 10 years ago
Yen Chi Hsuan a38436e889 [extractor/common] Add 'transform_source' parameter to _extract_f4m_formats() 10 years ago
Sergey M․ 31c746e5dc [extractor/common] Keep going in some media_url is missing 10 years ago
Sergey M․ 70f0f5a8ca [extractor/common] Recursively extract child f4m manifests 10 years ago
Sergey M․ cc357c4db8 [extractor/common] Properly handle full URLs 10 years ago
Sergey M․ 97f4aecfc1 [extractor/common] Handle malformed f4m manifests 10 years ago
Sergey M․ cf61d96df0 [extractor/common] Add _form_hidden_inputs 10 years ago
Sergey M․ f8da79f828 [extractor/common] Improve _form_hidden_inputs and rename to _hidden_inputs 10 years ago
Sergey M․ 27713812a0 [extractor/common] Add method for extracting form hidden input fields as dict 10 years ago
Yen Chi Hsuan 13af92fdc4 [common] Add 'fatal' to _extract_m3u8_formats 10 years ago
Sergey M․ 5414623791 [extractor/common] Remove superfluous line 10 years ago
Sergey M․ c342041fba [extractor/common] Use NO_DEFAULT from utils 10 years ago
Yen Chi Hsuan 621ed9f5f4 [common] Add note and errnote field for _extract_m3u8_formats 10 years ago
Sergey M․ baa43cbaf0 [extractor/common] Relax valid url check verbosity 10 years ago
Yen Chi Hsuan c1c924abfe [utils,common] Merge format_srt_time and _subtitles_timecode
format_srt_time uses a comma as the delimiter between seconds and
milliseconds while _subtitles_timecode uses a dot. All .srt examples I
found on the Internet uses a comma, so I use a comma in the merged
version. See http://matroska.org/technical/specs/subtitles/srt.html and
http://devel.aegisub.org/wiki/SubtitleFormats/SRT
10 years ago
Yen Chi Hsuan 05d5392cda [common] Ignore subtitles in m3u8 10 years ago
Sergey M․ 74f728249f [extractor/common] Fallback to empty string for (yet) missing `format_id` in `_sort_formats` (Closes #5624) 10 years ago
Jaime Marquínez Ferrándiz 2ddcd88129 Remove code that was only used by the Grooveshark extractor 10 years ago
zouhair cf0649f8b7 Typo: twice "the the" to "the" 10 years ago
Sergey M․ 3ded7bac16 [extractor/common] Add ability to specify custom field preference for `_sort_formats` 10 years ago
Jaime Marquínez Ferrándiz 08f2a92c9c InfoExtractor._search_regex: Suggest updating when the regex is not found (suggested in #5442)
Reuse the same message from ExtractorError
10 years ago
Yen Chi Hsuan c9a779695d [extractor/common] Add the encoding parameter
The QQMusic info extractor need forced encoding for correct working.
10 years ago
Sergey M․ 830d53bfae [utils] Add `video_title` for `url_result` 10 years ago
Sergey M․ e21a55abcc [extractor/common] Remove f4m section
It's now provided by `f4m_id`
10 years ago
Sergey M․ 4a34f69ea6 [extractor/common] Add subtitles timecode formatter 10 years ago
Sergey M․ f207019ce5 [extractor/common] Remove 'm3u8' from quality selection URL 10 years ago
Sergey M․ 8dc9d361c2 [extractor/common] Fix format_id when `last_media` is None and always include `m3u8_id` if present
The rationale behind `m3u8_id` was to resolve duplicates when processing several m3u8 playlists within the same media that give equal resulting `format_id`'s,
e.g. `youtube-dl http://www.rts.ch/play/tv/passe-moi-les-jumelles/video/la-fee-des-bois-mustang-les-chemins-du-vent?id=3854925 -F`
10 years ago
Philipp Hagemeister a0bb7c5593 [extractor/common] Improve m3u format IDs (#5143) 10 years ago
Sergey M․ 2f0f6578c3 [extractor/common] Assume non HTTP(S) URLs valid 10 years ago
Philipp Hagemeister 72a406e7aa [extractor/common] Pass in video_id (#5057) 10 years ago
Antti Ajanki 6f4ba54079 [extractor/common] Extract HTTP (possibly f4m) URLs from a .smil file 10 years ago
Antti Ajanki 637570326b [extractor/common] Extract the first of a seq of videos in a .smil file 10 years ago
Jaime Marquínez Ferrándiz bfc993cc91 Merge branch 'subtitles-rework'
(Closes PR #4964)
10 years ago
Sergey M․ 9fe6ef7ab2 [extractor/common] Fix preference for m3u8 quality selection URL 10 years ago
Philipp Hagemeister 8fb3ac3649 PEP8: W503 10 years ago
Philipp Hagemeister 77b2986b5b [extractor/common] Recognize Indian censorship (#5021) 10 years ago
Jaime Marquínez Ferrándiz 9868ea4936 [extractor/common] Simplify subtitles handling methods
Initially I was going to use a single method for handling both subtitles and automatic captions, that's why I used the 'list_subtitles' and the 'subtitles' variables.
10 years ago
Philipp Hagemeister fa15607773 PEP8 fixes 10 years ago
Jaime Marquínez Ferrándiz 4cd95bcbc3 [twitch:stream] Prefer the 'source' format (fixes #4972) 10 years ago
Sergey M? 4069766c52 [extractor/common] Test URLs with GET 10 years ago
Jaime Marquínez Ferrándiz 360e1ca5cc [youtube] Convert to new subtitles system
The automatic captions are stored in the 'automactic_captions' field, which is used if no normal subtitles are found for an specific language.
10 years ago
Jaime Marquínez Ferrándiz c84dd8a90d [YoutubeDL] store the subtitles to download in the 'requested_subtitles' field
We need to keep the orginal subtitles information, so that the '--load-info' option can be used to list or select the subtitles again.
We'll also be able to have a separate field for storing the automatic captions info.
10 years ago
Jaime Marquínez Ferrándiz a504ced097 Improve subtitles support
For each language the extractor builds a list with the available formats sorted (like for video formats), then YoutubeDL selects one of them using the '--sub-format' option which now allows giving the format preferences (for example 'ass/srt/best').
For each format the 'url' field can be set so that we only download the contents if needed, or if the contents needs to be processed (like in crunchyroll) the 'data' field can be used.

The reasons for this change are:
* We weren't checking that the format given with '--sub-format' was available, checking it in each extractor would be repetitive.
* It allows to easily support giving a format preference.
* The subtitles were automatically downloaded in the extractor, but I think that if you use for example the '--dump-json' option you want to finish as fast as possible.

Currently only the ted extractor has been updated, but the old system still works.
10 years ago
Philipp Hagemeister 03cd72b007 [extractor/common] Move up filesize
filesize and tbr should correlate, so it doesn't make sense to treat them differently.
10 years ago
Jaime Marquínez Ferrándiz 6ca7732d5e [extractor/common] Fix link to external documentation 10 years ago
Jaime Marquínez Ferrándiz 2d30521ab9 [youtube] Extract average rating (closes #2362) 10 years ago
Philipp Hagemeister 9650885be9 [escapist] Filter video differently (Fixes #4919) 10 years ago
Philipp Hagemeister 7e5db8c930 [options] Add --no-color 10 years ago
Philipp Hagemeister 3a5bcd0326 [extractor/common] Wrap extractor errors (Fixes #1194)
For now, we just wrap some common errors. More may follow. We do not want to catch actual programming errors in the extractors, such as 1 // 0.
10 years ago
Naglis Jonaitis 69319969de [extractor/common] Add new helper method _family_friendly_search 10 years ago