Yen Chi Hsuan
fac2af3c51
[common] Fix m3u8 extraction in f4m manifests
9 years ago
Yen Chi Hsuan
448bb5f333
[common] Fix non-bootstrapped support in f4m
9 years ago
Yen Chi Hsuan
240b60453e
[common] Support m3u8 in f4m manifests
...
Related: #9531
9 years ago
Yen Chi Hsuan
0a5685b26f
[common] Support non-bootstraped streams in f4m manifests
...
Related: #9531
9 years ago
Yen Chi Hsuan
7b2fcbfd4e
[common] Skip TYPE=CLOSED-CAPTIONS lines in m3u8 manifests
...
According to [1], valid values for TYPE are AUDIO, VIDEO, SUBTITLES
and CLOSED-CAPTIONS. Such a value is found in Anvato master playlists,
though I don't use _extract_m3u8_formats() in the end.
Part of #9522 .
[1] https://tools.ietf.org/html/draft-pantos-http-live-streaming-19#section-4.3.4.1
9 years ago
Yen Chi Hsuan
16da9bbc29
[common] Add _m3u8_meta_format() template
...
For extractors who handle m3u8 manifests by themselves. (eg., AnvatoIE)
Part of #9522
9 years ago
Yen Chi Hsuan
ad96b4c8f5
[common] Extract audio formats in SMIL
...
Found in http://www.cbc.ca/player/play/2657631896
Closes #5156
9 years ago
Sergey M․
ed56f26039
[extractor/common] Improve name extraction for m3u8 formats
9 years ago
Sergey M․
8a92e51c60
[extractor/common] Relax wording for creator metafield
9 years ago
Yen Chi Hsuan
e9c6cdf4a1
[common] Fix format_id construction for HLS
9 years ago
Kagami Hiiragi
b24d6336a7
[vlive] Add support for live videos
9 years ago
Yen Chi Hsuan
d6712378e7
Merge branch 'akamai_pv' of https://github.com/remitamine/youtube-dl into remitamine-akamai_pv
9 years ago
remitamine
fb72ec58ae
[extractor/common] do not process f4m manifest that contain akamai playerVerificationChallenge
9 years ago
Yen Chi Hsuan
2c0d9c6217
[extractor/common] Allow empty post data
9 years ago
Sergey M․
49caf3307f
[extractor/common] Remove irrelevant comment
9 years ago
Sergey M․
bacec0397f
[extractor/common] Relax _hidden_inputs
9 years ago
Sergey M․
fb38aa8b53
[extractor/common] Support arbitrary format strings for template based identifiers in mpd manifests ( Closes #9119 , closes #9120 )
9 years ago
Sergey M․
7a93ab5f3f
[extractor/common] Introduce music album metafields
9 years ago
Sergey M․
b507cc925b
[extractor/common] Carry long line
9 years ago
Sergey M․
db8ee7ec05
[extractor/common] Fix numeric identifiers conversion in DASH URL templates
9 years ago
remitamine
df634be2ed
[common] prefer using mime type over ext for smil subtitle extraction
...
the subtitle ext for http://www.cnet.com/videos/download-amazon-prime-movies-and-tv/
is adb_xml while using the mime type it get tt(application/smptett+xml)
9 years ago
Sergey M․
41d06b0424
[extractor/common] Improve _request_webpage
...
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
9 years ago
Sergey M․
b22ca76204
[extractor/common] Filter out unsupported encrypted media for f4m formats ( Closes #8573 )
9 years ago
Sergey M․
19dbaeece3
Remove _sort_formats from _extract_*_formats methods
...
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.
Closes #8051
9 years ago
Sergey M․
15707c7e02
[compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
...
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode
Closes #8974
9 years ago
remitamine
49dea4913b
Merge pull request #8513 from remitamine/dash-sort
...
[extractor/common] fix dash formats sorting
9 years ago
Sergey M․
0fdbb3322b
[extractor/common] Add _parse_f4m_formats routine
9 years ago
remitamine
09f572fbc0
[extractor/common] add transform_source to _download_smil and _extract_smil_formats
9 years ago
remitamine
15bf934de5
Merge pull request #8819 from remitamine/simple-webpage-requests
...
[extractor/common] simplify using data, headers and query params with _download_* methods
9 years ago
remitamine
cdfee16818
[extractor/common] add data, headers and query params to _request_webpage
9 years ago
Yen Chi Hsuan
a6c8b75904
[common] Use mimeType to determine file extensions ( #8766 )
9 years ago
Yen Chi Hsuan
64f08d4ff2
Merge pull request #8766 from yan12125/dash-detect-ext
...
Detect file extensions of DASH formats from their codecs
9 years ago
Yen Chi Hsuan
af7d5a63b2
[common] Document protocol http_dash_segments
9 years ago
Yen Chi Hsuan
2def60c5f3
[common] Use codec2ext for DASH formats ( #8764 )
9 years ago
Yen Chi Hsuan
e9c0cdd389
[jython] Introduce compat_os_name
...
os.name is always 'java' on Jython
9 years ago
Sergey M․
7bcd2830dd
[extractor/common] Document uploader_url
9 years ago
Sergey M․
2bc0c46f98
[extractor/common] Document license metafield
9 years ago
Sergey M․
d77ab8e255
Add --mark-watched feature ( Closes #5054 )
9 years ago
Sergey M․
9cdffeeb3f
[extractor/common] Clarify rationale on media playlist detection
9 years ago
Sergey M․
fbb6edd298
[extractor/common] Properly extract audio only formats in master m3u8 playlists
9 years ago
Sergey M․
f5bdb44443
[extractor/common] Add _remove_duplicate_formats
9 years ago
remitamine
cafcf657a4
add more subtitles mime types to mimetype2ext and fix the platform subtitle extraction
9 years ago
Sergey M․
611c1dd96e
[refactor] Single quotes consistency
9 years ago
Sergey M․
d800609c62
[refactor] Do not specify redundant None as second argument in dict.get()
9 years ago
Sergey M․
bb20526b64
[extractor/common] Improve base url construction
9 years ago
remitamine
c349456ef6
[extractor/common] strip http urls in smil manifest
9 years ago
remitamine
81e1c4e2fc
[extractor/common] remove duplicate rtmp formats in smil manifest
9 years ago
remitamine
dd86780596
[extractor/common] fix dash formats sorting
9 years ago
remitamine
154c209e2d
[extractor/common] improve dash format ids
9 years ago
remitamine
51e9094f4a
[extractor/common] extract youtube dash formats filesize( fixes #8480 )
9 years ago
remitamine
d413095f7e
[extractor/common] remove duplicated formats and subtiles in smil manifests
9 years ago
remitamine
6a3828fddd
[common] use float conversion instead of using division from __future__
9 years ago
remitamine
91cb6b5065
rename _parse_mpd to _parse_mpd_formats and add default value for mpd namespace
9 years ago
remitamine
0826a0b555
[common] sort dash formats
9 years ago
remitamine
255732f0d3
[common] fix segment duration calculation
9 years ago
remitamine
53c269c6fd
[common] fix media_template string formating
9 years ago
remitamine
675d001633
[common] skip drm protected dash formats
9 years ago
remitamine
d577c79632
[common] ignore ISO 639-2 generic codes
9 years ago
remitamine
f14be22816
[common] remove duplicate reference to namespace
9 years ago
remitamine
9c74423510
[common] fix media template regex
9 years ago
remitamine
1bac34556f
[common] add a generic support for mpd manifests
9 years ago
Yen Chi Hsuan
2d2fa82d17
[common] Add _extract_dash_manifest_formats
9 years ago
Yen Chi Hsuan
c94678957f
[common] Remove unused arguments
9 years ago
Yen Chi Hsuan
16f38a699f
[common] Rename to namespace
...
For consistency with _parse_smil_*
9 years ago
Yen Chi Hsuan
df374b5222
[common] Prefer the manifest than formats_dict in determining codecs
9 years ago
Yen Chi Hsuan
5ea1eb78f5
[common] Fix for youtube
9 years ago
Yen Chi Hsuan
b323e1707d
[common] Modify _parse_dash_manifest for use in Facebook
9 years ago
Yen Chi Hsuan
17b598d30c
[common] _parse_dash_manifest() from youtube.py
9 years ago
Sergey M․
350cf045d8
[extractor/common] Restrict checks when auto calculating tbr
9 years ago
remitamine
a9d5f12fec
Merge pull request #8328 from remitamine/hls-master-detect
...
[extractor/common] detect media playlist in _extract_m3u8_formats
9 years ago
remitamine
7f32e5dc35
[extractor/common] detect media playlist in _extract_m3u8_formats
9 years ago
Sergey M․
b0d21deda9
[extractor/common] Auto calculate tbr when missing
9 years ago
Yen Chi Hsuan
77f785076f
[common] Keep full codec name from m3u8 manifests
...
See #8293 . This is for consistency between YouTube and HLS formats.
9 years ago
Yen Chi Hsuan
0b26ba3fc8
[extractor/common] Allow passing more parameters to _search_json_ld
9 years ago
Sergey M․
4ca2a3cf3c
[extractor/common] Add initial support for JSON-LD metadata extraction into info_dict
9 years ago
Jakub Wilk
dfb1b1468c
Fix typos
...
Closes #8200 .
9 years ago
Sergey M
3f3343cd3e
Merge pull request #8061 from dstftw/introduce-chapter-and-series-fields
...
Introduce chapter and series fields
9 years ago
Sergey M․
27bfd4e526
[extractor/common] Introduce number fields for chapters and series
9 years ago
Philipp Hagemeister
32f9036447
[ccc] Add language information to formats
9 years ago
Sergey M․
7109903e61
[extractor/common] Document chapter and series fields
9 years ago
Sergey M․
7e5edcfd33
Simplify formats accumulation for f4m/m3u8/smil formats
...
Now all _extract_*_formats routines return a list
9 years ago
remitamine
39d60b715a
Merge pull request #7769 from remitamine/sort
...
[common] lower (m3u8,rtmp,rtsp) format preference only if required program is not available
9 years ago
remitamine
d497a201ca
[common] use specific variable for protocol preference in _sort_formats
9 years ago
remitamine
8d29e47f54
[common] simplify the use of _extract_m3u8_formats and _extract_f4m_formats
9 years ago
Sergey M․
9b9c5355e4
Rename error_to_str to error_to_compat_str
9 years ago
Sergey M․
7f8b271465
Properly convert errors to strings
9 years ago
Sergey M․
dd85e4d707
[extractor/common] Properly decode error string on python 2 ( Closes #1354 , closes #3957 , closes #4037 , closes #6449 )
9 years ago
Sergey M․
62d231c004
[extractor/common] Clarify duration can be float
9 years ago
Sergey M?
5c2266df4b
Switch codebase to use sanitized_Request instead of
...
compat_urllib_request.Request
[downloader/dash] Use sanitized_Request
[downloader/http] Use sanitized_Request
[atresplayer] Use sanitized_Request
[bambuser] Use sanitized_Request
[bliptv] Use sanitized_Request
[brightcove] Use sanitized_Request
[cbs] Use sanitized_Request
[ceskatelevize] Use sanitized_Request
[collegerama] Use sanitized_Request
[extractor/common] Use sanitized_Request
[crunchyroll] Use sanitized_Request
[dailymotion] Use sanitized_Request
[dcn] Use sanitized_Request
[dramafever] Use sanitized_Request
[dumpert] Use sanitized_Request
[eitb] Use sanitized_Request
[escapist] Use sanitized_Request
[everyonesmixtape] Use sanitized_Request
[extremetube] Use sanitized_Request
[facebook] Use sanitized_Request
[fc2] Use sanitized_Request
[flickr] Use sanitized_Request
[4tube] Use sanitized_Request
[gdcvault] Use sanitized_Request
[extractor/generic] Use sanitized_Request
[hearthisat] Use sanitized_Request
[hotnewhiphop] Use sanitized_Request
[hypem] Use sanitized_Request
[iprima] Use sanitized_Request
[ivi] Use sanitized_Request
[keezmovies] Use sanitized_Request
[letv] Use sanitized_Request
[lynda] Use sanitized_Request
[metacafe] Use sanitized_Request
[minhateca] Use sanitized_Request
[miomio] Use sanitized_Request
[meovideo] Use sanitized_Request
[mofosex] Use sanitized_Request
[moniker] Use sanitized_Request
[mooshare] Use sanitized_Request
[movieclips] Use sanitized_Request
[mtv] Use sanitized_Request
[myvideo] Use sanitized_Request
[neteasemusic] Use sanitized_Request
[nfb] Use sanitized_Request
[niconico] Use sanitized_Request
[noco] Use sanitized_Request
[nosvideo] Use sanitized_Request
[novamov] Use sanitized_Request
[nowness] Use sanitized_Request
[nuvid] Use sanitized_Request
[played] Use sanitized_Request
[pluralsight] Use sanitized_Request
[pornhub] Use sanitized_Request
[pornotube] Use sanitized_Request
[primesharetv] Use sanitized_Request
[promptfile] Use sanitized_Request
[qqmusic] Use sanitized_Request
[rtve] Use sanitized_Request
[safari] Use sanitized_Request
[sandia] Use sanitized_Request
[shared] Use sanitized_Request
[sharesix] Use sanitized_Request
[sina] Use sanitized_Request
[smotri] Use sanitized_Request
[sohu] Use sanitized_Request
[spankwire] Use sanitized_Request
[sportdeutschland] Use sanitized_Request
[streamcloud] Use sanitized_Request
[streamcz] Use sanitized_Request
[tapely] Use sanitized_Request
[tube8] Use sanitized_Request
[tubitv] Use sanitized_Request
[twitch] Use sanitized_Request
[twitter] Use sanitized_Request
[udemy] Use sanitized_Request
[vbox7] Use sanitized_Request
[veoh] Use sanitized_Request
[vessel] Use sanitized_Request
[vevo] Use sanitized_Request
[viddler] Use sanitized_Request
[videomega] Use sanitized_Request
[viewvster] Use sanitized_Request
[viki] Use sanitized_Request
[vk] Use sanitized_Request
[vodlocker] Use sanitized_Request
[voicerepublic] Use sanitized_Request
[wistia] Use sanitized_Request
[xfileshare] Use sanitized_Request
[xtube] Use sanitized_Request
[xvideos] Use sanitized_Request
[yandexmusic] Use sanitized_Request
[youku] Use sanitized_Request
[youporn] Use sanitized_Request
[youtube] Use sanitized_Request
[patreon] Use sanitized_Request
[extractor/common] Remove unused import
[nfb] PEP 8
9 years ago
Sergey M․
019839faaa
[extractor/common] Use baseURL from f4m manifest for recursive manifest extraction
9 years ago
Sergey M
30eecc6a04
Merge pull request #7296 from jaimeMF/xml_attrib_unicode
...
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x (…
9 years ago
Sergey M․
dbd82a1d4f
[extractor/common] Fix m3u8 extraction on failure
9 years ago
Sergey M․
dc519b5421
[extractor/common] Make ie_key and IE_NAME return unicode string
9 years ago
Jaime Marquínez Ferrándiz
36e6f62cd0
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x ( #7178 )
...
Attributes aren't unicode objects, so they couldn't be directly used in info_dict fields (for example '--write-description' doesn't work with bytes).
9 years ago
remitamine
3711304510
[extractor/common] get the redirected m3u8_url in _extract_m3u8_formats
9 years ago
Jaime Marquínez Ferrándiz
865d1fbafc
[extractor/common] Remove unused import
9 years ago
Sergey M․
943a1e24b8
[extractor/common] Use more generic URLError in _is_valid_url
9 years ago
Sergey M․
02835c6bf4
[extractor/common] Document repost_count
9 years ago
Sergey M․
448ef1f31c
[extractor/common] Allow angle brackets in attributes in _og_regexes ( #7215 )
9 years ago
Sergey M․
7a6d76a64d
[extractor/common] Require closing quote in _og_regexes ( Closes #7174 )
...
E.g. do not match `property='og:video:type'` when `og:video` is requested.
9 years ago
Sergey M․
4180a3d8b7
[extractor/common] Allow quoteless content attribute in og regexes ( Closes #7115 )
9 years ago
Yen Chi Hsuan
57935b2564
[extractor/common] Allow HTML5 unquoted attribute values
...
Fixes #7108
HTML5 allows unquoted attribute values. See the "Unquoted attribute value
syntax" section [1] for more information
[1] http://www.w3.org/TR/html5/syntax.html
9 years ago
Sergey M․
4bba371644
[YoutubeDL] Autocalculate ext for subtitles when missing
9 years ago
Sergey M․
e5851b963a
[extractor/common] Make f4m extraction for SMIL non fatal
9 years ago
Sergey M․
4de6131090
[extractor/common] Add fatal to _extract_f4m_formats
9 years ago
Sergey M․
3a1341a7bc
[extractor/common] Make m3u8 extraction for SMIL non fatal
9 years ago
Sergey M․
c78e48177c
[extractor/common] Check validity of direct URLs
9 years ago
Sergey M․
647eab4541
[extractor/common] Extract upload date from SMIL
9 years ago
Sergey M․
1e5bcdec02
[extractor/common] Extract images from SMIL
9 years ago
Sergey M․
e7d8e98a9f
[extractor/common] Allow float bitrates
9 years ago
Sergey M․
8aab976bbd
[extractor/common] Document release_date field
9 years ago
Sergey M․
c430802e32
[extractor/common] Add raise_geo_restricted
9 years ago
Sergey M․
586f1cc532
[extractor/common] Skip html comment tags ( Closes #6822 )
9 years ago
Sergey M․
73eb13dfc7
[extractor/common] Case insensitive inputs extraction
9 years ago
Sergey M․
be0e5dbd83
[extractor/common] Extract submit inputs
9 years ago
Sergey M․
43e7d3c945
[extractor/common] Add raise_login_required
9 years ago
Jaime Marquínez Ferrándiz
8c97f81943
[common] Follow convention of using 'cls' in classmethods
9 years ago
Yen Chi Hsuan
f738dd7b7c
[common] Remove debugging codes
10 years ago
Yen Chi Hsuan
912e0b7e46
[common] Add _merge_subtitles()
10 years ago
Yen Chi Hsuan
03bc7237ad
[common] _parse_smil_subtitles: accept `lang` as the subtitle language
10 years ago
Sergey M․
5cdefc4625
[extractor/common] Add more subtitle mime types for guess when ext is missing
10 years ago
Sergey M․
ce00af8767
[extractor/common] Add default subtitles lang
10 years ago
Yen Chi Hsuan
f877c6ae5a
[theplatform] Use InfoExtractor._parse_smil_formats()
10 years ago
Sergey M․
e64b756943
[extractor/common] Interactive TFA code input
10 years ago
Sergey M․
201ea3ee8e
[extractor/common] Improve _hidden_inputs
10 years ago
Sergey M․
8b9848ac56
[extractor/common] Expand meta regex
10 years ago
Sergey M․
942acef594
[extractor/common] Extract _parse_xspf
10 years ago
Sergey M․
98044462b1
[extractor/common] Use playlist id as default title
10 years ago
Sergey M․
e0b9d78fab
[extractor/common] Clarify playlists can have description field
10 years ago
Sergey M․
8d6765cf48
[extractor/generic] Add generic support for xspf playist extraction
10 years ago
Sergey M.
d5d7bdaeb5
Merge pull request #6428 from dstftw/improve-generic-smil-support
...
Improve generic SMIL support
10 years ago
Sergey M․
5b0c40da24
[extractor/common] Expand meta regex
10 years ago
Sergey M․
17712eeb19
[extractor/common] Extract namespace parse routine
10 years ago
Sergey M․
41c3a5a7be
[extractor/common] Fix python 3
10 years ago
Sergey M․
a107193e4b
[extractor/common] Extract f4m and m3u8 formats, subtitles and info
10 years ago
remitamine
799207e838
[viewster] extract the api auth token
...
Closes #6406 .
10 years ago
Sergey M․
864f24bd2c
[extractor/common] Add _meta_regex and clarify tags field
10 years ago
Purdea Andrei
5316bf7487
Documented tags as a possible dict key
10 years ago
Sergey M․
10952eb2cf
[extractor/common] Consistent URL spelling
10 years ago
Jaime Marquínez Ferrándiz
297a564bee
[youtube] Extract end_time
10 years ago
Jaime Marquínez Ferrándiz
7c80519cbf
[youtube] Extract start_time
...
From the 't=*' in the url.
Currently youtube-dl doesn't use the value, but it was requested for the mpv plugin.
10 years ago
Sergey M․
74fe23ec35
[extractor/common] Style
10 years ago
Yen Chi Hsuan
a38436e889
[extractor/common] Add 'transform_source' parameter to _extract_f4m_formats()
10 years ago
Sergey M․
31c746e5dc
[extractor/common] Keep going in some media_url is missing
10 years ago
Sergey M․
70f0f5a8ca
[extractor/common] Recursively extract child f4m manifests
10 years ago
Sergey M․
cc357c4db8
[extractor/common] Properly handle full URLs
10 years ago
Sergey M․
97f4aecfc1
[extractor/common] Handle malformed f4m manifests
10 years ago
Sergey M․
cf61d96df0
[extractor/common] Add _form_hidden_inputs
10 years ago
Sergey M․
f8da79f828
[extractor/common] Improve _form_hidden_inputs and rename to _hidden_inputs
10 years ago
Sergey M․
27713812a0
[extractor/common] Add method for extracting form hidden input fields as dict
10 years ago