Commit Graph

8711 Commits (1b5a1ae257c6d64fd69ea91eaf69328d4b6d17df)
 

Author SHA1 Message Date
Sergey M․ af9cdee9cb [brightcove] Improve and generalize brightcove URL extraction from JS
fnord b4e1576aee Brightcove extractor: support customBC.createVideo(...); method
found in http://www.americanbar.org/groups/family_law.html and
http://america.aljazeera.com/watch/shows/america-tonight/2015/6/exclusive-hunting-isil-with-the-pkk.html
Sergey M․ 78e2b74bb9 [tumblr] Add support for pornhub embeds (Closes )
Sergey M․ 65d161c480 [extractor/generic] Add support for pornhub embeds
Sergey M․ 9fcbd5db2a [pornhub] Add support for embeds
Sergey M․ 4f3bf679f5 [vk] Fix authentication for non-ASCII login/password
Sergey M․ 0072afca8e [YoutubeDL] Remove force_generic_extractor arg from process_ie_result
Sergey M․ 61aa5ba36e [YoutubeDL] Remove global state for force_generic_extractor flag in favor of passing argument
Sergey M․ 9f4323252a [YoutubeDL] Fix for multiple URLs
Sergey M․ 8b6c896c4b [prosiebensat1] Add title regex
Sergey M․ 185dbc4974 [prosiebensat1] Fix rtmp extraction (Closes )
Sergey M․ 3d535e0471 [tvc] Fix embed regex
Sergey M․ d22dec74ff Add `--force-generic-extractor`
For some extractors that are hard to workout a good _VALID_URL we use very vague and unrestrictive ones,
e.g. just allowing anything after hostname and capturing part of URL as id.
If some of these extractors happen to have an video embed of some different hoster or platform
and this scenario was not handled in extractor itself we end up with inability to download this embed
until extractor is fixed to support embed of this kind.
Forcing downloader to use the generic extractor can be a neat temporary solution for this problem.
Example: FiveTV extractor with Tvigle embed - http://www.5-tv.ru/rabota/broadcasts/48/
Sergey M․ 9872d3110c [extractor/generic] Add support for tvigle embeds
Sergey M․ b859971873 [extractor/generic] Rename tvc embed url variable
Sergey M․ e5095f1198 Merge branch 'hlintala-5tv'
Sergey M․ 499a077761 [5tv] Improve
Sergey M․ 5da7177729 Merge branch '5tv' of https://github.com/hlintala/youtube-dl into hlintala-5tv
Sergey M․ 3507766bd0 Merge branch 'hlintala-tvc'
Sergey M․ f37bdbe537 [extractor/generic] Add test for tvc embed
Sergey M․ 2da09ff8b0 [extractor/generic] Fix tvc ie_key
Sergey M․ 5ccddb7ecf [tvc] Fix ie_key
Sergey M․ 954c1d0529 [tvc] Refactor extractor names
Sergey M․ 494f20cbdc [extractor/generic] Add support for tvc embeds
Sergey M․ 29902c8ec0 [tvc:embed] Add embed extraction routine
Sergey M․ 9f15bdabc8 [tvc] Separate embed extractor
Sergey M․ fff3455f58 Merge branch 'tvc' of https://github.com/hlintala/youtube-dl into hlintala-tvc
Hannu Lintala 87446dc618 [tvc] Add extractor (Closes )
Hannu Lintala 99ac0390f5 [fivetv] Add extractor (Closes )
Sergey M․ ff0f0b9172 [tube8] Fix extraction (Closes )
Sergey M․ 97b570a94c [generic] Improve rtl.nl embeds detection (Closes )
Sergey M․ a9d56c6843 [rtlnl] Improve _VALID_URL ()
Sergey M․ f98470df69 [bilibili] Fix FutureWarning
Jaime Marquínez Ferrándiz eb8be1fe76 [rtbf] Extract all formats (closes )
Yen Chi Hsuan 7ebd5376fe [nfl] Relax _VALID_URL (fixes )
Jaime Marquínez Ferrándiz 70219b0f43 [youtube:playlist] Use an iterator for the entries (closes )
So that '--playlist-end' downloads only the required pages.
Sergey M․ bd5bc0cd5a [theplatform] Check for /select/media URLs first ()
Sergey M․ 6e054aacca [theplatform] Take care of /select/media URLs (Closes )
Sergey M․ 9d581f3d52 [cbs] Extract display_id
Sergey M․ 9bf99891d0 [cbs] Add support for colbertlateshow (Closes )
Sergey M․ d9cf48e81e [spiegeltv] Extract all formats and prefer hls (Closes )
Yen Chi Hsuan e1b9322b09 [youtube] Restricter DASH signature pattern
A problematic DASH url is:
https://manifest.googlevideo.com/api/manifest/dash/mm/35/key/yt5/ip/140.112.247.145/ms/pm/mv/s/mt/1433794435/id/o-AD2Od_dsOlAUYPu03ZsVWKSbGEbCJJrMp9vnXGhnyRhd/mn/sn-aigllm7r/sparams/as%2Chfr%2Cid%2Cip%2Cipbits%2Citag%2Cmm%2Cmn%2Cms%2Cmv%2Cnh%2Cpl%2Cplayback_host%2Crequiressl%2Csource%2Cexpire/fexp/9406009%2C9406821%2C9407575%2C9408142%2C9408420%2C9408710%2C9409121%2C9409208%2C9412514%2C9412780%2C9413208%2C9413426%2C9413476%2C9413503%2C9415304%2C9415753/upn/viDQrs8SnmE/as/fmp4_audio_clear%2Cwebm_audio_clear%2Cfmp4_sd_hd_clear%2Cwebm_sd_hd_clear%2Cwebm2_sd_hd_clear/playback_host/r4---sn-aigllm7r.googlevideo.com/ipbits/0/requiressl/yes/pl/20/itag/0/source/youtube/expire/1433824806/nh/EAQ/signature/81ABE6391E351BA495F5B041B00FF1257A353318.1A6E48ABB74E8F4AE73CA2CB1F963FC34E33DEE7/sver/3/hfr/1
Yen Chi Hsuan 627b964825 [kickstarted] Extract thumbnails in embedded videos ()
Sergey M․ a55e36f48d [YoutubeDL] Handle out-of-range timestamps ()
Yen Chi Hsuan 01e21b89ee [noco] Skip invalid timestamps (closes )
Yen Chi Hsuan 788be3313d [cnet] Fix theplatform vid extraction (fixes )
Yen Chi Hsuan e1ec93304d [instagram:user] Truncate title to 80 characters ()
This is a workaround. Currently YoutubeDL.process_info() truncates
info_dict['title'] to 200 characters, but the implementation can't
handle wide characters.
Yen Chi Hsuan edb99d4c18 [instagram] Handling null values (fixes )
I didn't add the test case here because it takes too much time. (7
minutes on my machine)
Yen Chi Hsuan 68477c3dab [tlc] Fix test failure due to DiscoveryIE changes
Yen Chi Hsuan 65ba8b23f4 [discovery] Rewrite DiscoveryIE (fixes )
Discovery.com now uses a completely different approach for serving
videos. At least in both test cases brightcove are involved. However,
AMF support is necessary for these brightcove videos. As a result, I
try to extract videos from the info page ('?flat=1'). The downloaded
file can be different from the one in browsers.