Commit Graph

10317 Commits (8b38f2ac40f30743b87fcf92c7570d669923a796)

Author SHA1 Message Date
Yen Chi Hsuan c05025fdd7 [internetvideoarchive] Fix extraction and support json URLs 9 years ago
Philip Huppert bfe96d7bea [presstv] Added extractor PressTV.
Fixes #7060
9 years ago
Yen Chi Hsuan ab481b48e5 [funnyordie] Relax M3U8 URL matching
Also, m3u8_url extraction should be fatal as all formats depends
directly or indirectly on it.

This change fixes test_Generic_26 and TestFunnyOrDieSubtitles
9 years ago
Sergey M․ 92c7f3157a [aol] Add coding cookie 9 years ago
Yen Chi Hsuan cacd996662 [utils] Don't touch URLs if not necessary
Fix test_Generic_15 (Google redirect)
9 years ago
remitamine bffb245a48 [aol] add support for videos with vidible IDs(closes #9124) 9 years ago
Jaime Marquínez Ferrándiz e0986e31cf lazy extractors: Output if it's enabled in the verbose log 9 years ago
Jaime Marquínez Ferrándiz 779822d945 Add experimental support for lazy loading the info extractors
'make lazy-extractors' creates the youtube_dl/extractor/lazy_extractors.py (imported by youtube_dl/extractor/__init__.py), which contains simplified classes that only have the 'suitable' class method and that load the appropiate class with the '__new__' method when a instance is created.
9 years ago
Jaime Marquínez Ferrándiz 1b3d5e05a8 Move the extreactors import to youtube_dl/extractor/extractors.py 9 years ago
Jaime Marquínez Ferrándiz e52d7f85f2 Delay initialization of InfoExtractors until they are needed 9 years ago
Sergey M․ 568d2f78d6 [tnaflix] Fix metadata extraction 9 years ago
Sergey M․ 2f2fcf1a33 [tnaflix] Fix extraction (Closes #9074) 9 years ago
Sergey M․ bacec0397f [extractor/common] Relax _hidden_inputs 9 years ago
Sergey M․ 3c6c7e7d7e [gdcvault] Fix extraction (Closes #9107, closes #9114) 9 years ago
Sergey M․ fb38aa8b53 [extractor/common] Support arbitrary format strings for template based identifiers in mpd manifests (Closes #9119, closes #9120) 9 years ago
Sergey M․ 18da24634c [democracynow] Improve extraction 9 years ago
Sergey M․ a134426d61 [democracynow] Fix tests 9 years ago
Sergey M․ a64c0c9b06 [democracynow] Make description optional (Closes #9115) 9 years ago
Sergey M․ 56019444cb [novamov] Improve _VALID_URL template (Closes #9116) 9 years ago
remitamine a1ff3cd5f9 [acast] fix channel extraction(closes #9117) 9 years ago
remitamine 9a32e80477 [acast] fix extraction(#9117) 9 years ago
Sergey M․ 536a55dabd [YoutubeDL] Sanitize single thumbnail URL 9 years ago
Sergey M․ ed6fb8b804 [vrt] Add support for direct hls playlists and YouTube (Closes #9108) 9 years ago
Sergey M․ 3afef2e3fc [beeg] Improve extraction 9 years ago
Sergey M․ e90d175436 [yandexmusic] Extract music album metafields (Closes #7354) 9 years ago
Sergey M․ 7a93ab5f3f [extractor/common] Introduce music album metafields 9 years ago
Philipp Hagemeister c41cf65d4a release 2016.04.06 9 years ago
Yen Chi Hsuan 92d5477d84 [compat] Handle tuples properly in urlencode()
Fixes #9055
9 years ago
Yen Chi Hsuan 8790249c68 [iqiyi] Improve error detection for VIP-only videos
Closes #9071
9 years ago
Philipp Hagemeister 416930d450 release 2016.04.05 9 years ago
Sergey M․ 65150b41bb [deezer] Fix extraction (Closes #9086) 9 years ago
Sergey M․ e42f413716 [rte] Improve thumbnail extraction (Closes #9085) 9 years ago
Sergey M․ 40a056d85d [extractor/__init__] Remove novamov extractor and sort novamov based extractors alphabetically 9 years ago
Sergey M․ e7d77efb9d [auroravid] Add extractor (Closes #9070) 9 years ago
Sergey M․ 995cf05c96 [novamov] Make title fatal 9 years ago
Jaime Marquínez Ferrándiz 5bf28d7864 [utils] dfxp2srt: add additional namespace
Used by the ZDF subtitles (#9081).
9 years ago
Jaime Marquínez Ferrándiz 8c7d6e8e22 [zdf] Extract subtitles (closes #9081) 9 years ago
Sergey M․ 6d4fc66bfc [youtube] Add support for zwearz (Closes #9062) 9 years ago
remitamine 23576edbfc [brightcove:legacy] skip None value for uploader_id 9 years ago
remitamine 4d4cd35f48 [brightcove:legacy] extract uploader_id as a string 9 years ago
remitamine 3aac9b2fb1 [nowness] update tests 9 years ago
remitamine e47d19e991 [brightcove:new] extract subtitles and strip video title 9 years ago
remitamine 41f5492fbc [brightcove:legacy] improve format extraction and extract uploader_id, duration and timestamp 9 years ago
Jaime Marquínez Ferrándiz 2defa7d75a [instagram:user] Fix extraction (fixes #9059)
The URL for the next page was incorrect and we always got the same page, therefore it got trapped in an infinite loop.
9 years ago
Sergey M․ bbc26c8a01 [bbc] Set vcodec to none for audio formats 9 years ago
Sergey M․ b507cc925b [extractor/common] Carry long line 9 years ago
Sergey M․ db8ee7ec05 [extractor/common] Fix numeric identifiers conversion in DASH URL templates 9 years ago
remitamine 08136dc138 [brightcove] fix format sorting 9 years ago
remitamine fe7ef95e91 [cbsinteractive] Add support for ZDNet videos 9 years ago
remitamine 5f705baf5e [cnet] extract more formats 9 years ago
remitamine 0750b2491f [ffmpeg] try to convert tt subtitles usng dfxp2srt 9 years ago
remitamine df634be2ed [common] prefer using mime type over ext for smil subtitle extraction
the subtitle ext for http://www.cnet.com/videos/download-amazon-prime-movies-and-tv/
is adb_xml while using the mime type it get tt(application/smptett+xml)
9 years ago
Jaime Marquínez Ferrándiz 6d628fafca [camwithher] Remove extra blank line 9 years ago
Jaime Marquínez Ferrándiz 0f28777f58 [cbsnews] Remove unused import 9 years ago
Jaime Marquínez Ferrándiz 329c1eae54 [aenetworks] Make pep8 happy 9 years ago
Sergey M․ 9aaaf8e8e8 [camwithher] Improve extraction (Closes #8989) 9 years ago
theGeekPirate 04819db58e [camwithher] Add extractor
Corrected unnecessary test

Sane variable naming

RTMP all .flv & url_id for _download_webpage()

Corrected all outstanding issues, next up is a squash!
9 years ago
remitamine 79ba9140dc [theplatform] extract timestamp and uploader 9 years ago
Sergey M․ 75d572e9fb [screencast] Improve title regexes (Closes #9025) 9 years ago
Martin Trigaux 791d6aaecc screencast.com: fallback on page title
When determining the title of the page, use the <title> tag of the page
9 years ago
Sergey M․ 81de73e5b4 [screencast] Add test 9 years ago
Martin Trigaux 83cedc1cf2 screencast.com: support missing www
The "www." part of the URL is not mandatory
9 years ago
Sergey M․ 244cd04237 [pluralsight] Remove unnecessary login/password encode 9 years ago
Sergey M․ fbdaced256 [lynda] Remove unnecessary login/password encode 9 years ago
Sergey M․ a3373823e1 [udemy] Remove unnecessary login/password encode
This is now covered by compat_urllib_parse_urlencode
9 years ago
Sergey M․ 03caa463e7 [udemy:course] Skip non-video lectures 9 years ago
remitamine 3f64379eda [movieclips] fix extraction 9 years ago
remitamine 3e0c3d14d9 [cbs] add base extractor 9 years ago
remitamine d8873d4def [aenetworks] improve format extraction 9 years ago
remitamine db1c969da5 [theplatform] sign https urls 9 years ago
Philipp Hagemeister 1e02bc7ba2 release 2016.04.01 9 years ago
remitamine 63c55e9f22 [cbs] improve extraction(closes #6321) 9 years ago
remitamine f9b1529af8 [generic] remove sbnation test(handled by VoxMediaIE) 9 years ago
remitamine 961fc024d2 [voxmedia] improve sbnation support 9 years ago
Sergey M․ b53a06e3b9 [udemy:course] Use new URL format 9 years ago
remitamine 4ecc1fc638 [howstuffworks] improve extraction 9 years ago
Yen Chi Hsuan 5b012dfce8 [tudou] Improve error handling (closes #8988) 9 years ago
remitamine 8369942773 [voxmedia] Add new extractor(closes #3182) 9 years ago
Sergey M․ 86f3b66cec [udemy] Remove unused import 9 years ago
Sergey M․ 6bb4600717 [udemy:course] Simplify course curriculum downloading 9 years ago
Sergey M․ 41d06b0424 [extractor/common] Improve _request_webpage
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
9 years ago
Sergey M․ 15d260ebaa [utils] Use update_Request in http_request 9 years ago
Sergey M․ ed0291d153 [utils] Add update_Request 9 years ago
Sergey M․ 81da8cbc45 [udemy] Switch to api 2.0 (Closes #9035) 9 years ago
Sergey M․ 5299bc3f91 [beeg] Switch to api v6 (Closes #9036) 9 years ago
remitamine c9c39c22c5 [nationalgeographic] add support for channel.nationalgeographic.com urls 9 years ago
remitamine d84b48e3f1 [nationalgeographic] improve extraction 9 years ago
remitamine dd17041c82 [tenplay] remove extractor(fixes #6927) 9 years ago
remitamine fea7295b14 [brightcove] relax embed_in_page regex 9 years ago
remitamine 9cf01f7f30 [nbc] add new extractor for csnne.com(#5432) 9 years ago
remitamine ce548296fe [cnbc] fix test 9 years ago
remitamine c02ec7d430 [cnbc] Add new extractor(closes #8012) 9 years ago
remitamine 6b820a2376 [myspace] improve extraction 9 years ago
Yen Chi Hsuan e621a344e6 [kwuo] Port to new API and enable --cn-verification-proxy 9 years ago
Yen Chi Hsuan 3ae6f8fec1 [kwuo] Remove _sort_formats() from KuwoBaseIE._get_formats()
Following the idea proposed in 19dbaeece3
9 years ago
Yen Chi Hsuan 597d52fadb [kuwo:song] Correct song ID extraction (fixes #9033)
Bug introduced in daef04a4e7.
9 years ago
Sergey M․ afca767d19 [tumblr] Improve _VALID_URL (Closes #9027) 9 years ago
remitamine 6e359a1534 [comcarcoff] don not depend on crackle extractor(closes #8995)
previously extraction has been delegated to crackle to extract more info
and subtitles #6106 but some of the episodes can't be extracted using
crackle #8995.
9 years ago
Sergey M․ 33f3040a3e [YoutubeDL] Fix sanitizing subtitles' url 9 years ago
Sergey M․ 03442072c0 [pornhub] Fix typo (Closes #9008) 9 years ago
Sergey M․ c8b13fec02 [foxnews] Restore upload time fields in test 9 years ago
Sergey M․ 87d105ac6c [amp] Fix upload timestamp extraction (Closes #9007) 9 years ago
Sergey M․ 3454139576 [pornhub:uservideos] Add support for multipage videos (Closes #9006) 9 years ago
Sergey M․ 3a23bae9cc [pornhub:playlistbase] Do not include videos not from playlist 9 years ago
Sergey M․ 8f9a477e7f [pornhub:playlistbase] Use orderedSet 9 years ago
Sergey M․ a1cf3e38a3 [bbc] Extend vpid regex (Closes #9003) 9 years ago
Philipp Hagemeister a122e7080b release 2016.03.27 9 years ago
Sergey M․ b22ca76204 [extractor/common] Filter out unsupported encrypted media for f4m formats (Closes #8573) 9 years ago
Sergey M․ f7df343b4a [downloader/f4m] Extract routine for removing unsupported encrypted media 9 years ago
Sergey M․ 19dbaeece3 Remove _sort_formats from _extract_*_formats methods
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.

Closes #8051
9 years ago
Yen Chi Hsuan 395fd4b08a [twitter] Handle another form of embedded Vine
Fixes #8996
9 years ago
Sergey M․ 8018028d0f [pluralsight] Extract chapter metadata (Closes #8993) 9 years ago
Sergey M․ 00322ad4fd [lynda] Extract chapter metadata (#8993) 9 years ago
Sergey M․ 4cf3489c6e [vevo] Update videoservice API URL (Closes #8900) 9 years ago
Sergey M․ b24ab3e341 [udemy] Improve paid course detection 9 years ago
Sergey M․ af4116f4f0 [udemy] Improve format_id 9 years ago
Sergey M․ f973e5d54e [udemy] Drop outputs' formats
Always results in 403
9 years ago
Sergey M․ 62f55aa68a [udemy] Add outputs metadata to view_html formats 9 years ago
Sergey M․ 02d7634d24 [udemy] Fix outputs' formats format_id 9 years ago
Sergey M․ 48dce58ca9 [udemy] Use custom sorting 9 years ago
Sergey M․ efcba804f6 [udemy] Extract formats from view_html (Closes #8979) 9 years ago
Sergey M․ 6dee688e6d [youtube:playlistsbase] Restrict playlist regex (Closes #8986) 9 years ago
Sergey M․ eedb7ba536 [YoutubeDL] Sort imports 9 years ago
Sergey M․ dcf77cf1a7 [YoutubeDL] Sanitize final URLs (Closes #8991) 9 years ago
Sergey M․ 17bcc626bf [utils] Extract sanitize_url routine 9 years ago
Sergey M․ b5a5bbf376 [mailru] Extend _VALID_URL (Closes #8990) 9 years ago
Yen Chi Hsuan e68d3a010f [twitter] Fix extraction (closes #8966)
HLS and DASH formats are no longer appeared in test cases. I keep them
for fear of triggering new errors.
9 years ago
Yen Chi Hsuan d10fe8358c [generic] Add a test case for brightcove embed
Closes #8862
9 years ago
Yen Chi Hsuan d6c340cae5 [brightcove] Extract more formats (#8862) 9 years ago
Yen Chi Hsuan 5964b598ff [brightcove] Support alternative BrightcoveExperience layout
The full URL lays in the `data` attribute of <object> (#8862)
9 years ago
Philipp Hagemeister 62cdb96f51 release 2016.03.26 9 years ago
Sergey M․ 6e6bc8dae5 Use urlencode_postdata across the codebase 9 years ago
Sergey M․ 15707c7e02 [compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode

Closes #8974
9 years ago
Sergey M․ 2156f16ca7 [thescene] Fix extraction and improve style (Closes #8978) 9 years ago
Sergey M․ 4db441de72 [once] Relax _VALID_URL (Closes #8976) 9 years ago
Philipp Hagemeister 0be8314dc8 release 2016.03.25 9 years ago
Yen Chi Hsuan d7f62b049a [iqiyi] Update enc_key 9 years ago
Yen Chi Hsuan 3bb3356812 [douyutv] Extend _VALID_URL 9 years ago
Sergey M․ 98e68806fb [mnet] Improve (Closes #8958) 9 years ago
Kagami Hiiragi e031768666 [mnet] Add new extractor 9 years ago
Sergey M․ 5eb7db4ee9 [udemy] Add support for new URL schema 9 years ago
Sergey M․ f0e83681d9 [udemy] Extract formats from outputs 9 years ago
Sergey M․ ff9d5d0938 [udemy] Improve course enrolling 9 years ago
Sergey M․ d041a73674 [extractor/__init__] Add youtube:live and sort youtube extractors alphabetically 9 years ago
Sergey M․ f07e276a04 [youtube:live] Add extractor (Closes #8959) 9 years ago
Sergey M․ 993271da0a [nytimes] Tolerate missing metadata (Closes #8952) 9 years ago
Sergey M․ 369e7e3ff0 [iprima] Fix extraction (Closes #8953) 9 years ago
Sergey M․ 5767b4eeae [mtv] Fix description extraction (Closes #8962) 9 years ago
Yen Chi Hsuan 622d19160b [utils] Clarify Python versions affected by buggy struct module 9 years ago
Yen Chi Hsuan 32d88410eb [tumblr] Add a test with Instagram embed
Closes #8817
9 years ago
Yen Chi Hsuan 5a51775a58 [generic] Extract Instagram embeds (#8817) 9 years ago
Yen Chi Hsuan 87696e78d7 [instagram] Unescape description (#8817) 9 years ago
Yen Chi Hsuan c4096e8aea [instagram] Extract embed videos (#8817) 9 years ago
Yen Chi Hsuan fc27ea9464 [tumblr] Support Vine embeds (#8817) 9 years ago
Yen Chi Hsuan 088e1aac59 [generic] Support Vine embeds (#8817) 9 years ago
Sergey M 4333d56494 Merge pull request #8898 from dstftw/fragment-retries
Add --fragment-retries option (Fixes #8466)
9 years ago
Sergey M․ 882c699296 [tunein] Fix stream data extraction (Closes #8899, closes #8924) 9 years ago
Yen Chi Hsuan efbed08dc2 [utils] Encode hostnames before passing to urllib
With IDN (Internationalized Domain Name) and a proxy, non-ascii URLs
are passed down to urllib/urllib2, causing UnicodeEncodeError

Fixes #8890
9 years ago
Jaime Marquínez Ferrándiz 7da2c87119 Add extractor for thescene.com (closes #8929) 9 years ago
Sergey M․ c6ca11f1b3 [once] Prevent ads from embedding into m3u8 playlists (Closes #8893) 9 years ago
Sergey M․ 2beeb286e1 [laola1tv] Add support for livestreams (Closes #8934) 9 years ago
Sergey M․ cc7397b04d [ceskatelevize] Make m3u8 formats extraction non fatal (Closes #8933) 9 years ago
Sergey M․ bc5d16b302 [animeondemand] Skip dash for now 9 years ago
Sergey M․ 85c637b737 [animeondemand] Extract teaser when no full episode available (#8923) 9 years ago
Sergey M․ 5c69f7a479 [animeondemand] Respect startvideo (Closes #8923) 9 years ago
Sergey M․ ff5873b72d [motherless] Detect friends only videos 9 years ago
Sergey M․ 065c4b27bf [xhamster:embed] Extract vars (Closes #8912) 9 years ago
Sergey M․ 1600ed1ff9 [rutv] Improve flash version pattern (Closes #8911) 9 years ago
Sergey M․ 5886b38d73 Add support for https for all extractors as preventive and future-proof measure 9 years ago
Sergey M․ 0cef27ad25 Add missing r prefix for _VALID_URLs 9 years ago
Sergey M․ 12af4beb3e [mailru] Add support for https (Closes #8920) 9 years ago
Sergey M․ 9016d76f71 [YoutubeDL] Improve _format_note 9 years ago
Sergey M․ 3c5d183c19 [animeondemand] Extract all formats (Closes #8906) 9 years ago
Sergey M․ 3e8bb9a972 [animeondemand] Detect geo restriction 9 years ago
Yen Chi Hsuan daef04a4e7 [kwuo] Fix KuwoChartIE and KuwoSingerIE and accept new URL forms 9 years ago
Yen Chi Hsuan 2648918c81 [vlive] Fix creator extraction (closes #8814) 9 years ago
Yen Chi Hsuan 9e3c2f1d74 [openload] Misc improvements
* Add thumbnail
* Detect errors (#6469)
* Match more (#6469, #8489)
9 years ago
Yen Chi Hsuan 2bfeee69b9 [openload] Add new extractor (closes #8489) 9 years ago
Yen Chi Hsuan 664bcd80b9 [tudou] Use InAdvancePagedList (closes #8884) 9 years ago
Sergey M․ 3c20208eff [francetv] Improve formats extraction 9 years ago
Sergey M․ db264e3cc3 [francetvinfo] Add support for france3-regions and strip title (Closes #7673) 9 years ago
Sergey M․ 96a9f22d98 [discovery] Relax _VALID_URL (Closes #8903) 9 years ago
Sergey M․ 40025ee2a3 [postprocessort/ffmpeg] Allow embedding webvtt into webm (Closes #8874) 9 years ago
Sergey M․ 298c04b464 [91porn] Use common messages' wording 9 years ago
Sergey M․ d95114dd83 [91porn] Unquote final URL (Closes #8881) 9 years ago
Sergey M․ fa023ccb2c [biobiochiletv] Fix extraction, extract m3u8 formats and overall improve (Closes #7314) 9 years ago
jjatria e36f4aa72b [biobiotv] Add extractor 9 years ago
Sergey M․ f1ced6df51 [cda] Improve and simplify (Closes #8805) 9 years ago
Kacper Michajłow 8b0d7a66ef [cda] Add new extractor for cda.pl
Fixes #8760
9 years ago
Sergey M․ 3aec71766d [safari:api] Separate extractor (Closes #8871) 9 years ago
Sergey M․ 16a8b7986b [downloader/fragment] Document fragment_retries 9 years ago
Sergey M․ 617e58d850 [downloader/{common,fragment}] Fix total retries reporting on python 2.6 9 years ago
Sergey M․ e33baba0dd [downloader/dash] Add fragment retry capability
YouTube may often return 404 HTTP error for a fragment causing the
whole download to fail. However if the same fragment is immediately
retried with the same request data this usually succeeds (1-2 attemps
is usually enough) thus allowing to download the whole file successfully.
So, we will retry all fragments that fail with 404 HTTP error for now.
9 years ago
Sergey M․ 721f26b821 [downloader/fragment] Add report_retry_fragment 9 years ago
Sergey M․ 52bb437e41 [options] Add --fragment-retries option 9 years ago
Jaime Marquínez Ferrándiz 782b1b5bd1 [utils] lookup_unit_table: Match word boundary instead of end of string 9 years ago
Sergey M․ 0d769bcb78 [extractor/generic] Fix missing byte literal prefix 9 years ago
remitamine 4cd70099ea [hbo] Add new extractor 9 years ago
Jaime Marquínez Ferrándiz 09fc33198a utils: lookup_unit_table: Use a stricter regex
In parse_count multiple units start with the same letter, so it would match different units depending on the order they were sorted when iterating over them.
9 years ago
John Peel d5aacf9a90 Added format_id to the filers on -f. 9 years ago