Commit Graph

411 Commits (c1084ddb0c87dac450d2b7c1b1cfef386d6f4481)

Author SHA1 Message Date
Sergey M. 3c3b4176bd Merge pull request #5961 from dstftw/force-generic-extractor
Add --force-generic-extractor
10 years ago
Sergey M․ 2fece970b8 [extractor/generic] Clarify generic extraction warning 10 years ago
Yen Chi Hsuan a5158f38a3 [generic/adobetv] Support AdobeTVVideo embeds (#6039) 10 years ago
Sergey M․ c76799c555 [extractor/generic] Add test for xhamster embed 10 years ago
Sergey M․ 2bb5b6d0a1 [generic] Add support for xhamster embeds 10 years ago
Yen Chi Hsuan 756f574e4e [dailymotion/generic] Add DailymotionCloudIE 10 years ago
Yen Chi Hsuan b407e173e4 [vimeo/generic] Move detection logic from GenericIE to VimeoIE 10 years ago
Sergey M․ 78e2b74bb9 [tumblr] Add support for pornhub embeds (Closes #5963) 10 years ago
Sergey M․ 65d161c480 [extractor/generic] Add support for pornhub embeds 10 years ago
Sergey M․ d22dec74ff Add `--force-generic-extractor`
For some extractors that are hard to workout a good _VALID_URL we use very vague and unrestrictive ones,
e.g. just allowing anything after hostname and capturing part of URL as id.
If some of these extractors happen to have an video embed of some different hoster or platform
and this scenario was not handled in extractor itself we end up with inability to download this embed
until extractor is fixed to support embed of this kind.
Forcing downloader to use the generic extractor can be a neat temporary solution for this problem.
Example: FiveTV extractor with Tvigle embed - http://www.5-tv.ru/rabota/broadcasts/48/
10 years ago
Sergey M․ 9872d3110c [extractor/generic] Add support for tvigle embeds 10 years ago
Sergey M․ b859971873 [extractor/generic] Rename tvc embed url variable 10 years ago
Sergey M․ f37bdbe537 [extractor/generic] Add test for tvc embed 10 years ago
Sergey M․ 2da09ff8b0 [extractor/generic] Fix tvc ie_key 10 years ago
Sergey M․ 954c1d0529 [tvc] Refactor extractor names 10 years ago
Sergey M․ 494f20cbdc [extractor/generic] Add support for tvc embeds 10 years ago
Sergey M․ 97b570a94c [generic] Improve rtl.nl embeds detection (Closes #5950) 10 years ago
Yen Chi Hsuan b26733ba7f [brightcove] Allow single quotes in Brightcove URLs (fixes #5901) 10 years ago
Sergey M․ c5138a7ce4 [extractor/generic] Clarify test comment 10 years ago
Sergey M․ c5fa81fe81 [extractor/generic] Put all direct link tests near to each other for better navigation 10 years ago
Sergey M․ a074e92296 [extractor/generic] Add test for large compressed media 10 years ago
Sergey M․ 1ddb9456c4 [extractor/generic] Use compat_urllib_parse_unquote for unquoting video_id and title from URL 10 years ago
Sergey M․ 58bde34a23 [extractor/generic] Force Accept-Encoding to any for extraction pass 10 years ago
Sergey M․ 339516072b [extractor/generic] Unescape video_id and title extracted from URL 10 years ago
Sergey M․ 25c3a7348f [generic] Fix typo 10 years ago
Sergey M․ b827a6015c [generic] Add test for sportbox embeds 10 years ago
Sergey M․ d40a3b5b55 [generic] Add support for sportbox embeds 10 years ago
Jaime Marquínez Ferrándiz 3b5f65a64c [mlb] Fix extraction of articles
And move test from generic, since it's directly handled by MLBIE
10 years ago
Sergey M․ dc455a5f88 [extractor/generic] Add test for svt embed 10 years ago
Sergey M․ bab19a8e91 [extractor/generic] Add support for svt embeds (Closes #5622) 10 years ago
Sergey M․ 406224be52 [extractor/generic] Fix following incomplete redirects (#5640) 10 years ago
Yen Chi Hsuan 8001607e90 [generic] Detect more MLB videos (fixes #5443) 10 years ago
Yen Chi Hsuan 0954cd8aa4 [Cinemassacre] Add detection for videos from blip.tv 10 years ago
Yen Chi Hsuan 2fe1b5bd2a [CSpan] Add detection for Senate ISVP. Closes #5302 10 years ago
Sergey M․ 6225984681 [generic] Update pladform embed test 10 years ago
Yen Chi Hsuan d6fd958c5f [generic] Extract videos from SMIL manifests (closes #5145 and fixes #5135) 10 years ago
Sergey M․ c798f15b98 [generic] Add test for playwire embed (#5430) 10 years ago
Sergey M․ 2dcc114f84 [generic] Add support for playwire embeds (Closes #5430) 10 years ago
Yen Chi Hsuan 01c58f8473 [generic] Fix test generic_51
The website replaced the original video with a new one
10 years ago
Yen Chi Hsuan edfcf7abe2 [generic] Support another type of Ooyala embedded video 10 years ago
Sergey M․ a4257017ef [generic] Add tests for Crooks and Liars embeds 10 years ago
Sergey M․ 18153f1b32 [generic] Add support for Crooks and Liars embeds 10 years ago
Yen Chi Hsuan 0a1603634b [utils] Remove url_infer_protocol 10 years ago
Yen Chi Hsuan 418c5cc3fc [udn] Add new extractor 10 years ago
Sergey M․ facecb84a1 [generic] Add working NBC Sports vplayer test 10 years ago
Yen Chi Hsuan a2edf2e7ff [NBC/ThePlatform/Generic] Add a generic detector for NBCSportsVPlayer and enhance error detection in ThePlatformIE 10 years ago
Naglis Jonaitis ad320e9b83 [generic] Add support for 5min embeds (#5310) 10 years ago
Sergey M․ 2051acdeb2 [extractor/generic] Add test for Libsyn embed 10 years ago
Sergey M․ cefdf970cc [extractor/generic] Support Libsyn embeds 10 years ago
Sergey M․ 3378d67a18 [generic] Add support for nytimes embeds (Closes #5234) 10 years ago
Sergey M․ ed9a25dd61 [generic] Generalize redirect regex 10 years ago
Sergey M․ 84f8101606 [generic] Follow redirects specified by `Refresh` HTTP header 10 years ago
Sergey M․ f838875726 [pladform] Add support for embeds 10 years ago
Sergey M․ d47ae7f620 [eagleplatform] Add support for ClipYou embeds 10 years ago
Sergey M․ 135c9c42bf [eagleplatform] Add support for embeds 10 years ago
Philipp Hagemeister 76c73715fb [generic] Parse RSS enclosure URLs (Fixes #5091) 10 years ago
Naglis Jonaitis e3216b82bf [generic] Support dynamic Kaltura embeds (#5016) (#5073) 10 years ago
Jaime Marquínez Ferrándiz 13d8fbef30 [generic] Don't set the 'title' if it's not defined in the entry (closes #5061)
Some of them may be an 'url' result, which in general don't have the 'title' field.
10 years ago
Sergey M․ 255fca5eea [generic] Add support for Zapiks embeds (#5014) 10 years ago
Philipp Hagemeister f37e3f99f0 [generic] Correct test case
Video has been reuploaded / edited
10 years ago
Philipp Hagemeister 11e611a7fa Extend various playlist tests 10 years ago
Philipp Hagemeister 59b8ab5834 [rtlnl|generic] Add support for rtl.nl embeds (Fixes #4959) 10 years ago
Philipp Hagemeister e98b8e79ea [generic] Improve SBS detection (Fixes #4899) 10 years ago
Philipp Hagemeister a0f7198544 [generic] Add support for jwPlayer YouTube videos
This makes nationalarchives.gov.uk work (Fixes #4907, fixes #4876)
10 years ago
naglis f076b63821 [generic/ooyala] Add support for Ooyala embeds on SBN network websites (Fixes #4859) 10 years ago
codesparkle 52585fd6dc The opening curly brace `{` is a regex reserved [control character](http://stackoverflow.com/a/400316/1106367), so it needs to be escaped. 10 years ago
Sergey M․ cb454b333d [generic] Improve some regexes 10 years ago
Shaya G 796df3c631 fixed viddler support - needed a Referer header; also added a viddler
generic extractor
10 years ago
Philipp Hagemeister b1b0b1ca30 [generic] Improve description testcase in rss test 10 years ago
Philipp Hagemeister 501f13fbf3 [generic] Add support for Cinerama player (Fixes #4752) 10 years ago
Philipp Hagemeister 61ca9a80b3 [generic] Add support for BOMs (Fixes #4753) 10 years ago
Philipp Hagemeister d7cc31b63e [generic] PEP8 10 years ago
Jaime Marquínez Ferrándiz cd791a5ea0 [ted] Add support for embed-ssl.ted.com embedded videos 10 years ago
Jaime Marquínez Ferrándiz 87830900a9 [generic] Update some tests 10 years ago
Sergey M․ faa4ea68c0 [generic] Add BBC iPlayer playlist test 10 years ago
Sergey M․ 476eae0c2a [generic] Generalize BBC iPlayer playlist extraction 10 years ago
Sergey M․ 8399267671 [generic] Make getter None by default 10 years ago
Sergey M․ db546cf87f [generic] Add support for BBC iPlayer embeds (Closes #4619) 10 years ago
Philipp Hagemeister 416c7fcbce Add documentation about supported sites (Fixes #4503) 10 years ago
Philipp Hagemeister 42bdd9d051 [cinchcast] Add new extractor (Fixes #4428) 10 years ago
Philipp Hagemeister 4e262a8838 [generic] Detect direct video links (Fixes #4149, #4313) 10 years ago
Philipp Hagemeister 65f3a228b1 [generic] Add support for LazyYT embeds (Fixes #4306) 10 years ago
Philipp Hagemeister b74e86f48a Fix all PEP8 issues except E501 10 years ago
Jouke Waleson 9e1a5b8455 PEP8: applied even more rules 10 years ago
Jouke Waleson 2514d2635e PEP8: E225,E227 10 years ago
Jouke Waleson 8bcc875676 PEP8: more applied 10 years ago
Jouke Waleson 5f6a1245ff PEP8 applied 10 years ago
Philipp Hagemeister 9b32eca3ce [generic] Add support for single quotes in HTML5 videos (Fixes #4265) 10 years ago
Philipp Hagemeister af63fed7d8 [generic] Add support for livestream embeds (Fixes #4185) 10 years ago
Jaime Marquínez Ferrándiz 9908e03528 Merge pull request #4076 from ghedo/direct_type
[generic] indicate when a direct video has been extracted
10 years ago
Naglis Jonaitis 37aab27808 [brightcove] Extract m3u8 formats (#3541) 10 years ago
Naglis Jonaitis bdf9701729 [generic/brightcove] Add a new test case for kijk.nl (#3541) 10 years ago
Philipp Hagemeister 8c25f81bee [util] Move compatibility functions out of util
utils is large enough without these compatibility functions.

Everything that is present in newer versions of Python (i.e. with dev Python it's just an import) goes into compat.py .
Everything else (i.e. youtube-dl-specific helpers) goes into utils.py .
10 years ago
Alessandro Ghedini ccdd0ffb80 [generic] indicate when a direct video has been extracted
Fixes #4052.
10 years ago
Sergey M․ 5263cdfcf9 [generic] Improve MLB iframe regex 10 years ago
Philipp Hagemeister ac645ac7d0 [generic] Allow soundcloud embeds with additional attributes 10 years ago
Philipp Hagemeister 48a24ab746 [generic] Fix HTML5 video regexp 10 years ago
Philipp Hagemeister d6d9186f0d [generic] Fix test title 10 years ago
Philipp Hagemeister 37d66e7f1e [generic] Correct call to _webpage_read_full_content 10 years ago
Philipp Hagemeister 70b7e3fbb6 [generic] Add a test case for direct links with broken HEAD (#4032) 10 years ago
Philipp Hagemeister 23be51d8ce [generic] Handle audio streams that do not implement HEAD (Fixes #4032) 10 years ago
Philipp Hagemeister a7e97f6db1 [generic] Allow new SWFObject()-style imports
This embed style is used on http://www.bitburger-open.de/ , but that is not included as a test case since the format is likely to be temporary.
10 years ago
Philipp Hagemeister f2f2c0c2c6 [generic] Allow --default-search without colon 10 years ago
Sergey M․ 281d3f1d68 [generic/wistia] Improve regex 10 years ago
dinesh 85d7b76586 [thoughtworks] wistia regex modified 10 years ago
dinesh 52cffcb186 [thoughtworks] wistia support added 10 years ago
Naglis Jonaitis 9471c44405 [generic] Make sure Wistia embed URLs contain the protocol (Closes #3977)
Also, improve detection (Addresses #3662)
10 years ago
Philipp Hagemeister 09a42738fc [generic] Correct handling of embedded vimeo players (#3955) 10 years ago
Sergey M․ 1419fafd36 [condenast] Add support for embedded videos (Closes #3929) 10 years ago
Sergey M․ ced659bb4d [generic] Ignore some non-video file extensions during generic extraction (Closes #3900) 10 years ago
Naglis Jonaitis 8489578df4 [generic] Support embedded Dailymotion playlists (fixes #3822) 10 years ago
Philipp Hagemeister 07e764439a [generic] Delete test case
The page is not available any more.
10 years ago
Philipp Hagemeister 761e1645e0 [generic] Remove unstable test checksum 10 years ago
Philipp Hagemeister a8eb5a8e61 [generic] Fix testcases 10 years ago
Philipp Hagemeister d6e6a42256 [vimeo:likes] Add new extractor (Fixes #3835) 10 years ago
Philipp Hagemeister c6e90caaa6 Merge remote-tracking branch 'naglis/wistia' 10 years ago
Philipp Hagemeister 6b08cdf626 [youtube] Support for embedded /p players (Fixes #3821) 10 years ago
Philipp Hagemeister 3b2f933b01 [generic] Allow embedded YoutubePlaylists (Fixes #3821) 10 years ago
Naglis Jonaitis 746c67d72f [wistia] Use API and make more generic 10 years ago
Jaime Marquínez Ferrándiz a04aa7a9e6 [generic] The url in the <meta redirect> doesn't need to be enclosed in single quotes (fixes #3770)
See the examples in https://en.wikipedia.org/wiki/Meta_refresh or the shortened urls from https://t.co/.
10 years ago
Sergey M․ 1bf5423e82 [generic/youtube] Recognize youtube nocookie embeds (Closes #3713) 10 years ago
Sergey M․ 1a94ff6865 [mlb] Add support for embedded videos (Closes #3653) 10 years ago
Philipp Hagemeister b759a0d4d2 fix up imports 10 years ago
Philipp Hagemeister 22a6f15061 Move playlist tests to extractors.
From now on, test_download will run these tests. That means we benefit not only from the networking setup in there, but also from the other tests (for example test_all_urls to find problems with _VALID_URLs).
10 years ago
Philipp Hagemeister aff216edf4 [generic] Prevent <video> search from skipping over empty sources (#3546) 10 years ago
Philipp Hagemeister 1cb6dcdbbe [generic] Do not download images as videos by accident 10 years ago
Philipp Hagemeister f971dcbba0 Merge branch 'master' of github.com:rg3/youtube-dl 10 years ago
Philipp Hagemeister 0990305d2a [generic] Fix rss under Python 2.x and move test to extractor 10 years ago
Jaime Marquínez Ferrándiz bcc069a937 [generic] Remove debug statement 10 years ago
Philipp Hagemeister 5f90042bd6 [generic] remove unused imports 10 years ago
Philipp Hagemeister ebab4520ff [generic] Use default opener for HEAD request (Fixes #3528) 10 years ago
Philipp Hagemeister 4d805e063c [generic] Automatic detection of flow player and age_limit (Fixes #3576) 10 years ago
Philipp Hagemeister 4d54ef20a2 [ministrygrid] Add extractor (Fixes #2900) 10 years ago
Philipp Hagemeister 1f8b6af773 [bip.tv] Allow underscore in lookup ids (Fixes #3573) 10 years ago
Philipp Hagemeister 10eaeb20c5 [generic] Require og:video URLs to contain a dot 10 years ago
Philipp Hagemeister fa8deaf38b [generic] Prevent from downloading a .swf as a video
We're seeing quite a number of people who do not put a video file in the og:video field, but the player URL. Try to detect some of these and filter them out.
10 years ago
Philipp Hagemeister c8e9a235d9 [generic] Add support for camtasia videos (Fixes #3574) 10 years ago
Philipp Hagemeister 2ef6fcb5d8 [sbs] Add new extractor (Fixes #3566) 10 years ago
Philipp Hagemeister ed2d6a1960 [generic] Simplify playlist support (#2948) 10 years ago
Philipp Hagemeister c71dfccc98 Merge remote-tracking branch 'anovicecodemonkey/generic-data-video-url'
Conflicts:
	youtube_dl/extractor/generic.py
10 years ago
Sergey M․ 49807b4ac6 [yahoo] Add support for embedded videos (Closes #3525) 10 years ago
Philipp Hagemeister 1f7ccb9014 [generic] Add --default-search fixup_error
This restores the ability to enter URLs without a scheme (and default to http), but still fail if the input is a search term.
10 years ago
Anthony J. Bentley eef4a7a304 Fix typo: “ytseach” → “ytsearch” 10 years ago
Philipp Hagemeister 7571c02c8a [generic] Set default-search to error
This prevents users from submitting bug reports where they mistyped a URL, and prevents me from getting a weird video when holding shift and thus searching for :Tds
11 years ago
Sergey M․ 0364fa8b65 [generic] Add support for ivi.ru embedded player 11 years ago
Jaime Marquínez Ferrándiz c5cd249e41 [generic] Extract mtvservices embedded videos 11 years ago
Philipp Hagemeister 826ec77fb2 [Vulture] Add support for vulture.com 11 years ago
Philipp Hagemeister 2b88feedf7 [generic] Add support for <embed YouTube 11 years ago
anovicecodemonkey 37e3cbe22e Move duplicate check to generic.py 11 years ago
anovicecodemonkey 610134730a Add a _TEST_ 11 years ago
Philipp Hagemeister 0e67ab0d8e [generic] Abort if user passes in URL "url" (#2942) 11 years ago
anovicecodemonkey 3442b30ab2 [generic] Support data-video-url for YouTube embeds (Fixes #2862) 11 years ago
Sergey M․ 8dfa187b8a [generic] Support pagespeed_iframe for NovaMov embeds 11 years ago
Sergey M․ a5a45015ba [generic] Fix redirect 11 years ago
Philipp Hagemeister 2099125333 [soundcloud/generic] Add support for playlists 11 years ago
Sergey M․ 669f0e7cda [generic] Fix wrong entries index 11 years ago
Philipp Hagemeister b30b8698ea [generic] Allow multiple matches for generic hits (Fixes #2818) 11 years ago
Philipp Hagemeister ebd3c7b370 [generic] Add support for protocol-independent URLs (Fixes #2810) 11 years ago
Philipp Hagemeister c0292e8ab7 [generic] Improve jwplayer detection (Fixes #2731) 11 years ago
Philipp Hagemeister 48099643cc [generic] Be more relaxed when looking for aparat embeds (Fixes #2784) 11 years ago
Philipp Hagemeister fa35cdad02 [condenast|generic] Add support for condenast embeds (Fixes #2783) 11 years ago
Sergey M․ fd50bf623c [generic] Modernize tests 11 years ago
Sergey M․ d360a14678 [generic] Update test 11 years ago
Sergey M․ 15c0e8e7b2 [generic] Generalize novamov based embeds 11 years ago
Sergey M․ 7cd3bc5f99 [nowvideo] Support more domains 11 years ago
Sergey M․ 50f566076f [generic] Add support for videoweed embeds 11 years ago
anovicecodemonkey ca6aada48e Fix _TEST for Ustream embed URLs 11 years ago
anovicecodemonkey 5c38625259 [UstreamIE] [generic] Added support for Ustream embed URLs (Fixes #2694) 11 years ago
Philipp Hagemeister 9ad400f75e [generic] Remove test case that has become a 404 11 years ago
Sergey M․ 0479c625a4 [brightcove] Encode object_str with utf-8 11 years ago
Philipp Hagemeister 9c1fc022ae [generic] Warn before fallback to automatic search 11 years ago
Sergey M․ cb3ac1c610 [smotri] Modernize and add support for emdebbed videos (Closes #2585) 11 years ago
Philipp Hagemeister 893f8832b5 [arte] Add support for embedded videos (Fixes #2620) 11 years ago
Jaime Marquínez Ferrándiz 7e2ede9891 [generic] Run TED detection before JW Player detection
Otherwise it overwrittes the `mobj` variable.
11 years ago
Jaime Marquínez Ferrándiz 0f2a2ba14b Merge remote-tracking branch 'dstftw/generic-webpage-unescape'
Conflicts:
	youtube_dl/extractor/generic.py
11 years ago
Jaime Marquínez Ferrándiz 750f9020ae [generic] Recognize more Ooyala embedded videos (#2569) 11 years ago
Philipp Hagemeister aab74fa106 [ted] Simplify embed code (#2587) 11 years ago
anovicecodemonkey 8d7c0cca13 [generic] Add support for embeded TED videos 11 years ago
Jaime Marquínez Ferrándiz ecbe1ad207 [generic] Fix access to removed function in python 3.4
The `Request.get_origin_req_host` method was deprecated in 3.3, use the
 `origin_req_host` property if it's not available, see http://docs.python.org/3.3/library/urllib.request.html#urllib.request.Request.get_origin_req_host.
11 years ago
Sergey M․ 93d020dd65 [generic] Add support for embedded rutv player 11 years ago
Sergey M․ 15fd51b37c [generic] More generic support for embedded vimeo player (#1602) 11 years ago
Sergey M․ d95e35d659 [generic] Add nowvideo test hidden behind percent encoding 11 years ago
Sergey M․ 1439073049 [generic] Add comment for unescaping webpage contents 11 years ago
Sergey M 1f7659dbe9 [generic] Unescape webpage contents 11 years ago
Philipp Hagemeister 60cc4dc4b4 [generic/funnyordie] Add support for funnyordie embeds (Fixes #2546) 11 years ago
Philipp Hagemeister bcf89ce62c [generic] Suppress warning about doctypes in RSS parser 11 years ago
Philipp Hagemeister 0a5bce566f [generic] Add all test attributes for embedly (#2447)
In the future, we may want to not only print something, but throw an error for untested properties.
11 years ago
Philipp Hagemeister 8012bd2424 [generic] Get a better ID 11 years ago
Philipp Hagemeister f55a1f0a88 Merge remote-tracking branch 'rzhxeo/embedly'
Conflicts:
	youtube_dl/extractor/generic.py
11 years ago
Sergey M. ca97a56e4b [vk] Add support for embedded videos (Closes #2473) 11 years ago
rzhxeo 1b86cc41cf Add support for embed.ly 11 years ago
Philipp Hagemeister 89ef304bed [generic] Add support for <meta redirect>
Fixes #413
11 years ago
Sergey M. b1c6c32f78 [generic] Add support for nowvideo embedded videos 11 years ago
Jaime Marquínez Ferrándiz f7300c5c90 [generic] Fix on python 2.6
`ParseError` is not available, it raises `xml.parsers.expat.ExpatError`.
The webpage needs to be encoded.
11 years ago
Philipp Hagemeister 4fc946b546 [generic] Add support for RSS feeds (Fixes #667) 11 years ago
Philipp Hagemeister 99877772d0 [generic] Add support for multiple brightcove URLs (Fixes #2283) 11 years ago
Philipp Hagemeister ab083b08ab [generic] remove testcase
The video seems to have been removed from the site.
11 years ago
Philipp Hagemeister 9f8928d032 [generic] Match JWPlayerOptions
This adds support for The Guardian, among others
Closes #2271, fixes #2267
11 years ago
dst c3f51436bf Improve some regexes for embedded players 11 years ago
Philipp Hagemeister 17ab4d3b5e [brightcove] Move test to generic 11 years ago