Commit Graph

7783 Commits (7d08f6073d42c8623031615c4b7e0bb26136c128)

Author SHA1 Message Date
Yen Chi Hsuan ec59d657e7
[dispeak] Add new extractor
Both GDCVault and GPUTechConf uses the service of DigitalSpeaking.
9 years ago
Yen Chi Hsuan 99ef96f84c
[gdcvault] Fix for videos with hard-coded hostnames
Fixes #9248
9 years ago
Yen Chi Hsuan 4dccea8ad0
[streetvoice] Fix extraction
The old API results in URLs with HTTP 403 from time to time.

Hopefully fixes #9219.
9 years ago
Yen Chi Hsuan 2c0d9c6217
[extractor/common] Allow empty post data 9 years ago
Sergey M․ 12a5134596
[tvigle] Fix extraction (Closes #9259) 9 years ago
Sergey M․ 16e633a5d7
[quickvid] Remove extractor (Closes #9258) 9 years ago
Sergey M․ 494ab6db73
[youtube] Capture and output login error message 9 years ago
Sergey M․ 107701fcfc
[people] Remove bogus comment 9 years ago
Sergey M․ f77970765a
[people] Add extractor 9 years ago
Sergey M․ 241a318f27
[vimeo] Improve _VALID_URL (Closes #9229) 9 years ago
Sergey M․ 4fdf082375
[theonion] Remove extractor (Closes #9220)
It now uses generic onionstudios embed
9 years ago
Jaime Marquínez Ferrándiz 1b6182d8f7 [youtube:playlist] Fetch all the videos in a mix (fixes #3837)
Since there doesn't seem to be any indication, it stops when there aren't new videos in the webpage.
9 years ago
remitamine 7bab22a402 [vice] remove unused import and variable 9 years ago
Yen Chi Hsuan 0f97fb4d00
[musicplayon] Relax _VALID_URL and improve metadata extraction
In r'pl=\d+&play=\d+' pages, several metadata items are missing

Closes #9222.
9 years ago
Yen Chi Hsuan b1cf58f48f
[musicplayon] Fix extraction (closes #9222) 9 years ago
remitamine bbb3f730bb [onionstudios] extract m3u8 formats 9 years ago
Yen Chi Hsuan 21525bb8ca
[kuwo:category] Update the test
Now the webpage says there are 24 songs.
9 years ago
Sergey M․ d8f103159f
[nerdist] Remove extractor
It now uses brightcove
9 years ago
remitamine 663ee5f0a9 [vice] extract youtube embed 9 years ago
Sergey M․ b6b950bf58
[cbs] Remove unused import 9 years ago
Sergey M․ 11e60fcad8
[extractor/generic] Improve instagram embeds (Closes #9213) 9 years ago
Sergey M․ c23533a100
[instagram] Add support for iframe embeds 9 years ago
Sergey M․ 0dafea02e6
[instagram] Add support for embed URLs 9 years ago
Sergey M․ 5d6360c3b7
[mooshare] Remove extractor 9 years ago
Yen Chi Hsuan 5e5c30c3fd
[mdr] Fix extraction and update tests
It's strange that the date is changed. Anyway, new data matches what the
webpage says.
9 years ago
Yen Chi Hsuan 9154c87fc4
[huffpost] Fix a typo 9 years ago
Yen Chi Hsuan ef0e4e7bc0
[generic] Fix test_Generic_2
Now a HEAD request returns 400 Bad Request
9 years ago
Yen Chi Hsuan 67d46a3f90
[ustream] Fix /embed/ URLs and add a test 9 years ago
Yen Chi Hsuan bec47a0748
[tudou] Improve error detection (closes #9175) 9 years ago
Yen Chi Hsuan 36b7d9dbfa
[twitter] Don't check /cards/ URLs
Fixes #9181

In this tweet, there are two cards:
1. https://twitter.com/i/cards/tfw/v1/719944006306701313
   This shows #TeamCap vs. #TeamIronMan
2. https://twitter.com/i/videos/tweet/719944021058060289
   This is the real video and can be handled by TwitterCardIE

In all current test_Twitter* tests, /videos/tweet/ approach works fine.
9 years ago
Yen Chi Hsuan 8c65e4a527
[bbc] Fix a test 9 years ago
Yen Chi Hsuan 6ad2ef8b7c
[audiomack] Update the test
The original test raises 404
9 years ago
Yen Chi Hsuan 00b426d66d
[varzesh3] Add md5 to the test 9 years ago
Yen Chi Hsuan 0de968b584
[newgrounds] Support videos (closes #9138) 9 years ago
remitamine 0841d5013c [cbs] do not catch Exceptions raised by by _extract_theplatform_smil 9 years ago
remitamine a71fca8577 [theplatform] remove _sort_formats from _extract_theplatform_smil 9 years ago
Yen Chi Hsuan ee94e7e66d
[varzesh3] Fix metadata extraction (closes #9197) 9 years ago
Yen Chi Hsuan 759e37c9e6
[gazeta] Relax _VALID_URL and update tests
Closes #9196
9 years ago
Yen Chi Hsuan ae65567102
[eagleplatform] Fix error handling 9 years ago
Yen Chi Hsuan c394b4f4cb
[puls4] Fix error detection (#9194) 9 years ago
Yen Chi Hsuan 260c7036ba
[sportbox] Fix SportBoxEmbedIE
Also fixes test_Generic_29 (http://www.vestifinance.ru/articles/25753)
9 years ago
remitamine f74197a074 [cbs] extract rtmp formats 9 years ago
remitamine f3a58d46bf [youtube:user] check if the url didn't match only the other youtube extractors 9 years ago
Sergey M․ b6612c9b11
[karaoketv] Fix extraction 9 years ago
Yen Chi Hsuan 7e176effb2
[iqiyi] Also suuport pps.tv URLs
PPS is acquired by Baidu and merged with iQiyi in 2013 [1]. Now they
have the same page layouts.

[1] http://www.chinanews.com/it/2013/05-07/4792526.shtml
9 years ago
Yen Chi Hsuan 4a252cc2d2
[karaoketv] Update and mark as not _WORKING 9 years ago
Yen Chi Hsuan f0ec61b525
[huffpost] Fix extraction 9 years ago
Yen Chi Hsuan 66d40ae3a5 Merge pull request #9041 from kasper93/master
[generic] Add support for LiveLeak embeds
9 years ago
Yen Chi Hsuan e6da9240d4
[mixcloud:stream] Add new extractor
Closes #7633
9 years ago
Yen Chi Hsuan dd91dfcd67
[mixcloud] Fix extraction by decrypting play info
Fixes #7521
9 years ago
Yen Chi Hsuan c773082692
Merge branch 'Phaeilo-mixcloud' 9 years ago
Yen Chi Hsuan 9c250931f5
[mixcloud] Improve and simplify mixcloud:user and mixcloud:playlist 9 years ago
Yen Chi Hsuan 56f1750049
[tdslifeway] Use the new Brightcove API
Thanks for @remitamine's suggestion.
9 years ago
Yen Chi Hsuan f2159c9815
[wayofthemaster] Remove extractor
Now it's using YouTube embeds.
9 years ago
Yen Chi Hsuan b0cf2e7c1b
[ubu] Remove extractor
1. Videos on ubu.com are now hosted on Vimeo
2. The duration is far from correct, and may not exist on other videos
   (For example http://ubu.com/film/hammons_king.html)
9 years ago
Yen Chi Hsuan 74b47d00c3
[xboxclips] Use http:// URL
xboxclips has misconfigured certificates
9 years ago
Yen Chi Hsuan 8cb57bab8e
[ministrygrid] Fix extraction and modernize 9 years ago
Yen Chi Hsuan e1bf277e19
[tdslifeway] Add TDSLifewayIE
Used by MinistryGridIE
9 years ago
Sergey M․ 9e28538726
[arte:creative] Improve _VALID_URL 9 years ago
Sergey M․ 404284132c
[arte:info] Add extractor (Closes #9182) 9 years ago
remitamine 5565be9dd9 [aol] relex _VALID_URL regex 9 years ago
Yen Chi Hsuan b3a9474ad1 Merge branch 'mixcloud' of https://github.com/Phaeilo/youtube-dl into Phaeilo-mixcloud 9 years ago
Yen Chi Hsuan 86475d59b1
[metacritic] Add a new valid test case 9 years ago
Yen Chi Hsuan 73d93f948e
[lecture2go] Fix extraction
RTSP stream fails to download. Seems it's a mpv bug as direct playback
works well:

$ mpv --ytdl-format rtsp https://lecture2go.uni-hamburg.de/veranstaltungen/-/v/17473
9 years ago
Yen Chi Hsuan d1c4e4ba15
[laola1tv] Improve error detection and skip an invalid test 9 years ago
Yen Chi Hsuan f141fefab7
[karrierevideos] Fix extraction
The server serves malformed header "Content Type: text/xml" for the XML
request (it should be Content-Type but not Content Type). Python 3.x,
which uses email.feedparser rejects such headers. As a result,
Content-Encoding header is not parsed, so the returned content is kept
not decompressed, and thus XML parsing error.
9 years ago
aystroganov@gmail.com 8334637f4a Make tbr field 'int' rather than 'tuple'
Closes #9180.
9 years ago
Kacper Michajłow b8f67449ec [generic] Add support for LiveLeak embeds 9 years ago
Yen Chi Hsuan 75af5d59ae
[netease] Skip all tests: completely georestricted 9 years ago
Philip Huppert 6d67169509 [mixcloud] improved extraction of user description 9 years ago
Philip Huppert dcaf00fb3e [mixcloud] support older urllib versions 9 years ago
Philip Huppert f896e1ccef [mixcloud] fixed some tests 9 years ago
Philip Huppert c96eca426b [mixcloud] Added support for user uploads, playlists, favorites and listens.
Fixes #3750 and #5272
9 years ago
Sergey M․ 466a614537
[youtube:playlist] Recognize popular uploads playlist as mix (Closes #9170) 9 years ago
Sergey M․ ffa2cecf72
[ard] Change subtitles extension to ttml (Closes #9169)
ttml is now served instead of srt
9 years ago
Yen Chi Hsuan a837416025
[jadorecettepub] Remove extractor: website gone 9 years ago
Yen Chi Hsuan c9d448876f
[izlesene] Fix extraction
description may be absent
9 years ago
Yen Chi Hsuan 8865b8abfd
[howstuffworks] Skip a broken test case 9 years ago
Yen Chi Hsuan c77a0c01cb
[groupon] Fix extraction 9 years ago
Yen Chi Hsuan 12355ac473
[goshgay] Fix extraction
isFamilyFriendly no longer exists in the webpage and I can't find
another indicator.
9 years ago
Sergey M․ 49f523ca50
[mixcloud] Capture error message (#9156) 9 years ago
remitamine 4a903b93a9 Revert "[openclassroom] Add new extractor(closes #9147)"
This reverts commit 13267a2be3.
9 years ago
remitamine 13267a2be3 [openclassroom] Add new extractor(closes #9147) 9 years ago
Yen Chi Hsuan 134c207e3f
[arte.tv:embed] Extended support (#2620) 9 years ago
Yen Chi Hsuan 0f56bd2178
Merge branch 'Phaeilo-presstv' 9 years ago
Yen Chi Hsuan dfbc7f7f3f
[presstv] Improve and simplify 9 years ago
Yen Chi Hsuan 7d58ea7c5b Merge branch 'presstv' of https://github.com/Phaeilo/youtube-dl into Phaeilo-presstv 9 years ago
Sergey M․ 452908b257
[telebruxelles] Fix extraction (Closes #9142) 9 years ago
Sergey M․ 5899e988d5
[glide] Improve extraction and extract upload info 9 years ago
Sergey M․ 4a121d29bb
[glide] Fix extraction (Closes #9141) 9 years ago
Sergey M․ 7ebc36900d
[jwplatform:base] Improve subtitles extraction 9 years ago
Sergey M․ d7eb052fa2
[screencastomatic] Add duration to test 9 years ago
Sergey M․ a6d6722c8f
[jwplatform:base] Extract duration 9 years ago
Sergey M․ 66fa495868
[screencastomatic] Fix extraction (Closes #9136) 9 years ago
Sergey M․ 443285aabe
[ebaumsworlds] Update _VALID_URL (Closes #9135) 9 years ago
Philip Huppert de728757ad [presstv] Refactored extractor. 9 years ago
Sergey M․ f44c276842
[extractor/extractors] Remove non-existant imports 9 years ago
Sergey M․ a1fa60a934
[cliprs] Add extractor (Closes #9099) 9 years ago
Sergey M․ 49caf3307f
[extractor/common] Remove irrelevant comment 9 years ago
Sergey M․ 61dd350a04
[1tv] Fix extraction (Closes #9103) 9 years ago
Philip Huppert 95153a960d [presstv] updated extractor and tests to work with current PressTV website 9 years ago
Yen Chi Hsuan c991106706 [videodetective] Adapt to InternetVideoArchiveIE 9 years ago
Yen Chi Hsuan dae2a058de [rottentomatoes] Adapt to InternetVideoArchiveIE 9 years ago
Yen Chi Hsuan c05025fdd7 [internetvideoarchive] Fix extraction and support json URLs 9 years ago
Philip Huppert bfe96d7bea [presstv] Added extractor PressTV.
Fixes #7060
9 years ago
Yen Chi Hsuan ab481b48e5 [funnyordie] Relax M3U8 URL matching
Also, m3u8_url extraction should be fatal as all formats depends
directly or indirectly on it.

This change fixes test_Generic_26 and TestFunnyOrDieSubtitles
9 years ago
Sergey M․ 92c7f3157a [aol] Add coding cookie 9 years ago
remitamine bffb245a48 [aol] add support for videos with vidible IDs(closes #9124) 9 years ago
Jaime Marquínez Ferrándiz e0986e31cf lazy extractors: Output if it's enabled in the verbose log 9 years ago
Jaime Marquínez Ferrándiz 779822d945 Add experimental support for lazy loading the info extractors
'make lazy-extractors' creates the youtube_dl/extractor/lazy_extractors.py (imported by youtube_dl/extractor/__init__.py), which contains simplified classes that only have the 'suitable' class method and that load the appropiate class with the '__new__' method when a instance is created.
9 years ago
Jaime Marquínez Ferrándiz 1b3d5e05a8 Move the extreactors import to youtube_dl/extractor/extractors.py 9 years ago
Jaime Marquínez Ferrándiz e52d7f85f2 Delay initialization of InfoExtractors until they are needed 9 years ago
Sergey M․ 568d2f78d6 [tnaflix] Fix metadata extraction 9 years ago
Sergey M․ 2f2fcf1a33 [tnaflix] Fix extraction (Closes #9074) 9 years ago
Sergey M․ bacec0397f [extractor/common] Relax _hidden_inputs 9 years ago
Sergey M․ 3c6c7e7d7e [gdcvault] Fix extraction (Closes #9107, closes #9114) 9 years ago
Sergey M․ fb38aa8b53 [extractor/common] Support arbitrary format strings for template based identifiers in mpd manifests (Closes #9119, closes #9120) 9 years ago
Sergey M․ 18da24634c [democracynow] Improve extraction 9 years ago
Sergey M․ a134426d61 [democracynow] Fix tests 9 years ago
Sergey M․ a64c0c9b06 [democracynow] Make description optional (Closes #9115) 9 years ago
Sergey M․ 56019444cb [novamov] Improve _VALID_URL template (Closes #9116) 9 years ago
remitamine a1ff3cd5f9 [acast] fix channel extraction(closes #9117) 9 years ago
remitamine 9a32e80477 [acast] fix extraction(#9117) 9 years ago
Sergey M․ ed6fb8b804 [vrt] Add support for direct hls playlists and YouTube (Closes #9108) 9 years ago
Sergey M․ 3afef2e3fc [beeg] Improve extraction 9 years ago
Sergey M․ e90d175436 [yandexmusic] Extract music album metafields (Closes #7354) 9 years ago
Sergey M․ 7a93ab5f3f [extractor/common] Introduce music album metafields 9 years ago
Yen Chi Hsuan 8790249c68 [iqiyi] Improve error detection for VIP-only videos
Closes #9071
9 years ago
Sergey M․ 65150b41bb [deezer] Fix extraction (Closes #9086) 9 years ago
Sergey M․ e42f413716 [rte] Improve thumbnail extraction (Closes #9085) 9 years ago
Sergey M․ 40a056d85d [extractor/__init__] Remove novamov extractor and sort novamov based extractors alphabetically 9 years ago
Sergey M․ e7d77efb9d [auroravid] Add extractor (Closes #9070) 9 years ago
Sergey M․ 995cf05c96 [novamov] Make title fatal 9 years ago
Jaime Marquínez Ferrándiz 8c7d6e8e22 [zdf] Extract subtitles (closes #9081) 9 years ago
Sergey M․ 6d4fc66bfc [youtube] Add support for zwearz (Closes #9062) 9 years ago
remitamine 23576edbfc [brightcove:legacy] skip None value for uploader_id 9 years ago
remitamine 4d4cd35f48 [brightcove:legacy] extract uploader_id as a string 9 years ago
remitamine 3aac9b2fb1 [nowness] update tests 9 years ago
remitamine e47d19e991 [brightcove:new] extract subtitles and strip video title 9 years ago
remitamine 41f5492fbc [brightcove:legacy] improve format extraction and extract uploader_id, duration and timestamp 9 years ago
Jaime Marquínez Ferrándiz 2defa7d75a [instagram:user] Fix extraction (fixes #9059)
The URL for the next page was incorrect and we always got the same page, therefore it got trapped in an infinite loop.
9 years ago
Sergey M․ bbc26c8a01 [bbc] Set vcodec to none for audio formats 9 years ago
Sergey M․ b507cc925b [extractor/common] Carry long line 9 years ago
Sergey M․ db8ee7ec05 [extractor/common] Fix numeric identifiers conversion in DASH URL templates 9 years ago
remitamine 08136dc138 [brightcove] fix format sorting 9 years ago
remitamine fe7ef95e91 [cbsinteractive] Add support for ZDNet videos 9 years ago
remitamine 5f705baf5e [cnet] extract more formats 9 years ago
remitamine df634be2ed [common] prefer using mime type over ext for smil subtitle extraction
the subtitle ext for http://www.cnet.com/videos/download-amazon-prime-movies-and-tv/
is adb_xml while using the mime type it get tt(application/smptett+xml)
9 years ago
Jaime Marquínez Ferrándiz 6d628fafca [camwithher] Remove extra blank line 9 years ago
Jaime Marquínez Ferrándiz 0f28777f58 [cbsnews] Remove unused import 9 years ago
Jaime Marquínez Ferrándiz 329c1eae54 [aenetworks] Make pep8 happy 9 years ago
Sergey M․ 9aaaf8e8e8 [camwithher] Improve extraction (Closes #8989) 9 years ago
theGeekPirate 04819db58e [camwithher] Add extractor
Corrected unnecessary test

Sane variable naming

RTMP all .flv & url_id for _download_webpage()

Corrected all outstanding issues, next up is a squash!
9 years ago
remitamine 79ba9140dc [theplatform] extract timestamp and uploader 9 years ago
Sergey M․ 75d572e9fb [screencast] Improve title regexes (Closes #9025) 9 years ago
Martin Trigaux 791d6aaecc screencast.com: fallback on page title
When determining the title of the page, use the <title> tag of the page
9 years ago
Sergey M․ 81de73e5b4 [screencast] Add test 9 years ago
Martin Trigaux 83cedc1cf2 screencast.com: support missing www
The "www." part of the URL is not mandatory
9 years ago
Sergey M․ 244cd04237 [pluralsight] Remove unnecessary login/password encode 9 years ago
Sergey M․ fbdaced256 [lynda] Remove unnecessary login/password encode 9 years ago
Sergey M․ a3373823e1 [udemy] Remove unnecessary login/password encode
This is now covered by compat_urllib_parse_urlencode
9 years ago
Sergey M․ 03caa463e7 [udemy:course] Skip non-video lectures 9 years ago
remitamine 3f64379eda [movieclips] fix extraction 9 years ago
remitamine 3e0c3d14d9 [cbs] add base extractor 9 years ago
remitamine d8873d4def [aenetworks] improve format extraction 9 years ago
remitamine db1c969da5 [theplatform] sign https urls 9 years ago
remitamine 63c55e9f22 [cbs] improve extraction(closes #6321) 9 years ago
remitamine f9b1529af8 [generic] remove sbnation test(handled by VoxMediaIE) 9 years ago
remitamine 961fc024d2 [voxmedia] improve sbnation support 9 years ago
Sergey M․ b53a06e3b9 [udemy:course] Use new URL format 9 years ago
remitamine 4ecc1fc638 [howstuffworks] improve extraction 9 years ago
Yen Chi Hsuan 5b012dfce8 [tudou] Improve error handling (closes #8988) 9 years ago
remitamine 8369942773 [voxmedia] Add new extractor(closes #3182) 9 years ago
Sergey M․ 86f3b66cec [udemy] Remove unused import 9 years ago
Sergey M․ 6bb4600717 [udemy:course] Simplify course curriculum downloading 9 years ago
Sergey M․ 41d06b0424 [extractor/common] Improve _request_webpage
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
9 years ago
Sergey M․ 81da8cbc45 [udemy] Switch to api 2.0 (Closes #9035) 9 years ago
Sergey M․ 5299bc3f91 [beeg] Switch to api v6 (Closes #9036) 9 years ago
remitamine c9c39c22c5 [nationalgeographic] add support for channel.nationalgeographic.com urls 9 years ago
remitamine d84b48e3f1 [nationalgeographic] improve extraction 9 years ago
remitamine dd17041c82 [tenplay] remove extractor(fixes #6927) 9 years ago
remitamine fea7295b14 [brightcove] relax embed_in_page regex 9 years ago
remitamine 9cf01f7f30 [nbc] add new extractor for csnne.com(#5432) 9 years ago
remitamine ce548296fe [cnbc] fix test 9 years ago
remitamine c02ec7d430 [cnbc] Add new extractor(closes #8012) 9 years ago
remitamine 6b820a2376 [myspace] improve extraction 9 years ago
Yen Chi Hsuan e621a344e6 [kwuo] Port to new API and enable --cn-verification-proxy 9 years ago
Yen Chi Hsuan 3ae6f8fec1 [kwuo] Remove _sort_formats() from KuwoBaseIE._get_formats()
Following the idea proposed in 19dbaeece3
9 years ago
Yen Chi Hsuan 597d52fadb [kuwo:song] Correct song ID extraction (fixes #9033)
Bug introduced in daef04a4e7.
9 years ago
Sergey M․ afca767d19 [tumblr] Improve _VALID_URL (Closes #9027) 9 years ago
remitamine 6e359a1534 [comcarcoff] don not depend on crackle extractor(closes #8995)
previously extraction has been delegated to crackle to extract more info
and subtitles #6106 but some of the episodes can't be extracted using
crackle #8995.
9 years ago
Sergey M․ 03442072c0 [pornhub] Fix typo (Closes #9008) 9 years ago
Sergey M․ c8b13fec02 [foxnews] Restore upload time fields in test 9 years ago
Sergey M․ 87d105ac6c [amp] Fix upload timestamp extraction (Closes #9007) 9 years ago
Sergey M․ 3454139576 [pornhub:uservideos] Add support for multipage videos (Closes #9006) 9 years ago
Sergey M․ 3a23bae9cc [pornhub:playlistbase] Do not include videos not from playlist 9 years ago
Sergey M․ 8f9a477e7f [pornhub:playlistbase] Use orderedSet 9 years ago
Sergey M․ a1cf3e38a3 [bbc] Extend vpid regex (Closes #9003) 9 years ago
Sergey M․ b22ca76204 [extractor/common] Filter out unsupported encrypted media for f4m formats (Closes #8573) 9 years ago
Sergey M․ 19dbaeece3 Remove _sort_formats from _extract_*_formats methods
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.

Closes #8051
9 years ago