Yen Chi Hsuan
c05025fdd7
[internetvideoarchive] Fix extraction and support json URLs
9 years ago
Philip Huppert
bfe96d7bea
[presstv] Added extractor PressTV.
...
Fixes #7060
9 years ago
Yen Chi Hsuan
ab481b48e5
[funnyordie] Relax M3U8 URL matching
...
Also, m3u8_url extraction should be fatal as all formats depends
directly or indirectly on it.
This change fixes test_Generic_26 and TestFunnyOrDieSubtitles
9 years ago
Sergey M․
92c7f3157a
[aol] Add coding cookie
9 years ago
Yen Chi Hsuan
cacd996662
[utils] Don't touch URLs if not necessary
...
Fix test_Generic_15 (Google redirect)
9 years ago
remitamine
bffb245a48
[aol] add support for videos with vidible IDs( closes #9124 )
9 years ago
Jaime Marquínez Ferrándiz
e0986e31cf
lazy extractors: Output if it's enabled in the verbose log
9 years ago
Jaime Marquínez Ferrándiz
779822d945
Add experimental support for lazy loading the info extractors
...
'make lazy-extractors' creates the youtube_dl/extractor/lazy_extractors.py (imported by youtube_dl/extractor/__init__.py), which contains simplified classes that only have the 'suitable' class method and that load the appropiate class with the '__new__' method when a instance is created.
9 years ago
Jaime Marquínez Ferrándiz
1b3d5e05a8
Move the extreactors import to youtube_dl/extractor/extractors.py
9 years ago
Jaime Marquínez Ferrándiz
e52d7f85f2
Delay initialization of InfoExtractors until they are needed
9 years ago
Sergey M․
568d2f78d6
[tnaflix] Fix metadata extraction
9 years ago
Sergey M․
2f2fcf1a33
[tnaflix] Fix extraction ( Closes #9074 )
9 years ago
Sergey M․
bacec0397f
[extractor/common] Relax _hidden_inputs
9 years ago
Sergey M․
3c6c7e7d7e
[gdcvault] Fix extraction ( Closes #9107 , closes #9114 )
9 years ago
Sergey M․
fb38aa8b53
[extractor/common] Support arbitrary format strings for template based identifiers in mpd manifests ( Closes #9119 , closes #9120 )
9 years ago
Sergey M․
18da24634c
[democracynow] Improve extraction
9 years ago
Sergey M․
a134426d61
[democracynow] Fix tests
9 years ago
Sergey M․
a64c0c9b06
[democracynow] Make description optional ( Closes #9115 )
9 years ago
Sergey M․
56019444cb
[novamov] Improve _VALID_URL template ( Closes #9116 )
9 years ago
remitamine
a1ff3cd5f9
[acast] fix channel extraction( closes #9117 )
9 years ago
remitamine
9a32e80477
[acast] fix extraction( #9117 )
9 years ago
Sergey M․
536a55dabd
[YoutubeDL] Sanitize single thumbnail URL
9 years ago
Sergey M․
ed6fb8b804
[vrt] Add support for direct hls playlists and YouTube ( Closes #9108 )
9 years ago
Sergey M․
3afef2e3fc
[beeg] Improve extraction
9 years ago
Sergey M․
e90d175436
[yandexmusic] Extract music album metafields ( Closes #7354 )
9 years ago
Sergey M․
7a93ab5f3f
[extractor/common] Introduce music album metafields
9 years ago
Philipp Hagemeister
c41cf65d4a
release 2016.04.06
9 years ago
Yen Chi Hsuan
92d5477d84
[compat] Handle tuples properly in urlencode()
...
Fixes #9055
9 years ago
Yen Chi Hsuan
8790249c68
[iqiyi] Improve error detection for VIP-only videos
...
Closes #9071
9 years ago
Philipp Hagemeister
416930d450
release 2016.04.05
9 years ago
Sergey M․
65150b41bb
[deezer] Fix extraction ( Closes #9086 )
9 years ago
Sergey M․
e42f413716
[rte] Improve thumbnail extraction ( Closes #9085 )
9 years ago
Sergey M․
40a056d85d
[extractor/__init__] Remove novamov extractor and sort novamov based extractors alphabetically
9 years ago
Sergey M․
e7d77efb9d
[auroravid] Add extractor ( Closes #9070 )
9 years ago
Sergey M․
995cf05c96
[novamov] Make title fatal
9 years ago
Jaime Marquínez Ferrándiz
5bf28d7864
[utils] dfxp2srt: add additional namespace
...
Used by the ZDF subtitles (#9081 ).
9 years ago
Jaime Marquínez Ferrándiz
8c7d6e8e22
[zdf] Extract subtitles ( closes #9081 )
9 years ago
Sergey M․
6d4fc66bfc
[youtube] Add support for zwearz ( Closes #9062 )
9 years ago
remitamine
23576edbfc
[brightcove:legacy] skip None value for uploader_id
9 years ago
remitamine
4d4cd35f48
[brightcove:legacy] extract uploader_id as a string
9 years ago
remitamine
3aac9b2fb1
[nowness] update tests
9 years ago
remitamine
e47d19e991
[brightcove:new] extract subtitles and strip video title
9 years ago
remitamine
41f5492fbc
[brightcove:legacy] improve format extraction and extract uploader_id, duration and timestamp
9 years ago
Jaime Marquínez Ferrándiz
2defa7d75a
[instagram:user] Fix extraction ( fixes #9059 )
...
The URL for the next page was incorrect and we always got the same page, therefore it got trapped in an infinite loop.
9 years ago
Sergey M․
bbc26c8a01
[bbc] Set vcodec to none for audio formats
9 years ago
Sergey M․
b507cc925b
[extractor/common] Carry long line
9 years ago
Sergey M․
db8ee7ec05
[extractor/common] Fix numeric identifiers conversion in DASH URL templates
9 years ago
remitamine
08136dc138
[brightcove] fix format sorting
9 years ago
remitamine
fe7ef95e91
[cbsinteractive] Add support for ZDNet videos
9 years ago
remitamine
5f705baf5e
[cnet] extract more formats
9 years ago
remitamine
0750b2491f
[ffmpeg] try to convert tt subtitles usng dfxp2srt
9 years ago
remitamine
df634be2ed
[common] prefer using mime type over ext for smil subtitle extraction
...
the subtitle ext for http://www.cnet.com/videos/download-amazon-prime-movies-and-tv/
is adb_xml while using the mime type it get tt(application/smptett+xml)
9 years ago
Jaime Marquínez Ferrándiz
6d628fafca
[camwithher] Remove extra blank line
9 years ago
Jaime Marquínez Ferrándiz
0f28777f58
[cbsnews] Remove unused import
9 years ago
Jaime Marquínez Ferrándiz
329c1eae54
[aenetworks] Make pep8 happy
9 years ago
Sergey M․
9aaaf8e8e8
[camwithher] Improve extraction ( Closes #8989 )
9 years ago
theGeekPirate
04819db58e
[camwithher] Add extractor
...
Corrected unnecessary test
Sane variable naming
RTMP all .flv & url_id for _download_webpage()
Corrected all outstanding issues, next up is a squash!
9 years ago
remitamine
79ba9140dc
[theplatform] extract timestamp and uploader
9 years ago
Sergey M․
75d572e9fb
[screencast] Improve title regexes ( Closes #9025 )
9 years ago
Martin Trigaux
791d6aaecc
screencast.com: fallback on page title
...
When determining the title of the page, use the <title> tag of the page
9 years ago
Sergey M․
81de73e5b4
[screencast] Add test
9 years ago
Martin Trigaux
83cedc1cf2
screencast.com: support missing www
...
The "www." part of the URL is not mandatory
9 years ago
Sergey M․
244cd04237
[pluralsight] Remove unnecessary login/password encode
9 years ago
Sergey M․
fbdaced256
[lynda] Remove unnecessary login/password encode
9 years ago
Sergey M․
a3373823e1
[udemy] Remove unnecessary login/password encode
...
This is now covered by compat_urllib_parse_urlencode
9 years ago
Sergey M․
03caa463e7
[udemy:course] Skip non-video lectures
9 years ago
remitamine
3f64379eda
[movieclips] fix extraction
9 years ago
remitamine
3e0c3d14d9
[cbs] add base extractor
9 years ago
remitamine
d8873d4def
[aenetworks] improve format extraction
9 years ago
remitamine
db1c969da5
[theplatform] sign https urls
9 years ago
Philipp Hagemeister
1e02bc7ba2
release 2016.04.01
9 years ago
remitamine
63c55e9f22
[cbs] improve extraction( closes #6321 )
9 years ago
remitamine
f9b1529af8
[generic] remove sbnation test(handled by VoxMediaIE)
9 years ago
remitamine
961fc024d2
[voxmedia] improve sbnation support
9 years ago
Sergey M․
b53a06e3b9
[udemy:course] Use new URL format
9 years ago
remitamine
4ecc1fc638
[howstuffworks] improve extraction
9 years ago
Yen Chi Hsuan
5b012dfce8
[tudou] Improve error handling ( closes #8988 )
9 years ago
remitamine
8369942773
[voxmedia] Add new extractor( closes #3182 )
9 years ago
Sergey M․
86f3b66cec
[udemy] Remove unused import
9 years ago
Sergey M․
6bb4600717
[udemy:course] Simplify course curriculum downloading
9 years ago
Sergey M․
41d06b0424
[extractor/common] Improve _request_webpage
...
* Do not ignore data, headers and query for Requests
* Default values for headers and query switched to dicts since these are used by urllib itself
9 years ago
Sergey M․
15d260ebaa
[utils] Use update_Request in http_request
9 years ago
Sergey M․
ed0291d153
[utils] Add update_Request
9 years ago
Sergey M․
81da8cbc45
[udemy] Switch to api 2.0 ( Closes #9035 )
9 years ago
Sergey M․
5299bc3f91
[beeg] Switch to api v6 ( Closes #9036 )
9 years ago
remitamine
c9c39c22c5
[nationalgeographic] add support for channel.nationalgeographic.com urls
9 years ago
remitamine
d84b48e3f1
[nationalgeographic] improve extraction
9 years ago
remitamine
dd17041c82
[tenplay] remove extractor( fixes #6927 )
9 years ago
remitamine
fea7295b14
[brightcove] relax embed_in_page regex
9 years ago
remitamine
9cf01f7f30
[nbc] add new extractor for csnne.com( #5432 )
9 years ago
remitamine
ce548296fe
[cnbc] fix test
9 years ago
remitamine
c02ec7d430
[cnbc] Add new extractor( closes #8012 )
9 years ago
remitamine
6b820a2376
[myspace] improve extraction
9 years ago
Yen Chi Hsuan
e621a344e6
[kwuo] Port to new API and enable --cn-verification-proxy
9 years ago
Yen Chi Hsuan
3ae6f8fec1
[kwuo] Remove _sort_formats() from KuwoBaseIE._get_formats()
...
Following the idea proposed in 19dbaeece3
9 years ago
Yen Chi Hsuan
597d52fadb
[kuwo:song] Correct song ID extraction ( fixes #9033 )
...
Bug introduced in daef04a4e7
.
9 years ago
Sergey M․
afca767d19
[tumblr] Improve _VALID_URL ( Closes #9027 )
9 years ago
remitamine
6e359a1534
[comcarcoff] don not depend on crackle extractor( closes #8995 )
...
previously extraction has been delegated to crackle to extract more info
and subtitles #6106 but some of the episodes can't be extracted using
crackle #8995 .
9 years ago
Sergey M․
33f3040a3e
[YoutubeDL] Fix sanitizing subtitles' url
9 years ago
Sergey M․
03442072c0
[pornhub] Fix typo ( Closes #9008 )
9 years ago
Sergey M․
c8b13fec02
[foxnews] Restore upload time fields in test
9 years ago
Sergey M․
87d105ac6c
[amp] Fix upload timestamp extraction ( Closes #9007 )
9 years ago
Sergey M․
3454139576
[pornhub:uservideos] Add support for multipage videos ( Closes #9006 )
9 years ago
Sergey M․
3a23bae9cc
[pornhub:playlistbase] Do not include videos not from playlist
9 years ago
Sergey M․
8f9a477e7f
[pornhub:playlistbase] Use orderedSet
9 years ago
Sergey M․
a1cf3e38a3
[bbc] Extend vpid regex ( Closes #9003 )
9 years ago
Philipp Hagemeister
a122e7080b
release 2016.03.27
9 years ago
Sergey M․
b22ca76204
[extractor/common] Filter out unsupported encrypted media for f4m formats ( Closes #8573 )
9 years ago
Sergey M․
f7df343b4a
[downloader/f4m] Extract routine for removing unsupported encrypted media
9 years ago
Sergey M․
19dbaeece3
Remove _sort_formats from _extract_*_formats methods
...
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.
Closes #8051
9 years ago
Yen Chi Hsuan
395fd4b08a
[twitter] Handle another form of embedded Vine
...
Fixes #8996
9 years ago
Sergey M․
8018028d0f
[pluralsight] Extract chapter metadata ( Closes #8993 )
9 years ago
Sergey M․
00322ad4fd
[lynda] Extract chapter metadata ( #8993 )
9 years ago
Sergey M․
4cf3489c6e
[vevo] Update videoservice API URL ( Closes #8900 )
9 years ago
Sergey M․
b24ab3e341
[udemy] Improve paid course detection
9 years ago
Sergey M․
af4116f4f0
[udemy] Improve format_id
9 years ago
Sergey M․
f973e5d54e
[udemy] Drop outputs' formats
...
Always results in 403
9 years ago
Sergey M․
62f55aa68a
[udemy] Add outputs metadata to view_html formats
9 years ago
Sergey M․
02d7634d24
[udemy] Fix outputs' formats format_id
9 years ago
Sergey M․
48dce58ca9
[udemy] Use custom sorting
9 years ago
Sergey M․
efcba804f6
[udemy] Extract formats from view_html ( Closes #8979 )
9 years ago
Sergey M․
6dee688e6d
[youtube:playlistsbase] Restrict playlist regex ( Closes #8986 )
9 years ago
Sergey M․
eedb7ba536
[YoutubeDL] Sort imports
9 years ago
Sergey M․
dcf77cf1a7
[YoutubeDL] Sanitize final URLs ( Closes #8991 )
9 years ago
Sergey M․
17bcc626bf
[utils] Extract sanitize_url routine
9 years ago
Sergey M․
b5a5bbf376
[mailru] Extend _VALID_URL ( Closes #8990 )
9 years ago
Yen Chi Hsuan
e68d3a010f
[twitter] Fix extraction ( closes #8966 )
...
HLS and DASH formats are no longer appeared in test cases. I keep them
for fear of triggering new errors.
9 years ago
Yen Chi Hsuan
d10fe8358c
[generic] Add a test case for brightcove embed
...
Closes #8862
9 years ago
Yen Chi Hsuan
d6c340cae5
[brightcove] Extract more formats ( #8862 )
9 years ago
Yen Chi Hsuan
5964b598ff
[brightcove] Support alternative BrightcoveExperience layout
...
The full URL lays in the `data` attribute of <object> (#8862 )
9 years ago
Philipp Hagemeister
62cdb96f51
release 2016.03.26
9 years ago
Sergey M․
6e6bc8dae5
Use urlencode_postdata across the codebase
9 years ago
Sergey M․
15707c7e02
[compat] Add compat_urllib_parse_urlencode and eliminate encode_dict
...
encode_dict functionality has been improved and moved directly into compat_urllib_parse_urlencode
All occurrences of compat_urllib_parse.urlencode throughout the codebase have been replaced by compat_urllib_parse_urlencode
Closes #8974
9 years ago
Sergey M․
2156f16ca7
[thescene] Fix extraction and improve style ( Closes #8978 )
9 years ago
Sergey M․
4db441de72
[once] Relax _VALID_URL ( Closes #8976 )
9 years ago
Philipp Hagemeister
0be8314dc8
release 2016.03.25
9 years ago
Yen Chi Hsuan
d7f62b049a
[iqiyi] Update enc_key
9 years ago
Yen Chi Hsuan
3bb3356812
[douyutv] Extend _VALID_URL
9 years ago
Sergey M․
98e68806fb
[mnet] Improve ( Closes #8958 )
9 years ago
Kagami Hiiragi
e031768666
[mnet] Add new extractor
9 years ago
Sergey M․
5eb7db4ee9
[udemy] Add support for new URL schema
9 years ago
Sergey M․
f0e83681d9
[udemy] Extract formats from outputs
9 years ago
Sergey M․
ff9d5d0938
[udemy] Improve course enrolling
9 years ago
Sergey M․
d041a73674
[extractor/__init__] Add youtube:live and sort youtube extractors alphabetically
9 years ago
Sergey M․
f07e276a04
[youtube:live] Add extractor ( Closes #8959 )
9 years ago
Sergey M․
993271da0a
[nytimes] Tolerate missing metadata ( Closes #8952 )
9 years ago
Sergey M․
369e7e3ff0
[iprima] Fix extraction ( Closes #8953 )
9 years ago
Sergey M․
5767b4eeae
[mtv] Fix description extraction ( Closes #8962 )
9 years ago
Yen Chi Hsuan
622d19160b
[utils] Clarify Python versions affected by buggy struct module
9 years ago
Yen Chi Hsuan
32d88410eb
[tumblr] Add a test with Instagram embed
...
Closes #8817
9 years ago
Yen Chi Hsuan
5a51775a58
[generic] Extract Instagram embeds ( #8817 )
9 years ago
Yen Chi Hsuan
87696e78d7
[instagram] Unescape description ( #8817 )
9 years ago
Yen Chi Hsuan
c4096e8aea
[instagram] Extract embed videos ( #8817 )
9 years ago
Yen Chi Hsuan
fc27ea9464
[tumblr] Support Vine embeds ( #8817 )
9 years ago
Yen Chi Hsuan
088e1aac59
[generic] Support Vine embeds ( #8817 )
9 years ago
Sergey M
4333d56494
Merge pull request #8898 from dstftw/fragment-retries
...
Add --fragment-retries option (Fixes #8466 )
9 years ago
Sergey M․
882c699296
[tunein] Fix stream data extraction ( Closes #8899 , closes #8924 )
9 years ago
Yen Chi Hsuan
efbed08dc2
[utils] Encode hostnames before passing to urllib
...
With IDN (Internationalized Domain Name) and a proxy, non-ascii URLs
are passed down to urllib/urllib2, causing UnicodeEncodeError
Fixes #8890
9 years ago
Jaime Marquínez Ferrándiz
7da2c87119
Add extractor for thescene.com ( closes #8929 )
9 years ago
Sergey M․
c6ca11f1b3
[once] Prevent ads from embedding into m3u8 playlists ( Closes #8893 )
9 years ago
Sergey M․
2beeb286e1
[laola1tv] Add support for livestreams ( Closes #8934 )
9 years ago
Sergey M․
cc7397b04d
[ceskatelevize] Make m3u8 formats extraction non fatal ( Closes #8933 )
9 years ago
Sergey M․
bc5d16b302
[animeondemand] Skip dash for now
9 years ago
Sergey M․
85c637b737
[animeondemand] Extract teaser when no full episode available ( #8923 )
9 years ago
Sergey M․
5c69f7a479
[animeondemand] Respect startvideo ( Closes #8923 )
9 years ago
Sergey M․
ff5873b72d
[motherless] Detect friends only videos
9 years ago
Sergey M․
065c4b27bf
[xhamster:embed] Extract vars ( Closes #8912 )
9 years ago
Sergey M․
1600ed1ff9
[rutv] Improve flash version pattern ( Closes #8911 )
9 years ago
Sergey M․
5886b38d73
Add support for https for all extractors as preventive and future-proof measure
9 years ago
Sergey M․
0cef27ad25
Add missing r prefix for _VALID_URLs
9 years ago
Sergey M․
12af4beb3e
[mailru] Add support for https ( Closes #8920 )
9 years ago
Sergey M․
9016d76f71
[YoutubeDL] Improve _format_note
9 years ago
Sergey M․
3c5d183c19
[animeondemand] Extract all formats ( Closes #8906 )
9 years ago
Sergey M․
3e8bb9a972
[animeondemand] Detect geo restriction
9 years ago
Yen Chi Hsuan
daef04a4e7
[kwuo] Fix KuwoChartIE and KuwoSingerIE and accept new URL forms
9 years ago
Yen Chi Hsuan
2648918c81
[vlive] Fix creator extraction ( closes #8814 )
9 years ago
Yen Chi Hsuan
9e3c2f1d74
[openload] Misc improvements
...
* Add thumbnail
* Detect errors (#6469 )
* Match more (#6469 , #8489 )
9 years ago
Yen Chi Hsuan
2bfeee69b9
[openload] Add new extractor ( closes #8489 )
9 years ago
Yen Chi Hsuan
664bcd80b9
[tudou] Use InAdvancePagedList ( closes #8884 )
9 years ago
Sergey M․
3c20208eff
[francetv] Improve formats extraction
9 years ago
Sergey M․
db264e3cc3
[francetvinfo] Add support for france3-regions and strip title ( Closes #7673 )
9 years ago
Sergey M․
96a9f22d98
[discovery] Relax _VALID_URL ( Closes #8903 )
9 years ago
Sergey M․
40025ee2a3
[postprocessort/ffmpeg] Allow embedding webvtt into webm ( Closes #8874 )
9 years ago
Sergey M․
298c04b464
[91porn] Use common messages' wording
9 years ago
Sergey M․
d95114dd83
[91porn] Unquote final URL ( Closes #8881 )
9 years ago
Sergey M․
fa023ccb2c
[biobiochiletv] Fix extraction, extract m3u8 formats and overall improve ( Closes #7314 )
9 years ago
jjatria
e36f4aa72b
[biobiotv] Add extractor
9 years ago
Sergey M․
f1ced6df51
[cda] Improve and simplify ( Closes #8805 )
9 years ago
Kacper Michajłow
8b0d7a66ef
[cda] Add new extractor for cda.pl
...
Fixes #8760
9 years ago
Sergey M․
3aec71766d
[safari:api] Separate extractor ( Closes #8871 )
9 years ago
Sergey M․
16a8b7986b
[downloader/fragment] Document fragment_retries
9 years ago
Sergey M․
617e58d850
[downloader/{common,fragment}] Fix total retries reporting on python 2.6
9 years ago
Sergey M․
e33baba0dd
[downloader/dash] Add fragment retry capability
...
YouTube may often return 404 HTTP error for a fragment causing the
whole download to fail. However if the same fragment is immediately
retried with the same request data this usually succeeds (1-2 attemps
is usually enough) thus allowing to download the whole file successfully.
So, we will retry all fragments that fail with 404 HTTP error for now.
9 years ago
Sergey M․
721f26b821
[downloader/fragment] Add report_retry_fragment
9 years ago
Sergey M․
52bb437e41
[options] Add --fragment-retries option
9 years ago
Jaime Marquínez Ferrándiz
782b1b5bd1
[utils] lookup_unit_table: Match word boundary instead of end of string
9 years ago
Sergey M․
0d769bcb78
[extractor/generic] Fix missing byte literal prefix
9 years ago
remitamine
4cd70099ea
[hbo] Add new extractor
9 years ago
Jaime Marquínez Ferrándiz
09fc33198a
utils: lookup_unit_table: Use a stricter regex
...
In parse_count multiple units start with the same letter, so it would match different units depending on the order they were sorted when iterating over them.
9 years ago
John Peel
d5aacf9a90
Added format_id to the filers on -f.
9 years ago