Commit Graph

1978 Commits (9d93e7da6c23151272a91fbc2889c098e3b5f139)

Author SHA1 Message Date
Philipp Hagemeister 8f05351984 [anitube] Minor fixes ()
Philipp Hagemeister 71791f414c Merge remote-tracking branch 'diffycat/master'
Philipp Hagemeister f3682997d7 Clean up unused imports and other minor mistakes
Philipp Hagemeister cc13cc0251 [teamcoco] Correct error
Philipp Hagemeister 86bd5f2ca9 Merge remote-tracking branch 'dz0ny/patch-1'
Philipp Hagemeister 8694c60000 import json for --dump-json
Mohamedh Fazal 9d1538182f Add an option to dump json information
Philipp Hagemeister 5904088811 Add support for tou.tv (Fixes )
Jaime Marquínez Ferrándiz 69545c2aff [d8] inherit from CanalplusIE
it reuses the same extraction process
Jaime Marquínez Ferrándiz 495da337ae Merge pull request from migbac/master
Add support for d8.tv
Philipp Hagemeister 34b3afc7be release 2013.11.19
Philipp Hagemeister 00373a4c5d Merge pull request from rg3/console-title
Correctly write and restore the console title on the stack (fixes )
Philipp Hagemeister cb7dfeeac4 [youtube] only allow domain name to be upper-case ()
Jaime Marquínez Ferrándiz efd6c574a2 Correctly write and restore the console title on the stack (fixes )
Philipp Hagemeister 4113e6ab56 [auengine] Do not return unnecessary ext
Philipp Hagemeister 9a942a4671 release 2013.11.18.1
Philipp Hagemeister 9906d397a0 [auengine] Simplify
Philipp Hagemeister ae8f787141 Remove iPhone from user agent. This breaks a lot of extractors
In the future, it might be worth investigating whether we get better content when we claime to be an iPhone.
Philipp Hagemeister a81b4d5c8f release 2013.11.18
Philipp Hagemeister 887c6acdf2 Support multiple embedded YouTube URLs (Fixes )
Philipp Hagemeister 83aa529330 Support protocol-independent URLs ()
Philipp Hagemeister 96b31b6533 Add iPhone to UA ()
Philipp Hagemeister fccd377198 Suppor embed-only videos (Fixes )
rzhxeo 2b35c9ef74 Merge branch 'master' into rtmpdump
Conflicts:
	youtube_dl/FileDownloader.py

Merge
Philipp Hagemeister 73c566695f release 2013.11.17
Philipp Hagemeister 63b7b7224a [MTVIE] Try with RTMP URL if download fails
This fixes youtube-dl http://www.southpark.de/clips/155251/cartman-vs-the-dog-whisperer
Philipp Hagemeister ce80c8b8ee Merge pull request from rzhxeo/southpark
Add support for southpark.de
Philipp Hagemeister 749febf4d1 Allow --console-title when --quiet is given (Fixes )
Philipp Hagemeister bdde425cbe Save and restore console title (Fixes )
rzhxeo 746f491f82 Add support for southpark.de
rzhxeo 1672647ade [SouthParkStudiosIE] Move from _TEST to _TESTS
rzhxeo 90b6bbc38c [SouthParkStudiosIE] Also detect urls without http:// or www
Philipp Hagemeister ce02ed60f2 Remove * imports
Philipp Hagemeister 1e5b9a95fd Move console_title to YoutubeDL
Philipp Hagemeister 1d699755e0 [youtube] Add view_count (Fixes )
Philipp Hagemeister ddf49c6344 [arte] remove two typos
Anton Larionov ba3881dffd Add support for anitube.se ()
Philipp Hagemeister d1c252048b [redtube] Do not test md5, seems to vary
Philipp Hagemeister eab2724138 [gamekings] Do not test md5 sum, precise file changes regularly
Philipp Hagemeister 21ea3e06c9 [gamekings] remove unnecessary import
Philipp Hagemeister 52d703d3d1 [tvp] Skip tests
Philipp Hagemeister ce152341a1 [bambuser] Do not test for MD5, seems to be flaky
Philipp Hagemeister f058e34011 [dailymotion] Fix playlists
Philipp Hagemeister b5349e8721 Fix indentation of (best) and (worst) in --list-formats
Philipp Hagemeister 7150858d49 [spiegel] Implement format selection
Philipp Hagemeister 91c7271aab Add automatic generation of format note based on bitrate and codecs
Philipp Hagemeister aa13b2dffd release 2013.11.15.1
Philipp Hagemeister fc2ef392be [ted] Fix playlists (Fixes )
Philipp Hagemeister 463a908705 [ted] simplify
Jaime Marquínez Ferrándiz d24ffe1cfa [rtlnow] Remove the test for nitro
The videos expire.
Jaime Marquínez Ferrándiz 78fb87b283 Don't accept '>' inside the content attribute in OpenGraph regexes
Jaime Marquínez Ferrándiz ab2d524780 Improve the OpenGraph regex
* Do not accept '>' between the property and content attributes.
* Recognize the properties if the content attribute is before the property attribute using two regexes (fixes the extraction of the description for SlideshareIE).
Jaime Marquínez Ferrándiz 85d61685f1 [tvp] Update the title and the description of the test video
Jaime Marquínez Ferrándiz b9643eed7c [youtube:channel] Fix the extraction of autogenerated channels
The ajax pages are empty, now it looks directly in the channel's /videos page
Jaime Marquínez Ferrándiz feee2ecfa9 Pass the 'download' argument to 'process_video_result' (fixes )
Philipp Hagemeister a25a5cfeec release 2013.11.15
Philipp Hagemeister 0e145dd541 Merge branch 'master' of github.com:rg3/youtube-dl
Philipp Hagemeister 9f9be844fc [youtube] Fix protocol-independent URLs (Fixes )
Jaime Marquínez Ferrándiz e3b9ab5e18 [soundlcoud] Set the correct extension for the tracks (fixes )
Some tracks are not in mp3 format, they can be wav files.
Jaime Marquínez Ferrándiz c66d2baa9c [livestream] Add an extractor for the original version of livestream (closes )
The two versions use different systems.
Janez Troha 9771cceb2c Fix filename extension leaking to json filename
Makes writeinfojson behaving exactly as writethumbnail in case where filename contains mediafile extension.

Case:

video.mp4 converted to music.mp3 would yield music.mp4.info.json instead music.mp3.info.json or music.info.json
Jaime Marquínez Ferrándiz ca715127a2 Don't assume the 'subtitlesformat' is set in the params dict (fixes )
Jaime Marquínez Ferrándiz ea7a7af1d4 [gamekings] Fix the test video checksum
Jaime Marquínez Ferrándiz 880e1c529d [youtube:playlist] Login into youtube if requested (fixes )
Allows to download private playlists
Jaime Marquínez Ferrándiz dcbb45803f [youtube:playlist] Don't use the gdata api (closes )
Parse the playlist pages instead
Philipp Hagemeister 80b9bbce86 release 2013.11.13
Philipp Hagemeister d37936386f Credit @saper for tvp IE ()
Philipp Hagemeister c3a3028f9f [tvp] Minor improvements ()
Philipp Hagemeister 6c5ad80cdc Merge remote-tracking branch 'saper/tvp'
Philipp Hagemeister b5bdc2699a Credit @jelly for gamekings extractor ()
Philipp Hagemeister 384b98cd8f [gamekings] Minor fixes ()
Jelle van der Waa eb9b5bffef Add extractor for gamekings.tv
migbac 0bd59f3723 Add support for d8.tv
Jaime Marquínez Ferrándiz 8b8cbd8f6d [vine] Fix uploader extraction
Jaime Marquínez Ferrándiz 72b18c5d34 FFmpegMetadataPP: don't enclose the values with " (fixes )
Philipp Hagemeister eb0a839866 [common] Simplify og_search_property
Philipp Hagemeister 1777d5a952 release 2013.11.11
Philipp Hagemeister d4b7da84c3 Clarify -c. Do not pass it in if you don't know what you're doing
Suggested in 
Jaime Marquínez Ferrándiz 801dbbdffd Use avconv for downloading with m3u8 manifests if it's available (fixes )
Jaime Marquínez Ferrándiz 0ed05a1d2d Use the 'rtmp_live' field for the live parameter of rtmpdump
Jaime Marquínez Ferrándiz 1008bebade Merge remote-tracking branch 'rzhxeo/rtmpdump_live'
Jaime Marquínez Ferrándiz be6dfd1b49 [ted] Return a single info_dict for talks urls
It failed with the --list-subs option
Jaime Marquínez Ferrándiz 231516b6c9 Merge pull request from iemejia/master
[ted] support for subtitles
Jaime Marquínez Ferrándiz fb53d58dcf Merge pull request from saper/escaped
Fix AssertionError when og property not found
Jaime Marquínez Ferrándiz 2a9e9b210b Fix the documentation of '--autonumber-size' ()
it's '--auto-number' not '--autonumber'
Jaime Marquínez Ferrándiz 897d6cc43a Improve format listing for long format ids
Now arte.tv videos have quite long ids.
Jaime Marquínez Ferrándiz f470c6c812 [arte] Improve the format sorting
Also use the bitrate.
Prefer normal version and sourds/mal version over original version with subtitles.
Jaime Marquínez Ferrándiz 566d4e0425 [arte] Make sure the format_id is unique (closes )
Include the bitrate and use the height instead of the quality field.
Jaime Marquínez Ferrándiz 81be02d2f9 [cnn] Accept www.cnn.com urls (fixes )
Jaime Marquínez Ferrándiz c2b6a482d5 [brightcove] the format function requires to specify the index in python2.6
Jaime Marquínez Ferrándiz 12c167c881 [soundcloud] Allow to download tracks marked as not 'streamable'
They use the rtmp protocol but if the are marked as 'downloadable' it can use the direct download link.
Jaime Marquínez Ferrándiz 20aafee7fa [kankan] Fix the video url
It now requires two additional parameters, one is a timestamp we get from the getCdnresource_flv page and the other is a key we have to build.
Jaime Marquínez Ferrándiz be07375b66 Don't recode the video with m3u8 downloads (fixes )
rzhxeo 4894fe8c5b Report download progress of rtmpdump
Jaime Marquínez Ferrándiz dd5bcdc4c9 [brightcove] Set the 'Referer' header if the url has the 'linkBaseUrl' parameter (fixes )
Philipp Hagemeister 6161d17579 release 2013.11.07
Jaime Marquínez Ferrándiz 4ac5306ae7 Fix the report progress when file_size is unknown ()
The report_progress function will accept eta and percent with None value and will set the message to 'Unknow ETA' or 'Unknown %'.
Otherwise the values must be numbers.
Jaime Marquínez Ferrándiz b1a80ec1a9 [xnxx] Accept urls that start with 'www' (fixes )
Philipp Hagemeister 672fe94dcb release 2013.11.06.1
Jaime Marquínez Ferrándiz 51040b72ed [brightcove] Support redirected urls from bcove.me (fixes )
'bctid' needs to be changed to '@videoPlayer', and 'bckey' to 'playerKey'.
Jaime Marquínez Ferrándiz 4f045eef8f [youtube:channel] Fix the extraction
The page don't include the 'load more' button anymore, now we directly get the 'c4_browse_ajax' pages.
Jaime Marquínez Ferrándiz 5d7b253ea0 Add an extractor for eitb.tv (fixes )
The BrighcoveExperience object doesn't contain the video id, the extractor adds it and passes the url to BrightcoveIE.
Jaime Marquínez Ferrándiz b0759f0c19 [brightcove] Extract all the available formats
Jaime Marquínez Ferrándiz 065472936a Add an extractor for space.com (fixes )
It uses Brightcove, but requires some special process for getting a url with the playerKey field in some videos
Jaime Marquínez Ferrándiz fc4a0c2aec [brightcove] Change the 'videoId' or 'videoID' field to '@videoPlayer' (fixes )
It seems to be needed when using the htmlFederated page
Jaime Marquínez Ferrándiz eeb165e674 [brightcove] Add the extraction of the url from generic
Jaime Marquínez Ferrándiz 9ee2b5f6f2 tests: don't run the test if any of the extractors listed in the 'add_ie' field is marked as not working
Philipp Hagemeister da54be877a release 2013.11.06
Philipp Hagemeister 50a886b7ab Fix reporting when file size is unkown (Fixes )
Philipp Hagemeister 76e67c2cb6 Clean up imports
Marcin Cieślak 5137ebac0b [tvp] Telewizja Polska: new extractor for tvp.pl, fixes
Thanks-To: mplonski

https://github.com/mplonski/linux/blob/master/tvp-dl.py
Marcin Cieślak a8eeb0597b Fix AssertionError when og property not found
On tvp.pl some webpages contain OpenGraph
metadata and some don't.

If og property is not found, _og_search_description
fails with

WARNING: unable to extract OpenGraph description; please report this issue on http://yt-dl.org/bug
Traceback (most recent call last):
  File "/usr/home/saper/bin/youtube-dl", line 18, in <module>
    youtube_dl.main()
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 766, in main
    _real_main(argv)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/__init__.py", line 719, in _real_main
    retcode = ydl.download(all_urls)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 715, in download
    videos = self.extract_info(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/YoutubeDL.py", line 348, in extract_info
    ie_result = ie.extract(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 125, in extract
    return self._real_extract(url)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/tvp.py", line 56, in _real_extract
    info['description'] = self._og_search_description(webpage)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 331, in _og_search_description
    return self._og_search_property('description', html, fatal=False, **kargs)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/extractor/common.py", line 325, in _og_search_property
    return unescapeHTML(escaped)
  File "/usr/home/saper/sw/youtube-dl/youtube_dl/utils.py", line 494, in unescapeHTML
    assert type(s) == type(u'')
AssertionError

The patch allows me to use:

  try:
    info['description'] = self._og_search_description(webpage)
    info['thumbnail'] = self._og_search_thumbnail(webpage)
  except RegexNotFoundError:
    pass
Ismaël Mejía 4ed3e51080 [ted] fixed error in case of no subtitles present
I created a test, but I leave it commented since TED videos get
new subtitles frequently.
rzhxeo 2dcf7d8f99 [GenericIE] Also detect youtube if src url of iframe is embedded in ' instaed of "
Jaime Marquínez Ferrándiz 19b0668251 [canal2c] Accept more urls (fixes )
The url only needs to have the 'idVideo' field in the query, in any position.
We have to set the 'void=oui' in the webpage url, so that we get the file name.
Jaime Marquínez Ferrándiz e7e6b54d8a [teamcoco] Parse the xml file and extract all the formats
Jaime Marquínez Ferrándiz 2a1a8ffe41 Merge pull request from alexvh/teamcoco_fix
[teamcoco] Fix video url extraction for some videos
Philipp Hagemeister 08fb86c49b [youtube] Add description for YoutubeSearchDateIE ()
Philipp Hagemeister 3633d77c0f Merge remote-tracking branch 'CBGoodBuddy/ytsearchtime'
Philipp Hagemeister 165e179764 release 2013.11.03
Philipp Hagemeister 12ebdd1506 [viddler] Support non-digit IDs (Fixes )
Jaime Marquínez Ferrándiz 1baf9a5938 Merge pull request from rzhxeo/cinemassacre
[CinemassacreIE] Support more embed urls
Jaime Marquínez Ferrándiz a56f9de156 Style fixes for extractors: remove spaces around (,),{ and }
Jaime Marquínez Ferrándiz fa5d47af4b Merge pull request from rzhxeo/mofosex
Add support for http://www.mofosex.com
Jaime Marquínez Ferrándiz d607038753 Merge pull request from rzhxeo/xtube
Add support for http://www.xtube.com
Jaime Marquínez Ferrándiz 9ac6a01aaf Merge pull request from rzhxeo/extremetube
Add support for http://www.extremetube.com
Jaime Marquínez Ferrándiz be97abc247 Set the 'extractor_key' field in the info_dict
It's the string returned by the class method 'ie_key', which allows to retrieve the extractor with 'get_info_extractor'
Jaime Marquínez Ferrándiz 9103bbc5cd Add the 'webpage_url' field to info_dict
The url for the video page, it must allow to reproduce the result.
It's automatically set by YoutubeDL if it's missing.
Jaime Marquínez Ferrándiz b6c45014ae Set the extra_info inside YoutubeDL.process_ie_result and set only if the keys are missing
Craig Markwardt a3dd924871 Add YoutubeSearchDateIE extractor to youtube.py & __init__.py, which searches by publication date.
rzhxeo 137bbb3e37 [XTubeIE] Add description to TEST
rzhxeo 86ad94bb2e [ExtremeTubeIE] Set age_limit to 18 and fix uploader extraction
Jaime Marquínez Ferrándiz 3e56add7c9 Merge pull request from rzhxeo/keezmovies
[KeezMoviesIE] Detect URLs with numbers in the SEO part correct
Jaime Marquínez Ferrándiz f52f01b5d2 [brightcove] Don't set the extension
If the video only has the 'FLVFullLengthURL' key, it can still be an mp4 file.
Jaime Marquínez Ferrándiz 98d7efb537 [exfm] skip tests
The site is down too often.
Jaime Marquínez Ferrándiz cf51923545 [youtube] Remove vevo test
The video is no longer available and it seems that vevo video don't use encrypted signatures anymore.
Ismaël Mejía 38fcd4597a Merge remote-tracking branch 'iemejia/master'
Jaime Marquínez Ferrándiz 165e3bb67a [bambuser] Add an extractor for channels (closes )
Ismaël Mejía 38db46794f Merge branch 'ted_subtitles'
Ismaël Mejía a9a3876d55 [ted] Added support for subtitle download
Ismaël Mejía 1f343eaabb [subtitles] refactor to support websites with subtitle information the
webpage.

I added the parameter webpage, so now it's similar to the way automatic
captions are handled. This is an improvement needed for websites like
TED.
Jaime Marquínez Ferrándiz 72a5b4f702 Add an extractor for bambuser.com ()
rzhxeo 0a43ddf320 [CinemassacreIE] Add live paramter to extracted info as a workaround
rzhxeo 31366066bd Add support for live parameter to rtmpdump
Philipp Hagemeister aa2484e390 release 2013.11.02
Philipp Hagemeister 8eddf3e91d [youtube] Encode subtitle track name in request (Fixes )
Jaime Marquínez Ferrándiz 60d142aa8d Add an extractor for vk.com (closes )
Jaime Marquínez Ferrándiz 66cf3ac342 [metacafe] Fix support for age-restricted videos (fixes )
The 'Content-Type' header must be set for disabling the family filter.
The 'flashversion' cookie  is only needed for AnyClip videos.
Added tests for standard metacafe videos and for age-restricted videos.
Also set the 'age_limit' field.
rzhxeo ab4e151347 [CinemassacreIE] Support more embed urls
Alex Van't Hof ac2547f5ff [teamcoco] Fix video url extraction for some videos
Video url extraction failed for some videos,
e.g. http://teamcoco.com/video/old-time-baseball

The url extracted was also occasionally suboptimal quality,
e.g. http://teamcoco.com/video/louis-ck-interview-george-w-bush