Commit Graph

449 Commits (8e7d0048886f374a58f0fe8ba021644d7074d02c)

Author SHA1 Message Date
Sergey M․ 666a9a2b95 [YoutubeDL] Improve audio/video-only file naming
Sergey M․ 38c6902b90 [YoutubeDL] Ensure correct extension is always present for a merged file (Closes )
Sergey M․ cb202fd286 [YoutubeDL] Filter requested info fields on `--load-info` as well
In order to properly handle JSON info files generated by youtube-dl versions prior to 4070b458ec
Jaime Marquínez Ferrándiz df8301fef5 [YoutubeDL] pep8: use 'k not in' instead of 'not k in'
Sergey M․ 4070b458ec [YoutubeDL] Do not write requested info in info JSON file (Closes , closes )
Sergey M․ e01c56f9e1 [YoutubeDL] Generalize best/worst format match behavior
Jaime Marquínez Ferrándiz 8dd5418803 Make 'best' format only match non-DASH formats (closes )
Otherwise it's impossible to only download non-DASH formats, for example `best[height=?480]/best` would download a DASH video if it's the only one with height=480, instead for falling back to the second format specifier.
For audio only urls (soundcloud, bandcamp ...), the best audio will be downloaded as before.
zouhair 5456d78f0c Typo "incompatible" instead of "uncompatible"
Jaime Marquínez Ferrándiz 9f3fa89f7c Remove the --max-quality option
It doesn't work well with 'bestvideo' and 'bestaudio' because they are usually before the max quality.
Format filters should be used instead, they are more flexible and don't require the requested quality to exist for each video.
Yen Chi Hsuan 8be2bdfabd [YoutubeDL] Remove the redundant assignment to old_filename
Caused by commmit 592e97e855
Sergey M․ 6728187ac0 [YoutubeDL] mp3 is compatible with mp4
Sergey M․ 17c8675853 [YoutubeDL] Allow bestvideo+bestaudio/best strategy for ted extractor
Sergey M․ c0dea0a782 [YoutubeDL] Respect explicit `--merge-format-output` for uncompatible formats as well
Sergey M․ 81cd954a51 [YoutubeDL] Merge incompatible formats into mkv ()
Sergey M․ feccf29c87 [YoutubeDL] Make `bestvideo+bestaudio/best` default format when merger is available
Jaime Marquínez Ferrándiz 5b5fbc0867 Detect already merged videos
Without the '--keep-video' option the two files would be downloaded again and even using the option, ffmpeg would be run again, which for some videos can take a long time.
We use a temporary file with ffmpeg so that the final file only exists if it success
Jaime Marquínez Ferrándiz d47aeb2252 FFmpegMergerPP: use the new system for specifying which files can be delete
Jaime Marquínez Ferrándiz 592e97e855 Postprocessors: use a list for the files that can be deleted
We could only know if we had to delete the original file, but this system allows to specify us more files (like subtitles).
Jaime Marquínez Ferrándiz ecc6bd1341 YoutubeDL.post_process: simplify keep_video handling
Since keep_video started as None we always set it to keep_video_wish unless it was None, so in the end keep_video == keep_video_wish. This should have been changed in f3ff1a3696, but I didn't notice it.
Sergey M․ c36a959549 [YoutubeDL] Try to download worst audio+video served by a single file first (Closes )
Jaime Marquínez Ferrándiz 8e4b83b96b Remove check for ssl certs
When it uses a capath instead of a cafile, 'get_ca_certs' or 'cert_store_stats' only returns certificates already used in a connection.
(see )
Sergey M․ a3c7019e06 [YoutubeDL] Check for `get_ca_certs` availability
`get_ca_certs` is not available in python <3.4
Jaime Marquínez Ferrándiz 1a944d8a2a Print a warning if no ssl certificates are loaded
Jaime Marquínez Ferrándiz 70a1165b32 Don't use bare 'except:'
They catch any exception, including KeyboardInterrupt, we don't want to catch it.
Jaime Marquínez Ferrándiz 179d6678b1 Remove the 'stitle' field
A warning has been printed for more than 2 years (since 97cd3afc75)
Kang Hyojun 613b2d9dc6 Fix mistyped docstring indent
Philipp Hagemeister 486dd09e0b [YoutubeDL] Check for bytes instead of unicode output templates ()
Also adapt the embedding examples for those poor souls still using 2.x.
Jaime Marquínez Ferrándiz 3a77719c5a Don't accept '-1' as format, 'all' is clearer
Sergey M․ e5a11a2293 [YoutubeDL] Sanitize path before creating non-existent paths (Closes )
Sergey M․ 1bb5c511a5 [YoutubeDL] Sanitize outtmpl as path
Sergey M․ 7dcad95d4f [YoutubeDL] Sanitize outtmpl as it may contain forbidden characters
Philipp Hagemeister 2461f79d2a [utils] Correct per-request proxy handling
Philipp Hagemeister 91410c9bfa [letv] Add --cn-verification-proxy (Closes )
Philipp Hagemeister c75f0b361a [downloader/external] Add support for custom options (Fixes , closes )
Jaime Marquínez Ferrándiz 31bd39256b --load-info: Use the fileinput module
It automatically handles the '-' filename as stdin
Jaime Marquínez Ferrándiz 003c69a84b Use shutil.get_terminal_size for getting the terminal width if it's available (python >= 3.3)
Jaime Marquínez Ferrándiz 0f2c0d335b [YoutubeDL] Use the InfoExtractor._download_webpage method for getting the subtitles
It handles encodings better, for example for 'http://www.npo.nl/nos-journaal/14-02-2015/POW_00942207'
Jaime Marquínez Ferrándiz bfc993cc91 Merge branch 'subtitles-rework'
(Closes PR )
Jaime Marquínez Ferrándiz b531cfc019 [YoutubeDL] remove compatiblity with the old subtitles system
Jaime Marquínez Ferrándiz 98c70d6fc7 [YoutubeDL] only add normal subtitles to the 'requested_subtitles' field if 'writesubtitles' is True
Jaime Marquínez Ferrándiz 4d17184817 [YoutubeDL] don't set the 'requested_subtitles' without writesubtitles or writeautomaticsub
Philipp Hagemeister 8fb3ac3649 PEP8: W503
Philipp Hagemeister b81a359eb6 [YoutubeDL] Use render_table for format listing
Jaime Marquínez Ferrándiz edab9dbf4d [YoutubeDL] use the 'render_table' function for listing the subtitles
Philipp Hagemeister 5cda4eda72 [YoutubeDL] Use a progress hook for progress reporting
Instead of every downloader calling two helper functions, let our progress report be an ordinary progress hook like everyone else's.
Closes .
Philipp Hagemeister f9216ed6ad Merge remote-tracking branch 'origin/master'
Philipp Hagemeister f740fae2a4 [ffmpeg] Make available a property
Jaime Marquínez Ferrándiz 662435f728 [YoutubeDL] Use a Request object for getting the cookies (fixes )
So that we don't have to implement all the methods used by the cookiejar.
Philipp Hagemeister 85729c51af [downloader] Add --hls-prefer-native to use the native HLS downloader ()
Jaime Marquínez Ferrándiz 360e1ca5cc [youtube] Convert to new subtitles system
The automatic captions are stored in the 'automactic_captions' field, which is used if no normal subtitles are found for an specific language.
Jaime Marquínez Ferrándiz c84dd8a90d [YoutubeDL] store the subtitles to download in the 'requested_subtitles' field
We need to keep the orginal subtitles information, so that the '--load-info' option can be used to list or select the subtitles again.
We'll also be able to have a separate field for storing the automatic captions info.
Jaime Marquínez Ferrándiz a504ced097 Improve subtitles support
For each language the extractor builds a list with the available formats sorted (like for video formats), then YoutubeDL selects one of them using the '--sub-format' option which now allows giving the format preferences (for example 'ass/srt/best').
For each format the 'url' field can be set so that we only download the contents if needed, or if the contents needs to be processed (like in crunchyroll) the 'data' field can be used.

The reasons for this change are:
* We weren't checking that the format given with '--sub-format' was available, checking it in each extractor would be repetitive.
* It allows to easily support giving a format preference.
* The subtitles were automatically downloaded in the extractor, but I think that if you use for example the '--dump-json' option you want to finish as fast as possible.

Currently only the ted extractor has been updated, but the old system still works.
Philipp Hagemeister 73fac4e911 [ffmpeg] Add --ffmpeg-location
Philipp Hagemeister 442c37b7a9 [YoutubeDL] Do not perform filter matching on partial results (Fixes )
Philipp Hagemeister 7e5db8c930 [options] Add --no-color
Philipp Hagemeister 347de4931c [YoutubeDL] Add generic video filtering (Fixes )
This functionality is intended to eventually encompass the current format filtering.
Philipp Hagemeister 43f244b6d5 [YoutubeDL] Do not show worst in --list-formats output
Nobody wants to know what the worst possible format is. And if they do, they can still provide -f worst.
Sergey M․ 2ec19e9558 [YoutubeDL] Allow filtering by audio sampling rate
Sergey M․ 9ddb6925bf [YoutubeDL] Allow filtering by string properties ()
Jaime Marquínez Ferrándiz b3a286d69d [YoutubeDL] _calc_cookies: add get_header method to _PseudoRequest ()
Sergey M․ 300753a069 [YoutubeDL] Fix video+audio format field (Closes )
Philipp Hagemeister f6c24009be [YoutubeDL] Calculate thumbnail IDs automatically
Philipp Hagemeister 8f9312c387 Appease pyflakes8-3
Sergey M․ 6d593c3276 [YoutubeDL] Fix video+audio format_id (Closes )
codesparkle 5468ff4d91 Remove duplicate dictionary keys
Jaime Marquínez Ferrándiz a7a14d9586 [YoutubeDL] set the 'thumbnails' field if the info_dict has the 'thumbnails' field
Since the '--write-thumbnail' uses the 'thumbnails' field and we didn't updated the info_dict, it wouldn't detect the thumbnail. (fixes )
Sergey M․ dcf53d4408 [YoutubeDL] Set format_id for video+audio (Closes )
Sergey M․ 3a0d2f520a [YoutubeDL] Temporary fix for subprocess encoding issues on python2 @ Windows (Closes )
For now filenames will be encoded with preferrefencoding before written to disk
Philipp Hagemeister 9f0df77ab1 [YoutubeDL] Allow format filtering by fps
Jaime Marquínez Ferrándiz e72c7e4123 [YoutubeDL] Always set the '_filename' field in the info_dict (reported in )
It's also useful when you use the '--write-info-json' option.
Philipp Hagemeister ee69b99af6 [YoutubeDL] clarify hook documentation
Philipp Hagemeister 881e6a1f5c Add --xattr-set-filesize option (Fixes )
Philipp Hagemeister c14e88f0f5 [YoutubeDL] Add --playlist-items option (Fixes )
Philipp Hagemeister ec82d85acd [YoutubeDL] Implement --write-all-thumbnails (Closes )
Philipp Hagemeister cfb56d1af3 Add --list-thumbnails
Jaime Marquínez Ferrándiz 1070711d60 [YoutubeDL._calc_cookies] Restore the 'is_unverifiable'
I should have check everything was copied before commiting 4b405cfc6e.
Jaime Marquínez Ferrándiz 4b405cfc6e [YoutubeDL._calc_cookies] Restore the 'has_header' method
I didn't copied it from downloader/external
Jaime Marquínez Ferrándiz e5660ee6ae [YoutubeDL] Fill the info dict 'http_headers' field with all the headers available
Useful for external tools using the json output.

The methods '_calc_headers' and '_calc_cookies' have been copied from the downloader/external, now they just use "info_dict['http_headers']".
Philipp Hagemeister 222516d97d [downloader] Lay groundwork for external downloaders.
This comes with a very simply implementation for wget; the real work is in setting up the infrastructure.
Philipp Hagemeister a055469faf [downloader] Improve downloader selection
Philipp Hagemeister 62cd676c74 [youtube] Fixup DASH m4a headers
This fixes , , , , , .
Philipp Hagemeister 5f0d813d93 Merge remote-tracking branch 'rupertbaxter2/master'
Conflicts:
	youtube_dl/__init__.py
	youtube_dl/downloader/common.py
Philipp Hagemeister 083c9df93b [YoutubeDL] Allow filtering by properties (Fixes )
Pierre 4340decad2 check for overwriting files in the downloader (fixes , closes )
Jaime Marquínez Ferrándiz f3ff1a3696 YoutubeDL: Make the decision about removing the original file after each postprocessor is run (fixes )
If one of the processors said the file should be kept, it wouldn't pay
attention to the response from the following processors. This was wrong if the
'keep_video' option was False, if the first extractor modifies the original file
and then we extract its audio we don't want to keep the original video file.
Philipp Hagemeister 8bfa75451b [options] Add --no-call-home
While we're at it, also drop "experimental" moniker for --call-home - should work fine.
Philipp Hagemeister 58b1f00d19 [YoutubeDL] Add new --call-home option for debugging
Philipp Hagemeister be4a824d74 Add new option --source-address
Closes , fixes , fixes , fixes , closes .
Philipp Hagemeister 6271f1cad9 [youtube|ffmpeg] Automatically correct video with non-square pixels (Fixes )
Philipp Hagemeister 45598f1578 Merge remote-tracking branch 'aft90/merge-output-format'
Conflicts:
	youtube_dl/YoutubeDL.py
Andrei Troie d02115f837 Use the option in preparing the merge output filename
Philipp Hagemeister f90ad27375 [YoutubeDL] Copy over format metadata when merging (Fixes )
Philipp Hagemeister 0217c78377 [YoutubeDL] Allow selection by more extensions
Philipp Hagemeister 0590062925 Respect age_limit when listing extractors (Fixes )
Philipp Hagemeister e65566a9cc [youtube] Correct handling when DASH manifest is not necessary to find all formats
Jaime Marquínez Ferrándiz f00fd51dae Don't write the description file if info_dict['description'] is None ()
Jaime Marquínez Ferrándiz 0e4cb4f406 YoutubeDL: style fix
AndroKev cd58dc3e56 Update YoutubeDL.py
AndroKev c33bcf2051 only add video-id to archive, when successful
Example:
no space left--> youtube-dl adds the id to archive, but the video isn't fully downloaded
Philipp Hagemeister 734ea11e3c Drop hash character in downloader output ()
Philipp Hagemeister d86007873e [YoutubeDL] Document where details for format can be found
Philipp Hagemeister 71b640cc5b [YoutubeDL] Add declarative version of progress hooks
Philipp Hagemeister 4f026fafbc [YoutubeDL] Make postprocessors declarative
Instead of having to configure PPs in code, this allows us and embedding programs not to worry about imports or finer details, similarly to how we handle IEs.
Mark Schreiber ff815fe65a Download playlist items in reverse order
Series of videos are typically uploaded to YouTube playlists in
chronological order.  By default, these videos are downloaded
latest-to-earliest; this is great for seeing the latest videos in a
series, but prevents streaming video in the order that the videos were
produced.  Add an option to download videos in reverse order,
earliest-to-latest.

Conflicts:
	youtube_dl/YoutubeDL.py
	youtube_dl/__init__.py
Philipp Hagemeister 412c617d0f [cnet] Update to new theplatform infrastructure (Fixes )
Philipp Hagemeister f17e4c9c28 [screenwavemedia] Simplify ()
cryptonaut 16ae61f655 Handle --get-url with merged formats (fixes )
Outputs one URL per line
Philipp Hagemeister b82f815f37 Allow iterators for playlist result entries
Sergey M․ 706d7d4ee7 [YoutubeDL] Avoid negative timestamps on Windows
Philipp Hagemeister 4aae94f9d0 [YoutubeDL] Remove incorrect documentation
Philipp Hagemeister acda92f6bc Clarify --no-playlist documentation (Closes )
Jouke Waleson 9e1a5b8455 PEP8: applied even more rules
Jouke Waleson 8bcc875676 PEP8: more applied
Jouke Waleson 5f6a1245ff PEP8 applied
Philipp Hagemeister 7d4111ed14 Provide guidance when called with a YouTube ID starting with a dash.
Reported at https://news.ycombinator.com/item?id=8648121
Philipp Hagemeister 42e12102a9 [YoutubeDL] Fix multi_video check
Philipp Hagemeister dfd5313afd [YoutubeDL] Support new _type multi_video
Philipp Hagemeister c9bf41145f [YoutubeDL] Warn if an extractor returns compat_list
Jaime Marquínez Ferrándiz 92120217eb [cache] Fix writing to paths with unicode characters
* Use "compat_getenv"
* "write_json_file" now expects the filename to be a string
Philipp Hagemeister c6afed48ff [YoutubeDL] guard against strange sys.stdouts
Jaime Marquínez Ferrándiz c295490830 [YoutubeDL] Fix bug in the detection of formats that don't contain video (fixes )
If the format requested was not available, we called the method '.get' in None.
Philipp Hagemeister a1cf99d03a [YoutubeDL] Add playlist_id and playlist_title fields (Fixes )
Jaime Marquínez Ferrándiz f866e474f3 [YoutubeDL] Don't dowload formats for merging if the first doesn't contain the video ()
Philipp Hagemeister 8c25f81bee [util] Move compatibility functions out of util
utils is large enough without these compatibility functions.

Everything that is present in newer versions of Python (i.e. with dev Python it's just an import) goes into compat.py .
Everything else (i.e. youtube-dl-specific helpers) goes into utils.py .
Philipp Hagemeister 4c83c96795 [YoutubeDL] Include rtmpdump in exe versions -v output
Philipp Hagemeister fbb21cf528 [youtube] Add formats 298, 299 (Fixes )
Philipp Hagemeister 3511266bc3 [YoutubeDL] Simplify API of YoutubeDL
Calling add_default_extractors twice should be harmless since the first set of extractors will match.
Philipp Hagemeister 488447455d [ffmpeg] Warn if ffmpeg/avconv version is too old (Fixes )
Philipp Hagemeister d28b517154 [YoutubeDL] Output avconv/ffmpeg versions if -v is given
Sergey M․ f889cea109 Merge branch 'compat-getenv-and-expanduser' of https://github.com/dstftw/youtube-dl into dstftw-compat-getenv-and-expanduser
Conflicts:
	test/test_utils.py
	youtube_dl/__init__.py
Philipp Hagemeister 63e0be3415 New option --dump-single-json ()
Philipp Hagemeister e82c1e9a6e [YoutubeDL] Do not apply playlist info to videos when extract_flat is set ()
Philipp Hagemeister 057a5206cc Add --flat-playlist option (Closes )
George Boyle 53d9009bdb KeyError on initialising YoutubeDL in python3
George Boyle 1b725173a5 Fixed typo
Sergey M․ 4644ac5527 [core] Decode environment variables with filesystem encoding (Fixes , Fixes , Fixes )
Introduces compat versions of os.getenv and os.path.expanduser
Sergey M․ ee0d90707a [YoutubeDL] Fix string check for python3
Sergey M․ 68b0973046 [YoutubeDL] Expect all kind of strings in urlopen
Now it doesn't fail if req is python2's str
Philipp Hagemeister 1de33fafd9 [YoutubeDL] Allow downloading multiple formats with ,
Philipp Hagemeister e2e5dae64d Add -f m4a
Sergey M․ d05cfe0600 [YoutubeDL/utils] Clarify rationale for URL escaping in comment, move escape routines to utils and add some tests
Sergey M․ 37419b4f99 [YoutubeDL] Escape non-ASCII characters in URLs
urllib chokes on URLs with non-ASCII characters (see http://bugs.python.org/issue3991)
Working around by replacing request's original URL with escaped one
Philipp Hagemeister a0e07d3161 [youtube] Move cache into its own module
Philipp Hagemeister 241f7a8ade Merge remote-tracking branch 'JGjorgji/fix-leading-zeroes'
Philipp Hagemeister 8d31fa3cce [execafterdownload] Simplify ()
Gjorgji Jankovski c6b4132a0a renamed for consistency
Gjorgji Jankovski ad260c90ab Filenames are padded according to the playlist length
Philipp Hagemeister be843678b1 [YouTubeDL] Correct handling of age_limit = None in result
Philipp Hagemeister e8ee972c6e Allow playlist test definitions in test_download.
This moves playlist tests where they belong, i.e. to the extractors themselves.
Additionally, all our network interaction configuration for tests in test_download now applies to playlist tests as well.
Philipp Hagemeister 11b85ce62e [YouTubeDL] Best practices (Closes )