Commit Graph

447 Commits (7caae128a72596e22cdfb538272c38a37d095db2)

Author SHA1 Message Date
Naglis Jonaitis a69801e2c6 [utils] Add additional format to unified_strdate
Johannes Knoedtel 3d5f7a3947 [utils] Prevent override of custom headers.
The dict of headers of request objects in urllib has its keys always
capitalized.

This causes the lookup to fail and overwrite the header. If for example
a Extractor tries to add a "User-Agent" header the internal
representation in the request object is "User-agent". The header is
therefore clobbered by the "User-Agent" in std_headers, because the
strings are not equal.
Sergey M․ 227d4822ff [utils] Disallow non string types in parse_duration (Closes )
Philipp Hagemeister 2aeb06d6dc [utils] Improve colon handling (Fixes )
Philipp Hagemeister c5a59d9391 [utils] Fix call to _create_http_connection
Avoid confusion over args/kwargs.
Philipp Hagemeister be4a824d74 Add new option --source-address
Closes , fixes , fixes , fixes , closes .
Sergey M․ 776dc3992a [utils] Clarify more day-month-first ambiguous formats
Philipp Hagemeister dd622d7c4e [netzkino] Add new extractor (Fixes )
Philipp Hagemeister 0590062925 Respect age_limit when listing extractors (Fixes )
Jaime Marquínez Ferrándiz 1d2d0e3ff2 utils: Remove blank line at the end of file
Philipp Hagemeister 2ccd1b10e5 [soulanime] Fix under Python 3
Philipp Hagemeister 416c7fcbce Add documentation about supported sites (Fixes )
Philipp Hagemeister 7af808a5ef Improve code style
Philipp Hagemeister cae97f6521 Improve and test ffmpeg version detection
Philipp Hagemeister 2128b696b8 [utils] Do not make an exception for SSLv3
SSLv3 is terminally vulnerable to POODLE; web browsers are currently deprecating/removing it.
Closes , fixes 
Philipp Hagemeister a23669220a [utils] Make ssl work on Python 2.7.8
Jaime Marquínez Ferrándiz 293f0f39ce [utils] make_HTTPS_handler: Remove try/except block that would always raise an exception
This code is only run for Python < 3.4, where context.load_default_certs doesn't exist
Jaime Marquínez Ferrándiz 0db261ba56 [utils] make_HTTPS_handler: Use ssl.create_default_context in Python 2.7.9
The new features in the ssl module have been backported from 3.4, see https://docs.python.org/dev/whatsnew/2.7.html#pep-466-network-security-enhancements-for-python-2-7
Philipp Hagemeister f8780e6d11 Merge remote-tracking branch 'grompe/patch-1'
Philipp Hagemeister e2f89ec7aa Revert "[utils] Work around PyPy stupidity with Windows DLLs (Fixes )"
This reverts commit 16040f46d6.
Philipp Hagemeister 16040f46d6 [utils] Work around PyPy stupidity with Windows DLLs (Fixes )
Philipp Hagemeister 42bdd9d051 [cinchcast] Add new extractor (Fixes )
Philipp Hagemeister f8795e102b [utils] Add "yesterday" as a date keyword
Jaime Marquínez Ferrándiz 83e865a370 Fix PEP8 issue E713
Grom PE 6ac4e8065a Fix utils.py for PyPy on Windows
The line
```python
from __future__ import unicode_literals
```
introduced in commit [ecc0c5ee01](ecc0c5ee01) broke youtube-dl for PyPy on Windows, making it unable to locate WinAPI functions.
Error: "TypeError: function name must be a string or integer"

Adding "b" prefix to strings with WinAPI function names fixes it.
Philipp Hagemeister 5f9b83944d [ffmpeg] Improve version check and call it from hls (Fixes )
Philipp Hagemeister e8df5cee12 [minhateca] Fix duration parsing
Philipp Hagemeister 4349c07dd7 [minhateca] Add extractor (Fixes )
Philipp Hagemeister ab4ee31eb1 [utils] remove useless u prefix
Philipp Hagemeister be64b5b098 [xminus] Simplify and extend ()
Philipp Hagemeister b74e86f48a Fix all PEP8 issues except E501
Jouke Waleson 5f6a1245ff PEP8 applied
Philipp Hagemeister 7d4111ed14 Provide guidance when called with a YouTube ID starting with a dash.
Reported at https://news.ycombinator.com/item?id=8648121
Philipp Hagemeister 732ea2f09b [utils] Improve update on error message somewhat
We still may want to implement a bulletproof check for the current version, and a better place to add this message so that it works for all kind of other errors too.
Ivan Kozik 1394646a0a Fix "ERROR: Cannot write metadata to JSON file" on Windows
Fixes 
Ivan Kozik 61ee5aeb73 Fix UnicodeEncodeError with --write-info-json on Python 2.7 + Windows
Fixes 
Jaime Marquínez Ferrándiz 92120217eb [cache] Fix writing to paths with unicode characters
* Use "compat_getenv"
* "write_json_file" now expects the filename to be a string
Philipp Hagemeister 28e614de5c [utils] Remove stray u'
Philipp Hagemeister 6a68bb574a [eporner] Fix duration (Closes )
Jaime Marquínez Ferrándiz ec5f601670 [utils] Fix "write_json_file" for unicode names in python 2.x (fixes )
Philipp Hagemeister eb4157fd17 [utils] Fix struct.pack call on very old Python versions ()
Philipp Hagemeister 609a61e3e6 [npo] Improve npo.nl (Fixes )
Philipp Hagemeister 4c0924bb24 [utils] Fix intlist_to_bytes in Python 2 ()
Philipp Hagemeister 4164f0117e [utils] Remove unused import
Philipp Hagemeister 3828505646 [utils] Use a regexp instead of HTMLParser for get_element_by_attribute
Philipp Hagemeister 12ea2f30cf [utils] Remove unused get_meta_content function
Jaime Marquínez Ferrándiz 4f195f55f0 Do not override stdlib html parser 'locatestarttagend' regex (fixes )
'<a href="foo" ><img src="bar" / ></a>' wouldn't be parsed right (the problem is '/ >', '/>' worked fine).
We need to change it in python 2.6 (for example the description of youtube videos wouldn't be extracted).
Philipp Hagemeister ecc0c5ee01 [utils] Modernize
Philipp Hagemeister baa708036c [compat] Fix imports
Philipp Hagemeister 8c25f81bee [util] Move compatibility functions out of util
utils is large enough without these compatibility functions.

Everything that is present in newer versions of Python (i.e. with dev Python it's just an import) goes into compat.py .
Everything else (i.e. youtube-dl-specific helpers) goes into utils.py .
Philipp Hagemeister 9580711841 [ffmpeg] Move version detection to utils
Philipp Hagemeister 6ad4013d40 [drtv] Allow fractional timestamps (Fixes )
Philipp Hagemeister 488447455d [ffmpeg] Warn if ffmpeg/avconv version is too old (Fixes )
Sergey M․ f889cea109 Merge branch 'compat-getenv-and-expanduser' of https://github.com/dstftw/youtube-dl into dstftw-compat-getenv-and-expanduser
Conflicts:
	test/test_utils.py
	youtube_dl/__init__.py
Naglis Jonaitis e9be9a6acd [utils] Add additional format to unified_strdate
Philipp Hagemeister d838b1bd4a [utils] Default age_limit to None
If we can't parse it, it means we don't have any information, not that the content is unrestricted.
Sergey M․ 146c80e256 [utils] Add parse_age_limit
Sergey M․ fc66e4a0d5 [utils] Add posix expanduser implementation and clarify the original source
Philipp Hagemeister e7b6d12254 [utils] Improve and test js_to_json
Philipp Hagemeister 410f3e73ab [utils] Fix js_to_json
Sergey M․ 4644ac5527 [core] Decode environment variables with filesystem encoding (Fixes , Fixes , Fixes )
Introduces compat versions of os.getenv and os.path.expanduser
Philipp Hagemeister 99b67fecc5 [arte] Fix upload date extraction
Philipp Hagemeister 989b4b2b86 [utils:YoutubeDLHandler] Work around brain-dead Python 2.6 httplib
In 2.6, the httplib sends fragments! Remove those (fixes generic_26 on 2.6).
Philipp Hagemeister 9c44d2429b [vimeo:likes] Support large like lists (Fixes )
Philipp Hagemeister 0003a5c416 Merge remote-tracking branch 'dstftw/escape-non-ascii-in-urls'
Conflicts:
	test/test_utils.py
Philipp Hagemeister a020a0dc20 [facebook] Fix support for untitled videos (Fixes )
Sergey M․ 984e8e14ea [utils] Remove debug garbage
Sergey M․ d05cfe0600 [YoutubeDL/utils] Clarify rationale for URL escaping in comment, move escape routines to utils and add some tests
Philipp Hagemeister d74bebd502 [utils] Apply 2.6 xpath craziness
This fixes ARD on 2.6
Philipp Hagemeister bf0ff93277 [ard] Make more robust against missing thumbnails
I cannot reproduce this error, it's from travis.
Philipp Hagemeister 4eefbfdbfd [utils] Fix find_xpath_attr on 2.6
Philipp Hagemeister aa37e3d486 [utils] Default SSL to TLS. (Fixes )
On 2.x, we now try TLS first, and fall back to the compat 23 (basically anything) afterwards.
On 3.4+, we now use the proper function so that we get all the latest security configurations.
We allow SSLv3 though for the time being, since a lot of older pages use that.
On 3.3, we default to SSLv23 (basically "anything, including TLS") because that has the widest compatibility.
Philipp Hagemeister a0e07d3161 [youtube] Move cache into its own module
Sergey M․ f164038b79 [utils] Make parse_duration case insensitive
Sergey M․ 884ae74785 [tvigle] Adapt to the new API
Philipp Hagemeister 2582bebe06 [utils] Use flock instead of lockf (Fixes )
Philipp Hagemeister 48d4681efc [eporner] Simplify and correct ()
Sergey M․ ca7b3246b6 [utils] Improve parse_duration
Philipp Hagemeister 4e408e479b [utils] Modernize tests
Philipp Hagemeister 33ac271ba7 [utils] Let request headers override standard headers
What was I thinking when writing this?
Philipp Hagemeister 0990305d2a [generic] Fix rss under Python 2.x and move test to extractor
Philipp Hagemeister 7adcbe7594 [rtlnl] Extract duration
Philipp Hagemeister 8d31fa3cce [execafterdownload] Simplify ()
Philipp Hagemeister 423817c468 [expotv] Add new extractor (Fixes )
Philipp Hagemeister 2b9faf5542 [rtve] Add support for live stream
At the moment, only RTVE-1 seems to work flawlessly.
-2 seems geoblocked right now.
-TDP doesn't seem to be available outside of Spain.
Philipp Hagemeister e05f693942 [patreon] Simplify ()
Sergey M․ 73159f99cc [utils] Add missing mode and encoding arguments
Philipp Hagemeister 181c8655c7 [utils] Make JSON file writes atomic (Fixes )
Sergey M․ fe556f1b0c [jove] Simplify, extract full description and add test for video that requires subscription
Philipp Hagemeister 9572013de9 [appletrailers] Support height-less videos
Jaime Marquínez Ferrándiz 40a90862f4 [reverbnation] The 'uploader_id' field must be a string
Sergey M․ f4776371ae [izlesene] Minor changes
Philipp Hagemeister cbf915f3f6 [livestream] Parse SMIL ()
Philipp Hagemeister f1f725c6a0 [dropbox] Fix title encoding on Python 2
Philipp Hagemeister 9732d77ed2 [snotr] PEP8 and minor fixes ()
Sergey M․ c45a6caa95 [utils] Add None check in str_to_int
Filippo Valsorda 816930c485 Fix utils.strip_jsonp
Filippo Valsorda 36cb99f958 [ReverbNation] Add new IE - closes
Philipp Hagemeister fa19dfccf9 Merge remote-tracking branch 'tobidope/gameone'
Petr Půlpán 78ff59d052 [Motherless] simplify
Tobias Bell 305d068362 [gameone] Added timestamp extraction
Philipp Hagemeister f07b74fc18 [ffmpeg] Correct argument encoding on Windows with Python 2.x
Fixes 
Philipp Hagemeister 0a871f6880 Provide compatibility check_output for 2.6 (Fixes )
Philipp Hagemeister a383a98af6 [utils/_windows_write_string] Be defensive about fileno (Fixes )
Philipp Hagemeister acd69589a5 [YoutubeDL] Do not require default output template to be set
Philipp Hagemeister d11271dd29 [youtube] Include video Id in common error message (Fixes )
Philipp Hagemeister 28746fbd59 [bilibili] Add preliminary support ()
The URL http://www.bilibili.tv/video/av636603/index_2.html does not work yet.
Philipp Hagemeister 478c2c6193 [clubic] Add extractor (Fixes )
Philipp Hagemeister d1b9c912a4 [utils] Fix _windows_write_string (Fixes )
It turns out that the function did not work for outputs longer than 1024 UCS-2 tokens.
Write non-BMP characters one by one to ensure that we count correctly.
Philipp Hagemeister 89938c719e Fix Windows output for non-BMP unicode characters
Philipp Hagemeister b58ddb32ba [utils] Completely rewrite Windows output (Fixes )
Philipp Hagemeister 104aa7388a Use our own encoding when writing strings
Philipp Hagemeister 734f90bb41 Use --encoding when outputting
Philipp Hagemeister 676eb3f2dd Fix unicode_escape (Fixes )
Philipp Hagemeister 9271bc8355 [cnet] Add new extractor (Fixes )
Philipp Hagemeister a612753db9 [utils] Correct decoding of large unicode codepoints in uppercase_escape (Fixes )
Philipp Hagemeister 62fec3b2ff Add new --encoding option (Fixes )
Philipp Hagemeister 43f775e4ca [comedycentral] Duration can now be a float (Fixes )
Philipp Hagemeister fac55558ad [washingtonpost] Add extractor (Fixes )
Philipp Hagemeister ba40a74666 [clipfish] Modernize
Philipp Hagemeister 912b38b428 [instagram] Fix info_dict key name
Philipp Hagemeister 6eefe53329 [utils] Simplify setproctitle
Philipp Hagemeister 64e7ad6045 [videolectures] (New extractor)
Philipp Hagemeister a1a530b067 [pbs] Add support for video ratings
Philipp Hagemeister 4e6f9aeca1 Fix typo
Philipp Hagemeister e68301af21 Fix getpass on Windows (Fixes )
Sergey M․ b047de6f6e Add format to unified_strdate
Philipp Hagemeister bcf89ce62c [generic] Suppress warning about doctypes in RSS parser
Jaime Marquínez Ferrándiz b74fa8cd2c [facebook] Fix login process
It was broken and didn't work in python 3.
And use `_download_webpage` instead of `compat_urllib_request.urlopen`.
Sergey M. 4cf9654693 Add one more format to unified_strdate
Philipp Hagemeister 62e609ab77 Ignore BOM in batch files (Fixes )
Sergey M. 7ff5d5c2e2 Add one more format to unified_strdate
Jaime Marquínez Ferrándiz f7300c5c90 [generic] Fix on python 2.6
`ParseError` is not available, it raises `xml.parsers.expat.ExpatError`.
The webpage needs to be encoded.
Sergey M. 0f99566c01 Add one more format in unified_strdate
Sergey M. 2db806b4aa Improve parse_duration
Philipp Hagemeister b53466e168 Fix f4m downloading on Python 2.6
Philipp Hagemeister 026fcc0495 Fix (date parsing with dashes)
Philipp Hagemeister 81c2f20b53 [youtube] Correct invalid JSON (Fixes )
Philipp Hagemeister 5de90176d9 [elpais] Add extractor
Philipp Hagemeister dd26ced164 Add __len__ to PagedLists
Philipp Hagemeister 65697b3bf3 Merge branch 'paged-lists'
Conflicts:
	test/test_utils.py
	youtube_dl/extractor/youtube.py
Philipp Hagemeister 50317b111d Merge branch 'youtube-dash-manifest'
Conflicts:
	youtube_dl/extractor/youtube.py
Philipp Hagemeister af1588c05f [mtv] Update tests and xpath function for new title extraction
Philipp Hagemeister 00122de6a9 [gametrailers/mtv] Fix pre-3.x compatibility function for find_xpath_attr
Fixes 
Philipp Hagemeister 5aafe895fc Correct XML ampersand fixup
Philipp Hagemeister b7ab059084 Add infrastructure for paged lists
This commit allows to download pages in playlists as needed instead of all at once.
Before this commit,
    youtube-dl http://www.youtube.com/user/ANNnewsCH/videos --playlist-end 2 --skip-download
took quite some time - now it's almost instantaneous.
As an example, the youtube:user extractor has been converted.
Fixes 
Philipp Hagemeister dd27fd1739 [youtube] Download DASH manifest
If given, download and parse the DASH manifest file, in order to get ultra-HQ formats.
Fixes 
Philipp Hagemeister d70ad093af Move check_executable into a helper ufnction
Philipp Hagemeister 5f263296ea Merge remote-tracking branch 'epitron/metadata-pp'
Conflicts:
	youtube_dl/PostProcessor.py
Philipp Hagemeister 79f8295303 Use original Referer URL in Brightcove requests (Fixes )
Philipp Hagemeister 5d73273f6f [orf] Use new extraction method (Fixes )
Philipp Hagemeister 8bf48f237d Fix/work around Windows encoding issues (Fixes )
Jaime Marquínez Ferrándiz 91d7d0b333 FFmpegMetadataPP; Write temporary file to `something.temp.{ext}` (fixes )
ffmpeg correctly recognize the formats of extensions like m4a, but it doesn’t works if it’s passed with the `—format` option.
epitron e63fc1bed4 Added '--xattrs' option which writes metadata to the file's extended attributes using a youtube-dl postprocessor.
Works on Linux, OSX, and Windows.
Philipp Hagemeister a0ddb8a2fa Add new --print-traffic option
Philipp Hagemeister 608d11f515 [cnn] Add multiple formats, duration, and upload_date
Philipp Hagemeister 7217e148fb [yahoo] Use centralized sorting, and add tbr field
Philipp Hagemeister aa94a6d315 [aparat] Add support (Fixes )
Philipp Hagemeister e029b8bd43 [utils] Remove duplicated line
This line was added by accident in 42393ce234
Jaime Marquínez Ferrándiz 9b8aaeed85 Simplify url_basename
Use urlparse from the standard library.
Philipp Hagemeister 42393ce234 Add support for direct links to a video ()
Philipp Hagemeister d6c7a367e8 [utils] Fix url_basename
Philipp Hagemeister 29eb517403 Add webpage_url_basename info_dict field (Fixes )
Philipp Hagemeister e64eaaa97d Fix execution under Python 3
Philipp Hagemeister d7dda16888 [blinkx] Add extractor (Fixes )
Philipp Hagemeister e3946f989e Set process title to youtube-dl
This allows killing all youtube-dl processes with killall youtube-dl, and shows up nicer in some programs.
Philipp Hagemeister 2a203a6cda Merge pull request from dstftw/master
Fix typo in month name
dst dadb8184e4 Fix typo in month name
Jaime Marquínez Ferrándiz 1825836235 Use `_download_xml` in more extractors
Philipp Hagemeister caefb1de87 [ndtv] Add extractor (Fixes )
Philipp Hagemeister 1c088fa89d Improve --bidi-workaround support
Philipp Hagemeister ac79fa02b8 Restore Python 2.6.<6 compatibility (Fixes )
Philipp Hagemeister 303b479e0a Automatically load SSL certs on Windows
Jaime Marquínez Ferrándiz f53c966a73 [dailymotion] Extract view count ()
Philipp Hagemeister a1ef7e85d6 Remove unused imports
Philipp Hagemeister 2a275ab007 [zdf] Use _download_xml
Philipp Hagemeister de79c46c8f [viki] Fix subtitle extraction
Philipp Hagemeister 79d09f47c2 Merge branch 'opener-to-ydl'
Philipp Hagemeister 02dbf93f0e [zdf/common] Use API in ZDF extractor.
This also comes with a lot of extra format fields
Fixes 
Philipp Hagemeister e03db0a077 Merge branch 'master' into opener-to-ydl
Philipp Hagemeister 13ebea791f [niconico] Simplify and make work with old Python versions
The website requires SSLv3, otherwise it just times out during SSL negotiation.
Philipp Hagemeister dca0872056 Move the opener to the YoutubeDL object.
This is the first step towards being able to just import youtube_dl and start using it.
Apart from removing global state, this would fix problems like .
Jaime Marquínez Ferrándiz a6a173c2fd utils.shell_quote: Convert the args to unicode strings
The youtube test video failed with `UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 34: ordinal not in range(128)`, the problem was with the filenames being encoded.
Philipp Hagemeister 5904088811 Add support for tou.tv (Fixes )
Philipp Hagemeister ae8f787141 Remove iPhone from user agent. This breaks a lot of extractors
In the future, it might be worth investigating whether we get better content when we claime to be an iPhone.
Philipp Hagemeister 96b31b6533 Add iPhone to UA ()
Jaime Marquínez Ferrándiz 55b3e45bba [vimeo] Fix pro videos and player.vimeo.com urls
The old process can still be used for those videos.
Added RegexNotFoundError, which is raised by _search_regex if it can't extract the info.
Philipp Hagemeister f4d96df0f1 Extend with --max-quality support
Philipp Hagemeister 9d4660cab1 [generic] Support embedded vimeo videos ()
Jaime Marquínez Ferrándiz d7e66d39a0 Add an extractor for internetvideoarchive.com videos
It's used by videodetective.com
Jaime Marquínez Ferrándiz 4eb7f1d12e FFmpegPostProcessor: print the command line used if the --verbose option is given
Jaime Marquínez Ferrándiz 2e1fa03bf5 Add an extractor for video.nhl.com (closes )
Philipp Hagemeister b24f347190 Merge branch 'download-archive'
Conflicts:
	youtube_dl/YoutubeDL.py
	youtube_dl/__init__.py
Philipp Hagemeister 41e8bca4d0 [viddler] Add basic support (Fixes )
Philipp Hagemeister c1c9a79c49 Add basic --download-archive option
Often, users want to be able to download only videos they haven't seen before, despite the video files having been deleted or moved in the mean time.
When --download-archive FILE is given, the extractor and ID of every download is recorded in the specified file. If it is already present, the video in question is skipped.
Philipp Hagemeister c38b1e776d [youtube] Simplify cache_dir code ()
Philipp Hagemeister e0df6211cc Restore accidentally deleted commits
That's what happens if you let Windows machines write :(
Philipp Hagemeister 7459e3a290 Always correct encoding when writing to sys.stderr (Fixes )
Jaime Marquínez Ferrándiz 19e1d35989 [mixcloud] Rewrite extractor (fixes )
Jaime Marquínez Ferrándiz a921f40799 [ustream] Simplify channel extraction
the ChannelParser has been moved to a new function in utils get_meta_content
Instead of the SocialStreamParser now it uses a regex