Commit Graph

10576 Commits (b95779be21929c1aed1c0edbc3f9a5d11826973f)
 

Author SHA1 Message Date
Sergey M․ d7011316d0 [facebook] Use compat_urllib_parse_unquote
Sergey M․ d3671b344f [ehow] Use compat_urllib_parse_unquote
Sergey M․ a60cccbf9f [crunchyroll] Use compat_urllib_parse_unquote
Sergey M․ 3e72f5f10e [ceskatelevize] Use compat_urllib_parse_unquote
Sergey M․ b94b78971c [bet] Use compat_urllib_parse_unquote
Sergey M․ 4d08161ac2 [compat] Mention unquote_plus
Sergey M․ 8954e48140 [test_compat] Add tests for compat_urllib_parse_unquote_plus
Sergey M․ aa99aa4e85 [compat] Add compat_urllib_parse_unquote_plus
Sergey M․ d79febcd06 [test_compat] Remove redundant test
Sergey M․ 13fc7f3a05 Merge branch 'atomicdryad-pr-crashfix_compat_urllib_unquote'
Sergey M․ 14309e1ddc [test_compat] Make tests more idiomatic
Sergey M․ 5513967926 [compat] Simplify and use latest cpython 3 code
Sergey M. eacd875f3b Merge pull request from remitamine/howstuffworks
[howstuffwoks] fix _VALID_URL regex
remitamine c4fe07c7af match only the test url
remitamine 1186e3f91a do not capture a group
remitamine f354385bf5 Add test for urls without a number
remitamine cabe001590 [howstuffwoks] fix _VALID_URL regex
Sergey M․ 89f691e141 Merge branch 'pr-crashfix_compat_urllib_unquote' of https://github.com/atomicdryad/youtube-dl into atomicdryad-pr-crashfix_compat_urllib_unquote
fnord 4a63291144 Add tests for compat_urllib_parse_unquote
fnord 593b77064c Don't forget trailing '%'
fnord 9fefc88656 fix TestCompat test_all_present
fnord eb08081330 democracynow: correct syntax
fnord a3bfddfa5e bbc.py: correct syntax
fnord 36da48798a handle titles and captions set to ''
fnord a0f28f90fa remove kebab
fnord 851229a01f remove debugprint
fnord c9c854cea7 replace old compat_urllib_parse_unquote with backport from python3's function
* required unquote_to_bytes function ported as well
    (uses .decode('hex') instead of dynamically populated _hextobyte global)
  * required implicit conversion to bytes and/or unicode in places due to
    differing type assumptions in p3
Yen Chi Hsuan a38436e889 [extractor/common] Add 'transform_source' parameter to _extract_f4m_formats()
Sergey M․ 23fc384f2c [npo] Compound title
Sergey M․ 1540119723 [npo] Remove unused imports
Sergey M․ 574f42d79a [vpro] Improve playlist extraction
Sergey M․ 536b0700b0 [npo] Allow missing description
Sergey M․ 5ba761eb85 [npo] Prefer aflevering_titel over titel
Sergey M․ 611ac379bb [vpro] Fix extraction and add support for vpro playlists
Sergey M․ 03f32a7ead [wnl] Add extractor for omroepwnl playlists
Sergey M․ 50ea2bb20d [npo] Update test
Sergey M․ 525daedd5a [npo] Add support for omroepwnl fragments
Sergey M․ e118031ef8 [npo] Extend _VALID_URL to support ntr.nl (Closes )
fnord 45eedbe58c Generic: use compat_urllib_parse_unquote to prevent utf8 mangling
of the entire page in python 2.

-requires- fixed compat_urllib_parse_unquote

example - the following will save with a mangled playlist title,
 instead of the kanji for 'tsunami'. This affects all utf8encoded
 urls as well

youtube-dl -f18 -o '%(playlist_title)s-%(title)s.%(ext)s' \
  61c14c1e3a/tsunami.html
fnord e37c932fca compat_urllib_parse_unquote: crash fix: only decode valid hex
on python 2 the following has a { "crash_rate": "100%" } of the time
as it tries to parse '" ' as hex.
Sergey M. 5eb778bf4d Merge pull request from dstftw/f4m-improvements
[f4m] Improvements
Sergey M․ ab9b890b52 [prosiebensat1] Clarify test purpose
Sergey M․ 31c746e5dc [extractor/common] Keep going in some media_url is missing
Sergey M․ f01f731107 [prosiebensat1] Use generic f4m manifest extraction
Sergey M․ 70f0f5a8ca [extractor/common] Recursively extract child f4m manifests
Sergey M․ cc357c4db8 [extractor/common] Properly handle full URLs
Sergey M․ 97f4aecfc1 [extractor/common] Handle malformed f4m manifests
Sergey M․ 2af0f87c8b [prosiebensat1] Fix extraction (Closes )
Sergey M. b062d94eef [README.md] Clarify authentication with .netrc file
Sergey M․ 6c1b0c0ed2 [dfb] Extract formats