Commit Graph

11 Commits (51350db5a395c45cb181b1813a71b5e1aff86993)

Author SHA1 Message Date
Yen Chi Hsuan e4e50f60b1
[googledrive] Fix extraction on Python 3.6
Since Python 3.6, invalid escape sequences are deprecated. It's likely
that there are invalid escape sequences somewhere on the webpage, so
instead of unescaping the whole webpage, just unescape the URL.

See https://bugs.python.org/issue27364. That change was designed for
string literals, while it affects the 'unicode_escape' encoding as well.
The code path is:

str.decode('unicode_escape')
    codecs.unicode_escape_decode()
        PyUnicode_DecodeUnicodeEscape()
Sergey M․ 58e6d097d8 [googledrive] Relax _VALID_URL (Closes )
Sergey M․ d69abbd3f0 [googledrive] Make thumbnail optional (Closes )
remitamine 5b251628e9 [googledrive] Modernize
remitamine 8e92d21ebf [googledrive] raise ExtractorError instead of warning
remitamine 36dbca8784 fix recursive error
remitamine d1cc05e17e remove unnecessary regex group names
remitamine 3b3d531965 fix embed regex
remitamine 3e5f3df172 move the embed to a separate class
remitamine f120a7ab5e change the _TEST info
remitamine 984e4d4875 [googledrive] Add new extractor