Philipp Hagemeister
dffe658bac
Remove exclamation mark in --restrict-filenames mode
12 years ago
Philipp Hagemeister
46cbda0be4
Minor filename encoding improvement in a common case
12 years ago
Philipp Hagemeister
56781d3d2e
Switch back to underline for invalid characters, and make restricted ASCII-only
12 years ago
Philipp Hagemeister
240089e5df
remove accidental remnants
12 years ago
Philipp Hagemeister
1c469a9480
New optoin --restrict-filenames
12 years ago
Philipp Hagemeister
92b91c1878
Use character instead of byte strings
12 years ago
Philipp Hagemeister
51937c0869
Add some parentheses around print for #180
12 years ago
Filippo Valsorda
5d40a470a2
quiet the HTMLParser debug info - closes #517
12 years ago
Filippo Valsorda
42cb53fcfa
modified filename escaping to a "smarter" one
12 years ago
Philipp Hagemeister
38612b4edc
update default UA string ( Closes #390 )
12 years ago
Filippo Valsorda
9bb8dc8e42
Python 2.6 compatibility fix. Thanks @Jamesc359 - closes #400
13 years ago
Arvydas Sidorenko
bae611f216
Simplified preferredencoding()
...
Not sure what is the point to use yield to return encoding, thus
it will simplify the whole function.
Signed-off-by: Arvydas Sidorenko <asido4@gmail.com>
13 years ago
chocolateboy
6ab92c8b62
don't corrupt stdout (-o -) in verbose mode
13 years ago
Filippo Valsorda
2c288bda42
reorganized the titles sanitizing: now title is the untouched title
...
and stitle is created in process_info() and is cross-filesystem sanitized by sanitize_filename();
closes #164
13 years ago
Filippo Valsorda
0b8c922da9
Introduced Trouble(Exception) for more elegant non-fatal errors handling
13 years ago
Filippo Valsorda
921a145592
dropped the support for Python 2.5
...
let's elaborate the decision: Python 2.5 is a 6 years old release
and "under the current release policy, no security issues in Python
2.5 will be fixed anymore" (!!); also, it doesn't support the new
zipfile distribution format.
13 years ago
Filippo Valsorda
9beb5af82e
some HTMLParser bugfixes
13 years ago
Filippo Valsorda
9e6dd23876
merged unescapeHTML branch; removed lxml dependency
13 years ago
Filippo Valsorda - Campagna
d11d05d07a
better naming for the sub-modules
13 years ago