Commit Graph

527 Commits (b7acc83550598b9facf1b31d3d9a1c2823b29cbe)

Author SHA1 Message Date
Xiao Di Guan 95e42d7336 [extractor/common] Ensure response handle is not prematurely closed before it can be read if it matches expected_status (resolves , closes , resolves )
Sergey M․ bebef10909
[extractor/common] Add validation for JSON-LD URLs
Sergey M․ 0e7b8d3eac
[extractor/common] Fix typos
Sergey M․ 6f1f59f39c
[extractor/common] Introduce channel meta fields
Remita Amine bd21ead2a2 [extractor/common] add support for DASH and MSS formats extraction in SMIL manifests
Sergey M․ 0685d9727b
[utils] Share JSON-LD regex
Sergey M․ eca1f0d115
[extractor/common] Properly escape % in MPD templates (closes )
Sergey M․ 5e8e2fa51f
[extractor/common] Use source URL as Referer for HTML5 entries (closes )
Sergey M․ d391b7e23d
[extractor/common] Introduce expected_status for convenient accept of failed HTTP requests
Useful when some non-success (2xx) HTTP status codes should be considered normal. Previously this required to manually catch corresponding exceptions and read the response.
Sergey M․ 504f20dd30
Remove experimental mark for some options
Sergey M․ 5f95927a62
Improve geo bypass mechanism
* Introduce geo bypass context
* Add ability to bypass based on IP blocks in CIDR notation
* Introduce --geo-bypass-ip-block
Sergey M․ e7e4a6e0f9
[extractor/common] Extract interaction statistic
Sergey M․ 0fe7783ece
[extractor/common] Add _download_json_handle
aeph6Ee0 66b686727b [extractor/common] Relax JSON-LD context check (closes )
Sergey M․ 6780154e6b
[extractor/common] Improve thumbnail extraction for HTML5 entries
Sergey M․ 47a5cb7734
Generalize XML manifest processing code and improve XSPF parsing (closes )
Sergey M․ e0d198c18d
[extractor/common] Add _download_xml_handle
Ricardo Constantino 96b8b9abae
[extractor/generic] Support relative URIs in _parse_xspf
<location> can have relative URIs, not just absolute.
Sergey M․ f4b7427279
[extractor/common] Improve jwplayer subtitles extraction (closes )
Sergey M․ 240f26229d
[extractor/common] Respect secure schemes in _extract_wowza_formats
Sergey M․ 00c97e3e7a
[downloader/http] Add ability to pass downloader options via info dict
Sergey M․ 3931b84597
[extractor/common] Improve _json_ld for articles
Sergey M․ 0d9c48de4f
[extractor/common] Improve DASH formats extraction for jwplayer (, )
Ondřej Caletka 126f225bcf
[extractor/common] Add container meta field for formats extracted in _parse_mpd_formats
felix 2501d41ef4
[common] use AACL as the default fourcc when AudioTag is 255
Sergey M․ 9d6ac71c27
[extractor/common] Fix extraction of DASH formats with the same representation id (closes )
Sergey M․ 2132edaa03
[extractor/common] Move X-Forwarded-For setup code into _request_webpage
Sergey M․ c10c93238e
[extractor/common] Introduce uploader, uploader_id and uploader_url meta fields for playlists (, )
Sergey M․ 78593e294c
Add references for
Sergey M․ 603fc4e0ea
[extractor/common] Add durations for DASH fragments with bare SegmentURLs
Petr Novak 41bf647e89
[extractor/common] Add support for DASH manifests with SegmentLists with bare SegmentURLs
Sergey M․ f610dbb05f
[extractor/common] Use final URL when dumping request (closes )
Remita Amine ea2295842f [common] skip Apple FairPlay m3u8 manifests(closes )
Sergey M․ 187ee66c94
[extractor/common] Add protocol for f4m formats
Sergey M․ 48107c198b
[f4m] Prefer baseURL for relative URLs (closes )
Sergey M․ 044eeb1455
[extractor/common] Respect URL query in _extract_wowza_formats (closes )
Sergey M․ 9211e3319e
[extractor/common] Prefix format id for audio only HLS formats
Remita Amine 50d808f5c9 [common] add support for jwplayer youtube embeds
M.K c110944fa2 [extractor/common] Fix typo in _parse_mpd_formats
Yen Chi Hsuan 4ed2d7b7d1 Fix flake8 issues after
Yen Chi Hsuan a88d461dff Merge pull request from Tithen-Firion/openload-phantomjs-method
Openload phantomjs method
Sergey M․ 1ed4549942
[extractor/common] Extract format id from label attribute of source tag for HTML5 videos ()
Sergey M․ dd121cc1ca
[extractor/common] Extract height from res attribute of source tag for HTML5 videos (closes )
Sergey M․ e01c3d2ef7
[extractor/common] Introduce _parse_xml
Sergey M․ b359e977b9
[extractor/common] Make HLS and DASH extraction non fatal in _parse_html5_media_entries (closes )
Sergey M․ 4850478543
[extractor/common] Add support for float durations in _parse_mpd_formats (closes )
Sergey M․ 868f79db41
[extractor/common] Fix _media_formats
Sergey M․ ac8491fcca
[extractor/common] Make _family_friendly_search optional
Sergey M․ 82889d4ae5
[extractor/common] Respect source's type attribute for HTML5 media (closes )
Sergey M․ 1141e9104b
Use relative paths for DASH fragments (closes )
10x reduced JSON size
refs