Commit Graph

115 Commits (6c894ea191583d628ce0dd7dd26a84d5291eb3c2)

Author SHA1 Message Date
Unknown 6c894ea191 [youtube/ytsearch] fix yt search feed + version update. 4 years ago
Unknown dd2d55f10d COMPLAINFREE 4 years ago
Unknown 3a6a581d94 Merge remote-tracking branch 'origin/master' 4 years ago
Unknown 07bafb4a90 [reddit] best format hotfix based on resolution. 4 years ago
Tom-Oliver Heidel 7eff09d332
Merge pull request #196 from blackjack4494/twitter_shortener
Twitter shortener
4 years ago
Unknown 957c523eea [youtube] cookie update reminder 4 years ago
Unknown a537ab1a09 [twitter/t.co] update supportedsites, failover replace, tco:id feature 4 years ago
Unknown 9e20a9c447 [twitter/t.co] implemented. 4 years ago
Unknown 51707d9a7a [MTV/Nick] universal mgid extractor + fix nick.de feed 4 years ago
Unknown f33b7b5eb4 [Twitter/t.co] showcase expanded how to use generic 4 years ago
Unknown 86b868c6a5 [Twitter/t.co] simple extractor added. modification needed. 4 years ago
Tom-Oliver Heidel d8f97cc1d3
Merge pull request #188 from blackjack4494/SouthparkDE_MTV
[SouthparkDE/MTV] another mgid extraction (mtv_base) feed url updated
4 years ago
Tom-Oliver Heidel 573c752256 Merge branch 'la7-fix' of https://github.com/iamleot/youtube-dl into iamleot-la7-fix 4 years ago
Unknown bc887cdd01 [SouthparkDE] regex and tests 4 years ago
Unknown 320724f964 [SouthparkDE/MTV] another mgid extraction (mtv_base) feed url updated 4 years ago
Tom-Oliver Heidel 60ecb525b2 Merge branch 'fixYTSearch' of https://github.com/xarantolus/youtube-dl into xarantolus-fixYTSearch 4 years ago
Tom-Oliver Heidel cfd7f14bb3
Merge pull request #176 from blackjack4494/mtv_updated_extractor_logic
[Mtv] updated extractor logic & more
4 years ago
Tom-Oliver Heidel b492464bf1
Merge pull request #171 from blackjack4494/yt_only_age_gate
[youtube] fix yt-only playback when age restricted/gated - requires cookies
4 years ago
Unknown cf7cb94287 [mtvn] update mtv network related extractors 4 years ago
Unknown b6e0c7d2e3 [mtv] fix mtv.com and more(?) 4 years ago
Unknown 962cc3ef87 merge bandcamp 4 years ago
Unknown b777004649 Merge branch 'ytdl-org-master' 4 years ago
Tom-Oliver Heidel 044ecf795d Merge branch 'feature_subscriber_count' of https://github.com/RedpointsBots/youtube-dl into RedpointsBots-feature_subscriber_count 4 years ago
Tom-Oliver Heidel a87a873d24 Merge branch 'bugfix_youtube_like_extraction' of https://github.com/RedpointsBots/youtube-dl into RedpointsBots-bugfix_youtube_like_extraction 4 years ago
Unknown c73baf23e0 fix to support python 2.6 4 years ago
Unknown 4bb9c8802e flake8 4 years ago
Unknown 9d9314cb66 [youtube] only playable on yt and age gated 4 years ago
Unknown 3d6a47d35f [skip travis] version 4 years ago
Unknown bdc3fd2f35 [core] add option to trim file name length with integer
https://github.com/blackjack4494/youtube-dlc/issues/85
4 years ago
Unknown 6923b5381f [hotstar] several api changes and payloads/queries 4 years ago
Unknown 3a379e5e83 [Bandcamp] update - fix regexp for JSON matching 4 years ago
Unknown 0c9df79e17 [core] no sleep affected subtitles only with enforced flag 4 years ago
Unknown 88bdacf33c Merge remote-tracking branch 'origin/master' 4 years ago
Unknown 8219ef6427 [tiktok] add referer - required to download from cdn 4 years ago
stephen 61e4c6ed45 Added regex for ABC.com site. 4 years ago
Unknown b33c48f269 [skip travis] version bump 4 years ago
Tom-Oliver Heidel 04b61c6572 Merge branch 'naver' of https://github.com/SeonjaeHyeon/youtube-dl into SeonjaeHyeon-naver 4 years ago
Unknown 915f2a92ac update workflow, semi fix integrated updater 4 years ago
Unknown 1b3f7c9a7e merge youtube-dl master 22.09.2020 4 years ago
Jody Bruchon a45e861918 Switch from binary search tree to Python sets
Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
4 years ago
Jody Bruchon fd87f42378 Randomize the ArchiveTree the proper Python way
Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
4 years ago
Jody Bruchon 2459b6e1cf Style revisions 4 years ago
Jody Bruchon 4f0150dcec Merge remote-tracking branch 'upstream/master' 4 years ago
Unknown 35d3b674c7 [hotstar] regex the second. 4 years ago
Jody Bruchon a4d834fb3e Fix wrong variable in position swap corrupting archive list
It's always a simple error in the end, you know?

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
4 years ago
Jody Bruchon fda63a4e87 Randomize archive order before populating search tree
This doesn't result in an elegant, perfectly balanced search tree,
but it's absolutely good enough. This commit completely mitigates
the worst-case scenario where the archive file is sorted.

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
4 years ago
Jody Bruchon 1d74d8d9f6 Try to mitigate the problem of loading a fully sorted archive
Sorted archives turn the binary tree into a linked list and make
things horribly slow. This is an incomplete mitigation for this
issue.
4 years ago
Jody Bruchon 1de7ea76f8 Remove recursion in at_insert() 4 years ago
Jody Bruchon a5029645ae Remove debugging print statements 4 years ago
Jody Bruchon ecdec1913f Keep download archive in memory for better performance
The old behavior was to open and scan the entire archive file for
every single video download. This resulted in horrible performance
for archives of any remotely large size, especially since all new
video IDs are appended to the end of the archive. For anyone who
uses the archive feature to maintain archives of entire video
playlists or channels, this meant that all such lists with newer
downloads would have to scan close to the end of the archive file
before the potential download was rejected. For archives with tens
of thousands of lines, this easily resulted in millions of line
reads and checks over the course of scanning a single channel or
playlist that had been seen previously.

The new behavior in this commit is to preload the archive file
into a binary search tree and scan the tree instead of constantly
scanning the file on disk for every file. When a new download is
appended to the archive file, it is also added to this tree. The
performance is massively better using this strategy over the more
"naive" line-by-line archive file parsing strategy.

The only negative consequence of this change is that the archive
in memory will not be synchronized with the archive file on disk.
Running multiple instances of the program at the same time that
all use the same archive file may result in duplicate archive
entries or duplicated downloads. This is unlikely to be a serious
issue for the vast majority of users. If the instances are not
likely to try to download identical video IDs then this should
not be a problem anyway; for example, having two instances pull
two completely different YouTube channels at once should be fine.

Signed-off-by: Jody Bruchon <jody@jodybruchon.com>
4 years ago