wubloader

Commit Graph

Author	SHA1	Message	Date
Mike Lang	62bd6539ea	Unpin gevent as that was a workaround for a py2 issue	3 years ago
Mike Lang	21856c68aa	Fix all instances of file.write() for py3 In python 3, file.write() may do a partial write and returns the number of characters written. In order to not lose data, we need to wrap every instance of file.write() with our new common.writeall() wrapper that loops until the data is actually written.	3 years ago
Mike Lang	a56f6859bb	more py3 fixes	3 years ago
Mike Lang	f2a8007bf7	Fix build dependency issues	3 years ago
Mike Lang	50231a5154	py3 fixes for cutter	3 years ago
HubbeKing	6d790a1b36	Do a first naive pass for py3 compatibility Check that open() calls for reading and writing use binary modes Use alpine version with py3-pip package Use python3 in Dockerfile CMD Remove sys.setdefaultencoding() "hack" Simplify ensure_directory() in common.common package	3 years ago
Mike Lang	f0546e2ee3	Pin gevent to 1.5a2 to avoid https://github.com/gevent/gevent/issues/1711	3 years ago
Mike Lang	b53fcd65a0	Add dependencies required to install psycopg2 from source We can't install the binaries as they don't support musl	4 years ago
HubbeKing	86f7823348	Replace calls to gevent.signal() with gevent.signal_handler() gevent.signal() was removed in gevent 1.5a4, see http://www.gevent.org/api/gevent.signal.html Removed on Feb 5th, see https://github.com/gevent/gevent/pull/1530	4 years ago
Mike Lang	a53786dc2d	Add file and make as build dependencies gevent now requires these to build. I'm not sure when this changed.	4 years ago
Mike Lang	b9cd76b1a2	Add non-static implict tags in sheetsync In order for the upcoming playlist manager to be able to use the DB `tags` column to know what tags a video has, all the tags it needs need to be present. Previously, this was a problem because the day and category tags only get added at the cutter and so wouldn't be listed. This moves them so they are added when parsing the row in sheetsync. It also adds the poster moment tag if poster moment is checked. Note that fully static tags that go on all videos are still only added in cutter, but the playlist manager doesn't need to care about those (since by definition they will match every video).	4 years ago
Mike Lang	a30d595fee	cutter: Use video's custom tags when uploading	4 years ago
Mike Lang	4a2fe7a6ed	cutter: Explicitly set mime type of uploads correctly	5 years ago
Mike Lang	fde2758275	cutter: Fix bug where uploader was cleared on non-retryable error Instead of on retryable error	5 years ago
Mike Lang	a002619c4c	youtube upload: Explicitly set mime type	5 years ago
Mike Lang	a438d86f80	cutter: Fix multiple problems with logging errors	5 years ago
Mike Lang	4bbcc8bc06	Revert "Merge pull request #155 from ekimekim/mike/manual-uploads" This reverts commit `99de586353`, reversing changes made to `4b04f70b6f`. We don't need this feature and it complicates things and adds bugs.	5 years ago
Mike Lang	c580671da2	Create upload pending state	5 years ago
Mike Lang	605fedeb52	cutter: Log when upload errors occur This got lost in the shuffle at some point.	5 years ago
Mike Lang	e4706232ad	cutter: Fix bug in handling cancellation before finalize it would always think it's finalized	5 years ago
Mike Lang	4c0e20983c	Local upload backend: Always use .ts extension Now that we're doing that for both fast and full cuts.	5 years ago
Mike Lang	1cb4d41334	cutter: Set upload time when marking video as done without transcoding	5 years ago
Mike Lang	a7ac264201	cutter: Fix local upload backend, which was returning video id as a uuid instead of a string	5 years ago
Mike Lang	da58021c6f	cutter: Fix typo where encoding settings had wrong name	5 years ago
Mike Lang	1ef15780df	cutter: Fix lack of info in error handling	5 years ago
Mike Lang	c1cb7966af	cutter: Default to full cuts, not fast	5 years ago
Mike Lang	b2a07ef114	Merge pull request #140 from ekimekim/mike/build-improvements Refactor dockerfiles for more shared layers	5 years ago
Mike Lang	731ef9e2d0	Refactor dockerfiles for more shared layers By carefully ensuring most of our dockerfiles are identical in their first few layers, we only need to build those layers once instead of every time. In particular, we move installing gevent to before installing common, so that even when common changes gevent doesn't need to be reinstalled. This is important because gevent takes ages to install. Also fixes segment_coverage, which wasn't being installed.	5 years ago
Mike Lang	3dfe0e8722	Allow thrimshim to safely cancel a job while it is cutting This differs from the existing reset row by only suceeding if the upload is not in finalizing. We also make some changes to cutter to handle this situation gracefully.	5 years ago
Mike Lang	4d52b18b04	cutter,restreamer: Set stream=True for full cuts when appropriate And also default to a new ffmpeg encoding setting for high-quality mpegts (ie. still streamable) that is encoded very quickly.	5 years ago
Mike Lang	f9b48bc70e	cutter: Add more metrics	5 years ago
Mike Lang	52e6c4ad41	sheetsync, cutter: Collect metrics on http calls In particular, to google apis.	5 years ago
Mike Lang	17af1c4e89	cutter, sheetsync: Wait for DB to connect on startup This is a nicer error than crashing in the depths of some error handler (which is what will happen if the DB goes unavailable while they're running), and it's a far more common case (eg. the DB is misconfigured) than having it fail halfway through. Neither of these services can do anything meaningful without the DB, so crashing without it is acceptable behaviour.	5 years ago
Mike Lang	ba746ff6e6	Add title and description header/footer in thrimshim instead of cutter This accomplishes two things: 1. It allows thrimshim to properly validate length restrictions (not implemented yet) 2. It means that the database has a record of the values actually written for each of these rows, instead of that information depending on how the cutter was configured at the time.	5 years ago
Mike Lang	596cd92644	cutter: Add more specific error handling to upload backends For youtube, know that 4xx's are safe even if finalizing was set. For local, make all disk errors retryable since it doesn't matter.	5 years ago
Mike Lang	736040435c	Refactor error handling in uploads Instead of handling each error condition seperately, we raise an UploadError which includes whether it's retryable. The advantage of this is that upload backends can also raise an UploadError to indicate two conditions it currently cannot: That an error is unretryable That an error is retryable, even if the row was already in finalizing Under this scheme, errors while cutting become unretryable UploadErrors, and unhandled exceptions in uploading become retryable UploadErrors if the row is not yet finalizing only.	5 years ago
Mike Lang	40458d9d7f	local backend: Use original version of title in write_info instead of safe version	5 years ago
Mike Lang	3fbbe59b00	cutter Local backend: Fix typo and file extension when full cutting	5 years ago
Mike Lang	b0a71fd9f1	cutter: Fix mistake in check_candidate error handling	5 years ago
Mike Lang	ba53172cbc	cutter: Get video link from backend instead of hard-coding youtube	5 years ago
Mike Lang	ed2395d08e	cutter: Fix typo that prevented backends from being configured	5 years ago
Mike Lang	1218b694dd	cutter: Allow fast or full cuts per location On full cut, use encoding settings determined by the backend.	5 years ago
Mike Lang	d3e1d6b4fc	Resurrect non-experimental cut, now dubbed "full" (vs "fast") cut In a fast cut, we edit the first and last segments then concatenate them all. However, this leads to some tiny but perciptible artifacting around the border of the first and second (and second-last and last) segments. A full cut is much slower, but re-encodes the video into the desired format and is more reliable. We want both options to be available. With this commit, we only add the option, we don't use it in restreamer or cutter.	5 years ago
Mike Lang	40c4baef0f	youtube upload: Set category and language settings configured on a per-location basis.	5 years ago
Mike Lang	5e0d5b9ddc	cutter: Add category and sheet_name as video tags This gives us tags on categories, and on which day's sheet the event came from. We use these to make automatic playlists.	5 years ago
Mike Lang	cbd0ef3d9e	cutter: Add title header, description footer and static tags These are pre-canned parts of the video metadata that we want to be configurable.	5 years ago
Mike Lang	1159a518f0	Merge pull request #100 from ekimekim/mike/cutter/multiple-locations cutter: Allow multiple upload locations	5 years ago
Mike Lang	1a177931a6	cutter: Simplify upload backend interface Only support iterable of string, not file-like or string. This is a minor usability loss but we only call this from one place anyway and it's always an iterable of string.	5 years ago
Mike Lang	f8039028c4	local upload backend: Add a testing option to show info upload_video() was called with This is useful because it lets us test uploads and tag logic without involving youtube.	5 years ago
Mike Lang	d6ed138db8	cutter: Set an error message to warn humans when it discards a candidate due to holes This prevents videos being stuck in EDITED with no visible problem when they contain holes, but is likely to false positive sometimes. This is fine though, as it's just a human-readable warning and it will be cleared as soon as any node accepts the row to be cut.	5 years ago
Mike Lang	75acd978de	cutter: Implement a Local upload location This cuts to local disk, useful for self-hosting and testing.	5 years ago
Mike Lang	8e6c40610a	cutter: Allow config to manually disable transcode checking for location This deals with the problem where multiple youtube locations that refer to the same actual account (but with different settings) will all try to check for when videos are done transcoding, when only one is needed.	5 years ago
Mike Lang	ecf58dfbfc	cutter: Add ability to configure multiple upload locations Cutter now takes a 'config' arg which is a json blob with detail on each upload location. This is a bit nasty if you're trying to run it manually but was the easiest way to transfer the config data from docker-compose.jsonnet to the actual application.	5 years ago
Mike Lang	32d352e2ba	cutter: Set video directly to DONE if upload backend doesn't need transcoding This will be the case for probably all non-youtube backends.	5 years ago
Mike Lang	39f006fdab	cutter: Replace youtube client with generic upload backend and specific youtube implementaton Still to go: Actually constructing the correct backend based on given config, switching behaviour based on needs_transcode.	5 years ago
Mike Lang	15f879682d	cutter: Allow job to specify upload location This lays the groundwork for being able to cut to many upload locations. Right now, only a single location can be configured, and only youtube is supported.	5 years ago
Christopher Usher	9ed8facce8	fixed bug in error handling	5 years ago
Christopher Usher	a63e18c0a9	added state to upload_errors	5 years ago
Christopher Usher	228468749c	changes based on ekim's suggestions	5 years ago
Christopher Usher	5c08540c0a	stats for the cutter	5 years ago
Christopher Usher	989ef80d30	starting on adding some stats to the cutter	5 years ago
Christopher Usher	fec8cff185	Merge branch 'master' into chrusher-database-replication	5 years ago
Christopher Usher	027c2900e2	fixes in response to ekim's comments	5 years ago
Christopher Usher	23e3cfce20	Added editor, edit_time and upload_time to thrimshim and cutter updates of the database	5 years ago
Christopher Usher	1dbe585837	retry database connection if it fails	5 years ago
Mike Lang	e4d3e418c8	transcode checker: longer retry while waiting for videos to finish but still check db often. This prevents us from using too much api quota on these checks, while still letting us spot new videos quickly.	5 years ago
Mike Lang	1f15900b6f	cutter: At least for now, don't auto-retry errors This leads to rapidly exhausting our upload limit since even a fast failed request costs the same amount of usage quota as a 1-hour long video.	5 years ago
Mike Lang	fbef4725d7	cutter: Handle case where we are told to stop while looking for candidates Previously, it would return None and things would break. Now the None is handled correctly, and is documented.	5 years ago
Mike Lang	5cec6ec96e	cutter: Reconnect after any error that might be a database error After certain kinds of DB error (eg. lost conn), we need to make a new conn to have things work again. To be safe, we just do it after every error where it might be a problem.	5 years ago
Mike Lang	fea9ff6c1d	cutter: Fix dockerfile, which was missing ffmpeg dependency	5 years ago
Mike Lang	e048db0d94	cutter: Fix a failure mode where we never recover from a DB conn failure in TranscodeChecker Since we never got a new conn after failure, we would just keep erroring with "connection already closed" errors. This isn't applicable to the main cutter loops since a DB failure there will restart the process.	5 years ago
Mike Lang	85c110ccb4	cutter: Fix typo from when we moved to the client model instead of auth headers	5 years ago
Mike Lang	3647d091f8	Move common google api auth functionality into common So we can reuse it for google sheets	5 years ago
Mike Lang	5b2a1ef6b7	cutter: Implement actual cut methods Each method is fairly complicated, but is self-contained and can be examined independently. cut_jobs in particular contains several extra helpers and directs control flow via some iterators. This is unfortunately nessecary due to the requests interface.	6 years ago
Mike Lang	ae809c696c	cutter: Outline of how main cutter run loop works This commit only lays out the main loop, showing the high-level flow and defining shared utilities. This is for clarity. The actual methods that do the work will be implemented seperately.	6 years ago
Mike Lang	80c1a66aa0	cutter: Implement TranscodeChecker It runs on an interval, fetching all videos in TRANSCODING from the DB, checking them against youtube, and then updating any that are done. It should be noted that youtube somewhat lies about what being "done" means, but this is a better approximation than nothing.	6 years ago
Mike Lang	3ce8360a1e	cutter: Add database manager and connections One connection each for transcode checker and cutter. We don't need more than one each since both workers only ever do one thing at once.	6 years ago
Mike Lang	fdd245a6d9	cutter: Add lightweight youtube client Provides basic youtube api calls, and gets passed into both transcode checker and cutter. The official youtube client library is many orders of magnitude larger and more complicated, and can't actually do what we want (stream an upload of unknown size).	6 years ago
Mike Lang	e4b6110fd7	cutter: Add initial outline The cutter has two jobs: * To cut videos, taking them through states EDITED -> TRANSCODING * To monitor TRANSCODING videos for when they're complete We run these as separate greenlets with their own DB connections, and if either dies we gracefully shut down the other.	6 years ago

1 2 3

129 Commits (f0fb2fb281578d99584c5b5a685da3e86038d059)