Commit Graph

565 Commits (d76f38bf20523ba074d5c8d375da2216c664a352)
 

Author SHA1 Message Date
Mike Lang d76f38bf20 prometheus: include url as a label
for coverage maps
5 years ago
Mike Lang 9a1369cf98 overview: Fix job -> service 5 years ago
Mike Lang e1993c6a79 overview dashboard: Look up services by 'service' label, not job
Job can't be repeated across scrape jobs, service can
5 years ago
Mike Lang ac98d67853 overview dashboard: Hide UNEDITED and DONE states so the others are visible 5 years ago
Mike Lang 8a65d18f74 prometheus config: Support mixed http and https scraping 5 years ago
Mike Lang 94d81d708f Downloader: Change access_token call to match website
It stopped working, these changes bring it back in line with the website
so it works.
5 years ago
Mike Lang aa3ca60b73
make video description slightly narrower
so that with the 1px border it's not too wide
5 years ago
Mike Lang eba5fc498a Remove flask response size tracking
Despite our best efforts, this was causing chunked responses to be fully
buffered into memory as a side effect.

This is really bad because responses can be VERY large.
5 years ago
Mike Lang 4bbcc8bc06 Revert "Merge pull request #155 from ekimekim/mike/manual-uploads"
This reverts commit 99de586353, reversing
changes made to 4b04f70b6f.

We don't need this feature and it complicates things and adds bugs.
5 years ago
Christopher Usher 845744cbf6 use a UTC timestamp 5 years ago
Christopher Usher 6b51734bbf added the ability to change the filename prefix 5 years ago
Christopher Usher 1325ccf280 added a read only user to database setup script 5 years ago
Mike Lang dc7f093ba0 Disable mp4 option for restreamer cuts
It caused our RSS to explode and i'm not sure why
5 years ago
Mike Lang 59d0fa3e40 sheetsync: Don't mis-parse blank as bad time 5 years ago
Mike Lang 99de586353
Merge pull request #155 from ekimekim/mike/manual-uploads
manual upload
5 years ago
Mike Lang 9ccc7e4e8d thrimshim: Allow manual_link to set things from UPLOAD_PENDING to TRANSCODING 5 years ago
Mike Lang c580671da2 Create upload pending state 5 years ago
Mike Lang 4b04f70b6f overview dashboard: Add system-level metrics 5 years ago
Mike Lang 967ac7b856 segment_coverage: Reduce "no hours" warning to info
This is too noisy at warning level, and comes up for non-main channels.
5 years ago
Mike Lang ab157afe20 sheetsync: Clear event counts before each update
Otherwise, no count of 0 ever gets set, and things are left showing
values when they shouldn't.
5 years ago
Mike Lang 47c8ebf11f nginx: SSL server should have same options as non-SSL 5 years ago
Mike Lang b936b9ab1c
Merge pull request #153 from ekimekim/mike/cache-builds
Add ability to explicitly pull and re-use layers from other commits when building
5 years ago
Mike Lang d231078048 Add ability to explicitly pull and re-use layers from other commits when building
This is intended mainly for travis CI, because by default it doesn't cache any layers
between builds.

By pulling likely-reusable builds (all parents of the current commit),
we take a fixed cost slowdown but in many cases should see a dramatic speed increase
overall, since we won't need to re-build anything that hasn't changed.

This isn't needed for local builds, where docker will do this on its own
with any previously-built images.
5 years ago
Mike Lang 0ab15672ae
Merge pull request #152 from ekimekim/mike/nginx/ssl
Add SSL to nginx if certs are given
5 years ago
Mike Lang 64766bcf35 Add SSL to nginx if certs are given 5 years ago
Mike Lang cff5c38691 Add new dashboard 5 years ago
Mike Lang 2efe1d6218 Fix a bad logging line when handling errors 5 years ago
Mike Lang 59ee5cf5c0 Only log at INFO about multiple versions of a segment
Since these tend to happen around stream endings, etc,
we don't want them to be crazy noisy and cause us to disregard real problems.

We can use the segment coverage to see in metrics if there are overlaps.
5 years ago
Mike Lang 4be8faf82e
Merge pull request #151 from ekimekim/mike/sheetsync/track-row-stats
sheetsync: Record counts of rows in the DB, segmented by various columns
5 years ago
Mike Lang 89a9e5554c sheetsync: Record counts of rows in the DB, segmented by various columns
This lets us view a number of useful graphs in dashboards, eg. rows by state,
errored rows, rows by day, rows by category, meltdowns per day, fraction of
events that are poster moments by category.

Sheetsync was the natural place to do this since it was already periodically scanning
the entire events table.
5 years ago
Mike Lang 72172024be overview dashboard: Stop reporting stream delay after stream stops
It just goes up forever and isn't helpful.
5 years ago
Mike Lang 77f23d775a overview dashboard: Show offending instance in error log rate graph 5 years ago
Mike Lang 249e32583b get_best_segments: Don't error if the only segments that exist for time are temp 5 years ago
Mike Lang 605fedeb52 cutter: Log when upload errors occur
This got lost in the shuffle at some point.
5 years ago
Mike Lang 0f4f04a998
Merge pull request #150 from ekimekim/mike/disable-flamegraph
Allow disabling of stacksampling with an env var
5 years ago
Mike Lang 3eb0ed2350 thrimbletrimmer: Fix bug where "Draft Saved" is shown after a failed submission 5 years ago
Mike Lang 8bcc38d386 docker-compose: Add generic option to pass env vars to all services 5 years ago
Mike Lang 6b602592f5 Allow disabling of stacksampling with an env var
This gives an easy way to do so across all services without adding new options.

Reasons to do so might be to avoid overheads or because your prometheus metrics grow too large.
5 years ago
Mike Lang 2accfa9080
Merge pull request #149 from ekimekim/mike/monitoring/stuff
monitoring fixes and improvements
5 years ago
Mike Lang e5a7c8adfa monitoring: Add "role" concept
This lets us know if a service is MEANT to be running or not.
5 years ago
Mike Lang 21a46a66bb monitoring: Set instance to friendly name for each node we're monitoring
So that you get eg. "charm" instead of "IP:PORT"
5 years ago
Mike Lang 51adeeab19 monitoring: Fix problems with the prometheus container 5 years ago
Mike Lang b84d4de085 Add segment_coverage service to be monitored 5 years ago
Mike Lang 6f0eff0258
Merge pull request #148 from ekimekim/mike/nginx/prevent-dns-caching
nginx: Prevent nginx from caching dns results
5 years ago
Mike Lang 8ae600f2e5 nginx: Prevent nginx from caching dns results
Otherwise if the containers get restarted and change ip, nginx hits the wrong ip.

We do this via a hack where we make all references indirect through a variable.
Since nginx only resolves this at request time, it always does a dns request.
5 years ago
Mike Lang 5ed8cf0a33
Merge pull request #147 from ekimekim/chrusher-downloader-label-fix
fixed outdated "stream", "variant" in metric
5 years ago
Mike Lang 7b6f691d63
Merge pull request #146 from ekimekim/mike/unicode/hack
Automatically set default encoding to utf-8 when common is imported
5 years ago
Christopher Usher abb9193705 fixed outdated "stream", "variant" in metric 5 years ago
Mike Lang 4d3aa94a71 Automatically set default encoding to utf-8 when common is imported
To be clear, this is an awful hack.

It means that any implicit str/unicode coersion will use the utf-8 encoding,
which is basically always what you want.

However, it is possible that some badly-written libraries might be relying
on the default encoding being ascii, and will do weird things as a result.

Finally, it's especially hacky to be doing this as part of importing a library.
Normally you're meant to do this as part of a sitecustomize.py in your python system directory,
and the function is deleted before passing control to normal code (this is why we need
to reload() to get it back).
5 years ago
Mike Lang f452aa3c32
Merge pull request #143 from ekimekim/mike/sheetsync/api-usage
sheetsync improvements
5 years ago