zydronium/google-nomulus

mirror of https://github.com/google/nomulus.git synced 2025-07-01 08:43:34 +02:00

Author	SHA1	Message	Date
mcilwain	001ce9cd52	Increase number of frontend/backend instances on prod/sandbox to 100 The higher the number the better for serious launches. These used to be 100 but had been detuned because instances weren't dying correctly when no longer needed, thus contributing to higher costs than necessary. That problem was fixed when we migrated to the Java 8 runtime, however, so there's no reason not to use the higher number. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=184742738	2018-02-20 15:18:54 -05:00
guyben	8beb10c2a3	Update sandbox / alpha cron.xml to be in line with production There are 2 types of changed done here: - reorder the existing cron jobs to be in the same order as production (for easier diffing) - add missing cron-jobs to either alpha or sandbox ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=183232936	2018-02-01 21:57:39 -05:00
mcilwain	46aa638b74	Rationalize prod/sandbox instance numbers to 50/5/50 That's 50 each for frontend and backend and 5 for tools. Since the MetricExporter bug has been fixed for awhile now, we aren't gaining anything by artificially keeping the instance number low, whereas we might benefit from higher instance counts, e.g. for load-testing. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=179432038	2017-12-27 11:13:42 -05:00
guyben	0e3d050dae	Temporarily disable deleteProberData cron job in sandbox for load-testing Loadtesting data is identified as "prober data" by this job (it removes anything under ".test", not only prober data) ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=177309096	2017-12-01 22:14:06 -05:00
mmuller	0ffd3553c3	Increase max number of sandbox frontend instances to 8 This mirrors production in hopes of triggering b/67508570 to test the fix. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=175295742	2017-11-21 18:24:32 -05:00
mcilwain	c24d5b8a88	Increase the frontend service idle timeout from 10 to 30 minutes This should help reduce the occurrence of requests taking a long time to process because a new instance is being spun up. We might consider increasing this further to 60 minutes in the future if necessary. This also increases the number of frontend instances on production to 8 from 6, since it appears like the issue we were attempting to mitigate with that change is now fixed. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=173440059	2017-11-07 17:07:10 -05:00
mcilwain	8908686f23	Add java8 runtime option to all non-production appengine-web*.xml files Java 8 is go! https://cloudplatform.googleblog.com/2017/09/Java-8-on-App-Engine-Standard-environment-is-now-generally-available.html We will add this option to the production files next week. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=170101056	2017-10-04 16:16:45 -04:00
mcilwain	7dc1940cdb	Move ResaveAllEppResources mapreduce from tools service to backend It makes sense for all mapreduces to run in backend, especially onces that are scheduled regularly to run in cron like this one now. We don't have many instances configured for the tools service anymore on some of our environments, so backend is the friendliest place for a mapreduce to run. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=168882122	2017-09-20 10:27:17 -04:00
guyben	d7214b58fc	Re-enable DeleteOldCommitLogs cron job Also adds a "resave all epp" cron job that's needed for the delete to work correctly. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=168879965	2017-09-20 10:27:17 -04:00
mcilwain	6e3a99a16b	Tune max # of frontend instances down to 4 in sandbox and alpha My continuing investigations into necessary resources for running our environments seems to indicate that four instances should be sufficient for our purposes. If it's not, we can always revert. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=155607688	2017-05-17 11:55:30 -04:00
mcilwain	2569d62b0a	Reduce maximum number of service instances for sandbox and alpha We want to lower the maximum number of service instances as much as possible without affecting service reliability so that we can make stronger statements about what the maximum cost of running a typical Nomulus environment might be. This first step likely won't affect the frontend and tools modules in practice because they aren't typically running even this many instances, but it will clamp down on the number of backend instances (which should be fine; it just means the mapreduces will take longer). Alpha is tuned down the same as sandbox for consistency reasons. This also standardizes on the B4 size (which has 512 MiB RAM) for all instances. Most instances were already using this, and the deviations from it were seemingly at random. Crucially, backend, which is likely most sensitive to this because it uses the mapreduce library, is already on the smaller memory size. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=154537995	2017-05-03 10:55:49 -04:00
mcilwain	cdadb54acd	Refer to Datastore everywhere correctly by its capitalized form ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=147479683	2017-02-17 12:12:12 -05:00
mcilwain	fb10005cba	Remove unneeded WEB-INF/*.yaml resource paths The YAML configuration files are now being built directly into the JAR, and not stored in the WEB-INF/ directory, so this is unnecessary. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=146815937	2017-02-08 17:42:49 -05:00
shikhman	d3fe6be385	Remove ECatcher from GAE code It's superceded by Stackdriver Trace. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=145440930	2017-01-25 12:25:41 -05:00
mcilwain	b5cf58bf2c	Add initial implementation of YAML config file This implements the basic framework that allows global YAML configuration, per-environment custom configuration, and unit- test-specific configuration. TESTED=I deployed to alpha, ran some EPP commands through the nomulus tool, and verified no errors. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=145422680	2017-01-25 12:25:05 -05:00
ctingue	f30b87d40e	Add ExpandRecurringBillingEventsAction cron job Note that this merely starts this MR on a daily schedule -- the billing queries that ultimately consume the synthetic OneTime events are filtering out the events at this time, so we're still relying on query-time expansion of Recurrings. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=144450565	2017-01-18 11:03:32 -05:00
mountford	890354938c	Switch TMCH SMDRL fetch job to start at 00:15 and 12:15 The job was starting at midnight and noon, which is exactly when the files are changing. This resulted in intermittent failures, as the files are temporarily missing during the changeover. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=139081163	2016-11-15 11:02:42 -05:00
mountford	5f691d6329	Enable built-in session cleanup in alpha and sandbox App Engine provides a servlet which deletes up to 100 expired _ah_SESSION entities from DataStore. This CL adds a cron job to call the servlet every 15 minutes in both alpha and sandbox. Assuming all goes well, we will turn it on in production. I originally learned about this servlet here: http://www.radomirml.com/blog/2011/03/26/cleaning-up-expired-sessions-from-app-engine-datastore/ But it appears that we do not need a servlet definition, just a cron entry. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=137533532	2016-11-02 15:19:34 -04:00
mcilwain	6636e02d57	Clean up and consolidate some unused queues ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=136170276	2016-10-14 17:37:21 -04:00
mcilwain	e192c11adf	Add a batched [] for DNS refreshing on host renames This will replace the existing DnsRefreshForHostRenameAction. This is stage one of a three stage migration process. It adds the new queue and [] but doesn't call them yet. Stage two will cut over to using the new functionality, and stage three will remove the old functionality. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=134793963	2016-10-03 16:11:00 -04:00
mcilwain	7f0cb4eae5	Move prober deletion action to backend module Also creates a new package named 'batch' to house it. TESTED=I deployed it to alpha, sent a POST request to the task URL, and it successfully ran the []. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=134332999	2016-10-03 15:41:38 -04:00
mcilwain	00ea99960a	Improve efficiency of async contact and host deletion with batching This allows handling of N asynchronous deletion requests simultaneously instead of just 1. An accumulation pull queue is used for deletion requests, and the async deletion [] is now fired off whenever that pull queue isn't empty, and processes many tasks at once. This doesn't particularly take more time, because the bulk of the cost of the async delete operation is simply iterating over all DomainBases (which has to happen regardless of how many contacts and hosts are being deleted). ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=133169336	2016-09-19 11:47:55 -04:00
mcilwain	c06133435b	Consolidate some cron groups and use fanout action for more retrying This is the first step in consolidating our task queues down into a smaller number. We have lots of tasks that run quite infrequently, and they can all run in the same queue to get retry semantics without needing a new queue for each cron entry. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=131990472	2016-09-02 13:56:23 -04:00
Justine Tunney	7cc7dc4af2	Use zip_rule() for EAR/WAR deploy archives These build rules allow Bazel to generate the .ear and .war files which appcfg.sh (a tool that comes with the App Engine SDK) can then use to perform a deployment. Included in this CL are configurations for five separate production environments: production, sandbox, alpha, crash, and local. ------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=129163010	2016-08-02 21:00:39 -04:00

24 commits