zydronium/google-nomulus

mirror of https://github.com/google/nomulus.git synced 2025-07-20 09:46:03 +02:00

Author	SHA1	Message	Date
gbrodman	7af9531d52	Add NotLoggedInException tests to flows and flow docs (#1437 ) * Add NotLoggedInException tests to flows and flow docs This wasn't included in flows.md before because the test existed in ResourceFlowTestCase. So even though the exception could be thrown and even though this was tested, it wasn't picked up in the documentation because the documentation is picked up from the corresponding concrete test class.	2021-11-30 15:00:05 -05:00
Weimin Yu	72be02916e	Validate SQL with Datastore being Primary (#1436 ) * Validate SQL with Datastore being primary Validates the data asynchronously replicated from Datastore to SQL. This is a short term tool optimized for the current production database. Tested in production.	2021-11-30 12:57:49 -05:00
gbrodman	52ef8592fc	Provide useful error messages on flows run during read-only mode (#1425 ) We want to keep the read-only-mode-exception as an unchecked exception, so we introduce a temporary check in the EppController that provides a specific error message for this situation (rather than letting it fall through to the generic "command failed" messaging	2021-11-24 14:57:44 -05:00
Rachel Guan	25d5d3d25e	Replace VKey.fromWebsafeKey() with VKey.create(string) (#1414 ) * Replace with stringify() and VKey.create(string) * Convert implicit cases of VKey.fromWebsafeKey(string) * Convert from Key to VKey to use stringify() * Modify existing code to show correct string representation of a key * Use VKey.create(websafeKey) to get ofy key in ResaveEntitiesCommand * Add TODO note in CommitLogMutation and determine if key string should be modified * Revert from stringify() to getOfyKey().getString() * Add bug ids to TODOs	2021-11-24 12:14:13 -05:00
gbrodman	e4fb083f8a	Ignore read-only mode in SQL->DS replication process (#1432 ) * Ignore read-only mode in SQL->DS replication process We need to be able to save indices and save data about the replication even when we're in read-only mode.	2021-11-24 11:51:25 -05:00
sarahcaseybot	97a87687b2	Add schema change for missing PollMessage.OneTime column (#1434 )	2021-11-24 11:23:26 -05:00
gbrodman	4368cc8ee0	Remove converter for CreateAutoTimestamp (#1429 ) We can handle it the same way that we handle UpdateAutoTimestamp, where we simply populate it in SQL if it doesn't exist. This has the following benefits: 1. The converter is unnecessary code 2. We get non-null column definitions for free (overridden in EppResource to allow null creation times so that legacy *History objects can contain null in that field 3. More importantly, this allows us for proper SQL->DS replay. If the field is filled out using a converter (as before this PR) then the field is only actually filled out on transaction commit (rather than when the write occurs within the transaction). This means that when we serialize the Transaction object during the transaction (the data that gets replayed to Datastore), we are crucially missing the creation time. If the creation time is written on commit, we have to start a new transaction to write the Transaction object, and it's an absolute necessity that the record of the transaction be included in the transaction itself so as to avoid situations where the transaction succeeds but the record fails. If the field is filled out in a @PrePersist method, crucially that occurs on the object write itself (before transaction commit).	2021-11-23 14:56:47 -05:00
Lai Jiang	839d27906a	Refactor RDE pipeline (#1427 ) The original RDE pipeline was a direct translation of the App Engine MapReduce logic. It turned out to be too slow (taking more than a day to run) due to the way it finds the most recent history entry. This PR overhauled the pipeline by using embedded EPP resource entities inside history entries (only available in SQL) and finding the most recent entries using the SQL engine. It cuts the time done to ~2h. Note that there are quota limits on the CPU cores and external IP addresses for a given GCP region inside a project, which will need to accommodate the resource requirements for the pipeline. More details are provided in comments. Also merged the update cursor stage and enqueue next action stage in RdeIO so that they can be done within a transaction, same as how MapReduce handles them. <!-- Reviewable:start --> This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/google/nomulus/1427) <!-- Reviewable:end -->	2021-11-23 11:29:00 -05:00
Michael Muller	80a70a035e	Add "postgres" robot id to nomulus (#1433 )	2021-11-22 12:35:51 -05:00
Rachel Guan	4eb55762f1	Change Optional::isEmpty to Optional::isPresent (#1428 )	2021-11-18 17:08:15 -05:00
gbrodman	9172093ccd	Ignore read-only mode when running commit logs / backups (#1424 ) We need to be able to continue running the backup and async replay code while the database is in read-only mode	2021-11-18 15:42:23 -05:00
sarahcaseybot	09271977c3	Remove TmchCrl singleton from Datastore (#1419 )	2021-11-17 14:53:29 -05:00
Rachel Guan	573f14514a	Change TaskQueueUtils to CloudTaskUtils in CommitLogFanoutAction (#1408 ) * Change TaskOptions to Task in CommitLogFanoutAction * Add a createTask method that takes clock and jitterSeconds * Change CreateTask parameter type and improve test cases * Improve comments and test casse * Improve test cases that handel jitterSeconds	2021-11-17 10:54:42 -05:00
Ben McIlwain	8915df8e87	Grandfather in old data for one-time billing event requirement (#1423 ) * Grandfather in old data for one-time billing event requirement We have data from 2018 and earlier where we didn't consistently set periodYears for OneTime BillingEvents with certain reasons. This grandfathers in that old data so that we can successfully move it over to Cloud SQL for now, then we can later run a query that will backfill it, after which we can then tighten up the requirement again. Note that the requirement is still being enforced for all billing events from 2019 onwards. This also improves the handling of validation, by adding a private field to the Reason enum rather than creating a throwaway inline ImmmutableSet in the Builder.	2021-11-16 16:12:08 -05:00
gbrodman	247267a03b	Release the replay lock in SQL, not Datastore (#1422 ) * Release the replay lock in SQL, not Datastore It's always acquired in SQL, so it should always be released in SQL.	2021-11-16 11:37:20 -05:00
Ben McIlwain	ff7ac45bf4	Send registrars poll messages when we add/remove server-side statuses (#1417 ) * Send registrars poll messages when we add/remove server-side status values	2021-11-16 11:35:05 -05:00
gbrodman	a5c646fab4	Add backend routing for ReplicateToDatastoreAction (#1415 ) Otherwise it's not visible so we can't call it	2021-11-15 16:25:10 -05:00
Lai Jiang	3e17788fbd	Make Nomulus compile on macOS (#1421 ) BSD sed requires a parameter to -i to indicate the backup suffix. By adding a blank suffix the sed command works on both Linux and macOS. <!-- Reviewable:start --> --- This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/google/nomulus/1421) <!-- Reviewable:end -->	2021-11-15 11:35:48 -05:00
Lai Jiang	f3feb18a6d	Update to Gradle 6.9.1 (#1420 ) <!-- Reviewable:start --> This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/google/nomulus/1420) <!-- Reviewable:end -->	2021-11-15 10:23:26 -05:00
Michael Muller	f18d0b15e9	Make TaskMatcher default to POST methods (#1418 ) * Make TaskMatcher default to POST methods TaskOptions.Builder.withUrl() defaults to POST methods. Therefore, it seems reasonable to verify that task queue methods are using the POST method, especially given that the method must now be identified explicitly when using CloudTaskUtils. This check would have guarded against the bug fixed by #1413. * Elaborate on comment * Further improved the comment	2021-11-12 14:03:23 -05:00
Rachel Guan	91886e64db	Correct HTTP method in CommitLogCheckPointAction (#1413 ) * Correct HTTP method in CommitLogCheckPointAction	2021-11-11 15:59:48 -05:00
Michael Muller	71e3427fdf	Add all necessary proxy configuration for QA (#1416 ) * Add all necessary proxy configuration for QA Add configuration files, deployment files and the necessary enum values for the QA environment.	2021-11-11 15:36:47 -05:00
Weimin Yu	e9805ecf7d	Remove the ineffective SQL injection check (#1412 ) * Remove the ineffective SQL injection check Remove the ineffective SQL-injection attack check in go/r3pr/954. It is quite restrictive, causing a long exempt list. It also doesn't protect queries made through helpers such as QueryComposer etc. We will start from scratch for a new solution.	2021-11-10 16:28:32 -05:00
Rachel Guan	7cbda7e8a4	Change from TaskQueueUtils to CLoudTasksUtils in PublishInvoicesAction (#1410 ) * Change from TaskQueueUtils to CLoudTasksUtils in PublishInvoicesAction	2021-11-10 10:13:19 -05:00
Rachel Guan	678ca22473	Change TaskQueueUtils to CloudTaskUtils in CommitLogCheckPointAction (#1409 ) * Change TaskQueueUtils to CloudTaskUtils in CommitLogCheckPointAction	2021-11-10 10:13:14 -05:00
sarahcaseybot	f729e8c117	Add a parameter to prevent spec11 from sending emails (#1407 )	2021-11-05 13:02:59 -04:00
Rachel Guan	421ed0a8d5	Test vkey behaviors when in a task queue (#1406 ) * Test vkey behavior in task queue	2021-11-04 21:04:18 -04:00
sarahcaseybot	d29a527f4b	Add Cloud SQL queries for transaction reports (#1397 ) * Add the Cloud SQL queries for transaction reports * Add the remaining queries * Some query fixes * Fix comments * Fix indentation in total_nameservers * Fix indentation on other Case condition	2021-11-03 11:25:31 -04:00
Rachel Guan	adb82565db	Add VKey to String and String to VKey methods (#1396 ) * Add stringify and parse methods to SerializeUTils * Improve comments and test cases * Fix comments and test strings * Fix dependency warning	2021-11-02 13:25:35 -04:00
gbrodman	230daeeab7	Pass the ICANN reporting BQ dataset to the DNS query coordinator (#1405 )	2021-11-02 13:24:04 -04:00
Weimin Yu	73f28ec70d	Fix InitSqlPipeline regarding synthesized history (#1404 ) * Fix InitSqlPipeline regarding synthesized history There are a few bad domains in Datastore that we hardcoded to ignore during SQL population. They didn't have history so we didn't try to filter when writing history. Recently we created synthesized history for domains, including the bad domains. Now we need to filter History entries.	2021-11-02 11:12:57 -04:00
Weimin Yu	e761e67434	Support shared database snapshot (#1403 ) * Support shared database snapshot Allow multiple workers to share a CONSISTENT database snapshot. The motivating use case is SQL database snapshot loading, where it is too slow to depend on one worker to load everything. This currently is postgresql-specific, but will be improved to be vendor-independent. Also made sure AppEngineEnvironment.java clears the cached environment in call cases when tearing down.	2021-11-01 13:01:37 -04:00
gbrodman	30c23efba9	Canonicalize domain/host names in async DS->SQL replay (#1350 )	2021-11-01 12:08:20 -04:00
Michael Muller	1b4b217588	Update terraform files and instructions (#1402 ) * Update terraform files and instructions Update proxy terraform files based on current best practices and allow exclusion of forwarding rules for HTTP endpoints. Specifically: - Add a "public_web_whois" input to allow disabling the public HTTP whois forwarding. - Add "description" fields to all variables. - Move outputs of the top-level module into "outputs.tf". - Auto-reformat using hclfmt.	2021-10-29 09:10:23 -04:00
Rachel Guan	f5d269c76d	Add domain pa notification response to first delete domain poll message (#1400 ) * Add domain pa notification response to first delete domain poll message * Add test case for poll message * Change time in response data to now	2021-10-28 15:45:50 -04:00
Weimin Yu	0978ede5d9	Make entities serializable for DB validation (#1401 ) * Make entities serializable for DB validation Make entities that are asynchronously replicated between Datastore and Cloud SQL serializable so that they may be used in BEAM pipeline based comparison tool. Introduced an UnsafeSerializable interface (extending Serializable) and added to relevant classes. Implementing classes are allowed some shortcuts as explained in the interface's Javadoc. Post migration we will decide whether to revert this change or properly implement serialization. Verified with production data.	2021-10-28 12:19:09 -04:00
gbrodman	0d5b436b41	Create a mechanism for storing / using locks explicitly only in SQL (#1392 ) This is used for the replay locks so that Beam pipelines (which will be used for database comparison) can acquire / release locks as necessary to avoid database contention. If we're comparing contents of Datastore and SQL databases, we shouldn't have replay actively running during the comparison, so the pipeline will grab the locks. Beam doesn't always play nicely with loading from / saving to Datastore, so we need to make sure that we store the replay locks in SQL at all times, even when Datastore is the primary DB.	2021-10-27 16:20:35 -04:00
Michael Muller	48e680ae45	Re-enable replay tests for most environments (#1399 ) * Re-enable replay tests for most environments This enables the replay tests except in environments where the NOMULUS_DISABLE_REPLAY_TESTS environment variable is set to "true". * Add a check for null	2021-10-25 12:11:02 -04:00
Rachel Guan	770b35464c	Send expiring notification emails to admins if no tech emails are on file (#1387 ) * Send emails to admin if tech emails are not present * Improve test cases and comments	2021-10-21 12:59:31 -04:00
Weimin Yu	9f08e8624b	Alt entity model for fast JPA bulk query (#1398 ) * Alt entity model for fast JPA bulk query Defined an alternative JPA entity model that allows fast bulk loading of multi-level entities, DomainBase and DomainHistory. The idea is to bulk the base table as well as the child tables separately, and assemble them into the target entity in memory in a pipeline. For DomainBase: - Defined a DomainBaseLite class that models the "Domain" table only. - Defined a DomainHost class that models the "DomainHost" table (nsHosts field). - Exposed ID fields in GracePeriod so that they can be mapped to domains after being loaded into memory. For DomainHistory: - Defined a DomainHistoryLite class that models the "DomainHistory" table only. - Defined a DomainHistoryHost class that models its namesake table. - Exposed ID fields in GracePeriodHistory and DomainDsDataHistory classes so that they can be mapped to DomainHistory after being loaded into memory. In PersistenceModule, provisioned a JpaTransactionManager that uses the alternative entity model. Also added a pipeline option that specifies which JpaTransactionManager to use in a pipeline.	2021-10-20 16:48:56 -04:00
gbrodman	8a4ac6511b	Use READ_COMMITTED serialization level in CreateSyntheticHEA (#1395 ) I observed an instance in which a couple queries from this action were, for whatever reason, hanging around as idle for >30 minutes. Assuming the behavior that we saw before where "an open idle serializable transaction means all pg read-locks stick around forever" still holds, that's the reason why the amount of read-locks in use spirals out of control. I'm not sure why those queries aren't timing out, but that's a separate issue.	2021-10-19 11:36:15 -04:00
Michael Muller	7fd7828cd8	Fix problems with the format tasks (#1390 ) * Fix problems with the format tasks The format check is using python2, and if "python" doesn't exist on the path (or isn't python 2, or there is any other error in the python code or in the shell script...) the format check just succeeds. This change: - Refactors out the gradle code that finds a python3 executable and use it to get the python executable to be used for the format check. - Upgrades google-java-format-diff.py to python3 and removes #! line. - Fixes shell script to ensure that failures are propagated. - Suppresses error output when checking for python commands. Tested: - verified that python errors cause the build to fail - verified that introducing a bad format diff causes check to fail - verified that javaIncrementalFormatDryRun shows the diffs that would be introduced. - verified that javaIncrementalFormatApply reformats a file. - verified that well formatted code passes the format check. - verified that an invalid or missing PYTHON env var causes google-java-format-git-diff.sh to fail with the appropriate error. * Fix presubmit issues Omit the format presubmit when not in a git repo and remove unused "string" import.	2021-10-18 08:10:09 -04:00
gbrodman	b24f3caac8	Fix weird flake (#1394 )	2021-10-15 18:00:46 -04:00
gbrodman	d9fea56f4c	Ignore class visibility in EntityTest (#1389 )	2021-10-15 17:08:51 -04:00
gbrodman	1fd179d041	Use multiple transactions in IcannReportingUploadAction (#1386 ) Relevant error log message: https://pantheon.corp.google.com/logs/viewer?project=domain-registry&minLogLevel=0&expandAll=false&timestamp=2021-10-11T15:28:01.047783000Z&customFacets=&limitCustomFacetWidth=true&dateRangeEnd=2021-10-11T20:51:40.591Z&interval=PT1H&resource=gae_app&logName=projects%2Fdomain-registry%2Flogs%2Fappengine.googleapis.com%252Frequest_log&scrollTimestamp=2021-10-11T15:10:23.174336000Z&filters=text:icannReportingUpload&dateRangeUnbound=backwardInTime&advancedFilter=resource.type%3D%22gae_app%22%0AlogName%3D%22projects%2Fdomain-registry%2Flogs%2Fappengine.googleapis.com%252Frequest_log%22%0A%22icannReportingUpload%22%0Aoperation.id%3D%22616453df00ff02a873d26cedb40001737e646f6d61696e2d726567697374727900016261636b656e643a6e6f6d756c75732d76303233000100%22 note the "invalid handle" bit From https://cloud.google.com/datastore/docs/concepts/transactions: "Transactions expire after 270 seconds or if idle for 60 seconds." From b/202309933: "There is a 60 second timeout on Datastore operations after which they will automatically rollback and the handles become invalid." From the logs we can see that the action is lasting significantly longer than 270 seconds -- roughly 480 seconds in the linked log (more or less). My running theory is that ICANN is, for some reason, now being significantly more slow to respond than they used to be. Some uploads in the log linked above are taking upwards of 10 seconds, especially when they have to retry. Because we have >=45 TLDs, it's not surprising that the action is taking >400 seconds to run. The fix here is to perform each per-TLD operation in its own transaction. The only reason why we need the transactions is for the cursors anyway, and we can just grab and store those at the beginning of the transaction.	2021-10-15 15:38:37 -04:00
Lai Jiang	d0c8f29a3b	Add a beam pipeline to create synthetic history entries in SQL (#1383 ) * Add a beam pipeline to create synthetic history entries in SQL The logic is mostly lifted from CreateSyntheticHistoryEntriesAction. We do not need to test for the existence of an embedded EPP resource in the history entry before create a synthetic one because after InitSqlPipeline runs it is guaranteed that no embedded resource exists.	2021-10-15 14:51:01 -04:00
Ben McIlwain	f38497849f	Add a scrap command to hard-delete a host resource (#1391 )	2021-10-15 12:28:18 -04:00
Ben McIlwain	b96de0e32f	Add tests for obscure hostname canonicalization rule (#1388 ) Also correctly configures Gradle for the util subproject (it wasn't possible to run tests in IntelliJ without these changes).	2021-10-14 14:53:28 -04:00
Rachel Guan	5803215755	Set payload in success response after sending notification emails (#1377 ) * Set payload in success response after sending expiring certificate notification emails * Modify log message and test cases for run() in sendExpiringCertificateNotificationEmailAction	2021-10-13 15:58:25 -04:00
Rachel Guan	6c4881fa4c	Add reason and requestedByRegistrar to domain renew flow (#1378 ) * Resolve merge conflict * Include reason and requestedByRegistrar in URS test file * Modify test cases for new parameters in renew flow * Add reason and registrar_request to renew domain command * Update comments for new params in renew flow * Make changes based on feedback	2021-10-13 11:41:02 -04:00

1 2 3 4 5 ...

3886 commits