mirror of
https://github.com/google/nomulus.git
synced 2025-04-30 03:57:51 +02:00
Shorten and edit README file
------------- Created by MOE: https://github.com/google/moe MOE_MIGRATED_REVID=134412992
This commit is contained in:
parent
e19546ffb4
commit
6c610d49fe
1 changed files with 91 additions and 268 deletions
359
README.md
359
README.md
|
@ -1,284 +1,107 @@
|
|||
# Domain Registry
|
||||
|
||||
Domain Registry is a production service for managing registrations on top-level
|
||||
domains in a shared namespace. Domain Registry runs on [Google App Engine][gae]
|
||||
and is written primarily in Java. It achieves in a hundred thousand lines of
|
||||
code what [Jon Postel][postel] used to do on index cards.
|
||||
## Overview
|
||||
|
||||
This is the software that [Google Registry][google-registry] uses to operate
|
||||
TLDs such as .GOOGLE, .HOW, .SOY, and .みんな.
|
||||
Domain Registry is an open source, scalable, cloud-based service for operating
|
||||
[top-level domains](https://en.wikipedia.org/wiki/Top-level_domain) (TLDs). It
|
||||
is the authoritative source for the TLDs that it runs, meaning that it is
|
||||
responsible for tracking domain name ownership and handling registrations,
|
||||
renewals, availability checks, and WHOIS requests. End-user registrants (i.e.
|
||||
people or companies that want to register a domain name) use an intermediate
|
||||
domain name registrar acting on their behalf to interact with the registry.
|
||||
|
||||
For more in-depth documentation, including install and setup instructions, see
|
||||
the Markdown documents in the `docs` directory.
|
||||
Domain Registry runs on [Google App
|
||||
Engine](https://cloud.google.com/appengine/docs/about-the-standard-environment)
|
||||
and is written primarily in Java. It is the software that [Google
|
||||
Registry](https://www.registry.google/) uses to operate TLDs such as .GOOGLE,
|
||||
.HOW, .SOY, and .みんな. It can run any number of TLDs in a single shared registry
|
||||
system using full horizontal scaling. Its source code is publicly available in
|
||||
this repository under the [Apache 2.0 free and open source
|
||||
license](https://www.apache.org/licenses/LICENSE-2.0).
|
||||
|
||||
### What is a Registry?
|
||||
## Getting started
|
||||
|
||||
When it comes to internet land, ownership flows down the following hierarchy:
|
||||
The following resources provide information on getting the code and setting up a
|
||||
running system:
|
||||
|
||||
1. [ICANN][icann]
|
||||
2. [Registries][registry] (e.g. Google Registry)
|
||||
3. [Registrars][registrar] (e.g. Google Domains)
|
||||
4. Registrants (e.g. you)
|
||||
* [Install
|
||||
guide](https://github.com/google/domain-registry/blob/master/docs/install.md)
|
||||
* Frequently asked questions
|
||||
* [Other docs](https://github.com/google/domain-registry/tree/master/docs)
|
||||
* Javadocs
|
||||
* [Domain registry user
|
||||
group](https://groups.google.com/forum/#!forum/domain-registry-users), for
|
||||
any other questions
|
||||
|
||||
A registry is any organization that operates an entire top-level domain. For
|
||||
example, Verisign controls all the .COM domains and Affilias controls all the
|
||||
.ORG domains.
|
||||
|
||||
### How Scalable is Domain Registry?
|
||||
|
||||
We successfully verified that Domain Registry is able to perform 1,000 EPP
|
||||
"domain creates" per second, with 99th percentile latency at ~3 seconds, and
|
||||
95th percentile latency at ~1 second. Please note that 1,000 was the highest QPS
|
||||
our load tester allowed.
|
||||
|
||||
In theory, Domain Registry is infinitely scalable. The only limitation is that
|
||||
each individual EPP resource can only support one write per second, which in
|
||||
practice, is more like ten. However reads to a single resource are free and
|
||||
unlimited.
|
||||
|
||||
### How Reliable is Domain Registry?
|
||||
|
||||
Domain Registry achieves its scalability without sacrificing the level of
|
||||
correctness an engineer would expect from an ACID SQL database.
|
||||
|
||||
Domain Registry is built on top of [Google Cloud Datastore][datastore]. This is
|
||||
a global NoSQL database that provides an unlimited number of [Paxos][paxos]
|
||||
entity groups, each of which being able to scale to an unlimited size while
|
||||
supporting a single local transaction per second. Datastore also supports
|
||||
distributed transactions that span up to twenty-five entity groups. Transactions
|
||||
are limited to four minutes and ten megabytes in size. Furthermore, queries and
|
||||
indexes that span entity groups are always eventually consistent, which means
|
||||
they could take seconds, and very rarely, days to update. While most online
|
||||
services find eventual consistency useful, it is not appropriate for a service
|
||||
conducting financial exchanges. Therefore Domain Registry has been engineered to
|
||||
employ performance and complexity tradeoffs that allow strong consistency to be
|
||||
applied throughout the codebase.
|
||||
|
||||
Domain Registry has a commit log system. Commit logs are retained in datastore
|
||||
for thirty days. They are also streamed to Cloud Storage for backup purposes.
|
||||
Commit logs are written across one thousand entity group shards, each with a
|
||||
local timestamp. The commit log system is able to reconstruct a global partial
|
||||
ordering of transactions, based on these local timestamps. This is necessary in
|
||||
order to do restores. Each EPP resource entity also stores a map of its past
|
||||
mutations with 24-hour granularity. This makes it possible to have point-in-time
|
||||
projection queries with effectively no overhead.
|
||||
|
||||
The Registry Data Escrow (RDE) system is also built with reliability in mind. It
|
||||
executes on top of App Engine task queues, which can be double-executed and
|
||||
therefore require operations to be idempotent. RDE isn't idempotent. To work
|
||||
around this, RDE uses datastore transactions to achieve mutual exclusion and
|
||||
serialization. We call this the "Locking Rolling Cursor Pattern." One benefit of
|
||||
this pattern, is that if the escrow service should fall into a failure state for
|
||||
a few days, or weeks, it will automatically catch up on its work once the
|
||||
problem is resolved. RDE is also able to perform strongly consistent queries
|
||||
with snapshot isolation across the entire datastore. It does this by sharding
|
||||
global indexes into entity groups buckets (which can be queried with strong
|
||||
consistency) and then rewinding the entities to the desired point in time.
|
||||
|
||||
The Domain Registry codebase is also well tested. The core packages in the
|
||||
codebase (model, flows, rde, whois, etc.) have 95% test coverage.
|
||||
If you are thinking about running a production registry service using our
|
||||
platform, please drop by the user group and introduce yourself and your use
|
||||
case. To report issues or make contributions, use GitHub issues and pull
|
||||
requests.
|
||||
|
||||
## Capabilities
|
||||
|
||||
Domain Registry has the following capabilities, many of which are standard IETF
|
||||
services.
|
||||
Domain Registry has the following capabilities:
|
||||
|
||||
### Extensible Provisioning Protocol (EPP)
|
||||
|
||||
[EPP][epp] is the core service of the registry. It's an XML protocol that's used
|
||||
by registrars to register domains from the registry on behalf of registrants.
|
||||
Domain Registry implements this service as an App Engine HTTP servlet listening
|
||||
on the `/_dr/epp` path. Requests are forwarded to this path by a public-facing
|
||||
proxy listening on port 700. Poll message support is also included.
|
||||
|
||||
To supplement EPP, Domain Registry also provides a public API for performing
|
||||
domain availability checks. This service listens on the `/check` path.
|
||||
|
||||
* [RFC 5730: EPP](http://tools.ietf.org/html/rfc5730)
|
||||
* [RFC 5731: EPP Domain Mapping](http://tools.ietf.org/html/rfc5731)
|
||||
* [RFC 5732: EPP Host Mapping](http://tools.ietf.org/html/rfc5732)
|
||||
* [RFC 5733: EPP Contact Mapping](http://tools.ietf.org/html/rfc5733)
|
||||
* [RFC 3915: EPP Grace Period Mapping](http://tools.ietf.org/html/rfc3915)
|
||||
* [RFC 5734: EPP Transport over TCP](http://tools.ietf.org/html/rfc5734)
|
||||
* [RFC 5910: EPP DNSSEC Mapping](http://tools.ietf.org/html/rfc5910)
|
||||
* [Draft: EPP Launch Phase Mapping (Proposed)]
|
||||
(http://tools.ietf.org/html/draft-tan-epp-launchphase-11)
|
||||
|
||||
### Registry Data Escrow (RDE)
|
||||
|
||||
RDE and BRDA are implemented as a cron mapreduce that takes a strongly
|
||||
consistent point-in-time snapshot of the registration database, turns it into a
|
||||
gigantic XML file, and uploads it to an SFTP server run by a third party escrow
|
||||
provider. This happens nightly with RDE and weekly with BRDA.
|
||||
|
||||
This service exists for ICANN regulatory purposes. ICANN needs to know that,
|
||||
should a registry business ever implode, that they can quickly migrate their
|
||||
TLDs to a different company so that they'll continue to operate.
|
||||
|
||||
* [Draft: Registry Data Escrow Specification]
|
||||
(http://tools.ietf.org/html/draft-arias-noguchi-registry-data-escrow-06)
|
||||
* [Draft: Domain Name Registration Data (DNRD) Objects Mapping]
|
||||
(http://tools.ietf.org/html/draft-arias-noguchi-dnrd-objects-mapping-05)
|
||||
* [Draft: ICANN Registry Interfaces]
|
||||
(http://tools.ietf.org/html/draft-lozano-icann-registry-interfaces-05)
|
||||
|
||||
### Trademark Clearing House (TMCH)
|
||||
|
||||
Domain Registry integrates with ICANN and IBM's MarksDB in order to protect
|
||||
trademark holders, when new TLDs are being launched.
|
||||
|
||||
* [Draft: TMCH Functional Spec]
|
||||
(http://tools.ietf.org/html/draft-lozano-tmch-func-spec-08)
|
||||
* [Draft: Mark and Signed Mark Objects Mapping]
|
||||
(https://tools.ietf.org/html/draft-lozano-tmch-smd-02)
|
||||
|
||||
### WHOIS
|
||||
|
||||
[WHOIS][whois] is a simple text-based protocol that allows anyone to look up
|
||||
information about a domain registrant. Domain Registry implements this as an
|
||||
internal HTTP endpoint running on `/_dr/whois`. A separate proxy running on port
|
||||
43 forwards requests to that path. Domain Registry also implements a public HTTP
|
||||
endpoint that listens on the `/whois` path.
|
||||
|
||||
* [RFC 3912: WHOIS Protocol Specification]
|
||||
(https://tools.ietf.org/html/rfc3912)
|
||||
* [RFC 7485: Inventory and Analysis of Registration Objects]
|
||||
(http://tools.ietf.org/html/rfc7485)
|
||||
|
||||
### Registration Data Access Protocol (RDAP)
|
||||
|
||||
RDAP is the new standard for WHOIS. It provides much richer functionality, such
|
||||
as the ability to perform wildcard searches. Domain Registry makes this HTTP
|
||||
service available under the `/rdap/...` path.
|
||||
|
||||
* [RFC 7480: RDAP HTTP Usage](http://tools.ietf.org/html/rfc7480)
|
||||
* [RFC 7481: RDAP Security Services](http://tools.ietf.org/html/rfc7481)
|
||||
* [RFC 7482: RDAP Query Format](http://tools.ietf.org/html/rfc7482)
|
||||
* [RFC 7483: RDAP JSON Responses](http://tools.ietf.org/html/rfc7483)
|
||||
* [RFC 7484: RDAP Finding the Authoritative Registration Data]
|
||||
(http://tools.ietf.org/html/rfc7484)
|
||||
|
||||
### Backups
|
||||
|
||||
The registry provides a system for generating and restoring from backups with
|
||||
strong point-in-time consistency. Datastore backups are written out once daily
|
||||
to Cloud Storage using the built-in Datastore snapshot export functionality.
|
||||
Separately, entities called commit logs are continuously exported to track
|
||||
changes that occur in between the regularly scheduled backups.
|
||||
|
||||
A restore involves wiping out all entities in Datastore, importing the most
|
||||
recent complete daily backup snapshot, then replaying all of the commit logs
|
||||
since that snapshot. This yields a system state that is guaranteed
|
||||
transactionally consistent.
|
||||
|
||||
### Billing
|
||||
|
||||
The registry performs a regular daily export of `BillingEvent` entities from
|
||||
Cloud Datastore, where they are stored and updated by the running system, to
|
||||
[BigQuery][bigquery], where they can be analyzed using SQL scripts to generate
|
||||
monthly invoices per registrar.
|
||||
|
||||
### High availablity with horizontal scaling
|
||||
|
||||
Because the registry runs on the Google Cloud Platform stack, it benefits from
|
||||
high availability, automatic fail-over, and horizontal auto-scaling of compute
|
||||
and database resources. This makes it quite flexible for running TLDs of any
|
||||
size.
|
||||
|
||||
### Automated tests
|
||||
|
||||
The registry codebase includes ~400 test classes with ~4,000 total unit and
|
||||
integration tests. This limits regressions, ensures correct system
|
||||
functionality, and allows for easy continued future development and refactoring.
|
||||
|
||||
### DNS
|
||||
|
||||
An interface for DNS operations is provided, along with a sample implementation
|
||||
that uses the [Google Cloud DNS](https://cloud.google.com/dns/) API. A bulk
|
||||
export tool is also provided to export a zone file for an entire TLD in BIND
|
||||
format.
|
||||
|
||||
* [RFC 1034: Domain Names - Concepts and Facilities]
|
||||
(https://www.ietf.org/rfc/rfc1034.txt)
|
||||
* [RFC 1035: Domain Names - Implementation and Specification]
|
||||
(https://www.ietf.org/rfc/rfc1034.txt)
|
||||
|
||||
### Exports
|
||||
|
||||
The registry uses background batch processes to periodically export information
|
||||
from the running system, including billing information, all EPP entities,
|
||||
backups, lists of all registered domain names, registrar contact emails,
|
||||
ICANN-mandated reports, database snapshots, and reserved terms.
|
||||
|
||||
### Metrics and reporting
|
||||
|
||||
The registry records metrics and regularly exports them to BigQuery so that
|
||||
analyses can be run on them using full SQL queries. Metrics include which EPP
|
||||
commands were run and when and by whom, information on failed commands, activity
|
||||
per registrar, and length of each request.
|
||||
|
||||
[BigQuery][bigquery] reporting scripts are provided to generate the required
|
||||
per-TLD monthly [registry reports]
|
||||
(https://www.icann.org/resources/pages/registry-reports) for ICANN.
|
||||
|
||||
### Registrar console
|
||||
|
||||
The registry includes a web-based registrar console that registrars can access
|
||||
in a browser. It provides the ability for registrars to view their billing
|
||||
invoices in Google Drive, contact the registry provider, and modify WHOIS,
|
||||
security (including SSL certificates), and registrar contact settings. Main
|
||||
registry commands such as creating domains, hosts, and contacts must go through
|
||||
EPP and are not provided in the console.
|
||||
|
||||
### Admin tooling
|
||||
|
||||
The registry comes with a fully featured `registry_tool` command-line tool (see
|
||||
`docs/` for full documentation) that allows developers and support personnel of
|
||||
the registry to run a full range of commands, including creating new registrars,
|
||||
running arbitrary EPP commands, inspecting the state of important things in the
|
||||
system, and creating new TLDs.
|
||||
|
||||
### Plug-and-play pricing engines
|
||||
|
||||
The registry has the ability to configure per-TLD pricing engines to
|
||||
programmatically determine the price of domain names on the fly. An
|
||||
implementation is provided that uses the contents of a static list of prices
|
||||
(this being by far the most common type of premium pricing used for TLDs).
|
||||
* **[Extensible Provisioning Protocol
|
||||
(EPP)](https://en.wikipedia.org/wiki/Extensible_Provisioning_Protocol)**: An
|
||||
XML protocol that is the standard format for communication between
|
||||
registrars and registries. It includes operations for registering, renewing,
|
||||
checking, updating, and transferring domain names.
|
||||
* **[DNS](https://en.wikipedia.org/wiki/Domain_Name_System) interface**: The
|
||||
registry provides a pluggable interface that can be implemented to handle
|
||||
different DNS providers. It includes a sample implementation using Google
|
||||
Cloud DNS.
|
||||
* **[WHOIS](https://en.wikipedia.org/wiki/WHOIS)**: A text-based protocol that
|
||||
returns ownership and contact information on registered domain names.
|
||||
* **[Registration Data Access Protocol
|
||||
(RDAP)](https://en.wikipedia.org/wiki/Registration_Data_Access_Protocol)**:
|
||||
A JSON API that returns structured, machine-readable information about
|
||||
domain name ownership. It is essentially a newer version of WHOIS.
|
||||
* **[Registry Data Escrow (RDE)](https://icannwiki.com/Data_Escrow)**: A daily
|
||||
export of all ownership information for a TLD to a third party escrow
|
||||
provider to allow take-over by another registry operator in the event of
|
||||
serious failure. This is required by ICANN for all new gTLDs.
|
||||
* **Premium pricing**: Communicates prices for premium domain names (i.e.
|
||||
those that are highly desirable) and supports configurable premium
|
||||
registration and renewal prices. An extensible interface allows fully
|
||||
programmatic pricing.
|
||||
* **Billing history**: A full history of all billable events is recorded,
|
||||
suitable for ingestion into an invoicing system.
|
||||
* **Registration periods**: Qualified Launch Partner, Sunrise, Landrush, and
|
||||
General Availability periods of the standard gTLD lifecycle are all
|
||||
supported.
|
||||
* **Brand protection for trademark holders (via
|
||||
[TMCH](https://newgtlds.icann.org/en/about/trademark-clearinghouse/faqs))**:
|
||||
Allows rights-holders to protect their brands by blocking registration of
|
||||
domains using their trademark. This is required by ICANN for all new gTLDs.
|
||||
* **Registrar support console**: A self-service web console that registrars
|
||||
can use to manage their accounts in the registry system.
|
||||
* **Reporting**: Support for required external reporting (such as [ICANN
|
||||
monthly registry
|
||||
reports](https://www.icann.org/resources/pages/registry-reports),
|
||||
[CZDS](https://czds.icann.org), Billing and Registration Activity) as well
|
||||
as internal reporting using [BigQuery](https://cloud.google.com/bigquery/).
|
||||
* **Administrative tool**: Performs the full range of administrative tasks
|
||||
needed to manage a running registry system, including creating and
|
||||
configuring new TLDs.
|
||||
|
||||
## Known issues
|
||||
|
||||
There are a few things that the registry cannot currently do, and a few things
|
||||
that are out of scope that it will never do.
|
||||
Here are some additional things you will likely need or want that are not
|
||||
provided out of the box:
|
||||
|
||||
* You will need a DNS system in order to run a fully-fledged registry. If you
|
||||
are planning on using anything other than Google Cloud DNS you will need to
|
||||
provide an implementation.
|
||||
* You will need an invoicing system to convert the internal registry billing
|
||||
events into registrar invoices using whatever accounts receivable setup you
|
||||
already have. A partial implementation is provided that generates generic
|
||||
CSV invoices (see `MakeBillingTablesCommand`), but you will need to
|
||||
integrate it with your payments system.
|
||||
* You will likely need monitoring to continuously monitor the status of the
|
||||
system. Any of a large variety of tools can be used for this, or you can
|
||||
write your own.
|
||||
* You will need a proxy to forward traffic on EPP and WHOIS ports to the HTTPS
|
||||
endpoint on App Engine, as App Engine only allows incoming traffic on
|
||||
HTTP/HTTPS ports. Similarly, App Engine does not yet support IPv6, so your
|
||||
proxy would have to support that as well if you need IPv6 support. Future
|
||||
versions of [App Engine Flexible][flex] should provide these out of the box,
|
||||
but they aren't ready yet.
|
||||
|
||||
[bigquery]: https://cloud.google.com/bigquery/
|
||||
[datastore]: https://cloud.google.com/datastore/docs/concepts/overview
|
||||
[gae]: https://cloud.google.com/appengine/docs/about-the-standard-environment
|
||||
[bazel-install]: http://bazel.io/docs/install.html
|
||||
[epp]: https://en.wikipedia.org/wiki/Extensible_Provisioning_Protocol
|
||||
[flex]: https://cloud.google.com/appengine/docs/flexible/
|
||||
[google-registry]: https://www.registry.google/
|
||||
[gtld]: https://en.wikipedia.org/wiki/Generic_top-level_domain
|
||||
[icann]: https://en.wikipedia.org/wiki/ICANN
|
||||
[paxos]: https://en.wikipedia.org/wiki/Paxos_(computer_science)
|
||||
[postel]: https://en.wikipedia.org/wiki/Jon_Postel
|
||||
[registrar]: https://en.wikipedia.org/wiki/Domain_name_registrar
|
||||
[registry]: https://en.wikipedia.org/wiki/Domain_name_registry
|
||||
[whois]: https://en.wikipedia.org/wiki/WHOIS
|
||||
* A DNS system. An interface for DNS operations is provided so you can write
|
||||
an implementation for your chosen provider, along with a sample
|
||||
implementation that uses [Google Cloud DNS](https://cloud.google.com/dns/).
|
||||
If you are using Google Cloud DNS you may need to understand its
|
||||
capabilities and provide your own
|
||||
multi-[AS](https://en.wikipedia.org/wiki/Autonomous_system_\(Internet\))
|
||||
solution.
|
||||
* An invoicing/payments system in order to charge registrars for domain name
|
||||
registrations and accept payments.
|
||||
* System status and uptime monitoring.
|
||||
* A proxy to forward traffic on EPP and WHOIS ports to App Engine via HTTPS,
|
||||
since App Engine Standard only serves HTTP/S traffic. The proxy must support
|
||||
IPv4 and IPv6 access to comply with ICANN's requirements for gTLDs. There
|
||||
are plans to eliminate the need for a proxy by migrating to [App Engine
|
||||
Flexible](https://cloud.google.com/appengine/docs/flexible/) in the future.
|
||||
|
|
Loading…
Add table
Reference in a new issue