mirror of
https://github.com/google/nomulus.git
synced 2025-04-29 19:47:51 +02:00
Update IDN tables with latest approved by ICANN (#1995)
This also adds README files to explain the two different IDN table locations (which have different purposes). See http://b/278565478 for more information.
This commit is contained in:
parent
4434d807ad
commit
ad7d6f4873
4 changed files with 5657 additions and 6624 deletions
File diff suppressed because it is too large
Load diff
|
@ -1,21 +1,23 @@
|
|||
# Registry: Charleston Road Registry Inc.
|
||||
#
|
||||
# Script: Latn
|
||||
#
|
||||
# Version: 1.0
|
||||
#
|
||||
# Effective Date: 04-12-2012
|
||||
#
|
||||
# Address: 1600 Amphitheatre Parkway Mountain View, CA 94043, USA
|
||||
#
|
||||
# Version: 2.0
|
||||
# Effective Date: 2023-04-04
|
||||
# URL: https://www.iana.org/domains/idn-tables/tables/google_latn_2.0.txt
|
||||
# Policy: https://www.registry.google/about/policies/domainabuse/
|
||||
# Contact Name: CRR Tech
|
||||
# Email address: crr-tech@google.com
|
||||
# Telephone: +1 (650) 253-0000
|
||||
#
|
||||
# Website: www.charlestonroadregistry.com
|
||||
# Code points requiring context rules
|
||||
#
|
||||
# Code point Description of rule/Reference
|
||||
#
|
||||
# U+002D Label must neither start nor end with U+002D. Label
|
||||
# HYPHEN-MINUS must not have U+002D in both third and fourth
|
||||
# position. RFC 5891 (sec 4.2.3.1)
|
||||
#
|
||||
# Notes: This table describes codepoints allowed for the Latin script.
|
||||
|
||||
U+002D # HYPHEN-MINUS
|
||||
#
|
||||
U+0030 # DIGIT ZERO
|
||||
U+0031 # DIGIT ONE
|
||||
U+0032 # DIGIT TWO
|
||||
|
@ -26,12 +28,7 @@ U+0036 # DIGIT SIX
|
|||
U+0037 # DIGIT SEVEN
|
||||
U+0038 # DIGIT EIGHT
|
||||
U+0039 # DIGIT NINE
|
||||
#
|
||||
# The following code points are listed according to the
|
||||
# European Ordering Rules (ENV 13710).
|
||||
#
|
||||
U+0061 # LATIN SMALL LETTER A
|
||||
U+00E1 # LATIN SMALL LETTER A WITH ACUTE
|
||||
U+00E0 # LATIN SMALL LETTER A WITH GRAVE
|
||||
U+0103 # LATIN SMALL LETTER A WITH BREVE
|
||||
U+00E2 # LATIN SMALL LETTER A WITH CIRCUMFLEX
|
||||
|
@ -39,14 +36,11 @@ U+00E5 # LATIN SMALL LETTER A WITH RING ABOVE
|
|||
U+00E4 # LATIN SMALL LETTER A WITH DIAERESIS
|
||||
U+00E3 # LATIN SMALL LETTER A WITH TILDE
|
||||
U+0105 # LATIN SMALL LETTER A WITH OGONEK
|
||||
U+0101 # LATIN SMALL LETTER A WITH MACRON
|
||||
U+01CE # LATIN SMALL LETTER A WITH CARON
|
||||
U+00E6 # LATIN SMALL LETTER AE
|
||||
U+0062 # LATIN SMALL LETTER B
|
||||
U+0063 # LATIN SMALL LETTER C
|
||||
U+0107 # LATIN SMALL LETTER C WITH ACUTE
|
||||
U+010D # LATIN SMALL LETTER C WITH CARON
|
||||
U+010B # LATIN SMALL LETTER C WITH DOT ABOVE
|
||||
U+00E7 # LATIN SMALL LETTER C WITH CEDILLA
|
||||
U+0064 # LATIN SMALL LETTER D
|
||||
U+010F # LATIN SMALL LETTER D WITH CARON
|
||||
|
@ -65,21 +59,14 @@ U+0259 # LATIN SMALL LETTER SCHWA
|
|||
U+0066 # LATIN SMALL LETTER F
|
||||
U+0067 # LATIN SMALL LETTER G
|
||||
U+011F # LATIN SMALL LETTER G WITH BREVE
|
||||
U+01E7 # LATIN SMALL LETTER G WITH CARON
|
||||
U+0121 # LATIN SMALL LETTER G WITH DOT ABOVE
|
||||
U+0123 # LATIN SMALL LETTER G WITH CEDILLA
|
||||
U+01E5 # LATIN SMALL LETTER G WITH STROKE
|
||||
U+0068 # LATIN SMALL LETTER H
|
||||
U+0127 # LATIN SMALL LETTER H WITH STROKE
|
||||
U+0069 # LATIN SMALL LETTER I
|
||||
U+0131 # LATIN SMALL LETTER DOTLESS I
|
||||
U+00ED # LATIN SMALL LETTER I WITH ACUTE
|
||||
U+00EC # LATIN SMALL LETTER I WITH GRAVE
|
||||
U+00EE # LATIN SMALL LETTER I WITH CIRCUMFLEX
|
||||
U+00EF # LATIN SMALL LETTER I WITH DIAERESIS
|
||||
U+012F # LATIN SMALL LETTER I WITH OGONEK
|
||||
U+012B # LATIN SMALL LETTER I WITH MACRON
|
||||
U+01D0 # LATIN SMALL LETTER I WITH CARON
|
||||
U+006A # LATIN SMALL LETTER J
|
||||
U+006B # LATIN SMALL LETTER K
|
||||
U+01E9 # LATIN SMALL LETTER K WITH CARON
|
||||
|
@ -91,20 +78,15 @@ U+013C # LATIN SMALL LETTER L WITH CEDILLA
|
|||
U+0142 # LATIN SMALL LETTER L WITH STROKE
|
||||
U+006D # LATIN SMALL LETTER M
|
||||
U+006E # LATIN SMALL LETTER N
|
||||
U+0144 # LATIN SMALL LETTER N WITH ACUTE
|
||||
U+0148 # LATIN SMALL LETTER N WITH CARON
|
||||
U+00F1 # LATIN SMALL LETTER N WITH TILDE
|
||||
U+0146 # LATIN SMALL LETTER N WITH CEDILLA
|
||||
U+014B # LATIN SMALL LETTER ENG
|
||||
U+006F # LATIN SMALL LETTER O
|
||||
U+00F3 # LATIN SMALL LETTER O WITH ACUTE
|
||||
U+00F2 # LATIN SMALL LETTER O WITH GRAVE
|
||||
U+00F4 # LATIN SMALL LETTER O WITH CIRCUMFLEX
|
||||
U+00F6 # LATIN SMALL LETTER O WITH DIAERESIS
|
||||
U+0151 # LATIN SMALL LETTER O WITH DOUBLE ACUTE
|
||||
U+00F5 # LATIN SMALL LETTER O WITH TILDE
|
||||
U+014D # LATIN SMALL LETTER O WITH MACRON
|
||||
U+01D2 # LATIN SMALL LETTER O WITH CARON
|
||||
U+00F8 # LATIN SMALL LETTER O WITH STROKE
|
||||
U+0153 # LATIN SMALL LIGATURE OE
|
||||
U+0070 # LATIN SMALL LETTER P
|
||||
|
@ -112,41 +94,31 @@ U+0071 # LATIN SMALL LETTER Q
|
|||
U+0072 # LATIN SMALL LETTER R
|
||||
U+0155 # LATIN SMALL LETTER R WITH ACUTE
|
||||
U+0159 # LATIN SMALL LETTER R WITH CARON
|
||||
U+0157 # LATIN SMALL LETTER R WITH CEDILLA
|
||||
U+0073 # LATIN SMALL LETTER S
|
||||
U+015B # LATIN SMALL LETTER S WITH ACUTE
|
||||
U+0161 # LATIN SMALL LETTER S WITH CARON
|
||||
U+015F # LATIN SMALL LETTER S WITH CEDILLA
|
||||
U+0074 # LATIN SMALL LETTER T
|
||||
U+0165 # LATIN SMALL LETTER T WITH CARON
|
||||
U+0163 # LATIN SMALL LETTER T WITH CEDILLA
|
||||
U+0167 # LATIN SMALL LETTER T WITH STROKE
|
||||
U+0075 # LATIN SMALL LETTER U
|
||||
U+00FA # LATIN SMALL LETTER U WITH ACUTE
|
||||
U+00F9 # LATIN SMALL LETTER U WITH GRAVE
|
||||
U+00FB # LATIN SMALL LETTER U WITH CIRCUMFLEX
|
||||
U+016F # LATIN SMALL LETTER U WITH RING ABOVE
|
||||
U+00FC # LATIN SMALL LETTER U WITH DIAERESIS
|
||||
U+0171 # LATIN SMALL LETTER U WITH DOUBLE ACUTE
|
||||
U+0173 # LATIN SMALL LETTER U WITH OGONEK
|
||||
U+016B # LATIN SMALL LETTER U WITH MACRON
|
||||
U+01D4 # LATIN SMALL LETTER U WITH CARON
|
||||
U+0076 # LATIN SMALL LETTER V
|
||||
U+0077 # LATIN SMALL LETTER W
|
||||
U+1E83 # LATIN SMALL LETTER W WITH ACUTE
|
||||
U+1E81 # LATIN SMALL LETTER W WITH GRAVE
|
||||
U+0175 # LATIN SMALL LETTER W WITH CIRCUMFLEX
|
||||
U+1E85 # LATIN SMALL LETTER W WITH DIAERESIS
|
||||
U+0078 # LATIN SMALL LETTER X
|
||||
U+0079 # LATIN SMALL LETTER Y
|
||||
U+00FD # LATIN SMALL LETTER Y WITH ACUTE
|
||||
U+1EF3 # LATIN SMALL LETTER Y WITH GRAVE
|
||||
U+0177 # LATIN SMALL LETTER Y WITH CIRCUMFLEX
|
||||
U+00FF # LATIN SMALL LETTER Y WITH DIAERESIS
|
||||
U+007A # LATIN SMALL LETTER Z
|
||||
U+017A # LATIN SMALL LETTER Z WITH ACUTE
|
||||
U+017E # LATIN SMALL LETTER Z WITH CARON
|
||||
U+017C # LATIN SMALL LETTER Z WITH DOT ABOVE
|
||||
U+0292 # LATIN SMALL LETTER EZH
|
||||
U+01EF # LATIN SMALL LETTER EZH WITH CARON
|
||||
U+00FE # LATIN SMALL LETTER THORN
|
||||
|
|
7
core/src/main/java/google/registry/idn/README.md
Normal file
7
core/src/main/java/google/registry/idn/README.md
Normal file
|
@ -0,0 +1,7 @@
|
|||
# IDN tables
|
||||
|
||||
This directory contains the most recent version of our approved IDN tables. They
|
||||
should match what is published (or will be published soon) by IANA. In the case
|
||||
of IDN tables that have had multiple revisions over time, this directory should
|
||||
contain only the latest version, not previous ones, even when said previous ones
|
||||
might still be in use by previously launched TLDs.
|
|
@ -0,0 +1,6 @@
|
|||
# IDN tables
|
||||
|
||||
This directory contains the IDN tables that are actually in use by our running
|
||||
system (and the Java code to enable such). This will include older versions of
|
||||
IDN tables that are no longer published by IANA so long as those tables are
|
||||
still in use.
|
Loading…
Add table
Reference in a new issue