Skip to content
/ fqdn Public

RFC-compliant FQDN validation and manipulation for Python.

License

Notifications You must be signed in to change notification settings

ypcrts/fqdn

Repository files navigation

Python FQDN Fully-Qualified Domain Names

License Status Build Status Coverage Status Latest PyPI Version Downloads Monthly Downloads Python Versions

This package validates Fully Qualified Domain Names (FQDNs) conforming to the Internet Engineering Task Force specification [1]. The design intent is to validate that a string would be traditionally acceptable as a public Internet hostname to RFC-conforming software, which is a strict subset of the logic in modern web browsers like Mozilla Firefox and Chromium that determines whether make a DNS lookup [2]. Configuration options can relax constraints so that short hostnames without periods or others with underscores will be valid. These relaxations are closer to how modern web browsers work [3] .

>>> from fqdn import FQDN
>>> domain = 'bbc.co.uk'
>>> bbc_fqdn = FQDN(domain)
>>> bbc_fqdn.is_valid
True
>>> bbc_fqdn.absolute
'bbc.co.uk.'
>>> bbc_fqdn.relative
'bbc.co.uk'

Equality checks are implemented case insensitive conforming to to the IETF specification [#equality].

>>> FQDN('BBC.CO.UK.') == FQDN('BbC.Co.uK')
True
>>> hash(FQDN('BBC.CO.UK.')) == hash(FQDN('BbC.Co.uK'))
True
[1]See IETF Specification.
[2]Certificate Authorities like Let's Encrypt run a narrower set of string validation logic to determine validity for issuance. This package is not intended to achieve functional parity with CA issuance, because they may have proprietary or custom login. Boulder's code is starkly different from Chromium's, as outlined in Issue #14.
[3]See Issue #14.

Standards Conformance

In the default configuration, this package adds only one additional constraint to the IETF specification, requiring a minimum of two labels, separated by periods. This extra restriction can be disabled. It is enabled by default to prevent breaking backwards compatibility. Review the tests for examples of the impact of this.

IETF Specification

The IETF specification restricts domain names to alphanumeric ASCII characters and hyphens as described below.

RFC 1123: Requirements for Internet Hosts - Application and Support, October 1989

This RFC is an official specification for the Internet community. It incorporates by reference, amends, corrects, and supplements the primary protocol standards documents relating to hosts.

2.1 Host Names and Numbers

The syntax of a legal Internet host name was specified in RFC-952 [DNS:4]. One aspect of host name syntax is hereby changed: the restriction on the first character is relaxed to allow either a letter or a digit. Host software MUST support this more liberal syntax.

Host software MUST handle host names of up to 63 characters and SHOULD handle host names of up to 255 characters.

Whenever a user inputs the identity of an Internet host, it SHOULD be possible to enter either (1) a host domain name or (2) an IP address in dotted-decimal ("#.#.#.#") form. The host SHOULD check the string syntactically for a dotted-decimal number before looking it up in the Domain Name System.

RFC 952: DoD Internet host table specification, October 1985

This RFC is the official specification of the format of the Internet Host Table.
<hname> ::= <name>*["."<name>]
<name>  ::= <let>[*[<let-or-digit-or-hyphen>]<let-or-digit>]

Commentary

RFC-1034: Domain Name Concepts and Facilities, November 1987

  • Section 3.5 specifies a "preferred name syntax", which is non-compulsory.

    3.5. Preferred name syntax

    The DNS specifications attempt to be as general as possible in the rules for constructing domain names. The idea is that the name of any existing object can be expressed as a domain name with minimal changes. However, when assigning a domain name for an object, the prudent user will select a name which satisfies both the rules of the domain system and any existing rules for the object, whether these rules are published or implied by existing programs.

    For example, when naming a mail domain, the user should satisfy both the rules of this memo and those in RFC-822. When creating a new host name, the old rules for HOSTS.TXT should be followed. This avoids problems when old software is converted to use domain names.

RFC-1035: Domain Names - Implementation and Specification, November 1987

  • Section 2.3.1 repeats the "preferred name syntax" proposal from RFC-1034.

RFC-2181: Clarification to the DNS Specification, July 1997

  • Section 11 comments that RFC-1035 does not restrict domain names to the preferred name syntax set out in it. Instead Internet hostnames are restricted more or less by a combination of tradition and RFC-2181, where this package finds itself.

RFC-3696: Application Techniques for Checking and Transformation of Names, February 2004

  • This memo provides fascinating commentary of the history of string validation for domain names.

Licenses

License Status Shield