`http://./` is a valid url #146

anonrig · 2023-01-28T02:05:12Z

http://./ as an input is valid for both safari & chrome, but it's invalid for us.

The text was updated successfully, but these errors were encountered:

anonrig · 2023-01-28T03:26:28Z

I'm reopening this because we were skipping this URL in the input, but we shouldn't have.

lemire · 2023-01-28T04:11:30Z

Why? The standard is clear on this. Labels must be between 1 and 63 bytes. If we misread the standard, can you quote the relevant section?

anonrig · 2023-01-28T04:15:41Z

I've not looked into the spec for this, but: there is a particular section in WPT labeled "domains with empty labels": https://github.com/web-platform-tests/wpt/blob/master/url/resources/urltestdata.json#L3889

lemire · 2023-01-28T04:20:55Z

RFC 1034: Internally, programs that manipulate domain names should represent them
as sequences of labels, where each label is a length octet followed by
an octet string. Because all domain names end at the root, which has a
null string for a label, these internal representations can use a length
byte of zero to terminate a domain name.

lemire · 2023-01-28T04:22:19Z

One label is reserved, and that is
the null (i.e., zero length) label used for the root.

lemire · 2023-01-28T04:26:09Z

A label may contain zero to 63 characters. The null label, of length zero, is reserved for the root zone. https://en.m.wikipedia.org/wiki/Domain_Name_System

anonrig · 2023-01-28T04:26:11Z

Can you open an issue to the web-platform-tests repository? Even though you're right, removing this test from the Node repository without changing the WPT won't be possible.

lemire · 2023-01-28T04:27:55Z

"The hierarchy of domains descends from the right to the left label in the name; each label to the left specifies a subdivision, or subdomain of the domain to the right. For example: the label example specifies a node example.com as a subdomain of the com domain, and www is a label to create www.example.com, a subdomain of example.com. Each label may contain from 1 to 63 octets. The empty label is reserved for the root node and when fully qualified is expressed as the empty label terminated by a dot. The full domain name may not exceed a total length of 253 ASCII characters in its textual representation.” https://en.m.wikipedia.org/wiki/Domain_name

lemire · 2023-01-28T04:58:21Z

Seems related to this: servo/rust-url#554

So something can be an invalid URL, but still pass through the algorithm.

miguelteixeiraa · 2023-01-28T14:25:28Z

About the link that Yagiz provided (WPT tests),

All the tests/examples of the section mentioned (domains with empty labels) are not FQDN (fully qualified domain names)
To be considered FQDN, the domain name must include a Second-Level Domain (SLD) and a Top-Level Domain (TLD).

An example of FQDN is www.example.com, where "www" is the hostname (not required), "example" is the second-level domain (SLD), and ".com" is the top-level domain (TLD).

miguelteixeiraa · 2023-01-28T14:26:29Z

I'm looking for references that state these limits/sizes/rules only apply to FQDNs.

miguelteixeiraa · 2023-01-28T14:43:03Z

I'm looking for references that state these limits/sizes/rules only apply to FQDNs.

I couldn't find anything (but we could think about it 🤔 .. apply the rules only when there is the basic structure to be a fqdn (at least 2 non-zero labels) )

lemire · 2023-01-28T16:04:26Z

@miguelteixeiraa I think we should add a method to the URL struct such as bool is_fully_qualified_domain_name() const and that method would check that it is indeed a Fully qualified domain name, possibly adding other checks.

I'm looking for references that state these limits/sizes/rules only apply to FQDNs.

I don't think they do.

apply the rules only when there is the basic structure to be a fqdn (at least 2 non-zero labels) )

My proposal is rather to parse successfully the URL, irrespective of label lengths and so forth, but to have a method like is_valid_domain() const that does additional checks.

lemire · 2023-01-28T16:08:27Z

@anonrig I don't think that http://./ is a valid URL. To prove me wrong, please register it. I don't think you can.

It is a valid URL string as per https://url.spec.whatwg.org/ But that link mentions RFC 1034 only once, in passing, and does not appear to try to abide by it at all.

lemire · 2023-01-28T19:02:09Z

To be clear, I still think it is fine to accept to parse it…

lemire · 2023-01-31T01:32:46Z

@anonrig

Is this resolved?

anonrig · 2023-01-31T01:38:04Z

We can close this for now 👍

anonrig added the bug Something isn't working label Jan 28, 2023

anonrig mentioned this issue Jan 28, 2023

test: add error case #147

Merged

anonrig closed this as not planned Won't fix, can't repro, duplicate, stale Jan 28, 2023

anonrig reopened this Jan 28, 2023

anonrig changed the title ~~http://./ is invalid~~ http://./ is a valid url Jan 28, 2023

anonrig assigned lemire Jan 28, 2023

anonrig added the specification issue label Jan 28, 2023

anonrig closed this as completed Jan 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`http://./` is a valid url #146

`http://./` is a valid url #146

anonrig commented Jan 28, 2023

anonrig commented Jan 28, 2023

lemire commented Jan 28, 2023

anonrig commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

anonrig commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

miguelteixeiraa commented Jan 28, 2023

miguelteixeiraa commented Jan 28, 2023

miguelteixeiraa commented Jan 28, 2023 •

edited

Loading

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 31, 2023

anonrig commented Jan 31, 2023

http://./ is a valid url #146

http://./ is a valid url #146

Comments

anonrig commented Jan 28, 2023

anonrig commented Jan 28, 2023

lemire commented Jan 28, 2023

anonrig commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

anonrig commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

miguelteixeiraa commented Jan 28, 2023

miguelteixeiraa commented Jan 28, 2023

miguelteixeiraa commented Jan 28, 2023 • edited Loading

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 28, 2023

lemire commented Jan 31, 2023

anonrig commented Jan 31, 2023

`http://./` is a valid url #146

`http://./` is a valid url #146

miguelteixeiraa commented Jan 28, 2023 •

edited

Loading