test.resources.data.urltestdata_host_whatwg.txt Maven / Gradle / Ivy
Go to download
Show more of this group Show more artifacts with this name
Show all versions of galimatias Show documentation
Show all versions of galimatias Show documentation
Fork of the galimatias URL parsing and normalization library.
The newest version!
-- Based on http://trac.webkit.org/browser/trunk/LayoutTests/fast/url/host.html
-- Basic canonicalization, uppercase should be converted to lowercase
http://ExAmPlE.CoM http://other.com/ s:http p:/ h:example.com
-- US: Spaces should fail
http://example\sexample.com
-- WEBKIT: Spaces and some other characters should be escaped.
-- US: This should fail
http://Goo%20\sgoo%7C|.com
-- WEBKIT: Exciting different types of spaces!
-- US: This should fail
http://GOO\u00a0\u3000goo.com
-- Other types of space (no-break, zero-width, zero-width-no-break) are
-- name-prepped away to nothing.
http://GOO\u200b\u2060\ufeffgoo.com s:http p:/ h:googoo.com
-- Ideographic full stop (full-width period for Chinese, etc.) should be
-- treated as a dot.
http://www.foo\u3002bar.com s:http p:/ h:www.foo.bar.com
-- Invalid unicode characters should fail...
http://\ufdd0zyx.com
-- ...This is the same as previous but with with escaped.
http://%ef%b7%90zyx.com
-- Test name prepping, fullwidth input should be converted to ASCII and NOT
-- IDN-ized. This is "Go" in fullwidth UTF-8/UTF-16.
http://\uff27\uff4f.com s:http p:/ h:go.com
-- WEBKIT: Test that fullwidth escaped values are properly name-prepped,
-- WEBKIT: then converted or rejected.
-- WEBKIT: ...%41 in fullwidth = 'A' (also as escaped UTF-8 input)
--
-- WHATWG: URL spec forbids this:
-- WHATWG: https://www.w3.org/Bugs/Public/show_bug.cgi?id=24257
http://\uff05\uff14\uff11.com
http://%ef%bc%85%ef%bc%94%ef%bc%91.com
-- ...%00 in fullwidth should fail (also as escaped UTF-8 input)
-- TODO
-- http://\uff05\uff10\uff10.com
-- http://%ef%bc%85%ef%bc%90%ef%bc%90.com
-- Basic IDN support, UTF-8 and UTF-16 input should be converted to IDN
http://\u4f60\u597d\u4f60\u597d s:http p:/ h:xn--6qqa088eba
-- Invalid escaped characters should fail and the percents should be
-- escaped.
-- TODO: https://www.w3.org/Bugs/Public/show_bug.cgi?id=24191
-- http://%zz%66%a.com
-- If we get an invalid character that has been escaped.
-- TODO
-- http://%25
-- http://hello%00
-- Escaped numbers should be treated like IP addresses if they are.
-- TODO: Right now galimatias treats IPv4 addresses as domains in this case
-- Check this case later on
-- http://%30%78%63%30%2e%30%32%35%30.01 s:http p:/ h:127.0.0.1
-- http://%30%78%63%30%2e%30%32%35%30.01%2e
-- Invalid escaping should trigger the regular host error handling.
-- TODO
-- http://%3g%78%63%30%2e%30%32%35%30%2E.01
-- Something that isn't exactly an IP should get treated as a host and
-- spaces escaped.
http://192.168.0.1\shello
-- Fullwidth and escaped UTF-8 fullwidth should still be treated as IP.
-- These are "0Xc0.0250.01" in fullwidth.
-- TODO
-- http://\uff10\uff38\uff43\uff10\uff0e\uff10\uff12\uff15\uff10\uff0e\uff10\uff11 s:http p:/ h:192.168.0.1
-- Broken IP addresses.
-- TODO
-- http://192.168.0.257
http://[google.com]
-- Cyrillic letter followed buy ( should return punicode for ( escaped before punicode string was created. I.e.
-- if ( is escaped after punicode is created we would get xn--%28-8tb (incorrect).
-- TODO
-- http://\u0442(
-- IDN misc
http://ジェーピーニック.jp s:http h:xn--hckqz9bzb1cyrb.jp p:/
http://ß.com/ s:http h:ss.com p:/
http://☃.com/ s:http h:xn--n3h.com p:/
http://☃☃☃.com/ s:http h:xn--n3haa.com p:/
http://\uD83D\uDC35.com/ s:http h:xn--9o8h.com p:/
-- Percent-encoded ß
http://%C3%9F.com/ s:http h:ss.com p:/
-- GALIMATIAS: DNS length checks
http://000000000000000000000000000000000000000000000000000000000000000 s:http p:/ h:000000000000000000000000000000000000000000000000000000000000000
http://0000000000000000000000000000000000000000000000000000000000000000
http://0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0 s:http p:/ h:0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0
http://0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0
http://000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000.0000000000000000000000000000000000000000000000000000000000000 s:http p:/ h:000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000.0000000000000000000000000000000000000000000000000000000000000
http://000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000.000000000000000000000000000000000000000000000000000000000000000
http://0..0