URL canonicalization
Michael 'hacker' Krelin
hacker at klever.net
Wed Sep 14 14:57:46 PDT 2005
On Wed, Sep 14, 2005 at 03:53:28PM -0600, Dan Libby wrote:
> The spec says:
>
> "Note that the user can leave off "http://" and the trailing "/". A
> consumer must canonicalize the URL, following redirects and noting the
> final URL. The final, canonicalized URL is the user's identity URL."
>
> Okay, so case-insensitivity is fairly obvious. I'm already lower-casing
> everything. But what about http vs https? For example, should
I'm afraid you're overdoing it. You should not lowercase anything past
hostname.
> "https://sally.people.com/" be treated as a separate identity from
> "http://sally.people.com/"? Or should the protocol be ignored?
Clearly https://sally.people.com/ can have content different from
http://sally.people.com/ so you can't just ignore it.
Love,
H
> I suppose the issue can be broadened to: the spec is a bit vague about
> canonicalization of identity URLs. Can we get clarification?
>
> thanks,
>
> Dan Libby
>
>
>
More information about the yadis
mailing list