URL Normalization for De-Duplication of Web Pages

doi 10.1145/1645953.1646283