Logo do repositório
 
A carregar...
Miniatura
Publicação

On URL and content persistence

Utilize este identificador para referenciar este registo.
Nome:Descrição:Tamanho:Formato: 
05-21.pdf207.37 KBAdobe PDF Ver/Abrir

Orientador(es)

Resumo(s)

This report presents a study of URL and content persistence among 51 million pages from a national web harvested 8 times over almost 3 years. This study differs from previous ones because it describes the evolution of a large set of web pages for several years, studying in depth the characteristics of persistent data. We found that the persistence of URLs and contents follows a logarithmic distribution. We characterized persistent URLs and contents, and identified reasons for URL death. We found that lasting contents tend to be referenced by different URLs during their lifetime. On the other hand, half of the contents referenced by persistent URLs did not change

Descrição

Palavras-chave

URL persistence content persistence tomba

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo

Editora

Department of Informatics, University of Lisbon

Licença CC