Logo do repositório
 
Publicação

The Viuva Negra crawler

dc.contributor.authorGomes, Danielpor
dc.contributor.authorSilva, Mário J.por
dc.date.accessioned2009-02-10T13:11:59Zpor
dc.date.accessioned2014-11-14T16:23:51Z
dc.date.available2009-02-10T13:11:59Zpor
dc.date.available2014-11-14T16:23:51Z
dc.date.issued2006-11por
dc.description.abstractThis report discusses architectural aspects of web crawlers and details the design, implementation and evaluation of the Viuva Negra (VN) crawler. VN has been used for 4 years, feeding a search engine and an archive of the Portuguese web. In our experiments it crawled over 2 million documents per day, correspondent to 63 GB of data. We describe hazardous situations to crawling found on the web and the adopted solutions to mitigate their effects. The gathered information was integrated in a web warehouse that provides support for its automatic processing by text mining applications.por
dc.identifier.urihttp://hdl.handle.net/10451/14117por
dc.identifier.urihttp://repositorio.ul.pt/handle/10455/3014por
dc.language.isoporpor
dc.publisherDepartment of Informatics, University of Lisbonpor
dc.relation.ispartofseriesdi-fcul-tr-06-21por
dc.subjectCrawler designpor
dc.subjecttumba!por
dc.subjectweb partitioning,experimentspor
dc.subjectharvestingpor
dc.subjectTombapor
dc.titleThe Viuva Negra crawlerpor
dc.typereport
dspace.entity.typePublication
rcaap.rightsopenAccesspor
rcaap.typereportpor

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
06-21.pdf
Tamanho:
367.78 KB
Formato:
Adobe Portable Document Format