Martins, BrunoSilva, Mário J.2009-02-102014-11-142009-02-102014-11-142004-05http://hdl.handle.net/10451/14209http://repositorio.ul.pt/handle/10455/2914This report presents a statistical study of WPT-03, a text corpus built from the pages of the `Portuguese Web' collected in the repository of the tumba! search engine. We give a statistical analysis of the textual contents available in the Portuguese Web, including size distributions, the language of the pages, and the terms they containporA Statistical Study of the WPT-03 Corpusreport