A great deal of data from iWeb is available for download, in the same way that it already is available for COCA: word frequency, collocates, n-grams, full text data, etc.
Click on any of the links below for more information and samples of this data.
95% of the text from the 14 billion words of text, including a listing of all 22+ million web pages used in the corpus