The WARC files for this crawl are currently not publicly accessible.
The seed for Survey 6 was a list of 251million domains.
This crawl was run with no deduplication, at "level 0" (only archive the seed URLs and their embeds and do not follow any outlinks)