Second report (November 2018) - File Index
Links to 1GB WARC files and JSON metadata indexes of the URLs inside each WARC file
Source
This data file or API can be downloaded from:
https://datagovau.s3.ap-southeast-2.amazonaws.com/cd574697-6734-4443-b350-9cf9eae427a2/99f43557-1d3d-40e7-bc0c-665a4275d625/webcrawl02/index.htmlResources
Additional Information
Field | Value |
---|---|
Data last updated | unknown |
Metadata last updated | November 13, 2024 |
Format | HTML |
License | Other |
Datastore active | False |
Datastore contains all records of source file | False |
Has views | True |
Id | 25c03d81-6973-4839-a4fa-8f29a10a3710 |
Mimetype | text/html |
Package id | 99f43557-1d3d-40e7-bc0c-665a4275d625 |
Position | 3 |
State | active |
Zip extract | False |
Size | Unknown |
ID | 25c03d81-6973-4839-a4fa-8f29a10a3710 |
Harvested | False |
Package ID | 99f43557-1d3d-40e7-bc0c-665a4275d625 |
State | Archived |