Fourth report (May 2019) - File Index
Links to 1GB WARC files and JSON metadata indexes of the URLs inside each WARC file
Source
This data file or API can be downloaded from:
https://datagovau.s3.ap-southeast-2.amazonaws.com/cd574697-6734-4443-b350-9cf9eae427a2/99f43557-1d3d-40e7-bc0c-665a4275d625/webcrawl04/index.htmlResources
Additional Information
Field | Value |
---|---|
Data last updated | unknown |
Metadata last updated | November 13, 2024 |
Format | HTML |
License | Other |
Datastore active | False |
Datastore contains all records of source file | False |
Has views | True |
Id | e4d0d064-860b-4584-8ca5-27b5cacc8593 |
Mimetype | text/html |
Package id | 99f43557-1d3d-40e7-bc0c-665a4275d625 |
Position | 5 |
State | active |
Zip extract | False |
Size | Unknown |
ID | e4d0d064-860b-4584-8ca5-27b5cacc8593 |
Harvested | False |
Package ID | 99f43557-1d3d-40e7-bc0c-665a4275d625 |
State | Archived |