Third report (February 2019) - File Index
Links to 1GB WARC files and JSON metadata indexes of the URLs inside each WARC file
Source
This data file or API can be downloaded from:
https://datagovau.s3.ap-southeast-2.amazonaws.com/cd574697-6734-4443-b350-9cf9eae427a2/99f43557-1d3d-40e7-bc0c-665a4275d625/webcrawl03/index.htmlResources
Additional Information
Field | Value |
---|---|
Data last updated | unknown |
Metadata last updated | November 13, 2024 |
Format | HTML |
License | Other |
Datastore active | False |
Datastore contains all records of source file | False |
Has views | True |
Id | 2eaf0328-5e4a-4eb8-894a-d3bc30457dc0 |
Mimetype | text/html |
Package id | 99f43557-1d3d-40e7-bc0c-665a4275d625 |
Position | 4 |
State | active |
Zip extract | False |
Size | Unknown |
ID | 2eaf0328-5e4a-4eb8-894a-d3bc30457dc0 |
Harvested | False |
Package ID | 99f43557-1d3d-40e7-bc0c-665a4275d625 |
State | Archived |