From Geoscience Australia

Uncover-ML: a machine learning pipeline for geoscience data analysis.

ARCHIVED

Created 13/01/2025

Updated 13/01/2025

The geosciences are a data-rich domain where Earth materials and processes are analysed from local to global scales. However, often we only have discrete measurements at specific locations, and a limited understanding of how these features vary across the landscape. Earth system processes are inherently complex, and trans-disciplinary science will likely become increasingly important in finding solutions to future challenges associated with the environment, mineral/petroleum resources and food security. Machine learning is an important approach to synthesise the increasing complexity and sheer volume of Earth science data, and is now widely used in prediction across many scientific disciplines. In this context, we have built a machine learning pipeline, called Uncover-ML, for both supervised and unsupervised learning, prediction and classification. The Uncover-ML pipeline was developed from a partnership between CSIRO and Geoscience Australia, and is largely built around the Python scikit-learn machine learning libraries. In this paper, we briefly describe the architecture and components of Uncover-ML for feature extraction, data scaling, sample selection, predictive mapping, estimating model performance, model optimisation and estimating model uncertainties. Links to download the source code and information on how to implement the algorithms are also provided. Citation: Wilford, J., Basak, S., Hassan, R., Moushall, B., McCalman, L., Steinberg, D. and Zhang, F, 2020. Uncover-ML: a machine learning pipeline for geoscience data analysis. In: Czarnota, K., Roach, I., Abbott, S., Haynes, M., Kositcin, N., Ray, A. and Slatter, E. (eds.) Exploring for the Future: Extended Abstracts, Geoscience Australia, Canberra, 1–4.

Files and APIs

Tags

Additional Info

Field Value
Title Uncover-ML: a machine learning pipeline for geoscience data analysis.
Language eng
Licence notspecified
Landing Page https://devweb.dga.links.com.au/data/dataset/8f4fd27d-a8ea-4d85-801a-d38157b8b285
Contact Point
Geoscience Australia
clientservices@ga.gov.au
Reference Period 08/04/2019
Geospatial Coverage {"type": "Polygon", "coordinates": [[[112.0, -44.0], [154.0, -44.0], [154.0, -9.0], [112.0, -9.0], [112.0, -44.0]]]}
Data Portal data.gov.au

Data Source

This dataset was originally found on data.gov.au "Uncover-ML: a machine learning pipeline for geoscience data analysis.". Please visit the source to access the original metadata of the dataset:
https://devweb.dga.links.com.au/data/dataset/uncover-ml-a-machine-learning-pipeline-for-geoscience-data-analysis

No duplicate datasets found.