A deep autoencoder network connected to geographical random forest for spatially aware geochemical anomaly detection
Machine learning (ML) and deep learning (DL) techniques have recently shown encouraging performance in recognizing metal-vectoring geochemical anomalies within complex Earth systems. However, the generalization of these techniques to detect subtle anomalies may be precluded due to overlooking non-st...
Uloženo v:
| Vydáno v: | Computers & geosciences Ročník 190; s. 105657 |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Elsevier Ltd
01.08.2024
|
| Témata: | |
| ISSN: | 0098-3004 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | Machine learning (ML) and deep learning (DL) techniques have recently shown encouraging performance in recognizing metal-vectoring geochemical anomalies within complex Earth systems. However, the generalization of these techniques to detect subtle anomalies may be precluded due to overlooking non-stationary spatial structures and intra-pattern local dependencies contained in geochemical exploration data. Motivated by this, we conceptualize in this paper an innovative algorithm connecting a DL architecture to a spatial ML processor to account for local neighborhood information and spatial non-stationarities in support of spatially aware anomaly detection. A deep autoencoder network (DAN) is trained to abstract deep feature codings (DFCs) of multi-element input data. The encoded DFCs represent the typical performance of a nonlinear Earth system, i.e., multi-element signatures of geochemical background populations developed by different geo-processes. A local version of the random forest algorithm, geographical random forest (GRF), is then connected to the input and code layers of the DAN processor to establish nonlinear and spatially aware regressions between original geochemical signals (dependent variables) and DFCs (independent variables). After contributions of the latter on the former are determined, residuals of GRF regressions are quantified and interpreted as spatially aware anomaly scores related to mineralization. The proposed algorithm (i.e., DAN‒GRF) is implemented in the R language environment and examined in a case study with stream sediment geochemical data pertaining to the Takht-e-Soleyman district, Iran. The high-scored anomalies mapped by DAN‒GRF, compared to those by the stand-alone DAN technique, indicated a stronger spatial correlation with locations of known metal occurrences, which was statistically confirmed by success-rate curves, Student's t‒statistic method, and prediction-area plots. The findings suggested that the proposed algorithm has an enhanced capability to recognize subtle multi-element geochemical anomalies and extract reliable insights into metal exploration targeting.
[Display omitted]
•A hybrid algorithm to recognize metal-vectoring geochemical anomaly patterns.•A deep autoencoder network to learn deep feature codings of multi-element input data.•A geographical random forest regression to quantify spatially aware anomaly scores.•A comparative experiment on a case study from Takht-e-Soleyman district, Iran.•Success-rate curves, Student's t‒statistic, and prediction-area plots for performance evaluation. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ISSN: | 0098-3004 |
| DOI: | 10.1016/j.cageo.2024.105657 |