This paper compares three unsupervised machine-learning algorithms – local outlier factor (LOF), Isolation Forest (iForest) and one-class support vector machine (OCSVM) – for anomaly detection in a multivariate geochemical dataset in northeastern Iran. This area contains several Au, Cu and Pb–Zn mineral occurrences. The methodology incorporates single-element geochemistry, multivariate data analysis and application of the three unsupervised machine-learning algorithms. Principal component analysis unveiled diverse elemental associations for the first seven principal components (PCs): PC1 shows a Co–Cr–Ni–V–Sn association indicating a lithological influence; PC2 shows a Au–Bi–Cu–W association suggesting epithermal Au mineralization; PC3 shows variability in Zn–V–Co–Sb–Cu–Cr; PC4 shows a Au–Cu–Ba–Sr–Ag association indicating Au and polymetallic mineralization; PC5 reflects Zn–Ag–Ni–Pb related to hydrothermal mineralization; and PC6 and PC7 show element associations suggesting epithermal and intrusive-related polymetallic mineralization. It was found that OCSVM performed slightly better than LOF and iForest in detecting anomalies associated with known Cu occurrences, and it successfully delineated dispersion from all known Au occurrences. LOF outperformed iForest and OCSVM in identifying all four Pb–Zn occurrences, and the three methods substantially limited the areas of the anomaly class. The analysis showed that LOF produced a less cluttered anomaly map compared to the isolated patterns in the iForest map. LOF was accurate in identifying anomalies associated with Au–Pb mineralization, while iForest detected anomalies associated with Pb–Zn–Cu occurrences and neighbouring Pb–Zn occurrence. OCSVM performed similarly in the northern and western areas but displayed unique discrepancies in the SE and west by detecting anomalies associated with two Cu occurences and a Pb–Cu occurrence. This study examined the influence of contamination fraction on detection of geochemical anomalies, revealing a noteworthy rise in the count of mineral occurrences delineated by anomalies when the contamination fraction increases from 5 to 10%. However, even with a 35% contamination fraction, some Cu occurrences remained outside the anomaly category, indicating potentially overlooked geochemical signals from mineral occurrences due to sampling schemes.
Skip Nav Destination
Article navigation
Research Article|
September 13, 2024
Effectiveness of LOF, iForest and OCSVM in detecting anomalies in stream sediment geochemical data
Shahed Shahrestani;
Shahed Shahrestani
*
1
Karaj, Iran
*
Correspondence: [email protected]
Search for other works by this author on:
Emmanuel John M. Carranza
Emmanuel John M. Carranza
2
Department of Geology, University of the Free State
, Bloemfontein, South Africa
Search for other works by this author on:
Shahed Shahrestani
*
1
Karaj, Iran
Emmanuel John M. Carranza
2
Department of Geology, University of the Free State
, Bloemfontein, South Africa
*
Correspondence: [email protected]
Publisher: Geological Society of London
Received:
11 Mar 2024
Revision Received:
05 Jun 2024
Accepted:
06 Jun 2024
First Online:
13 Jun 2024
Online ISSN: 2041-4943
Print ISSN: 1467-7873
© 2024 The Author(s). Published by The Geological Society of London for GSL and AAG. All rights, including for text and data mining (TDM), artificial intelligence (AI) training, and similar technologies, are reserved. For permissions: https://www.lyellcollection.org/publishing-hub/permissions-policy. Publishing disclaimer: https://www.lyellcollection.org/publishing-hub/publishing-ethics
© 2024 The Author(s)
Geochemistry: Exploration, Environment, Analysis (2024) 24 (3): geochem2024-009.
Article history
Received:
11 Mar 2024
Revision Received:
05 Jun 2024
Accepted:
06 Jun 2024
First Online:
13 Jun 2024
Citation
Shahed Shahrestani, Emmanuel John M. Carranza; Effectiveness of LOF, iForest and OCSVM in detecting anomalies in stream sediment geochemical data. Geochemistry: Exploration, Environment, Analysis 2024;; 24 (3): geochem2024–009. doi: https://doi.org/10.1144/geochem2024-009
Download citation file:
You could not be signed in. Please check your email address / username and password and try again.
Index Terms/Descriptors
- algorithms
- artificial intelligence
- Asia
- detection
- epithermal processes
- fluvial environment
- geochemical anomalies
- gold ores
- heavy mineral deposits
- hydrothermal alteration
- intrusions
- Iran
- metal ores
- metasomatism
- Middle East
- mineral deposits, genesis
- mineral exploration
- mines
- multivariate analysis
- placers
- polymetallic ores
- principal components analysis
- sediments
- statistical analysis
- stream placers
- stream sediments
- northeastern Iran
- machine learning
- Damghan Iran
- support vector machines
- Semnan Iran
- random forest
- Amirabad Iran
- Moalleman Iran
- iForest algorithm
- local outlier factor algorithm
- isolation forest
Latitude & Longitude
Citing articles via
Related Articles
Catchment basin modelling of stream sediment anomalies revisited: incorporation of EDA and fractal analysis
Geochemistry: Exploration, Environment, Analysis
Modeling of mineralization using minimum/maximum autocorrelation factor: case study Sury Gunay gold deposit NW of Iran
Geochemistry: Exploration, Environment, Analysis
The significance of copper concentrations in natural gold alloy for reconnaissance exploration and understanding gold-depositing hydrothermal systems
Geochemistry: Exploration, Environment, Analysis
Related Book Content
Gold deposits of Myanmar
Myanmar: Geology, Resources and Tectonics
Ore Genesis Constraints on the Agdarreh and Zarshouran Carlin-Style Gold Deposits in the Takab Region of Northwestern Iran
Diversity in Carlin-Style Gold Deposits
Chapter 38: Hydrothermal Gold Deposition in Epithermal, Carlin, and Orogenic Deposits
Geology of the World’s Major Gold Deposits and Provinces
Characteristics of Gold-Copper Hydrothermal Systems
Southwest Pacific Rim Gold-Copper Systems: Structure, Alteration, and Mineralization