The article presents a practical approach to the geological and geophysical spatial data collection and preliminary processing to use in machine learning models for geophysical applications. According to the established principles for estimating efforts in data analysis, which are confirmed by the results of surveys among specialists, this stage is viewed as major time and resource-consuming, amounting up to 80% in total volume of data analysis for a hypothesis testing project. The paper focuses on creating a consistent data set that integrates geological and geophysical information on a given region. We consider problems of different sources in the geodata representation to be related to their format (vector/raster), scale, type of attribute information (quantitative/qualitative) and their availability. The algorithm formalization and synthesis for combining geospatial data and converting them into quantitative vectors is a critical aspect. Combining various data draws on the concept of neighborhood fitting in with the data selection techniques and data consolidation strategy. The paper presents the general architecture of the software and hardware complex which includes a module for data collection and transformation in Python using the Pandas library, a data storage system based on the PostgreSQL DBMS (Database Management System) with the PostGIS extension. It is shown that for the considered class of problems in geophysics, it is sufficient to use a relational DBMS for data storing and processing. If the problem dimension increases, it is proposed to use the Big Data technology based on Apache Hadoop for scaling the system. A practical application of the proposed approach is demonstrated as results of data collection for the Caucasus region and eastern sector of the Russian Arctic. Based on the prepared data, experiments were carried out using machine learning models for recognition of locations of potential strong earthquakes and for sensitivity estimation of several geophysical features of these regions. The article presents the experimental results and evaluation of their efficiency.
Skip Nav Destination
Article navigation
Research Article|
February 01, 2025
GENERALIZED DATASET OF GEOLOGICAL AND GEOPHYSICAL INFORMATION ON THE EASTERN SECTOR OF THE RUSSIAN ARCTIC FOR MACHINE LEARNING-BASED ANALYSIS
I.A. Lisenkov;
1
Geophysical Center of the Russian Academy of Sciences,, ul. Molodezhnaya 3, Moscow, 119296, Russia✉
E-mail: [email protected]
Search for other works by this author on:
A.A. Soloviev;
A.A. Soloviev
1
Geophysical Center of the Russian Academy of Sciences,, ul. Molodezhnaya 3, Moscow, 119296, Russia2
Schmidt Institute of Physics of the Earth of the Russian Academy of Sciences, ul. Bolshaya Gruzinskaya 10, bld. 1, Moscow, 123242, Russia
Search for other works by this author on:
V.A. Kuznetsov;
V.A. Kuznetsov
3
National Research Nuclear University MEPhI, Kashirskoe shosse 31, Moscow, 115409, Russia
Search for other works by this author on:
Yu.I. Nikolova
Yu.I. Nikolova
1
Geophysical Center of the Russian Academy of Sciences,, ul. Molodezhnaya 3, Moscow, 119296, Russia
Search for other works by this author on:
1
Geophysical Center of the Russian Academy of Sciences,, ul. Molodezhnaya 3, Moscow, 119296, Russia
A.A. Soloviev
1
Geophysical Center of the Russian Academy of Sciences,, ul. Molodezhnaya 3, Moscow, 119296, Russia2
Schmidt Institute of Physics of the Earth of the Russian Academy of Sciences, ul. Bolshaya Gruzinskaya 10, bld. 1, Moscow, 123242, Russia
V.A. Kuznetsov
3
National Research Nuclear University MEPhI, Kashirskoe shosse 31, Moscow, 115409, Russia
Yu.I. Nikolova
1
Geophysical Center of the Russian Academy of Sciences,, ul. Molodezhnaya 3, Moscow, 119296, Russia✉
E-mail: [email protected]
Publisher: Novovsibirsk State University
Received:
03 Apr 2024
Accepted:
26 Aug 2024
First Online:
26 Nov 2024
Online ISSN: 1878-030X
Print ISSN: 1068-7971
© 2025, Novosibirsk State University
Novosibirsk State University
Russ. Geol. Geophys. (2025) 66 (2): 210–223.
Article history
Received:
03 Apr 2024
Accepted:
26 Aug 2024
First Online:
26 Nov 2024
Citation
I.A. Lisenkov, A.A. Soloviev, V.A. Kuznetsov, Yu.I. Nikolova; GENERALIZED DATASET OF GEOLOGICAL AND GEOPHYSICAL INFORMATION ON THE EASTERN SECTOR OF THE RUSSIAN ARCTIC FOR MACHINE LEARNING-BASED ANALYSIS. Russ. Geol. Geophys. 2025;; 66 (2): 210–223. doi: https://doi.org/10.2113/RGG20244747
Download citation file:
You could not be signed in. Please check your email address / username and password and try again.
Email alerts
Index Terms/Descriptors
- algorithms
- Arctic region
- Armenia
- Azerbaijan
- Bouguer anomalies
- Caucasus
- Commonwealth of Independent States
- computer languages
- data processing
- earthquakes
- Europe
- experimental studies
- geographic information systems
- Georgian Republic
- gravity anomalies
- information systems
- lineaments
- magnetic anomalies
- magnetic properties
- neural networks
- relief
- Russian Arctic
- Russian Federation
- seismic risk
- machine learning
- Python computer language
- PostgreSQL
- QGIS
- Apache Hadoop
- AutoML
Latitude & Longitude
Citing articles via
Related Articles
Further Insights into Deep Structure of Malmyzh, Pony-Muli, and Anadzhakan Ore Clusters in the Middle Amur Sedimentary Basin (Northern Sikhote-Alin Orogenic Belt)
Russian Geology and Geophysics
Geophysical evidence of deep hydrocarbon flow in Mottled Zone areas, Dead Sea Transform zone
Geophysics
Use of Digital Elevation Models in Metallogenic Investigations on the Example of the Central Part of the Lower Amur Province
Russian Geology and Geophysics
Related Book Content
Simulating the area covered by lava flows using the DOWNFLOW code
Detecting, Modelling and Responding to Effusive Eruptions
Early Mesozoic sinistral transpression along the Pai-Khoi–Novaya Zemlya fold–thrust belt, Russia
Circum-Arctic Lithosphere Evolution
Eastern Anatolia: A hotspot in a collision zone without a mantle plume
Plates, Plumes and Planetary Processes
Dyke emplacement and crustal structure within a continental large igneous province, northern Barents Sea
Circum-Arctic Lithosphere Evolution