We have adopted a new tool to invert magnetotelluric data for the 1D model based on deep Q-networks (DQN), which works as a stochastic optimization method. By transforming the inversion problem into a Markov decision process, the tool learns by trial and error to find the optimal path for updating the model to fit the observed data. The DQN method converges to the target through different paths (e.g., Bayesian or other stochastic methods) and can partially provide the probability distribution of the inversion results, which can be used for uncertainty estimation. The DQN search space gradually decreases as the learning experience progresses, accelerating the single inversion and approximating the optimal result. To check the effectiveness of the DQN inversion, the five- and eight-layer models were separately designed to test the robustness of the DQN for the initial model and the noise level. A further comparison with Occam’s method and the Bayesian method indicated that our DQN obtained more robust inversion results for data contaminated by different noise levels. The inversion results with the survey data from Zhagaitunuoergong area, Inner Mongolia, China, well reveals the shape of the interface basement.

You do not have access to this content, please speak to your institutional administrator if you feel you should have access.