RESEARCH8 min read · 7 Jul 2023

AI in geology: transforming noisy curves into instant insights

How transformer-based deep learning - the same architecture family that powers ChatGPT and Vision Transformer - applies to one of the upstream industry's most stubborn problems: turning decades of scanned raster well-log images into clean, queryable digital curves. A condensed companion to the VeerNet AI whitepaper.

by The EarthScan Team · Research, engineering, and field

transformersvision-transformerraster-log-digitisationveernetwell-logsfracture-detectionfault-detectionwhitepaper-companion

“
The companies that win this transition will be the ones that capture institutional knowledge from senior staff before retirement closes that window. AI is not the prize - captured tacit knowledge is.
”

The ongoing digital revolution - what's commonly framed as the "fourth industrial revolution" - is marked by the integration of advanced technologies such as AI, robotics, and autonomous systems. Within that wave, deep learning has the potential to transform traditional reservoir engineering practices and address some of the upstream industry's longest-standing challenges.

This piece is a condensed companion to the VeerNet AI whitepaper - the in-depth treatment of how a transformer-based vision model handles raster log digitisation at scale.

The deep-learning lever in upstream

1970s

Earliest raster archives in the typical operator's portfolio

Day → minutes

Junior-petrophysicist interpretation time, before vs. after VeerNet

Operational levers automation pulls (efficiency / cost / quality / staff retention / customer outcomes)

Architecture under nearly every post-2017 AI breakthrough - the Transformer

Why digital transformation has been hard for oil & gas

The oil and gas industry, alongside mining, has been famously slow to embrace digital transformation. Three forces compound:

Resource and expertise gaps in AI - small in-house data-science teams and an external talent market that overwhelmingly chases consumer-tech salaries.
Slow technology-adoption cycles - capital projects measured in decades make "let's pilot this" a much harder sell than in software.
The Great Crew Change - a generational gap in the workforce as senior geoscientists retire faster than junior engineers gain enough field exposure to replace them. Domain knowledge walks out the door without a clean transfer path.

The article's real argument: 'AI is not the prize — captured tacit knowledge is.' SPE estimated in 2024 that ~50% of upstream technical staff retire within 10 years while only ~30% of replacement hires arrive with subsurface domain training. Drag the timeline toward the 10-year cliff: the retiring senior cohort sheds tacit knowledge (teal) while juniors backfill only ~30% trained (grey), opening a widening knowledge gap. The single orange element is the AI capture window — toggle it on and it bridges most of that gap before the cliff closes, compressing the transfer window 'from years to weeks'. Every on-canvas number — ~50%, ~30%, the 10-year window and 'years to weeks' — is the article's own; the cohort decay curves and the orange capture-band height carry no number and are schematic interpolations between those sourced endpoints, flagged on-canvas.

These aren't excuses. They're the operating constraints any AI strategy in this industry has to ship inside.

Automation as a forcing function

To stay competitive, integrating AI and ML has emerged as a vital lever. Automation streamlines processes, reduces manual labour where it doesn't add value, and optimises resource use. Large-language models like ChatGPT - built on the transformer architecture - point toward an immediate use case: automating repetitive interpretation tasks and capturing tacit knowledge from senior staff before they retire.

Implementing automation in upstream operations delivers, in roughly this order:

Improved operational efficiency - fewer manual handoffs in the interpretation pipeline.
Cost and time savings - labour and rework costs drop; cycle times shrink.
Improved quality and consistency - tasks performed identically every time, removing inter-interpreter variance.
Higher employee satisfaction - engineers redirected from tedious clean-up onto higher-judgment work.
Better customer outcomes - faster turnaround on internal and external deliverables.

Transformers - the architecture that changed the game

The Transformer (Vaswani et al., 2017) is the architecture under nearly every AI breakthrough since 2017: BERT, GPT, ChatGPT, AlphaFold, Stable Diffusion. The core idea - self-attention - lets a model directly compare every token in its input to every other token, capturing long-range dependencies that recurrent and convolutional architectures struggled with.

The Vision Transformer (ViT) (Dosovitskiy et al., 2021) extended this from text to images. Instead of treating an image as a 2D grid for convolutions, ViT splits it into patches, embeds each as a token, and applies the same self-attention mechanism. The benefit: ViT captures global image context - relationships between distant regions - in a way CNNs only approximate via deep stacks of receptive fields.

That global-context property is exactly what raster log interpretation needs. When you're identifying a fracture or a fault from a scanned log image, the diagnostic features aren't local - they span the full track and depend on context far above and below the candidate region.

VeerNet - transformers applied to raster well logs

VeerNet is the transformer-based deep-learning model EarthScan built specifically for raster log interpretation. It uses self-attention to identify individual curves from a single track in scanned legacy log images, providing fast, stable, scalable digitisation that replaces hours of manual mouse-clicking per log.

VeerNet was trained on a large in-house corpus of raster log images covering a wide range of vintages, scanning quality, and curve types. The training set captures the failure modes that off-the-shelf vision models miss: faded ink, overlapping curves, hand-annotated grid markings, and the typical scan artefacts of a 1970s log photographed in 2010 by an iPhone in a basement.

VeerNet AI architecture diagram - patch embedding, transformer encoder, and curve-segmentation head. — VeerNet's transformer-encoder architecture applied to raster log patches.

What VeerNet delivers in production:

Reduced time complexity and faster interpretation - what took a junior petrophysicist a day takes minutes.
Minimised systematic and random errors - eliminates click-jitter and inter-interpreter variance.
Interactive dashboard UX - interpreters review and correct, rather than tracing from scratch.
Scalable to different tracks and curves - same model, different curve types via fine-tuning.
Custom model training per reservoir - operators can fine-tune VeerNet on their own historical archives for region-specific accuracy.

Managing the AI technical debt that comes with this

Adopting AI is not free. Models accumulate AI technical debt - drift, retraining-pipeline staleness, training-data provenance gaps, brittle preprocessing - that can quietly degrade production performance.

Three recommendations:

Track model performance against ground truth continuously, not just at training time.
Budget for retraining cycles as a recurring operational cost, not a one-off project line item.
Invest in upskilling existing geoscientists, not just hiring new data scientists. The combination of subsurface knowledge plus AI literacy is rarer than either alone, and far more valuable.

Closing thought

AI - particularly deep learning models based on transformers - has the potential to revolutionise geology interpretation in oil and gas. Automation powered by these models drives efficiency, cost savings, and better decisions. The companies that win this transition will be the ones that:

Invest in AI skills among existing geoscience teams.
Treat AI tooling as production infrastructure, not as a research curio.
Use AI to capture institutional knowledge from senior staff before retirement closes that window.

For the full technical treatment - model architecture, training-data construction, benchmark comparisons against alternative approaches - see the VeerNet AI whitepaper.

Key takeaways

Transformers (and Vision Transformers) are the architecture under nearly every post-2017 AI breakthrough - including the transformer-residual hybrid that powers VeerNet.
The Great Crew Change is the AI urgency signal: senior geoscientists retiring faster than juniors gain field exposure. AI's role is to capture tacit knowledge while it's still in the building.
The transition winners will upskill existing geoscience teams rather than just hire new data scientists. The combination of subsurface knowledge plus AI literacy is rarer than either alone, and far more valuable.
AI technical debt is real - drift, retraining staleness, training-data provenance gaps. Treat AI tooling as production infrastructure, not a research curio.

Glossary

AI technical debt: The accumulating cost of running ML models in production - drift, training-data provenance gaps, brittle preprocessing, retraining-pipeline staleness. Analogous to software technical debt; ignored at your peril.
BERT: Bidirectional Encoder Representations from Transformers (Devlin et al., 2018) - the first transformer-based language model to demonstrate large transfer-learning gains across NLP benchmarks. The masked-language-model pretraining recipe that made transformers ubiquitous.
ChatGPT: OpenAI's transformer-based chatbot (released 2022) - the consumer-facing demonstration of large-language-model capabilities. Made 'self-attention' a household concept and accelerated transformer adoption across industries that had been transformer-skeptical.
Great Crew Change: A workforce phenomenon in oil & gas: senior geoscientists retiring faster than junior engineers gain enough field exposure to replace them. Domain knowledge walks out the door without a clean transfer path. AI's role: capture tacit knowledge while it's still in the building.
Patch embedding: The Vision Transformer's preprocessing step: split an input image into fixed-size patches (e.g. 16×16 pixels), flatten each, and project to a fixed-dimensional embedding. Each patch becomes one token in the transformer's input sequence - the bridge from images to sequence-of-tokens.
Self-attention: The core mechanism of the Transformer architecture (Vaswani et al., 2017). Each output element is computed as a weighted sum of all input elements. Captures long-range dependencies that local convolutional kernels and stepwise recurrences cannot see in a single layer.
Transformer: The neural-network architecture introduced in 'Attention Is All You Need' (Vaswani et al., 2017). Replaced recurrence and convolution with self-attention. The architectural foundation of nearly every AI breakthrough since 2017 - BERT, GPT, ChatGPT, Stable Diffusion, AlphaFold.
Vision Transformer (ViT): Dosovitskiy et al. (2021) - the application of the Transformer architecture to images. Splits an image into patches, treats each as a token, applies self-attention. Captures global image context that pure CNNs only approximate via deep stacks of receptive fields.

AI in geology: transforming noisy curves into instant insights

Why digital transformation has been hard for oil & gas

Automation as a forcing function

Transformers - the architecture that changed the game

VeerNet - transformers applied to raster well logs

Managing the AI technical debt that comes with this

Closing thought

Glossary

Related research

Digitization of raster logs - VeerNet, a U-Net + transformer hybrid

Geoscience Language Models: A Survey of LLMs for Reports, Logs, and Well Metadata

The Quiet Power of a Good Data Contract Between ML and Frontend

EarthScan insights, in your inbox.