The bottleneck in borehole-image interpretation is rarely the geology. It is the clock. A single high-resolution borehole image log runs over 200 metres, every planar feature that crosses the wellbore projects as a sinusoid, and an interpreter must pick each trace by hand, fit a curve, and recover three numbers — depth, dip, azimuth — at intervals down the whole well. In a fractured carbonate, that is days of expert time per well, and the queue only grows. For a mid-sized Middle East carbonate operator we worked with, the question that mattered to the subsurface manager was never the F1 score of any one model. It was simpler and harder: how many wells can my team turn around this quarter, and what is the expert time costing me to get there?
This case study is about the commercial arc — the before-and-after of converting a manual, multi-week-per-well interpretation process into a productised pipeline (AutoFrac for fracture and bedding sinusoids, AutoVug for vug morphology) that interprets at machine speed while the geoscientist stays in the loop as the validator, not the labourer.
At a glance
- Before: Manual sinusoid picking ran to days of expert time per well across a 16-well programme; vug identification, done by eye, took hours per well. Interpretation was the throughput ceiling for the subsurface team.
- After: The productised AutoFrac/AutoVug tools interpreted at roughly 30 seconds per metre — about 5x faster than the manual baseline — picking sinusoids across multiple wells in minutes and turning a multi-week backlog into same-day turnaround.
- Quality held: The companion well-to-well correlation tool reached 95% target-location precision and 90% stratigraphic success, with the operator's subsurface team reporting +60% interpretation productivity and +75% interpretation accuracy on the productised stack.
- The engineering, not just the science: A from-scratch Detection-Transformer (DETR-derived) model, trained under extreme well-data scarcity, served behind a containerised web application on an on-prem GPU stack with full MLOps experiment tracking — an applied-AI production system, not a notebook.
The manual baseline, costed honestly
Start with what the interpreter actually does. A high-resolution borehole image log is unwrapped into a 2D image; the interpreter scans it, picks every fracture and bedding sinusoid, and for each one records depth, dip, and azimuth. Vug analysis is a separate manual pass — identifying pore features by eye, zone by zone. On this operator's data, sinusoid picking ran to days per well, and vug identification took hours per well, both done by a scarce specialist.
That cost compounds. The engagement spanned roughly 16 wells of usable image-log data (with the broader operator inventory exceeding 80 image logs), so the manual process did not just delay one well — it set the throughput ceiling for the entire subsurface programme. Worse, manual picking is inconsistent between interpreters and across sessions, so the multi-week spend bought a result that still carried inter-interpreter variance. The operator was paying premium expert time for a process that was simultaneously slow and not perfectly repeatable.
The strategic reading is the one that matters to a subsurface manager: this is a capacity problem disguised as a quality problem. The team did not lack skill; it lacked hours. Any intervention that did not move the throughput needle — a marginally better accuracy figure, a prettier interface — would have left the actual constraint untouched.
What we built: a productised interpretation pipeline, not a model
The instinct in subsurface AI is to chase the model. We did build the model — a customised Detection Transformer that we covered in depth elsewhere — but the throughput dividend came from treating interpretation as a production system end to end, with the model as one component.
The engineering had four layers, each load-bearing for the commercial result:
- The model. A DETR-derived detector with a ResNet-10 backbone, transformer encoder/decoder, and a multi-task head that classifies fracture-vs-bedding and regresses depth, dip, and azimuth in a single forward pass — no per-pixel masks, no anchor boxes, no non-maximum suppression to tune. Set prediction with Hungarian matching is what lets overlapping sinusoids be recovered natively rather than fought with thresholds.
- The data engineering. The reservoir intervals were brutally sparse. We grew the training corpus from roughly 900 image–ground-truth pairs to more than 55,000 — a 65x increase — through overlapping patch extraction and geometry-preserving augmentation, the difference between a model that learned nothing and one that converged. The vug pipeline was a separate classical computer-vision stack (top-k mode subtraction, adaptive thresholding, contour filtering) tuned to recover per-vug morphology at 0.1 m resolution.
- The serving layer. Both tools were productised and exposed through a containerised web application so a geoscientist could run a well through interpretation from a browser — no model code, no GPU babysitting.
- The MLOps and infrastructure. The whole thing ran on an on-prem GPU stack with experiment-tracking, a versioned data store, and a custom MLOps layer — the production scaffolding that lets a model be retrained, versioned, and trusted rather than re-derived by hand each time.
That fourth layer is where most subsurface-AI pilots die, and it is the reason the speed-up was real and repeatable rather than a benchmark artefact. A model that runs once in a notebook does not change a team's throughput; a model behind a containerised app on managed infrastructure does.
The throughput dividend, in numbers
With the pipeline in place, interpretation ran at roughly 30 seconds per metre of image log — about 5x faster than the manual baseline. In practice that meant the model picked sinusoids across multiple wells in minutes where the manual process took days per well, and the vug tool did in minutes what previously took hours per well. The multi-week interpretation backlog collapsed into same-day turnaround for the subsurface team.
Crucially, speed did not come at the expense of trust. On the productised stack the operator's team reported +60% interpretation productivity and +75% interpretation accuracy, and the companion well-to-well correlation tool — which builds fracture and bedding density logs and correlates them across wells via kriging — hit 95% target-location precision and 90% stratigraphic success. The pipeline was faster and more consistent, because a model applies the same picking criteria to every metre of every well, removing the inter-interpreter variance that manual picking carries.
The point the exhibit above makes — drawn from a separate, larger portfolio engagement — generalises to this operator exactly: automation did not replace the geoscientist. It moved expert hours off rote tracing and onto the judgment calls that actually need a specialist. Anomalous intervals, ambiguous overlapping features, and final sign-off stayed with the interpreter. What changed was the ratio of judgment to labour. A geoscientist who had been spending days per well dragging a cursor over sinusoids was now reviewing model output and adjudicating edge cases — higher-leverage work, and far less of the rote kind.
Why this is an ROI story, not a research result
A 5x interpretation speed-up is not a vanity metric; it is a re-pricing of the constraint. Three commercial consequences followed directly for the operator, and they are the reasons a subsurface manager funds the next phase.
First, same-day turnaround changes the operating model. When interpretation takes days per well, image-log analysis is a back-office, after-the-fact activity. When it takes hours, it can sit closer to operations — informing completion and stimulation decisions inside a useful window rather than arriving after the fact. The value of an interpretation is partly a function of when it lands.
Second, the expert-hours freed are the real dividend, and they compound across the portfolio. Across a 16-well programme — against an inventory exceeding 80 image logs — the reclaimed specialist time is not a one-off saving; it is recurring capacity that grows with every new well the operator drills. That capacity can absorb the backlog, take on wells that were previously deprioritised for lack of interpreter time, or be redirected to the interpretation work that genuinely needs a human.
Third, consistency de-risks downstream decisions. A reservoir model built on machine-picked sinusoids inherits a single, repeatable picking standard rather than the variance of several interpreters working under deadline. The +75% accuracy figure the team reported is not just a quality number — it is a reduction in the silent uncertainty that propagates from picks into the static model and out into well placement.
The honest caveats
The numbers are real, but they are bounded, and a credible vendor says so. The headline speed and quality figures are the operator's own, measured on the productised tools over a confidential 16-well carbonate dataset; they are not a published, peer-reviewed benchmark, and the more conservative, F1-based model metrics we report elsewhere are the right reference for scientific claims. The corpus was vertical wells of a single reservoir — horizontal wells, where fracture sinusoid amplitude drops sharply, are a separate distribution that needed its own treatment. And the speed-up assumes the production scaffolding stays in place: the model behind a maintained, monitored, retrainable stack is what makes the throughput durable. A model handed over as a pickle file would have decayed back into a research artefact within a quarter.
None of that undercuts the commercial thesis. Across our subsurface engagements — with operators in the Middle East and the United States — the pattern that holds is this: the deep-learning model is necessary but not sufficient. The throughput dividend, the part a subsurface manager can take to the board, comes from engineering the model into a production system that an interpreter trusts and a team can run every day.
The throughput dividend, in one view
- Interpretation, not geology, was the constraint: manual sinusoid picking ran to days per well and vug ID to hours per well, setting the throughput ceiling for the whole subsurface programme across a 16-well engagement.
- The productised AutoFrac/AutoVug pipeline interpreted at roughly 30 seconds per metre — about 5x faster — collapsing a multi-week backlog into same-day turnaround, with the operator's team reporting +60% productivity and +75% interpretation accuracy and the correlation tool hitting 95% target-location precision / 90% stratigraphic success.
- The speed-up was real because it was a production system, not a model: a from-scratch DETR served behind a containerised web app on an on-prem GPU stack with experiment tracking and a custom MLOps layer — and the geoscientist stayed the validator, with expert hours moved off rote tracing onto judgment.
References
-
Productised tool performance (AutoFrac/AutoVug ~5x faster interpretation; Well-to-Well +60% productivity, +75% interpretation accuracy, 95% target-location precision, 90% stratigraphic success) and the on-prem MLOps/serving stack are drawn from the Phase-3 transition and project review materials for a confidential mid-sized Middle East carbonate operator; figures are the operator's own and are withheld in raw form under confidentiality.
-
Interpretation speed (~30 s/m, measured on the AutoFrac sinusoid-interpretation pipeline), manual baselines (sinusoid picking takes days per well; vug identification takes hours per well), and dataset growth (~900 to >55,000 image–ground-truth pairs, ~65x via overlap and augmentation) derived from internal Phase-3 reporting on the same engagement.
-
Model architecture (customised Detection Transformer, ResNet-10 backbone, multi-task depth/dip/azimuth head, Hungarian matching) detailed in the companion GeoBFDT case study; data and code withheld under operator confidentiality.