AI Table Extraction

From Scanned PDFs to
Structured Data.

Drag. Drop. Download.

NDepth extracts structured tabular data from scanned core analysis reports, mud logs, and subsurface PDFs. AI reads handwritten notes, faded tables, and complex multi-column layouts with high accuracy.

Drop PDF hereDepthPorPermSwLith5420.518.2%24532.1%SS5421.016.8%19835.4%SS5421.521.3%31228.7%SS5422.014.1%8741.2%LS5422.519.6%26730.5%SSExcel / CSVDrop files to extract

73K+
Wells Processed
<1 min
Per File
100+
Batch Upload
$10
Free Credit

Specialized AI for Every Document

Specialized AI models trained on thousands of geological documents for maximum accuracy.

Core Analysis

Extract Porosity, Permeability & More

Digitize decades of core analysis reports. Our AI reads handwritten notes, faded tables, and complex multi-column layouts with high accuracy.

Depth intervals automatically extracted and validated
Porosity, permeability, saturation values captured
Lithology descriptions parsed and structured
Multi-page tables merged automatically
Mud Log Shows

Hydrocarbon Show Extraction

Pull gas readings, show descriptions, and formation tops from mud logs - even from the messiest handwritten field reports.

Gas readings and show types extracted
Formation tops and depth intervals
Fluorescence and cut descriptions
Standardized show classification

How NDepth Works

Every document goes through a six-stage pipeline. AI handles the heavy lifting - you handle the final QC.

01

OCR & Enhancement

Advanced OCR with auto-rotation, deskew, contrast enhancement, and column detection. Low-confidence pages are automatically re-processed with enhanced imaging.

02

Structure Classification

Every page is classified by visual structure: tabular data, graphical logs, maps, photographs, or forms. Tables always take priority over other content types.

03

Table Type Classification

Tabular pages are classified into dozens of specific types using column fingerprinting, keyword matching, and header analysis. Each type gets a specialized extraction model.

04

Intelligent Extraction

Data is extracted with column alignment validation, depth continuity logic, ditto mark resolution, and precise handling of multi-page tables that span across pages.

05

Duplicate Detection

Multi-signal dedup using Jaccard similarity on headers, sample number overlap analysis, depth range intersection, and continuation detection to prevent data duplication.

06

Data Validation & QC

Values are validated against geological ranges (porosity 0-40%, grain density 2.0-3.5 g/cc). OCR errors are auto-corrected and out-of-range values are flagged for review.


Every Field, Automatically

NDepth identifies and standardizes well metadata from headers, cover pages, and document body text.

API Number

Standardized to 14 digits with state code validation

Well Name

Parsed from headers, cover pages, and body text

Operator

Matched to known operator names

Laboratory

Normalized lab names (Core Labs, Weatherford, etc.)

Formation

Geological formation and member identification

County & State

Full county name, 2-letter state abbreviation

Field Name

Oil/gas field identification from document context

Date

Normalized to YYYY-MM-DD from any format


Built for the Worst Documents

Faded 1960s photocopies. Handwritten notes. Rotated pages. Multi-column layouts. If it was important enough to scan, it's important enough for us to read.

Drag & Drop Upload

Upload single files or batch hundreds at once. PDFs, TIFFs, JPEGs, PNGs - even rotated and degraded scans. Processing starts automatically.

Batch upload 100+ filesAuto-rotation & deskewMulti-page supportDegraded scan handling

Metadata Extraction

API numbers, well names, operators, labs, formations, counties - all parsed from headers, cover pages, and body text. Standardized to PPDM format.

14-digit API normalizationOperator name matchingLab name normalizationPPDM compliant

Page Classification

Every page is classified by structure and content type. Core analysis, mud log shows, completion data, DSTs - each gets a specialized extraction pipeline.

Dozens of document typesColumn fingerprintingKeyword & header analysisConfidence scoring

Multi-Table Merging

Tables that span multiple pages are automatically detected and merged. Continuation markers, depth range analysis, and header matching ensure nothing is missed.

Cross-page mergingDepth continuity logicDitto mark resolutionContinuation detection

Excel & CSV Export

Download structured data as Excel spreadsheets or CSV files, ready for import into Petrel, Kingdom, or any analysis tool. One file or batch export entire orders.

Excel (.xlsx) exportCSV exportBatch downloadColumn-mapped output

QC Portal & Review

Side-by-side comparison of extracted data against the original document. Flag issues, approve extractions, and track quality across your entire dataset.

Side-by-side reviewIssue flaggingRange validationOCR correction audit

Upload Anything

NDepth handles any scan format, any orientation, any condition.

PDF
TIFF
JPEG
PNG
Multi-Page
Rotated

Ready to Try NDepth?

Start with $10 free credit. No credit card required. Upload your first document in minutes.