TL;DR
Built a computer vision-powered API that digitizes manually aggregated invoices, vendor quotes, and documents into structured data within Global Shop Solutions' ERP platform, achieving 95% ingestion and transcription accuracy
Delivered the project two weeks ahead of schedule with a model 15% more accurate than originally projected, exceeding the client's expectations on both timeline and performance
Automated the full document processing pipeline using GPT-4V and custom LLM field extraction, reducing overhead costs by 90% and freeing manufacturing teams to focus on higher-value work
The Challenge
Manufacturing operations generate a constant volume of paper-based and scanned documents. Invoices from suppliers, vendor quotes, purchase orders, and shipping records arrive in inconsistent formats, from handwritten notes to multi-page PDFs from dozens of different vendors. Extracting usable data from these documents requires manual entry, which is slow, error-prone, and expensive at scale.
Global Shop Solutions builds ERP software used by manufacturers around the world to manage inventory, scheduling, accounting, supply chain, and customer relationships. Their customers needed a way to get structured, actionable data from incoming documents directly into the ERP without manual transcription. The existing process required staff to open documents, read values, and key them into the system field by field. As document volumes grew, so did the overhead.
The core technical challenge was building a system that could handle the real-world messiness of manufacturing documents. Vendors do not use standardized invoice templates. Dates, gross totals, vendor identifiers, and line items appear in different locations, in different formats, and with different labels depending on the source. A reliable ingestion system needed to understand document structure at a level of nuance that simple OCR and template matching cannot provide.
Global Shop Solutions needed a custom AI model that could ingest documents regardless of format, extract the right fields with high accuracy, match vendor nomenclature to ERP records, and return structured data through an API that integrated cleanly with their existing platform.
Key Results
95% ingestion and transcription accuracy across real-world invoice and vendor document formats
90% reduction in overhead costs associated with manual document processing
Delivered two weeks ahead of the projected six to seven week timeline
Model accuracy 15% higher than originally projected at project kickoff
Smooth engineering handover enabling Global Shop Solutions team to maintain and extend the system independently
