Technology

Our Technology Platform Is Unique

Nominal training period for each document type. Depending on the document complexity 2 to 20 specimens of each document are required to build a template. Fully automated. flow ℠ provides unprecedented accuracy which removes the need for human intervention.

Intuitive quality control metrics. Our error checking algorithm provides yes or no assurance that the numbers are right. Our text OCR algorithm provides a good proxy for the accuracy of text OCR. Both can be used as alternative ways to evaluate accuracy. We will never provide a confidence score.

Structured Language Overview

Rigid Forms

These include tax forms & applications. All these forms have the same number of pages. and every elements is in the same place on the page.

Variable Size Field

These include the Universal Residential Loan Application or Form 1003. Here certain sections might balloon bigger or smaller depending upon the number of items.

Category 1 Document

Category 2 Document

Structured Documents

These include standardized contracts like the Form of Mortgage or signed Disclosure Statements.

Formal But Unstructured Documents

These include standarization contracts like the form of mortgage or signed Disclosure statements.

Category 3 Document

Category 4 Document

Our iceflake ℠ algorithm can categorize each document after seeing two specimens.  Our proprietary flow ℠ technology platform can abstract data from Category 1 – 3 documents. Category 4 documents require the customer to provide a list of data items wanted.  Once we have that list, flow ℠ can abstract the data.

To understand how flow ℠ works, it helps to compare against other technologies.  There are at least a dozen companies that have solved the problem of extracting data from Category 1 Documents or Rigid Forms.  They use spatial relationships on the pdf page itself to extract data with the expectation that the data will be on the same place on every page.  This model works for this very narrow application with accuracy levels approaching 100% for at least half a dozen providers that we have tested.  The challenge for most of these service providers is that building a model is time-consuming as evidenced by the one-time model fees charged which can range from $180 to $3,000 or annual minimums which can be $6000 per Category 1 Document.   

Providers have tried to expand this technology into other Categories with mixed success.  For example, a common method is to look for a floating field name and bring in the value below or to the right of that field.  This works well enough to make operations more efficient but still requires human intervention.  Publicly available reviews and case studies suggest that accuracy levels are 75 – 90% for Category 2 documents.

At the other end of the spectrum are Natural Language Processors.  Because of the complexity of Natural Language all these companies use AI or Machine Learning to abstract information from a document.  These technologies traffic in probabilities.  There are many possible answers and the model picks the most probable one.  Natural Language Processors do a very good job with Category 3 Structured Documents. Before Firehorse, AI companies were the only solution for Category 4 documents. They do a serviceable job but at a high price. 

Pure Natural Language Processors are not very accurate with the other Categories of Structured Language documents.  Most companies actually combine floating field techniques with Natural Language Processing to get to those 75 – 90% accuracy levels.  Natural Language Processors, AI and Machine Language companies for the most part focus on harder documents, either handwriting or poor quality pdf scans.  When every customers’ automation processes are built already for a significant amount of human intervention, Automation provider’s lowest hanging fruit is to improve bad document OCR rather than making good document OCR better.

flow ℠ does things differently.  It focuses on documents created and scanned from 2018 till now and used as the medium of information exchange between businesses and between businesses and consumers.  These documents have a high enough quality that flow abstracts data from them with near perfect accuracy.

snowflake ℠  sorts and categorizes Structured Documents.  Structured Documents follow a consistent internal structure which repeats from specimen to specimen.  Forms are an example of Structured Documents.  snowflake only needs 2 specimens of each type of Structured Document to categorize it accurately.  Unlike other classification technologies, snowflake does not look to see what is similar about each document, rather it looks to catalog differences.  There are an infinite variety of Structured Documents.  Every unique Structured Document may also have an infinite variety of field names.

is an App that effects a change.  From application to approval, from bidder to owner, delta ℠ processes flow ℠ to make a change in our world.  In cases where decisions can be made algorithmically, delta ℠ can make the decision.  In cases where human judgment is still required, delta ℠ presents flow ℠ in an intuitive design so that our customer can make the decision.  River deltas historically have been centers of trade & innovation.  It is no different now at Firehorse.

flow ℠ converts the entire Structured Document into text and abstracts required data with unprecedented accuracy.  Depending on the complexity, flow ℠ only needs two to twenty specimens of the same Structured Document to parse all the fields, tables and required words.  All text and structured data is stored in NoSQL databases where our error checking algorithm confirms the numbers and provides an accuracy measure for the text.   We do industry specific calculations or analysis and present the data in both raw and processed form.  Just as we sometimes get in the flow during moments of peak performance, flow ℠ is the magic part of our business. 

flow ℠ converts the entire Structured Document into text and abstracts required data with unprecedented accuracy.  Depending on the complexity, flow ℠ only needs two to twenty specimens of the same Structured Document to parse all the fields, tables and required words.  All text and structured data is stored in NoSQL databases where our error checking algorithm confirms the numbers and provides an accuracy measure for the text.   We do industry specific calculations or analysis and present the data in both raw and processed form.  Just as we sometimes get in the flow during moments of peak performance, flow ℠ is the magic part of our business. 

we're fast and affordable

Combine our Technology Products to Create the Perfect Platform