Technology
Our Technology Platform Is Unique
Nominal training period for each document type. Depending on the document complexity 2 to 20 specimens of each document are required to build a template. Fully automated. flow ℠ provides unprecedented accuracy which removes the need for human intervention.
Intuitive quality control metrics. Our error checking algorithm provides yes or no assurance that the numbers are right. Our text OCR algorithm provides a good proxy for the accuracy of text OCR. Both can be used as alternative ways to evaluate accuracy. We will never provide a confidence score.
Structured Language Overview
Rigid Forms
These include tax forms & applications. All these forms have the same number of pages. and every elements is in the same place on the page.
Variable Size Field
These include the Universal Residential Loan Application or Form 1003. Here certain sections might balloon bigger or smaller depending upon the number of items.
Category 1 Document
Category 2 Document
Structured Documents
These include standardized contracts like the Form of Mortgage or signed Disclosure Statements.
Formal But Unstructured Documents
Category 3 Document
Category 4 Document
Our iceflake ℠ algorithm can categorize each document after seeing two specimens. Our proprietary flow ℠ technology platform can abstract data from Category 1 – 3 documents. Category 4 documents require the customer to provide a list of data items wanted. Once we have that list, flow ℠ can abstract the data.
To understand how flow ℠ works, it helps to compare against other technologies. There are at least a dozen companies that have solved the problem of extracting data from Category 1 Documents or Rigid Forms. They use spatial relationships on the pdf page itself to extract data with the expectation that the data will be on the same place on every page. This model works for this very narrow application with accuracy levels approaching 100% for at least half a dozen providers that we have tested. The challenge for most of these service providers is that building a model is time-consuming as evidenced by the one-time model fees charged which can range from $180 to $3,000 or annual minimums which can be $6000 per Category 1 Document.
Providers have tried to expand this technology into other Categories with mixed success. For example, a common method is to look for a floating field name and bring in the value below or to the right of that field. This works well enough to make operations more efficient but still requires human intervention. Publicly available reviews and case studies suggest that accuracy levels are 75 – 90% for Category 2 documents.
At the other end of the spectrum are Natural Language Processors. Because of the complexity of Natural Language all these companies use AI or Machine Learning to abstract information from a document. These technologies traffic in probabilities. There are many possible answers and the model picks the most probable one. Natural Language Processors do a very good job with Category 3 Structured Documents. Before Firehorse, AI companies were the only solution for Category 4 documents. They do a serviceable job but at a high price.
Pure Natural Language Processors are not very accurate with the other Categories of Structured Language documents. Most companies actually combine floating field techniques with Natural Language Processing to get to those 75 – 90% accuracy levels. Natural Language Processors, AI and Machine Language companies for the most part focus on harder documents, either handwriting or poor quality pdf scans. When every customers’ automation processes are built already for a significant amount of human intervention, Automation provider’s lowest hanging fruit is to improve bad document OCR rather than making good document OCR better.
snowflake ℠ sorts and categorizes Structured Documents. Structured Documents follow a consistent internal structure which repeats from specimen to specimen. Forms are an example of Structured Documents. snowflake only needs 2 specimens of each type of Structured Document to categorize it accurately. Unlike other classification technologies, snowflake does not look to see what is similar about each document, rather it looks to catalog differences. There are an infinite variety of Structured Documents. Every unique Structured Document may also have an infinite variety of field names.
is an App that effects a change. From application to approval, from bidder to owner, delta ℠ processes flow ℠ to make a change in our world. In cases where decisions can be made algorithmically, delta ℠ can make the decision. In cases where human judgment is still required, delta ℠ presents flow ℠ in an intuitive design so that our customer can make the decision. River deltas historically have been centers of trade & innovation. It is no different now at Firehorse.
flow ℠ converts the entire Structured Document into text and abstracts required data with unprecedented accuracy. Depending on the complexity, flow ℠ only needs two to twenty specimens of the same Structured Document to parse all the fields, tables and required words. All text and structured data is stored in NoSQL databases where our error checking algorithm confirms the numbers and provides an accuracy measure for the text. We do industry specific calculations or analysis and present the data in both raw and processed form. Just as we sometimes get in the flow during moments of peak performance, flow ℠ is the magic part of our business.
flow ℠ converts the entire Structured Document into text and abstracts required data with unprecedented accuracy. Depending on the complexity, flow ℠ only needs two to twenty specimens of the same Structured Document to parse all the fields, tables and required words. All text and structured data is stored in NoSQL databases where our error checking algorithm confirms the numbers and provides an accuracy measure for the text. We do industry specific calculations or analysis and present the data in both raw and processed form. Just as we sometimes get in the flow during moments of peak performance, flow ℠ is the magic part of our business.
we're fast and affordable
Combine our Technology Products to Create the Perfect Platform
Predict
Once the data or numbers are in a format that can be analyzed, then you can compare outcomes vs. inputs.
Decide
Once you have all the data required to make the decision, it’s possible to make a decision. Without the data.
Convert
Fully automated flow provides unprecedented accuracy which removes the need for human intervention sort & categorize pdfs .
Identify
Delta℠ Mortgage Premium
Firehorse does the underwriting using our platform, which is why we can offer pricing half of the market rates.
Delta℠ Mortgage Direct
Firehorse provides the data; your underwriters can confirm or change the underwriting.