Common T12 Parsing Issues and Quick Fixes

Failed T12 parsing? Here's what may have gone wrong and what you can do to fix them.

While Archer has been trained on thousands of documents with different types of formats, there will be common issues that will cause the T12 parsing to fail, or produce a large delta to the NOI provided in the original document.

Below are some of the common issues that come up and how to fix them.

Original T12 document provided has only one column of data.

The parser cannot currently parse a single column financial statement because it cannot determine if this is a YTD, a single month or single annual value. We are working to enhance this further. For the time being, if you can spread the data over multiple months, that would help with parsing.

Original T12 document provided has more than 12 months of data + a total column.

We are currently able to parse up to 12 months of data + a total column. If there extra columns outside of these to the right, they sometimes interfere with parsing. Removing those excess columns should help with parsing.

Original T12 document provided doesn't clearly label headers with dates.

If dates (at least month & year) cannot be found in the header for a T12, then the parser doesn’t know which month to attribute the financials to, so it won’t be able to extract the data. Please make sure that you have a full date (or at least month & year) for your columns.

Excess rows above or below the main body of data.

If there are excess rows above or below the main body of data, that can sometimes cause issues because the parser can struggle to know the start and end of the true data. Removing those may help with parsing.

Hidden Rows & Columns.

Check to see if there are any hidden rows & columns in Excel. The hidden columns can have data that interfere with the parsing of the main data. If yes, please unhide those columns, and fix file correctly.

Failed PDF Parsing

While Archer has the ability to read and parse PDF documents, there are instances where Archer won't be able to recognize the format of the PDF an incorrectly map the data. We have expanded our capabilities on PDF parsing, however PDFs offer additional complexity compared to Excel files because they may require OCR, rotation, encryption etc. If PDF Parsing fails, and you have the capability to convert to an Excel file, please try that. (Or just send to support@archer.re, and we'll parse it for you).

Multiple Sheets/Tabs and Hidden Sheets/Tabs

If there are multiple sheets, then parser can struggle to find the sheet with relevant T12 data. Removing any unnecessary sheets will help with parsing. The most common reason parsing has failed for customers has been because there were hidden tabs to the left of the main T12 financial data tab. So if everything else about the original document looks fine, but your parsing still failed, make sure to check hidden tabs!
 
Please send the file to support@archer.re for any files you cannot parse.