Whether its an Invoice or Receipt, ChatGPT is terrible at reading scanned documents. At Pinsight, we are working on building high quality, open source vision-langauge model that is enable to processes multi-page scanned documents and product structured output.
We call our models DOLMA which stands for "Document Optimized Language Models for Automation".