Informonics - Software Developer
During the summer of 2020, I undertook a 12-week remote internship with Informonics, where I focused on developing their data extraction tool. My primary task involved enhancing the accuracy of the Optical Character Recognition (OCR) data extraction tool using AWS Textract and AWS Lambda with Python. Additionally, I revamped the document query page by introducing custom pagination, using Angular and AWS DynamoDB. This optimization allowed users to efficiently filter and sort documents without loading all documents at once. Furthermore, I implemented a document preview feature in Angular, facilitating zooming and page navigation, with image access facilitated through AWS S3. To enhance document organization, I developed a document tagging system. This involved processing stored tags from AWS S3 via an AWS Lambda function written in Python, and integrating this functionality into the Angular application. Lastly, I remodelled the OCR algorithm by incorporating open-source OCR software, resulting in more efficient scanning processes to lower scan costs.