Skip to main content

Pulled in at 11th hour on project converting from C++ to PHP for heavily used OCR of archival newspapers to web-database application.

Previous company was unable to finish project. Goal was to write an XML-path function that would search an XML OCR scan of newspaper for location of words, phrases, and sentences and return pixel locations for display of images with the words highlighted even if wrapped across columns, hyphenated, etc. Client was given successful function in time to meet deadline.

Taxonomy