Computer Vision RGB v/s HSV for Computer Vision Computer vision is an interdisciplinary scientific field that deals with how computers can be made to gain high-level understanding from digital images or videos. If we consider Digital images then it can be
Computer Vision Graph Convolution for Multimodal Information Extraction from Visually Rich Documents The objective of the paper is to extract structured information from unstructured documents such as invoices and receipts. Document Modeling Each document is modeled as a graph of text segments, where each text
Deep Learning Visual classification of document images Document image classification is not as well studied as natural image classification. We experimented with different neural network architectures on document image dataset. We discuss our preliminary results in this post.
Deep image prior Deep Image Prior defies the idea that "deep learning only works in the context of massive datasets or models pretrained on such datasets". This paper showed that some deep neural networks could be
Deep Learning Audio/Video receiver ports detection Object detection process applied to Audio/Video receiver back panel images.
Deep Learning Measuring feet using deep learning Applying object identification techniques to a simple biometrics problem