What is Computer Vision?

Computer vision is an interdisciplinary field that deals with how computers can be made for gaining high-level understanding from digital images or videos. From the perspective of engineering, it seeks to automate tasks that the human visual system can do.

Computer vision tasks include methods for acquiring, processing, analyzing and understanding digital images, and extraction of high-dimensional data from the real world in order to produce numerical or symbolic information, e.g., in the forms of decisions.Understanding in this context means the transformation of visual images (the input of the retina) into descriptions of the world that can interface with other thought processes and elicit appropriate action. This image understanding can be seen as the disentangling of symbolic information from image data using models constructed with the aid of geometry, physics, statistics, and learning theory.

As a scientific discipline, computer vision is concerned with the theory behind artificial systems that extract information from images. The image data can take many forms, such as video sequences, views from multiple cameras, or multi-dimensional data from a medical scanner. As a technological discipline, computer vision seeks to apply its theories and models for the construction of computer vision systems.

Sub-domains of computer vision include scene reconstruction, event detection, video tracking, object recognition, 3D pose estimation, learning, indexing, motion estimation, and image restoration.

Overview

The goal of Computer Vision is to emulate human vision using digital images through three main processing components, executed one after the other:

1. Image acquisition
2. Image processing
3. Image analysis and understanding

As our human visual understanding of world is reflected in our ability to make decisions through what we see, providing such a visual understanding to computers would allow them the same power

Image acquisition
Image acquisition is the process of translating the analog world around us into binary data composed of zeros and ones, interpreted as digital images.

Different tools have been created to build such data sets:

1. Webcams & embedded cameras
2. Digital compact cameras & DSLR

3. Consumer 3D cameras & laser range finders

Image processing

The second component of Computer Vision is the low-level processing of images. Algorithms are applied to the binary data acquired in the first step to infer low-level information on parts of the image. This type of information is characterized by image edges, point features or segments, for example. They are all the basic geometric elements that build objects in images.

This second step usually involves advanced applied mathematics algorithms and techniques.

Low-level image processing algorithms include

1. Edge detection
2. Segmentation
3. Classification

4. Feature detection and matching

Image processing

The second component of Computer Vision is the low-level processing of images. Algorithms are applied to the binary data acquired in the first step to infer low-level information on parts of the image. This type of information is characterized by image edges, point features or segments, for example. They are all the basic geometric elements that build objects in images.

This second step usually involves advanced applied mathematics algorithms and techniques.

Low-level image processing algorithms include:
1. Edge detection
2. Segmentation
3. Classification

4. Feature detection and matching

Applications of computer vision

Techniques developed for Computer Vision have many applications in the fields of robotics, human-computer interaction and visualization, to name a few:
1. Motion recognition
2. Augmented reality
3. Autonomous cars
4. Domestic/service robots

5. Image restoration such as denoising

Challenges in computer vision

When developing Computer Vision algorithms, one has to face different issues and challenges, related to the very nature of the data or event the application to be created and its context:
1. Noisy or incomplete data
2. Real-time processing
3. Limited resources: power, memory

Current research is focused on addressing these challenges to make the algorithms more robust and efficient in difficult conditions.

Credits
Richard Szeliski, “Computer Vision: Algorithms and Applications”, Springer Science & Business Media, 2010

Karen Reed, “8 Ways Technology is Improving your Health”, Positive Health Wellness, 2017

Header Ads

What is Computer Vision?

No comments:

Facebook

Social

Search This Blog

Blog Archive

Report Abuse

Contributors

Recent

Popular

Comments

Categories

Random Posts

Tags

Recent in Sports

Header Ads

What is Computer Vision?

You Might Also Like

No comments:

Facebook

Social

Search This Blog

Blog Archive

Report Abuse

Contributors

Recent

Popular

Comments

Categories

Random Posts

Tags

Recent in Sports