UiPath AI Computer Vision

Automate in dynamic interfaces and across virtual desktops

computerVision-valueprop-thumbnail@2x-1 Play Video Button

Your robot needs to be able to 'see' everything you're automating.

AI Computer Vision is an AI skill that enables all UiPath Robots to see every element on a computer screen. Now, businesses and RPA developers can automate tasks on most virtual desktop interface (VDI) environments—regardless of framework or operating system.

Automation without limitation

With AI Computer Vision, your robot can see on-screen elements with human-like recognition.

AI Computer Vision datasheet UiPath Chatbots product brochure arrow icon

Automation beyond selectors

Lets robots recognize and interact with more on-screen fields and components—even Flash, Silverlight, PDFs, and images

Reliable on VDIs and desktops

Relieves issues with failure-prone image automation techniques and with selector-based targeting on desktops

Broad range of interface types

Includes VDI environments (Citrix, VMWare, Microsoft RDP, VNC, and others) for desktop and web applications

enterprise-competency alt
Intelligent, intuitive capabilities

Provides details, validation, and notifications about on-screen selections via an on-screen wizard

Drag-and-drop AI

Does not require advanced image recognition techniques; the recorder can generate full vision-based automation

Cross-platform capabilities

Supports automation on Windows, Linux, Android, and other operating systems

Automation between VDI & non-VDI

Simplifies VDI-to-desktop automation by reducing necessary modifications

Multiple development options

Deploys via SaaS; available on-premises for Linux and Windows. Desktop Server is now also available.

Dynamic UI elements

Enables automations that include drop-down and checkbox elements

Why VDI automations need
AI Computer Vision

RPA depends on a robot’s ability to see selectors on a webpage or computer interface. But a virtual desktop interface (VDI) doesn’t present a traditional user interface; instead, it streams an image of a remote desktop.

VDIs make it virtually impossible for robots to easily or accurately recognize—much less interact with—the selectors they need to see.

But with AI Computer Vision, robots can “see” the elements they need—even through a VDI.


How does
AI Computer Vision

UiPath robots' human-like vision comes from a neural network with a combination of custom screen OCR and text fuzzy matching with a multi-anchoring system.

See how AI Computer Vision compares to traditional selector-based activities as you see it at work in this training video.

UiPath-AI-Computer-Vision-snapshot Play Video Button

Why RPA developers love AI Computer Vision

AI Computer Vision eliminates the reliance on selectors, while still maintaining familiar workflows for RPA developers. With an increase in visible screen elements, more automations are possible.


Give AI Computer Vision a try