Solutions
- Solutions
  - Agentic use cases
    Watch agentic automation use case videos and demos
    Webinars
    Learn best practices from industry experts.
    Customer stories
    Gather insight, read success stories, and more.
- By industry
  Banking & financial services
  Healthcare
  Insurance
  Public sector
  Manufacturing
  All industries
  By department
  Supply chain
  Finance & accounting
  HR
  QA / Testing
  Contact center
  All departments
  By technology
  Peak.ai
  Coded agents
  Microsoft
  SAP
  Agentic testing
  Technology solutions overview
  Agentic AI Summit recap: Embracing AI transformation enterprise-wide
  Read the blog
Platform
- - Agentic Automation
    Discover the place where agents think, robots do, and people lead.
  - Agentic Testing
    Explore agentic testing for the enterprise
  - Explore the Platform
  - View all products
  - Pricing
  - Support
- Agentic Automation
  Agentic Automation
  Discover the place where agents think, robots do, and people lead.
  Processes
  Model and orchestrate agents, robots, and people end-to-end
  Agentic orchestration
  Business process management (BPM)
  Process intelligence
  Workflows
  Plan, build, and deploy automated workflows
  Agentic & robotic workflows
  Human-in-the-loop
  Agent evaluation
  Activities
  Empower agents and robots with AI, API, and rules-based tools
  Agentic activities
  RPA & API
  Forms & apps
  Intelligent Document Processing (IDP)
  Foundation: Orchestrate with security, governance, and trust
  Join CEO Daniel Dines to meet the reimagined UiPath Platform™ for agentic automation
  Watch the launch
  Agentic Testing
  Agentic Testing
  Explore agentic testing for the enterprise
  Topics
  Unlock comprehensive testing for enterprises
  Enterprise applications
  Integrations
  SAP testing
  Test automation
  Products
  Take your testing to the next level with agentic testing
  Test Cloud
  Agent Builder for testers
  Autopilot™ for testers
  Explore agentic testing solutions for your enterprise
  Agentic testing is here. Catch all the buzz from our recent launch at the Agentic AI Summit.
  Watch the launch
Developers
- Developers
  - AgentPath
    Discover the developer's path to agentic automation.
    Academy
    Learn the skills of the future with free online automation training.
    Documentation
    Explore product documentation and guides.
- Learn
  Academy
  Academic Alliance
  AgentPath
  Certifications
  Digital credentials
  UiPath DevCon
  UiPath.ai
  Support
  Community
  Customer portal
  Customer support
  Documentation
  Forum
  Marketplace
  Latest
  Tech blog
  AI research
  Community blog
  Community Certification Framework Program 2025
  Start your expert training journey today and earn free UiPath certifications! Join to upskill and unlock your next professional level!
  Join now
Why agentic
- Why agentic
  - Customer stories
    Gather insight, read success stories, and more.
    Blog
    Get up close and personal with our people and products.
- Get started
  Agentic AI
  Agentic automation
  Agentic testing
  AI agents
  Enterprise AI
  Generative AI
  What is RPA
  Deep dive
  Events and webinars
  Customer stories
  Demos and videos
  White papers
  Analyst reports
  Blog
  See all resources
  Our partners
  UiPath Partner Network
  Find a partner
  Become a partner
  Business partner portal
  Technology partner portal
  Professional services
  See all partners
  Three days in Vegas to change the course of your business forever
  September 29 - October 2
  Secure your spot now

All

uipath.com

Forum

Docs

Close

Try UiPath Free

Product

AI Computer Vision

UIPATH AI COMPUTER VISION

Build resilient automations for dynamic interfaces and remote desktops

Try UiPath free

Join AI Summit

UiPath AI Computer Vision Demo – Automate in dynamic interfaces and across virtual desktops

Your robot needs to be able to 'see' everything you're automating.

AI Computer Vision enables all UiPath Robots to see every element of an interface. You can easily build a vision-based automation that will run on most virtual desktop interface (VDI) environments—regardless of framework or operating system.

Automation without limitation

With AI Computer Vision, your robot can see on-screen elements with human-like recognition.

AI Computer Vision brochure

Automation beyond selectors

Enable robots to recognize and interact with more on-screen fields and components—even Flash, Silverlight, PDFs, and images

Reliable on VDIs and desktops

Relieves issues with failure-prone image automation techniques and with selector-based targeting on desktops

Broad range of interface types

Includes VDI environments (Citrix, VMWare, Microsoft RDP, VNC, and others) for desktop and web applications

Intelligent, intuitive capabilities

Provides details, validation, and notifications about on-screen selections via an on-screen wizard. Uses the recorder to easily generate full vision-based automations

Run-time Auto-scroll Support

Easily automate scrollable content in webpages or apps using CV activities

Cross-platform capabilities

Automate for Windows, Linux, Android and other operating systems through remote desktops

Automation between VDI & non-VDI

Simplifies VDI-to-desktop automation by reducing necessary modifications

Multiple deployment options

Deploys via SaaS; available on-premises for Linux and Windows, or right from your desktop

Dynamic UI elements

Enables automations that include tables, drop-down lists, and checkbox elements

“We've been successful with UiPath products where other vendors couldn't do the job. Using AI Computer Vision, we can do it quickly and ensure it's consistent, without having to worry about constantly updating the automations.”

Dan Stoudt

Solutions Architect, ApprioHealth

See the case study

UiPath Academy Course

Ready to skill up with AI-based automations?

Get expert-level knowledge of how to utilize AI Computer Vision to build resilient automations

Start free training

Explore UiPath Academy

Why VDI automations need AI Computer Vision

RPA depends on a robot’s ability to see selectors on a webpage or computer interface.

But a virtual desktop interface (VDI) doesn’t present a traditional user interface; instead, it streams an image of a remote desktop.

VDIs make it virtually impossible for robots to easily or accurately recognize—much less interact with—the selectors they need to see.

But with AI Computer Vision, robots can “see” the elements they need—even through a VDI.

How does AI Computer Vision work?

UiPath robots' human-like vision is powered by a neural network with a combination of custom Screen OCR, text matching, and a multi-anchoring system.

Why RPA developers love AI Computer Vision

AI Computer Vision eliminates the reliance on selectors, while still maintaining familiar workflows for RPA developers. With an increase in visible screen elements, more automations are possible.

Also worth a read

Blog

Combining OCR With AI and RPA for Advanced Data Analysis

Unstructured data is everywhere, hiding in places like documents, audio files, videos, emails, images, and log files—the list goes on.

Read the blog

Documentation

AI Computer Vision Documentation

Check out the UIAutomation activities package and discover all the basic activities used for creating automation projects.

Read the docs

building the future one rpa robot at a time

Related asset

AI Computer Vision UiPath Forum

Join the AI Computer Vision Community Forum and get help with your automation projects , share feedback, report bugs or just drop us any questions you may have.

Give AI Computer Vision a try