UiPath AI Computer Vision Now Available in Public Preview

Solutions
- Solutions
  - Agentic use cases
    Watch agentic automation use case videos and demos
    Webinars
    Learn best practices from industry experts.
    Customer stories
    Gather insight, read success stories, and more.
- By industry
  Banking & financial services
  Healthcare
  Insurance
  Public sector
  Manufacturing
  All industries
  By department
  Supply chain
  Finance & accounting
  HR
  QA / Testing
  Contact center
  All departments
  By technology
  Peak.ai
  Coded agents
  Microsoft
  SAP
  Agentic testing
  Technology solutions overview
  Agentic AI Summit recap: Embracing AI transformation enterprise-wide
  Read the blog
Platform
- - Agentic Automation
    Discover the place where agents think, robots do, and people lead.
  - Agentic Testing
    Explore agentic testing for the enterprise
  - Explore the Platform
  - View all products
  - Pricing
  - Support
- Agentic Automation
  Agentic Automation
  Discover the place where agents think, robots do, and people lead.
  Processes
  Model and orchestrate agents, robots, and people end-to-end
  Agentic orchestration
  Business process management (BPM)
  Process intelligence
  Workflows
  Plan, build, and deploy automated workflows
  Agentic & robotic workflows
  Human-in-the-loop
  Agent evaluation
  Activities
  Empower agents and robots with AI, API, and rules-based tools
  Agentic activities
  RPA & API
  Forms & apps
  Intelligent Document Processing (IDP)
  Foundation: Orchestrate with security, governance, and trust
  Join CEO Daniel Dines to meet the reimagined UiPath Platform™ for agentic automation
  Watch the launch
  Agentic Testing
  Agentic Testing
  Explore agentic testing for the enterprise
  Topics
  Unlock comprehensive testing for enterprises
  Enterprise applications
  Integrations
  SAP testing
  Test automation
  Products
  Take your testing to the next level with agentic testing
  Test Cloud
  Agent Builder for testers
  Autopilot™ for testers
  Explore agentic testing solutions for your enterprise
  Agentic testing is here. Catch all the buzz from our recent launch at the Agentic AI Summit.
  Watch the launch
Developers
- Developers
  - AgentPath
    Discover the developer's path to agentic automation.
    Academy
    Learn the skills of the future with free online automation training.
    Documentation
    Explore product documentation and guides.
- Learn
  Academy
  Academic Alliance
  AgentPath
  Certifications
  Digital credentials
  UiPath DevCon
  UiPath.ai
  Support
  Community
  Customer portal
  Customer support
  Documentation
  Forum
  Marketplace
  Latest
  Tech blog
  AI research
  Community blog
  Community Certification Framework Program 2025
  Start your expert training journey today and earn free UiPath certifications! Join to upskill and unlock your next professional level!
  Join now
Why agentic
- Why agentic
  - Customer stories
    Gather insight, read success stories, and more.
    Blog
    Get up close and personal with our people and products.
- Get started
  Agentic AI
  Agentic automation
  Agentic testing
  AI agents
  Enterprise AI
  Generative AI
  What is RPA
  Deep dive
  Events and webinars
  Customer stories
  Demos and videos
  White papers
  Analyst reports
  Blog
  See all resources
  Our partners
  UiPath Partner Network
  Find a partner
  Become a partner
  Business partner portal
  Technology partner portal
  Professional services
  See all partners
  Three days in Vegas to change the course of your business forever
  September 29 - October 2
  Secure your spot now

All

uipath.com

Forum

Docs

Close

Try UiPath Free

UiPath Blog

Automation

Digital Transformation

Industry Solutions

Product

RPA

Community Blog

Resource Center

Newsroom

Blog

Product

The New UiPath AI Computer Vision Is Now in Public Preview

Cosmin Voicu

•February 19, 2019

UPDATE: This blog was originally published in February 2019 to announce the public preview. We're excited to share that UiPath AI Computer Vision is now publicly available. You can learn more on our AI Computer Vision page.

One of our driving tenants on the Artificial Intelligence (AI) team at UiPath is something we call “Pragmatic AI” – teaching our Robots AI skills to solve complex problems for our customers in the most effective way.

Reliably automating Virtual Desktop Environments (VDIs) such as Citrix, VMware, VNC, and Windows Remote Desktop has always been a tough nut to crack in Robotic Process Automation (RPA). There are hundreds of thousands of businesses globally using VDIs and virtualization in enterprises is growing by the day.

Finding simple solutions to complex problems is certainly not an easy task. Good things come to those who wait, so let me just say I’m super excited to announce the public preview of what we believe is a true breakthrough for the RPA industry: the new UiPath AI Computer Vision capability built on deep learning.

The challenge with automating VDI environments

The specific challenge when trying to automate VDI environments is RPA’s traditional reliance on selectors. These selectors work using the underlying properties of user interface (UI) elements and work great for identifying application elements (such as buttons, text-fields, etc.) when automating native desktop systems. However, this method completely breaks down when trying to automate the same software in a VDI environment.

The reason for the breakdown is that VDIs stream an image of the remote desktop, similar to how video-streaming services like Netflix do. There are simply no selectors to be identified in “video.”

Attempts to solve this challenge have used optical character recognition (OCR) and image matching, but even those attempts have led to reliability and maintenance issues, because even minor changes in the UI break the automations.

There has simply been no solution available in the market to enable effective automation of VDI environments. Until now.

Seamless automation across desktop and VDI environments

UiPath solves the challenges discussed above with an AI Computer Vision algorithm that enables human-like recognition of user interfaces, using a mix of AI, OCR , text fuzzy-matching, and an anchoring system to tie it all together.

This allows our Robots to “see” the screen and visually identify all the elements, rather than relying on their hidden properties, IDs, and other metadata.

In fact, this new AI Computer Vision capability isn’t just limited to VDI environments. It can also recognize elements across a wide range of cases where traditional UI automation methods struggle, including SAP, Flash, Silverlight, PDFs, and even images.

Unlike traditional image automation, our AI Computer Vision does not rely on image matching. As a result, it’s highly resilient to interface changes including color, font, size, and resolution changes. The AI Computer Vision handles all these changes at once and still finds the intended target.

See a demo of the new AI Computer Vision in action:

UiPath-AI-Computer-Vision-Now-Available-in-Public-Preview-|-UiPath-Video-3

AI Computer Vision - The path forward

Granted, this whole technology is still in its infancy, and we have big plans for it. Throughout the year we’ll add a few more usability improvements to this current version, with support for recording full automations using AI Computer Vision, then (and we’re really excited about this) in V2 we’ll bring a whole new level of capability and robustness.

We also have a kind request, because with your help, we can make AI Computer Vision both faster and better: please use the Report functionality in the wizard to alert us to any gaps. It's the best way to make it smarter and better for your needs.

Check out these additional resources to learn more:

Download the preview today
Watch the training video on UiPath Academy or YouTube
Connect with us directly on the UiPath Forum and let us know how our new AI Computer Vision algorithm is working for you and your organization

Topics:

Artificial Intelligence (AI)Product Releases

Cosmin Voicu

Principal Product Manager, UiPath