top of page
Blue Argus Demo
10:56
Blue Argus Demo
Learn about Blue Sky Robotics' Computer Vision Package: Blue Argus!
Features: Houston
00:33
Features: Houston
Blue Sky Robotics' low-code automation platform
Features: Analytics Dashboard
00:56
Features: Analytics Dashboard
Blue Sky Robotics' control center analytics dashboard
Meet the "Hands" of your robot!
00:30
Meet the "Hands" of your robot!
Meet the "Hands" of your robot! 🤖 End effectors are how robotic arms interact with their world. We’re breaking down the standard UFactory gripper—the versatile go-to for most of our automation tasks. 🦾✨ #UFactory #xArm #Robotics #Automation #Engineering #TechTips #shorts Learn more at https://f.mtr.cool/jenaqtawuz

What Is 3D Machine Vision and Why Does It Matter for Robot Automation?

  • Apr 6
  • 5 min read

Updated: Apr 13

Standard cameras see the world as a flat image. They can tell you that an object is present, what color it is, and roughly where it sits in a frame. What they cannot tell you is how far away it is, how it is tilted, or how its shape varies from one unit to the next.


That limitation matters enormously in robot automation. A robot arm acting on 2D image data alone is working with an incomplete picture. Move a part a few millimeters, rotate it slightly, or let two items overlap in a bin, and the system breaks down. 3D machine vision solves this by adding depth to the equation, giving robots the spatial awareness they need to handle real-world variability reliably.


This post explains what 3D machine vision is, how the core technologies differ from one another, what it enables in practice, and how Blue Sky Robotics' cobots integrate with it.


What 3D Machine Vision Is


3D machine vision is the use of sensors and software to capture three-dimensional data about a scene, producing a spatial map that includes not just the position of objects in X and Y, but their depth along the Z axis and their full surface geometry.


The output is typically a point cloud: a dense collection of data points, each representing a location in 3D space. From that point cloud, vision software can calculate object position, orientation, dimensions, surface flatness, and the presence or absence of specific features. The robot controller receives those calculations as coordinates and acts on them.


The practical difference over 2D vision is significant. A 2D system can tell a robot there is a box at position X, Y. A 3D system tells the robot the box is at X, Y, Z, tilted 12 degrees clockwise, with its top surface 47 mm above the conveyor. The robot can then plan a precise, collision-free grasp accordingly.


The Main 3D Vision Technologies


Not all 3D machine vision works the same way. Three core technologies dominate industrial applications, each with distinct strengths.


Structured light projects a known pattern of light, usually a grid or fringe pattern, onto the scene. A camera captures how the pattern deforms across the object's surface, and software reconstructs the 3D geometry from that deformation. Structured light produces highly accurate, dense point clouds and handles a wide range of surface types. It is the technology behind most industrial-grade 3D cameras used in bin picking, palletizing, and precision inspection, including the Mech-Eye series from Mech-Mind.


Stereo vision uses two cameras offset from each other, the way human eyes are, to calculate depth from the disparity between the two images. Stereo cameras are compact, relatively affordable, and well suited for robotics research and lighter-duty applications. The Intel RealSense D435 and Luxonis OAK-D, both of which integrate cleanly with UFactory's xArm SDK, use stereo vision.


Time-of-Flight (ToF) sensors emit pulses of infrared light and measure how long they take to return from the scene. This gives a depth map in real time at high frame rates, making ToF a strong choice for fast-moving applications and mobile robots. Industrial ToF sensors now achieve millimeter-level accuracy and maintain reliable performance in dusty, bright, or low-light conditions common on production floors.


Each technology involves tradeoffs between cost, accuracy, speed, and robustness on difficult surfaces. The right choice depends on the specific application.


What 3D Machine Vision Enables


The applications where 3D vision makes a meaningful difference over 2D share a common thread: the robot needs to handle variability rather than just repeatability.


Bin picking. Parts arrive in a bin in random orientations, often touching or stacked. A 3D vision system maps the entire bin, identifies pickable parts, calculates each part's orientation, and plans a grasp that avoids collisions with neighboring items. This is not possible with 2D vision alone.


Flexible palletizing and depalletizing. Mixed pallet loads, deformed bags, angled cases, and varying stack heights all require 3D spatial awareness to handle reliably at speed. Without it, the robot needs every case to arrive in exactly the same position, which defeats the purpose of vision guidance.


Inline dimensional inspection. 3D vision systems can measure part dimensions to sub-millimeter accuracy, verify surface flatness, detect dents or deformations, and flag parts that fall outside tolerance, all at line speed without pulling parts off for manual gauging.


Precise assembly and alignment. For tasks where a component needs to be placed to within fractions of a millimeter, 3D feedback lets the robot correct for the small positional errors that accumulate in real production environments.


3D Machine Vision and Cobot Arms


Every arm in Blue Sky Robotics' lineup supports 3D vision integration through open APIs, Python SDKs, and ROS compatibility. The combination of an accurate cobot arm and a well-calibrated 3D vision system is what makes flexible, autonomous automation cells practical for small and mid-size manufacturers.

For entry-level vision applications, the UFactory Lite 6 ($3,500) paired with a stereo depth camera is the most accessible starting point. UFactory's open-source vision SDK includes ready-to-run integration examples for the Intel RealSense and Luxonis OAK-D cameras.


For production-grade bin picking and inspection, the Fairino FR5 ($6,999) and FR10 ($10,199) offer the payload and reach to work alongside industrial structured-light cameras, including the Mech-Eye series. Both arms support ROS, which gives you access to the broader open-source 3D vision ecosystem.


The total cost of an entry-level 3D vision cell, including robot arm, depth camera, mounting hardware, and open-source vision software, starts well under $5,000. A production-ready cell with a structured-light camera and industrial software runs higher, but remains a fraction of what traditional integrator-built systems cost.


Getting Started


Use our Cobot Selector to match an arm to your application, or the Automation Analysis Tool to model whether a 3D vision cell makes financial sense for your specific workflow. When you are ready to see a live demonstration, book a session and we will walk through a cell design built around your use case.

Browse our full UFactory lineup and Fairino cobots with current pricing. To learn more about computer vision software visit Blue Argus.


FAQ


What is the difference between 2D and 3D machine vision?

2D machine vision captures flat images and can detect the presence, position, and appearance of objects in a single plane. 3D machine vision adds depth data, giving robots full spatial awareness including distance, orientation, and surface geometry. Most robotic manipulation tasks require 3D vision to handle variability reliably.


Which 3D vision technology is best for bin picking?

Structured light cameras are the standard choice for bin picking because they produce dense, accurate point clouds even on challenging surfaces. Stereo vision cameras are a lower-cost option for simpler applications. Time-of-Flight sensors are better suited for fast-moving or large-area applications where real-time depth mapping matters more than micron-level accuracy.


How accurate is 3D machine vision?

Accuracy varies by technology and hardware. Industrial structured-light systems can achieve depth resolution as fine as 0.02 millimeters for precision inspection tasks. Stereo cameras used in cobot applications typically deliver accuracy in the low single-digit millimeter range, which is sufficient for most pick and place work.

bottom of page