Computer vision

With a thought bubble containing a to-do list over its head, a robotic arm begins to complete kitchen tasks in three panels: opening a microwave, opening and closing a cupboard door, and placing a pot on a stove.

Multiple AI models help robots execute complex plans more transparently

A multimodal system uses models trained on language, vision, and action data to help robots develop and execute plans for household, construction, and manufacturing tasks.

January 8, 2024

3 rows show how a cat’s tail is animated in 3 ways. The rows are labeled, from top to bottom: Dirichlet, Weighted TV, and ARAP. The rows show very similar versions of the cat’s tail enclosed in polygonal mesh marking animation points.

A flexible solution to help artists improve animation

This new method draws on 200-year-old geometric foundations to give artists control over the appearance of animated characters.

December 20, 2023

3 by 6 grid of photos. Rows depict tennis rackets, measuring tape/rulers, and hammers. Along the bottom are time measurements from 17 milliseconds to 10 seconds, and the objects are increasingly harder to recognize from left to right.

Image recognition accuracy: An unseen challenge confounding today’s AI

“Minimum viewing time” benchmark gauges image recognition complexity for AI systems by measuring the time needed for accurate human identification.

December 15, 2023

Justin Solomon stands in front of a wall and is lit with dramatic pink and blue light, with grid-like shadows on the wall.

A computer scientist pushes the boundaries of geometry

Justin Solomon applies modern geometric techniques to solve problems in computer vision, machine learning, statistics, and beyond.

December 12, 2023

Illustration of a disembodied brain with glowing tentacles reaching out to different squares of images at the ends

Synthetic imagery sets new bar in AI training efficiency

MIT CSAIL researchers innovate with synthetic imagery to train AI, paving the way for more efficient and bias-reduced machine learning.

November 20, 2023

Rendering shows a six-legged robot, standing against a black background, in the process of being 3D-printed. Near the back of the robot, floating black spheres are assembled and then cured by a blue UV light beaming down from above. On top, cameras point down to scan the action.

This 3D printer can watch itself fabricate objects

Computer vision enables contact-free 3D printing, letting engineers print with high-performance materials they couldn’t use before.

November 15, 2023

Two by two grid of images. At top left, a large robotic arm with objects it can pick up, including a white doll, a banana, multicolored building blocks, and green grapes. The other three panels show the same demonstration setup in different heat signatures.

Using language to give robots a better grasp of an open-ended world

By blending 2D images with foundation models to build 3D feature fields, a new MIT method helps robots understand and manipulate nearby objects with open-ended language prompts.

November 2, 2023

Hundreds of colorful dots represent 16 types of bikes. There are 16 bike icons that point to various clusters.

To excel at engineering design, generative AI must learn to innovate, study finds

AI models that prioritize similarity falter when asked to design something completely new.

October 19, 2023

View of two researchers down a dark corridor in a data center, making adjustments to hardware on large racks lining both walls.

New tools are available to help reduce the energy that AI models devour

Amid the race to make AI bigger and better, Lincoln Laboratory is developing ways to reduce power, train efficiently, and make energy use transparent.

October 5, 2023

Conceptual image of an open box that has sparks flying out on a black background. The lid of the box resembles that of a laptop computer screen.

From physics to generative AI: An AI model for advanced pattern generation

Inspired by physics, a new generative model PFGM++ outperforms diffusion models in image generation.

September 27, 2023

Illustration of three human-like individuals in suits, with heads resembling computers and wires, sitting around at a table

Multi-AI collaboration helps reasoning and factual accuracy in large language models

Researchers use multiple AI models to collaborate, debate, and improve their reasoning abilities to advance the performance of LLMs while increasing accountability and factual accuracy.

September 18, 2023

A silhouette of a child running is next to 3 stick-figure like people made of colorful lines and balls for joints.

A pose-mapping technique could remotely evaluate patients with cerebral palsy

The machine-learning method works on most mobile devices and could be expanded to assess other motor disorders outside of the doctor’s office.

September 14, 2023

Rendering shows a figure standing in castle ruins, and a wooden box in foreground.

Helping computer vision and language models understand what they see

Researchers use synthetic data to improve a model’s ability to grasp conceptual information, which could enhance automatic captioning and question-answering systems.

September 13, 2023

A busy city intersection where a bus is colored blue and pedestrians are colored red.

AI model speeds up high-resolution computer vision

The system could improve image quality in video streaming or help autonomous vehicles identify road hazards in real-time.

September 12, 2023

Two side-by-side black and white images of the same brain scan. The one on the left is blurry, and the label "motion-corrupted" appears above it; the one on the right is more clear, and is labeled "motion-corrected."

MIT researchers combine deep learning and physics to fix motion-corrupted MRI scans

The challenge involves more than just a blurry JPEG. Fixing motion artifacts in medical imaging requires a more sophisticated approach.

August 17, 2023

MIT News | Massachusetts Institute of Technology

Browse By

Topics

Departments

Centers, Labs, & Programs

Schools

Topic