MIT Scientists Release Open-Source Photorealistic Simulator for Autonomous Driving

Autonomous Driving Concept — MIT released an open-source simulation engine where vehicles can learn real-world driving and recover from near-crash scenarios.

MIT researchers unveil the first open-source simulation engine capable of constructing realistic environments for deployable training and testing of autonomous vehicles.

Since they’ve proven to be productive test beds for safely trying out dangerous driving scenarios, hyper-realistic virtual worlds have been heralded as the best driving schools for autonomous vehicles (AVs). Tesla, Waymo, and other self-driving companies all rely heavily on data to enable expensive and proprietary photorealistic simulators, because testing and gathering nuanced I-almost-crashed data usually isn’t the easiest or most desirable to recreate.

VISTA 2.0 Open-Source Simulation Engine — VISTA 2.0 is an open-source simulation engine that can make realistic environments for training and testing self-driving cars. Credit: Image courtesy of MIT CSAIL

With this in mind, scientists from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) created “VISTA 2.0,” a data-driven simulation engine where vehicles can learn to drive in the real world and recover from near-crash scenarios. What’s more, all of the code is being released open-source to the public.

“Today, only companies have software like the type of simulation environments and capabilities of VISTA 2.0, and this software is proprietary. With this release, the research community will have access to a powerful new tool for accelerating the research and development of adaptive robust control for autonomous driving,” says the senior author of a paper about the research, MIT Professor and CSAIL Director Daniela Rus.

VISTA is a data-driven, photorealistic simulator for autonomous driving. It can simulate not just live video but LiDAR data and event cameras, and also incorporate other simulated vehicles to model complex driving situations. VISTA is open source and the code can be found below.

VISTA 2.0, which builds off of the team’s previous model, VISTA, is fundamentally different from existing AV simulators since it’s data-driven. This means it was built and photorealistically rendered from real-world data — thereby enabling direct transfer to reality. While the initial iteration only supported single car lane-following with one camera sensor, achieving high-fidelity data-driven simulation required rethinking the foundations of how different sensors and behavioral interactions can be synthesized.

Enter VISTA 2.0: a data-driven system that can simulate complex sensor types and massively interactive scenarios and intersections at scale. Using much less data than previous models, the team was able to train autonomous vehicles that could be substantially more robust than those trained on large amounts of real-world data.

Advancing AV Training Through Simulated Complex Scenarios

“This is a massive jump in capabilities of data-driven simulation for autonomous vehicles, as well as the increase of scale and ability to handle greater driving complexity,” says Alexander Amini, CSAIL PhD student and co-lead author on two new papers, together with fellow PhD student Tsun-Hsuan Wang. “VISTA 2.0 demonstrates the ability to simulate sensor data far beyond 2D RGB cameras, but also extremely high dimensional 3D lidars with millions of points, irregularly timed event-based cameras, and even interactive and dynamic scenarios with other vehicles as well.”

The team of scientists was able to scale the complexity of the interactive driving tasks for things like overtaking, following, and negotiating, including multiagent scenarios in highly photorealistic environments.

Because most of our data (thankfully) is just run-of-the-mill, day-to-day driving, training AI models for autonomous vehicles involves hard-to-secure fodder of different varieties of edge cases and strange, dangerous scenarios. Logically, we can’t just crash into other cars just to teach a neural network how to not crash into other cars.

Recently, there’s been a shift away from more classic, human-designed simulation environments to those built up from real-world data. The latter have immense photorealism, but the former can easily model virtual cameras and lidars. With this paradigm shift, a key question has emerged: Can the richness and complexity of all of the sensors that autonomous vehicles need, such as lidar and event-based cameras that are more sparse, accurately be synthesized?

Lidar sensor data is much harder to interpret in a data-driven world — you’re effectively trying to generate brand-new 3D point clouds with millions of points, only from sparse views of the world. To synthesize 3D lidar point clouds, the researchers used the data that the car collected, projected it into a 3D space coming from the lidar data, and then let a new virtual vehicle drive around locally from where that original vehicle was. Finally, they projected all of that sensory information back into the frame of view of this new virtual vehicle, with the help of neural networks.

Real-Time Simulation and Multiagent Interactions

Together with the simulation of event-based cameras, which operate at speeds greater than thousands of events per second, the simulator was capable of not only simulating this multimodal information but also doing so all in real-time. This makes it possible to train neural nets offline, but also test online on the car in augmented reality setups for safe evaluations. “The question of if multisensor simulation at this scale of complexity and photorealism was possible in the realm of data-driven simulation was very much an open question,” says Amini.

With that, the driving school becomes a party. In the simulation, you can move around, have different types of controllers, simulate different types of events, create interactive scenarios, and just drop in brand new vehicles that weren’t even in the original data. They tested for lane following, lane turning, car following, and more dicey scenarios like static and dynamic overtaking (seeing obstacles and moving around so you don’t collide). With the multi-agency, both real and simulated agents interact, and new agents can be dropped into the scene and controlled any which way.

Taking their full-scale car out into the “wild” — a.k.a. Devens, Massachusetts — the team saw immediate transferability of results, with both failures and successes. They were also able to demonstrate the bodacious, magic word of self-driving car models: “robust.” They showed that AVs, trained entirely in VISTA 2.0, were so robust in the real world that they could handle that elusive tail of challenging failures.

Now, one guardrail humans rely on that can’t yet be simulated is human emotion. It’s the friendly wave, nod, or blinker switch of acknowledgment, which are the type of nuances the team wants to implement in future work.

Open-Source Availability and Future Applications

“The central algorithm of this research is how we can take a dataset and build a completely synthetic world for learning and autonomy,” says Amini. “It’s a platform that I believe one day could extend in many different axes across robotics. Not just autonomous driving, but many areas that rely on vision and complex behaviors. We’re excited to release VISTA 2.0 to help enable the community to collect their own datasets and convert them into virtual worlds where they can directly simulate their own virtual autonomous vehicles, drive around these virtual terrains, train autonomous vehicles in these worlds, and then can directly transfer them to full-sized, real self-driving cars.”

Reference: “VISTA 2.0: An Open, Data-driven Simulator for Multimodal Sensing and Policy Learning for Autonomous Vehicles” by Alexander Amini, Tsun-Hsuan Wang, Igor Gilitschenski, Wilko Schwarting, Zhijian Liu, Song Han, Sertac Karaman and Daniela Rus, 23 November 2021, Computer Science > Robotics.
arXiv:2111.12083

Amini and Wang wrote the paper alongside Zhijian Liu, MIT CSAIL PhD student; Igor Gilitschenski, assistant professor in computer science at the University of Toronto; Wilko Schwarting, AI research scientist and MIT CSAIL PhD ’20; Song Han, associate professor at MIT’s Department of Electrical Engineering and Computer Science; Sertac Karaman, associate professor of aeronautics and astronautics at MIT; and Daniela Rus, MIT professor and CSAIL director. The researchers presented the work at the IEEE International Conference on Robotics and Automation (ICRA) in Philadelphia.

This work was supported by the National Science Foundation and Toyota Research Institute. The team acknowledges the support of NVIDIA with the donation of the Drive AGX Pegasus.

Never miss a breakthrough: Join the SciTechDaily newsletter.
Follow us on Google and Google News.

MIT Scientists Release Open-Source Photorealistic Simulator for Autonomous Driving

Using Artificial Intelligence to Improve the Way Videos Are Organized

MIT’s New Artificial Intelligence Algorithm Designs Soft Robots That Sense

MIT’s New Neural Network: “Liquid” Machine-Learning System Adapts to Changing Conditions

New MIT Social Intelligence Algorithm Helps Build Machines That Better Understand Human Goals

MIT Using Artificial Intelligence to Translate Ancient “Dead” Languages

Startup Deploying AI Chatbots With “Conversational Memory” for More Natural Exchanges

Photorealistic Simulation Engine Used to Train Driverless Cars Before They Hit the Road

Showing Robots How to Do Your Chores – Automated Robots That Learn Just by Watching

Swarms of Self-Transforming Robot Blocks Unlock Stealthy Abilities to Accomplish Complex Tasks

Bees Have a Hidden Nutritional Superpower, Oxford Study Reveals

A Century-Old Belief About Diamond Has Just Been Overturned

The Universe Can Expand Faster Than Light Without Breaking Physics

The Exercise Paradox: Why Workouts Don’t Guarantee Weight Loss

This Ancient Sea Animal Fights Viruses in the Opposite Way Humans Do

39 Sweeteners Put to the Test Produced Surprising Gut Changes

This Experimental Drug Repaired the Gut and Reversed Severe Fatty Liver Disease

Physicists Just Turned a Black Hole Energy Theory Into Reality

MIT Scientists Release Open-Source Photorealistic Simulator for Autonomous Driving

Advancing AV Training Through Simulated Complex Scenarios

Real-Time Simulation and Multiagent Interactions

Open-Source Availability and Future Applications

Related Articles