Update (15.11.2024): DrivAerNet++ has been accepted to NeurIPS 2024! The full dataset will be released on Harvard Dataverse next week.
Update (11.09.2024): Due to the overwhelming interest and numerous inquiries from industry partners, we are excited to announce that we are now offering commercial licensing options for the DrivAerNet and DrivAerNet++ datasets. Please refer to the DrivAerNet/DrivAerNet++ Commercial Inquiry section.
Update (15.08.2024): DrivAerNet has been utilized in the IJCAI 2024 competition. All competitors' code and results are available in the open-source repository: Rapid Aerodynamic Drag Prediction for Arbitrary Vehicles in 3D Space.
Update (11.07.2024): DrivAerNet is now integrated into NVIDIA Modulus FIGConvUNet and AeroGraphNet
Our new preprint: DrivAerNet++ paper here
Video summary of DrivAerNet++ paper here
DrivAerNet paper: here
We present DrivAerNet++, the largest and most comprehensive multimodal dataset for aerodynamic car design. DrivAerNet++ comprises 8,000 diverse car designs modeled with high-fidelity computational fluid dynamics (CFD) simulations. The dataset includes diverse car configurations such as fastback, notchback, and estateback, with different underbody and wheel designs.
Design parameters for the generation of the DrivAerNet++ dataset. Several geometric parameters with significant impact on aerodynamics were selected and varied within a specific range. These parameter ranges were chosen to avoid values that are either difficult to manufacture or not aesthetically pleasing.
DrivAerNet++ covers all conventional car designs. The dataset encompasses various underbody and wheel designs to represent both internal combustion engine (ICE) and electric vehicles (EV). This extensive coverage allows for comprehensive studies on the impact of geometric variations on aerodynamic performance. By including a diverse set of car shapes, DrivAerNet++ facilitates the exploration of aerodynamic effects across different vehicle types, supporting both academic research and industrial applications.
Each 3D car geometry is parametrized with 26 parameters that completely describe the design. To create diverse car designs, we used two main morphing methods: morphing boxes and direct morphing. For a detailed description of the design parameters, their ranges, lower and upper bounds, please refer to the paper.
Dataset diversity and shape variation are crucial for developing robust deep learning models in aerodynamic car design. By providing a wide range of car shapes and configurations with high-fidelity CFD, DrivAerNet++ enables models to generalize better, supports exploration of unconventional designs, and enhances understanding of how geometric features impact aerodynamic performance.
- Parametric Models: Parametric models with tabular data, allowing extensive exploration of automotive design variations.
- Point Cloud Data: Point cloud data for each car design.
- 3D Car Meshes: Detailed 3D meshes of each car design, suitable for various machine learning applications.
- CFD Simulation Data: High-fidelity CFD simulation data for each car design, including 3D volumetric fields, surface fields, and streamlines.
- Aerodynamic Coefficients: Key aerodynamic metrics such as drag coefficient (Cd), lift coefficient (Cl), and more.
In addition to the CFD simulation data, our dataset includes detailed annotations for various car components (29 labels), such as wheels, side mirrors, and doors. These annotations are instrumental for a range of machine learning tasks, including classification, semantic segmentation, and object detection. The comprehensive labeling can also facilitate automated CFD meshing processes by providing precise information about different car components. By incorporating these labels, our dataset enhances the utility for developing and testing advanced algorithms in automotive design and analysis.
Running the high-fidelity CFD simulations for DrivAerNet++ required substantial computational resources. The simulations were conducted on the MIT Supercloud, leveraging parallelization across 60 nodes, totaling 2880 CPU cores, with each CFD case using 256 cores and 1000 GBs of memory. The full dataset requires 39 TB of storage space. The simulations took approximately 3 Γ 10βΆ CPU-hours to complete.
DrivAerNet++ supports a wide array of machine learning applications, including but not limited to:
- π Data-driven design optimization: Optimize car designs based on aerodynamic performance.
- π§ Generative AI: Train generative models to create new car designs based on performance or aesthetics.
- π― Surrogate models: Predict aerodynamic performance without full CFD simulations.
- π₯ CFD simulation acceleration: Speed up simulations using machine learning and multi-GPU techniques.
- π Reduced Order Modeling: Create data-driven reduced-order models for efficient & fast aerodynamic simulations.
- πΎ Large-Scale Data Handling: Efficiently store and manage large datasets from high-fidelity simulations.
- ποΈ Data Compression: Implement high-performance lossless compression techniques.
- π Part and shape classification: Classify car categories or components to enhance design analysis.
- π§ Automated CFD meshing: Automate the meshing process based on car components to streamline simulations.
DrivAerNet++ serves as a valuable benchmark dataset due to its size and diversity. It provides extensive coverage of various car designs and configurations, making it ideal for testing and validating machine learning models in aerodynamic design. We provide the train, test, and validation splits in the following folder: train_val_test_splits.
To replicate the results based on the parametric tabular data, please refer to the ParametricModels
directory and the code in AutoML_parametric.py
.
To replicate the results based on geometric deep learning methods, please refer to the code saved in the DeepSurrogates
directory.
Drag values for the 8k car designs can be found Here
DrivAerNet++ stands out as the largest and most comprehensive dataset in the field of car design.
We welcome contributions to improve the dataset or project. Please submit pull requests for review.
Maintained by the DeCoDE Lab at MIT. Report issues via GitHub issues.
- Tutorials: Link
To replicate the code and experiments from the first version of DrivAerNet, please refer to the folder: DrivAerNet_v1.
The code is distributed under the MIT License. The DrivAerNet/DrivAerNet++ dataset is distributed under the Creative Commons Attribution-NonCommercial (CC BY-NC) license. Full terms for the dataset license here.
If you are interested in the commercial use of the DrivAerNet or DrivAerNet++ datasets, please contact Mohamed Elrefaie (mohamed.elrefaie@mit.edu) and Faez Ahmed (faez@mit.edu) with the subject line: "DrivAerNet Commercial Inquiry".
To cite this work, please use the following reference:
@article{elrefaie2024drivaernet++,
title={DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks},
author={Elrefaie, Mohamed and Morar, Florin and Dai, Angela and Ahmed, Faez},
journal={arXiv preprint arXiv:2406.09624},
year={2024}
}
To cite the first version of DrivAerNet, please use the following reference:
@article{elrefaie2024drivaernet,
title={DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and Graph-Based Drag Prediction},
author={Elrefaie, Mohamed and Dai, Angela and Ahmed, Faez},
journal={arXiv preprint arXiv:2403.08055},
year={2024}
}