At SimTech, we provide and actively develop an infrastructure and tools for the management of research data. We are integrating research data management into research, and conducting research relating to research data management. Our vision is to make available research data and software to every published article, make science reproducable and promote the transfer into practice.Therefore we supply the right tools for you to use our research data and software for your own projects:
Our research
The Research Data and Software Management team at SimTech initiates data and software management projects in close collaboration with researchers to enable individual RDM solutions as well as an integration to DaRUS. In the following, active as well as finished projects are listed.
Highlights
Fair-FLEXI
Description
For reliable simulation of complex flow problems, comparative data from good simplified benchmark problems are absolutely necessary for validation and verification of the computational program and modeling. Only in this way can the simulation of complex flows be trusted at all.
It is true that there are benchmark simulations for very basic flow problems and associated databases. These are often well maintained, but they are far away from a real application and most of them have reached a certain age. In addition to their application as validation data, there is also a great need for data from this side - to train ML models - due to the strong research activities in machine learning.
In this PN-1 project, benchmark problems for turbulent flows and their modeling are designed and carried out by using FLEXI. The SimTech RDM-Team assists and develops in close collaboration with the Institute of Aerodynamics and Gas Dynamics solutions to efficiently manage the big data that is generated in the process of CFD-Simulations. The goal is to provide a comprehensive yet easily accessible resource to validate simulation runs, but also to train.
Data Version Control for Simulation
Description
Simulation meta-data logging is a critical aspect that requires a well-structured approach to ensure rich metadata descriptions and seamless workflows for our applications. At SimTech, we have found a solution to this problem by implementing Data Version Control in simulation sciences and the intersection with Machine Learning. This software provides a well-designed backend structure and that combines and organizes input parameters, quality assessment metrics, and the model itself, while providing multiple interfaces for interactions.
We have already successfully applied this approach to molecular dynamics problems in chemistry and biochemistry, as well as in modeling biocatalytic systems. Our objective is to continue to apply this concept and software to our research and make it accessible across the simulation sciences community. With Data Version Control, we are confident that we can achieve our goals of streamlining workflows and producing rich metadata descriptions for our applications.
Project members
Fabian Zills - ICP Stuttgart
Marcelle Spera IBTB Stuttgart
Jan Range - EXC2075 RDM-Team Stuttgart
Data Modelling
Description
Deep eutectic solvents (DES) are promising solvents in biocatalysis, because they are designable, renewable, biodegradable, and cheap, and thus are an alternative to organic solvents for enzyme-catalyzed reactions that involve hydrophobic substrates. Their thermophysical properties (density, viscosity, thermodynamic activities) are obtained by molecular simulation and by experiment and are stored in the standardized data exchange format ThermoML.
The goal of the project is to establish an API as well as REST interface to write ThermoML documents by either converting existing CML documents or reading from modelling tools. Next, the data model will be projected onto DaRUS to make use of the federated database system to share ThermoML across multiple installation, but also to provide reasearchers from various fields the opportunity to combnine their metadata with ThermoML.
Publications
2024
- S. Malzacher et al., “The STRENDA Biocatalysis Guidelines for cataloguing metadata,” Nature Catalysis, vol. 7, no. 12, Art. no. 12, 2024, doi: 10.1038/s41929-024-01261-x.
- B. Flemisch et al., “Research Data Management in Simulation Science: Infrastructure, Tools, and Applications,” Datenbank-Spektrum, vol. 24, pp. 97–105, 2024, doi: 10.1007/s13222-024-00475-4.
- S. Hermann and J. Fehr, “Documenting Research in Simulation Science to Enhance Understanding for Reusability,” Royal Society Open Science, vol. 11, no. 10, Art. no. 10, Oct. 2024, doi: https://doi.org/10.1098/rsos.240776.
2023
- S. Lauterbach et al., “EnzymeML: seamless data flow and modeling of enzymatic data,” Nature Methods, vol. 20, no. 3, Art. no. 3, 2023, doi: 10.1038/s41592-022-01763-1.
- T. Giess, S. Itzigehl, J. Range, R. Schömig, J. R. Bruckner, and J. Pleiss, “FAIR and scalable management of small-angle X-ray scattering data,” Journal of Applied Crystallography, vol. 56, no. 2, Art. no. 2, Apr. 2023, doi: 10.1107/S1600576723001577.
2022
- C. Brecher et al., “Commitment zu aktivem Daten- und -softwaremanagement in großen Forschungsverbünden,” Bausteine Forschungsdatenmanagement, vol. 1, 2022, doi: 10.17192/BFDM.2022.1.8412.
- S. Hermann and J. Fehr, “Documenting research software in engineering science,” Scientific Reports, vol. 12, no. 1, Art. no. 1, 2022.
- M. Gültig, J. P. Range, B. Schmitz, and J. Pleiss, “Integration of Simulated and Experimentally Determined Thermophysical Properties of Aqueous Mixtures by ThermoML,” Journal of Chemical & Engineering Data, vol. 67, no. 11, Art. no. 11, 2022, doi: 10.1021/acs.jced.2c00391.
- J. Range et al., “EnzymeML—a data exchange format for biocatalysis and enzymology,” The FEBS Journal, vol. 289, no. 19, Art. no. 19, Oct. 2022, doi: https://doi.org/10.1111/febs.16318.
2020
- B. Flemisch et al., “Umgang mit Forschungssoftware an der Universität Stuttgart,” Universität Stuttgart, 2020. doi: 10.18419/OPUS-11178.
- X. Xu, J. Range, G. Gygli, and J. Pleiss, “Analysis of Thermophysical Properties of Deep Eutectic Solvents by Data Integration,” Journal of Chemical & Engineering Data, vol. 65, pp. 1172–1179, 2020, doi: https://doi.org/10.1021/acs.jced.9b00555.
Our network
Empowering researchers
RDM at SimTech offers the possibility to initiate data and software management projects in close collaboration with researchers. We are part of a large research data network, contribute to various working groups and offer workshops and seminars. The team is also closely involved in FoKUS, the Competence Center for Research Data Management from the University of Stuttgart, to support researchers in publishing data.
Workshops and seminars
Dataverse
- PyDataverse Working Group
Leading the maintenance and extension of PyDataverse in cooperation with IQSS, Harvard University. - pyDataverse
Presentation of the pyDataverse Generator, Dataverse Community Call, December 2023 - Easy Dataverse
Presentation of the tool Easy Dataverse, Dataverse IQSS Lunch, March 2023 - Easy Review
Presentation of the tool EasyReview, Dataverse Community Call, January 2023 and January 2024 - Rust Dataverse
Presentation of the Rust Dataverse library, Dataverse Community Call, August 2024
SIGDIUS seminars
- SIGDIUS seminars
The Special Interest Group Data Infrastructure (SIGDIUS) provides a forum for interested working groups that want to establish or further develop an RDM infrastructure at the working group or institute level.
Our Team
News & Events
December 2024
November 2024
August 2024
July 2024
June 2024
May 2024
April 2024
March 2024
February 2024
January 2024
December 2023
November 2023
October 2023
July 2023
Wouldn’t it be great if we would be able to combine any dataset with any other dataset we would want to?
European Commission, Reproducibility of scientific results in the EU
SimTech RDM committee
Research Data Management Team
Pfaffenwaldring 5a, 70569 Stuttgart