Research Data and Software Management

At SimTech, we provide and actively develop an infrastructure and tools for the management of research data. We are integrating research data management into research, and conducting research relating to research data management. Our vision is to make available research data and software to every published article, make science reproducable and promote the transfer into practice.Therefore we supply the right tools for you to use our research data and software for your own projects:

Our research

The Research Data and Software Management team at SimTech initiates data and software management projects in close collaboration with researchers to enable individual RDM solutions as well as an integration to DaRUS. In the following, active as well as finished projects are listed.

Highlights

Description

For reliable simulation of complex flow problems, comparative data from good simplified benchmark problems are absolutely necessary for validation and verification of the computational program and modeling. Only in this way can the simulation of complex flows be trusted at all.

It is true that there are benchmark simulations for very basic flow problems and associated databases. These are often well maintained, but they are far away from a real application and most of them have reached a certain age. In addition to their application as validation data, there is also a great need for data from this side - to train ML models - due to the strong research activities in machine learning.

In this PN-1 project, benchmark problems for turbulent flows and their modeling are designed and carried out by using FLEXI. The SimTech RDM-Team assists and develops in close collaboration with the Institute of Aerodynamics and Gas Dynamics solutions to efficiently manage the big data that is generated in the process of CFD-Simulations. The goal is to provide a comprehensive yet easily accessible resource to validate simulation runs, but also to train.

Description

Simulation meta-data logging is a critical aspect that requires a well-structured approach to ensure rich metadata descriptions and seamless workflows for our applications. At SimTech, we have found a solution to this problem by implementing Data Version Control in simulation sciences and the intersection with Machine Learning. This software provides a well-designed backend structure and that combines and organizes input parameters, quality assessment metrics, and the model itself, while providing multiple interfaces for interactions.

We have already successfully applied this approach to molecular dynamics problems in chemistry and biochemistry, as well as in modeling biocatalytic systems. Our objective is to continue to apply this concept and software to our research and make it accessible across the simulation sciences community. With Data Version Control, we are confident that we can achieve our goals of streamlining workflows and producing rich metadata descriptions for our applications.

Project members
Fabian Zills - ICP Stuttgart
Marcelle Spera IBTB Stuttgart
Jan Range - EXC2075 RDM-Team Stuttgart

Description

Deep eutectic solvents (DES) are promising solvents in biocatalysis, because they are designable, renewable, biodegradable, and cheap, and thus are an alternative to organic solvents for enzyme-catalyzed reactions that involve hydrophobic substrates. Their thermophysical properties (density, viscosity, thermodynamic activities) are obtained by molecular simulation and by experiment and are stored in the standardized data exchange format ThermoML.

The goal of the project is to establish an API as well as REST interface to write ThermoML documents by either converting existing CML document s or reading from modelling tools. Next, the data model will be projected onto DaRUS to make use of the federated database system to share ThermoML across multiple installation, but also to provide reasearchers from various fields the opportunity to combnine their metadata with ThermoML.

Publications

2024
1. S. Malzacher et al., “The STRENDA Biocatalysis Guidelines for cataloguing metadata,” Nature Catalysis, vol. 7, no. 12, Art. no. 12, 2024, doi: 10.1038/s41929-024-01261-x.
2. B. Flemisch et al., “Research Data Management in Simulation Science: Infrastructure, Tools, and Applications,” Datenbank-Spektrum, vol. 24, pp. 97–105, 2024, doi: 10.1007/s13222-024-00475-4.
3. S. Hermann and J. Fehr, “Documenting Research in Simulation Science to Enhance Understanding for Reusability,” Royal Society Open Science, vol. 11, no. 10, Art. no. 10, Oct. 2024, doi: https://doi.org/10.1098/rsos.240776.
2023
1. S. Lauterbach et al., “EnzymeML: seamless data flow and modeling of enzymatic data,” Nature Methods, vol. 20, no. 3, Art. no. 3, 2023, doi: 10.1038/s41592-022-01763-1.
2. T. Giess, S. Itzigehl, J. Range, R. Schömig, J. R. Bruckner, and J. Pleiss, “FAIR and scalable management of small-angle X-ray scattering data,” Journal of Applied Crystallography, vol. 56, no. 2, Art. no. 2, Apr. 2023, doi: 10.1107/S1600576723001577.
2022
1. C. Brecher et al., “Commitment zu aktivem Daten- und -softwaremanagement in großen Forschungsverbünden,” Bausteine Forschungsdatenmanagement, vol. 1, 2022, doi: 10.17192/BFDM.2022.1.8412.
2. S. Hermann and J. Fehr, “Documenting research software in engineering science,” Scientific Reports, vol. 12, no. 1, Art. no. 1, 2022.
3. M. Gültig, J. P. Range, B. Schmitz, and J. Pleiss, “Integration of Simulated and Experimentally Determined Thermophysical Properties of Aqueous Mixtures by ThermoML,” Journal of Chemical & Engineering Data, vol. 67, no. 11, Art. no. 11, 2022, doi: 10.1021/acs.jced.2c00391.
4. J. Range et al., “EnzymeML—a data exchange format for biocatalysis and enzymology,” The FEBS Journal, vol. 289, no. 19, Art. no. 19, Oct. 2022, doi: https://doi.org/10.1111/febs.16318.
2020
1. B. Flemisch et al., “Umgang mit Forschungssoftware an der Universität Stuttgart,” Universität Stuttgart, 2020. doi: 10.18419/OPUS-11178.
2. X. Xu, J. Range, G. Gygli, and J. Pleiss, “Analysis of Thermophysical Properties of Deep Eutectic Solvents by Data Integration,” Journal of Chemical & Engineering Data, vol. 65, pp. 1172–1179, 2020, doi: https://doi.org/10.1021/acs.jced.9b00555.

Our network

World

Europe

Germany

Stuttgart

Empowering researchers

RDM at SimTech offers the possibility to initiate data and software management projects in close collaboration with researchers. We are part of a large research data network, contribute to various working groups and offer workshops and seminars. The team is also closely involved in FoKUS, the Competence Center for Research Data Management from the University of Stuttgart, to support researchers in publishing data.

Workshops and seminars

">

Dataverse

PyDataverse Working Group
Leading the maintenance and extension of PyDataverse in cooperation with IQSS, Harvard University.
pyDataverse
Presentation of the pyDataverse Generator, Dataverse Community Call, December 2023
Easy Dataverse
Presentation of the tool Easy Dataverse, Dataverse IQSS Lunch, March 2023
Easy Review
Presentation of the tool EasyReview, Dataverse Community Call, January 2023 and January 2024
Rust Dataverse
Presentation of the Rust Dataverse library, Dataverse Community Call, August 2024

SIGDIUS seminars

SIGDIUS seminars
The Special Interest Group Data Infrastructure (SIGDIUS) provides a forum for interested working groups that want to establish or further develop an RDM infrastructure at the working group or institute level.

Our Team

Sibylle Hermann

Data and Software Steward: draw up strategies, workflows, standards and best practices for RDM, developing research software

Jan Range

Research Software Engineering: Dataverse Tools, Metadata harvesting and tracking, RDM Project Management

Sarbani Roy

Currently on maternity leave

Research Software Engineering: containerising, reproducibility, metadata harvesting and define metadata standards

Fangfang Wang

Research Software Engineering: reviewing dataset, managing Dataverse collection, developing metadata harvester

News & Events

January 2025

SIGDIUS Seminar - online - 2pm

Event
1/8/25

December 2024

SIGDIUS Seminar - online - 2pm

Event
12/4/24

November 2024

SIGDIUS Seminar - online - 2pm

Event
11/6/24

August 2024

Jan Range Introduces Rust Dataverse: A New Tool for Efficient Data Management

News
8/16/24

Sibylle Hermann new chair of the Commission for Research-related Services

News
8/15/24

July 2024

SIGDIUS Seminar - online - 2pm

Event
7/17/24

June 2024

Setting New Standards in Research Data Management for Simulation Science

News
6/14/24

SIGDIUS Seminar - online - 2pm

Event
6/12/24

May 2024

Advancements in Research Data Management at SimTech

News
5/31/24

SIGDIUS Seminar - online - 2pm

Event
5/8/24

April 2024

preCICE Workshop 2024 at University of Stuttgart: Call for Contributions open

News
4/8/24

March 2024

Software Carpentry Workshop

Event
3/19 – 3/22/24

February 2024

RDM: Sarbani Roy presents the Harvester-Curator

News
2/8/24

SIGDIUS Seminar - online - 2pm

Event
2/7/24

January 2024

Jan Range starts new working group to revamp pyDataverse

News
1/15/24

SIGDIUS Seminar - online - 2pm

Event
1/10/24

December 2023

SIGDIUS Seminar - online - 2pm

Event
12/6/23

Dataverse Community Call: Further development of pyDataverse

News
12/5/23

November 2023

SimTech research group contributes to AGU23: Wide. Open. Science.

News
11/10/23

Join Anneli Guthke’s sessions at EGU24

News
11/9/23

SIGDIUS Seminar - online - 2pm

Event
11/8/23

Best Paper Award at MTSR2023 for Björn Schembera and Dominik Göddeke

News
11/3/23

New DFG funding for open-source framework for complex flow simulations

News
11/2/23

October 2023

Jan Range presents EasyReview at CESSDA Tools Open Hour

Event
10/18/23

July 2023

SIGDIUS Seminar - online - 2pm

Event
7/12/23

Wouldn’t it be great if we would be able to combine any dataset with any other dataset we would want to?
European Commission, Reproducibility of scientific results in the EU

SimTech RDM committee

Flemisch, Bernd

Simulation Technology

E-Mail

Herschel, Melanie

Data Engineering

E-Mail

Pleiss, Jürgen

Technical Biochemistry

E-Mail

Uekermann, Benjamin

Usability and Sustainability of Simulation Software

E-Mail

Our research

Highlights

Fair-FLEXI

Description

Data Version Control for Simulation

Description

Data Modelling

Description

Publications

2024

2023

2022

2020

Our network

World

Europe

Germany

Stuttgart

Empowering researchers

Workshops and seminars

Dataverse

SIGDIUS seminars

Our Team

Sibylle Hermann

Jan Range

Sarbani Roy

Fangfang Wang

News & Events

January 2025

SIGDIUS Seminar - online - 2pm

December 2024

SIGDIUS Seminar - online - 2pm

November 2024

SIGDIUS Seminar - online - 2pm

August 2024

Jan Range Introduces Rust Dataverse: A New Tool for Efficient Data Management

Sibylle Hermann new chair of the Commission for Research-related Services

July 2024

SIGDIUS Seminar - online - 2pm

June 2024

Setting New Standards in Research Data Management for Simulation Science

SIGDIUS Seminar - online - 2pm

May 2024

Advancements in Research Data Management at SimTech

SIGDIUS Seminar - online - 2pm

April 2024

preCICE Workshop 2024 at University of Stuttgart: Call for Contributions open

March 2024

Software Carpentry Workshop

February 2024

RDM: Sarbani Roy presents the Harvester-Curator

SIGDIUS Seminar - online - 2pm

January 2024

Jan Range starts new working group to revamp pyDataverse

SIGDIUS Seminar - online - 2pm

December 2023

SIGDIUS Seminar - online - 2pm

Dataverse Community Call: Further development of pyDataverse

November 2023

SimTech research group contributes to AGU23: Wide. Open. Science.

Join Anneli Guthke’s sessions at EGU24

SIGDIUS Seminar - online - 2pm

Best Paper Award at MTSR2023 for Björn Schembera and Dominik Göddeke

New DFG funding for open-source framework for complex flow simulations

October 2023

Jan Range presents EasyReview at CESSDA Tools Open Hour

July 2023

SIGDIUS Seminar - online - 2pm

SimTech RDM committee

Research Data Management Team

Here you can reach us

Audience

Formalities

Services

Organization