Project

DMP Chef

Draft funder-compliant Data Management Plans with AI

Helping researchers create compliant and machine-actionable Data Management Plans in minutes.

DMP Chef logo

Overview

Data Management Plans (DMPs) are now commonly required by funders with all grant proposals. Researchers often struggle to prepare them due to the lack of training, the lack of support, the complexity of data, and the wide range of best practices, repositories, and metadata choices available. DMP Chef is an open-source (MIT License), Python-based pipeline that leverages AI to draft funder-compliant DMPs. This project is part of a broader extension of the DMP Tool platform. The ultimate goal is to integrate the DMP Chef pipeline into the DMP Tool platform, providing researchers with a familiar and convenient user interface that does not require any coding knowledge.

What is DMP Chef?

DMP Chef is an open-source, Python-based pipeline where researchers can easily generate drafts of funder-compliant DMPs tailored to their grant proposals and even create machine-actionable versions.

How does it work?

Researchers only need to answer a few questions about their grant proposal (target funding organization, data to be collected, etc.) and DMP Chef will generate a draft of the entire DMP in line with the requirements of the target funder.

What challenges does it address?

Writing a DMP is overwhelming, with each funder having its own policies, formats, and compliance criteria. Researchers often struggle to prepare a compliant DMP due to lack of training and support, and technical complexity. DMP Chef addresses this challenge by translating high-level project descriptions into funder-compliant DMP, using a tailored prompt, funder templates, funder guidelines, and an open source large language model (LLM) enhanced with a custom Retrieval-augmented generation (RAG) pipeline.

Why is this important?

A strong DMP is often a requirement not only for securing research funds but, as the first point of data documentation, also critical for adopting FAIR data practices. DMP Chef helps researchers create high-quality, DMPs that ensure compliance with funder policies and maximize the long-term impact of their data.

Development Approach

DMP chief is developed using an open-source approach. This allows for transparency and collaboration with the open source community. The GitHub repository will be linked here as we test and develop the platform.

GitHub logo

Team

Members

Researchers, engineers, and collaborators behind this project.

Research partners

We are collaborating on this project with the developers and maintainers of the DMP Tool at the California Digital Library (University of California).

Funding

This project is a National Science Foundation (NSF) and Chan Zuckerberg Initiative (CZI)-funded collaboration between the FAIR Data Innovations Hub and California Digital Library (University of California Office of the President) as part of a broader extension of the DMP Tool platform.

Project milestones

Key milestones in the early development of DMP Chef.

Phase 1

October 2025 - March 2026

We are investigating how different approaches like Retrieval Augmented Generation (RAG) can improve the performance of open-source LLMs like Llama in drafting DMPs that comply with various funders’ requirements like NIH and NSF.