The FAIR Data Pipeline (FDP) is a collection of software utilities for tracking the provenance of FAIR (Findable, Accessible, Interoperable, Reusable) data. It was originally developed to aid in the creation of reproducible epidemiological modelling workflows, a need spurred by the outbreak of COVID-19. FDP allows users to automate a data processing pipeline that traces all inputs, outputs and associated metadata in a FAIR manner. APIs are available in Python, R, Julia, C++, and Java.