Distance and Data Reuse

11.07.2025 | 10:00 - 11:00 h

HMC FAIR Friday

Speaker: Paul Groth, University of Amsterdam

Title: Distance and Data Reuse

Date: Friday, 11 July 2025, 10 am


The literature contains a myriad of recommendations, advice, and strictures about what data providers should do to facilitate data reuse. It can be overwhelming. Based on empirical work (analyzing data reuse proxies at scale, understanding data sensemaking and looking at how researchers search for data), I talk about what practices are a good place to start for helping others to reuse your data. I then introduce the construct of distance between data provider and data reuser to help understand where to invest for data, which is based on a recent paper in Harvard Data Science Review:

Borgman, C. L., & Groth, P. (2025). From Data Creator to Data Reuser: Distance Matters. Harvard Data Science Review.
https://doi.org/10.1162/99608f92.35d32cfc

About the speaker: Paul Groth is Professor of Algorithmic Data Science at the University of Amsterdam, where he leads the Intelligent Data Engineering Lab (INDElab). He received his Ph.D. in Computer Science from the University of Southampton in 2007 and has conducted research at the University of Southern California, VU Amsterdam, and Elsevier Labs. His research focuses on intelligent systems for processing large, contextualized knowledge – particularly in web and scientific applications such as data provenance, integration, and knowledge sharing.

He serves as Scientific Director of the UvA’s Data Science Center and Co-Scientific Director of two ICAI labs: the AI for Retail (AIR) Lab (with Ahold Delhaize) and the Discovery Lab (with Elsevier, UvA, and VU Amsterdam).

Previously, Paul led the design of a number of large scale data integration and knowledge graph construction efforts in the biomedical domain. Paul was co-chair of the W3C Provenance Working Group that created a standard for provenance interchange. He has also contributed to the emergence of community initiatives to build a better scholarly ecosystem including altmetrics and the FAIR data principles.

Registration open until 11. July 2025, 09:45 am: https://events.hifis.net/event/2486
(For organizational reasons, we cannot guarantee participation for registrations after 09:45 am)




The Helmholtz Metadata Collaboration (HMC) invites you to engage in an exciting series of talks on FAIR Data through the HMC FAIR Friday lecture series. Renowned experts from around the world will present key aspects of FAIR data, offering deep dives into the topic and sparking discussions on best practices and new perspectives. HMC FAIR Friday is aimed at professionals in research data management as well as researchers from all disciplines of the Helmholtz Association and beyond, who want to learn more about the importance and implementation of FAIR principles.