CSE Distinguished Seminar Series | The Challenge of Constructive Data Search

A central challenge in our modern information environment is how to find data and unify it from a multitude of diverse sources. The problem of data discovery remains a challenging activity in particular for researchers. I will present our work looking at how researchers go about searching and evaluating data. This is based on in-depth social science inquiry and a unique survey of over 1600 researchers. Based on these insights, I will outline the challenge of constructive data search - building datasets on the fly from multiple sources. Finally, I will discuss our work on the automatic construction of integrated data in the form of knowledge graphs.
Speaker: Prof. Paul Groth
Biografia: Paul Groth is Professor of Algorithmic Data Science at the University of Amsterdam where he leads the Intelligent Data Engineering Lab (INDElab). He holds a Ph.D. in Computer Science from the University of Southampton (2007) and has done research at the University of Southern California, the Vrije Universiteit Amsterdam and Elsevier Labs. His research focuses on intelligent systems for dealing with large amounts of diverse contextualized knowledge with a particular focus on web and science applications. This includes research in data provenance, data integration and knowledge sharing. Previously, Paul led the design of a number of large scale data integration and knowledge graph construction efforts in the biomedical domain. Paul was co-chair of the W3C Provenance Working Group that created a standard for provenance interchange. He has also contributed to the emergence of community initiatives to build a better scholarly ecosystem including altmetrics and the FAIR data principles.Paul is co-author of “Provenance: an Introduction to PROV” and “The Semantic Web Primer: 3rd Edition” as well as numerous academic articles. He blogs at http://thinklinks.wordpress.com.
