Speaker: Patrick Valduriez, Inria, University of Montpellier, CNRS, LIRMM
Location: DC 1304
Abstract: Data driven science requires manipulating large datasets coming from various data sources through complex workflows based on a variety of models and languages. With the increasing number of big data sources and models developed by different groups, it is hard to relate models and data and use them in unanticipated ways for specific data analysis. Current solutions are typically ad-hoc, specialized for particular data, models and workflow systems. In this talk, we focus on data driven life science and propose an open service-based architecture, Life Science Workflow Services (LifeSWS), which provides data analysis workflow services for life sciences. We illustrate our motivations and rationale for the architecture with real use cases from life science.
Bio: Patrick Valduriez is a director of research emeritus at Inria, France, the scientific director of the Inria-Brasil international lab. and the Chief Scientist Officer of the LeanXcale company (that delivers a NewSQL database).
He is currently a member of the Zenith team (between Inria and University of Montpellier at the LIRMM lab.), focusing on data science, in particular data management in large-scale distributed and parallel systems and scientific data management. He has authored and co-authored more than 400 technical papers and several textbooks, among which “Principles of Distributed Database Systems” (with Professor Tamer Özsu, University of Waterloo). He currently serves as associate editor of the Distributed and Parallel Databases journal. He has served as PC chair of major conferences such as SIGMOD and VLDB. He was the general chair of SIGMOD 2004, EDBT 2008 and VLDB 2009.
He received several best paper awards, including VLDB 2000. He was the recipient of the 1993 IBM scientific prize in Computer Science in France and the 2014 Innovation Award from Inria and the French Academy of Science. He is an ACM Fellow since 2013.