Rumi Chunara, Computer Science and in Global Public Health
New York University
Aditya Parameswaran, Department of Computer Science
University of Illinois-Urbana Champaign
Heng Ji, Rensselaer Polytechnic Institute
Kareem El Gebaly, PhD candidate
David R. Cheriton School of Computer Science
The process of analyzing relational data typically involves tasks facilitating gaining familiarity or insights and coming up with findings or conclusions based on the data. This process is usually practiced by data experts (data scientists) that share their output with potentially less data expert audience (everyone).
Amira Ghenai, PhD candidate
David R. Cheriton School of Computer Science
People regularly use web search engines to investigate the efficacy of medical treatments. Search results can contain documents that present incorrect information that contradicts current established medical understanding on whether a treatment is helpful or not for a health issue. If people are influenced by the incorrect information found in search results, they can make harmful decisions about the appropriate treatment.
Benny Kimelfeld, Technion
Part of the Data Systems Seminar Series (2016-2017)
Speaker: | Hans-Peter Kriegel, Ludwig-Maximilians-Universität München |
Part of the Data Systems Seminar Series (2016-2017)
Speaker: | C. Mohan, IBM Almaden Research Center |
Patrick Valduriez
Inria and Biology Computational Institute (IBC)
Abstract: The blooming of different cloud data management infrastructures, specialized for different kinds of data and tasks, has led to a wide diversification of DBMS interfaces and the loss of a common programming paradigm.
Speaker: | Mohamed Sabri |
Abstract: | Linked data has been widely adopted over the last few years, with the size of the Linked Data Cloud almost doubling every year. However, there is still no well-defined mechanism to query such a Web of Data. |
Speaker: | Kareem El Gebaly |
Abstract: |
This talk explores the idea of in-browser interactive analytics with split execution strategies where query operators are distributed between the frontend and backend servers. |
Speaker: |
Ahmed El-Roby |
Abstract: |
There is currently a large amount of publicly accessible structured data available as RDF data sets. For example, the Linked Open Data (LOD) cloud now consists of thousands of RDF data sets with over 30 billion triples, and the number and size of the data sets is continuously growing. |
Speaker: | Xu Chu |
Abstract: | Data quality is one of the most important problems in data management and data science, since dirty data often leads to inaccurate data analytics results and wrong business decisions. |
Speaker: | Shaikh Quader |
Part of the Data Systems Seminar Series (2016-2017)
Speaker: | Felix Naumann, Hasso Plattner Institute |