Events

Panos Ipeirotis, Professor and George A. Kellner Faculty Fellow
Department of Information, Operations, and Management Sciences
New York University

Torben Bach Pedersen, Professor of Computer Science
Aalborg University, Denmark

Data collected from new sources such as sensors and smart devices is large, fast, and often complex. There is a universal wish to perform multidimensional OLAP-style analytics on such data, i.e., to turn it into “Big Multidimensional Data”. Supporting this is a multi-stage journey, requiring new tools and systems, and forming a new, extended data cycle with models as a key concept.

Paolo Atzeni, Database Professor and Head of the Department of Engineering
Università Roma Tre

NoSQL systems have gained their popularity for many reasons, including the flexibility they provide with modeling, which tries to relax the rigidity provided by the relational model and by the other structured models.

Speaker: Stratos Idreos, Harvard University

Speaker: Mohammad Sadoghi, UC Davis

A. Erdem Sarıyüce, University at Buffalo

Abstract: Finding dense substructures in a network is a fundamental graph mining operation, with applications in bioinformatics, social networks, and visualization to name a few. Yet most standard formulations of this problem (like clique, quasi-clique, densest at-least-k subgraph) are NP-hard. Furthermore, the goal is rarely to find the “true optimum” but to identify many (if not all) dense substructures, understand their distribution in the graph, and ideally determine relationships among them. In this talk, I will talk about a framework that we designed to find dense regions of the graph with hierarchical relations.

Panos K. Chrysanthis, University of Pittsburgh

Abstract: Online analytics, in most advanced scientific, business, and defense applications, rely heavily on the efficient execution of large numbers of Aggregate Continuous Queries (ACQs). ACQs continuously aggregate streaming data and periodically produce results such as max or average over a given window of the latest data. It was shown that in processing ACQs it is beneficial to use incremental evaluation, which involves storing and reusing calculations performed over the unchanged parts of the window, rather than performing the re-evaluation of the entire window after each update.

Speaker: Verena Kantere, University of Ottawa

Abstract: Big Data analytics in science and industry are performed on a range of heterogeneous data stores, both traditional and modern, and on a diversity of query engines. Workflows are difficult to design and implement since they span a variety of systems. To reduce development time and processing costs, some automation is needed. In this talk we will present a new platform to manage analytics workflows.

Monday, April 15, 2019 3:00 pm - 3:00 pm EDT (GMT -04:00)

DSG Seminar Series • Data Management meets Information Theory

Speaker: Dan Suciu, University of Washington

Speaker: Oliver Kennedy, University at Buffalo

Filter by:

DSG Seminar Series • Targeted Crowdsourcing with a Billion (Potential) Users

DSG Seminar Series • Managing Big Multidimensional Data – A Journey from Data Acquisition to Prescriptive Analytics

DSG Seminar Series • Data Models from Traditional Databases to NoSQL Systems

DSG Seminar Series • What if we could reason about the design space of data structures?

DSG Seminar Series • ExpoDB: Towards a Unified OLTP and OLAP Over a Secure Platform

DSG Seminar Series • Hierarchical Dense Subgraph Discovery: Models, Algorithms, Applications

DSG Seminar Series • Algorithms and Optimizations for Incremental Window-Based Aggregations

DSG Seminar Series • Adaptive Scalable Analytics in Multi-Engine Environments

DSG Seminar Series • Data Management meets Information Theory

DSG Seminar Series • Just-in-Time Indexing: Rethinking Data Layout as a Compiler Problem