Master’s Thesis Presentation • Data Systems • Triangle Count Estimation and Label Prediction Over Uncertain Streaming Graphs

Tuesday, July 2, 2024 10:30 am - 11:30 am EDT (GMT -04:00)

Please note: This master’s thesis presentation will take place online.

Ipsita Mohanty, Master’s candidate
David R. Cheriton School of Computer Science

Supervisor: Professor M. Tamer Özsu

This thesis aims to integrate the notions of uncertainty with graph stream processing, presenting probabilistic models to enhance real-time analytical capabilities in graph database systems. These systems are crucial for managing interconnected data in various domains, such as social networks, traffic networks, and genomic databases, where data often contains incomplete or probabilistic connections that complicate processing and analysis.

We develop and validate two main methodologies: a martingale-based approach for approximating triangle counts in edge uncertain streaming graphs and a Graph Neural Network (GNN)-based method for dynamic label prediction in attribute uncertain streaming graphs. Both methods demonstrate robust performance in handling dynamic and uncertain data, thus opening new avenues for future research in expanding the scope of graph-based analytics. This work lays the groundwork for future developments in uncertain graph processing, suggesting pathways to refine these approaches and explore new applications in dynamic environments.