Leventidis, A., M. Pekár Christensen, M. Lissandrini, L. Di Rocco, K. Hose, and R. Miller, "A Large Scale Test Corpus for Semantic Table Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Yu, A., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "CAMO: Explaining Consensus Across MOdels", IEEE International Conference on Data Engineering (ICDE), 2024.
Adeyemi, M., A. Oladipo, X. Zhang, D. Alfonso-Hermelo, M. Rezagholizadeh, B. Chen, A-H. Omotayo, I. Abdulmumin, N. A. Etori, T. Babatunde Musa, et al., "CIRAL: A Test Collection for CLIR Evaluations in African Languages", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Glavic, B., G. Mecca, R. Miller, P. Papotti, D. Santoro, and E. Veltri, "Comparing Incomplete Database Instances", Sistemi Evoluti per Basi di Dati (SEBD), 2024.
Yu, A., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "Exploring the Space of Model Comparisons", IEEE International Conference on Data Engineering (ICDE), 2024.
Rahmani, H. A., C. Siro, M. Aliannejadi, N. Craswell, C. Clarke, G. Faggioli, B. Mitra, P. Thomas, and E. Yilmaz, "LLM4Eval: Large Language Model for Evaluation in IR", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Khalaji, M., T. Brown, K. Daudjee, and V. Aksenov, "Practical Hardware Transactional vEB Trees", ACM Symposium on Principles & Practice of Parallel Programming (PPoPP), 2024.
Bonifati, A., T. Ozsu, Y. Tian, H. Voigt, W. Yu, and W. Zhang, "The Future of Graph Analytics", ACM International Conference on Management of Data (SIGMOD), 2024.
Azzopardi, L., C. Clarke, P. B. Kantor, B. Mitra, J. R. Trippas, and Z. Ren, "The Search Futures Workshop", European Conference on Information Retrieval (ECIR), 2024.
Kamalloo, E., S. Upadhyay, and J. Lin, "Towards Robust QA Evaluation via Open LLMs", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Azzopardi, L., C. Clarke, P. B. Kantor, B. Mitra, J. R. Trippas, Z. Ren, M. Aliannejadi, N. Arabzadeh, R. Chandrasekar, M. de Rijke, et al., "Report on the Search Futures Workshop at ECIR 2024", SIGIR Forum, vol. 58, issue 1, pp. 1--41, 2024.
Faggioli, G., L. Dietz, C. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, et al., "Who Determines What Is Relevant? Humans or AI? Why Not Both?", Communications of the ACM, vol. 67, issue 4, pp. 31--34, 2024.
Seifikar, M., L. Nhi Phan Minh, N. Arabzadeh, C. Clarke, and M. Smucker, "A Preference Judgment Tool for Authoritative Assessment", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Ma, X., H. Fun, X. Yin, A. Mallia, and J. Lin, "Enhancing Sparse Retrieval via Unsupervised Learning", ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region (SIGIR-AP), 2023.
Kamalloo, E., X. Zhang, O. Ogundepo, N. Thakur, D. Alfonso-Hermelo, M. Rezagholizadeh, and J. Lin, "Evaluating Embedding APIs for Information Retrieval", Association for Computational Linguistics (ACL), 2023.
Ilyas, I., JP. Lacerda, Y. Li, U. Farooq Minhas, A. Mousavi, J. Pound, T. Rekatsinas, and C. Sumanth, "Growing and Serving Large Open-Domain Knowledge Graphs", ACM International Conference on Management of Data (SIGMOD), 2023.
Bianchi, A., R. Karegar, P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "iORDER: Mining Implicit Domain Orders", IEEE International Conference on Data Engineering (ICDE), 2023.
Jin, G., X. Feng, Z. Chen, C. Liu, and S. Salihoglu, "KÙZU Graph Database Management System", Conference on Innovative Data Systems Research (CIDR), 2023.
Kamphuis, C., A. Lin, S. Yang, J. Lin, A. P. de Vries, and F. Hasibi, "MMEAD: MS MARCO Entity Annotations and Disambiguations", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Faggioli, G., L. Dietz, C. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, et al., "Perspectives on Large Language Models for Relevance Judgment", International Conference on the Theory of Information Retrieval (ICTIR), 2023.
Saxena, H., L. Golab, S. Idreos, and I. Ilyas, "Real-Time LSM-Trees for HTAP Workloads", IEEE International Conference on Data Engineering (ICDE), 2023.
Wang, Q., X. Hu, B. Dai, and K. Yi, "Change Propagation Without Joins", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 5, pp. 1046--1058, 2023.
Nargesian, F., K. Q. Pu, B. Ghadiri Bashardoost, E. Zhu, and R. Miller, "Data Lake Organization", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 35, issue 1, pp. 237--250, 2023.
Koutrika, G., J. Yang, M. Athanassoulis, K. Stefanidis, J. Fan, A. Quamar, Y. Tian, A. Jindal, C. Binnig, J. Rogers, et al., "Front Matter", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 12, 2023.
Mohoney, J., A. Pacaci, S. Rahman Chowdhury, A. Mousavi, I. Ilyas, U. Farooq Minhas, J. Pound, and T. Rekatsinas, "High-Throughput Vector Similarity Search in Knowledge Graphs", Proceedings of the ACM on Management of Data, vol. 1, issue 2, pp. 197:1--197:25, 2023.
Zhang, X., N. Thakur, O. Ogundepo, E. Kamalloo, D. Alfonso-Hermelo, X. Li, Q. Liu, M. Rezagholizadeh, and J. Lin, "MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages", Transactions of the Association for Computational Linguistics, vol. 11, pp. 1114--1131, 2023.
Khatiwada, A., G. Fan, R. Shraga, Z. Chen, W. Gatterbauer, R. Miller, and M. Riedewald, "SANTOS: Relationship-Based Semantic Table Union Search", Proceedings of the ACM on Management of Data, vol. 1, issue 1, pp. 9:1--9:25, 2023.
Parsa, M. S., H. Shi, Y. Xu, A. Yim, Y. Yin, and L. Golab, "Analyzing Climate Change Discussions on Reddit", International Conference on Computational Science and Computational Intelligence (CSCI), 2022.
Karegar, R., M. Mirsafian, P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Discovering Domain Orders via Order Dependencies", IEEE International Conference on Data Engineering (ICDE), 2022.
Vezvaei, A., L. Golab, M. Kargar, D. Srivastava, J. Szlichta, and M. Zihayat, "Fine-Tuning Dependencies With Parameters", International Conference on Extending Database Technology (EDBT), 2022.
Yan, X., C. Luo, C. Clarke, N. Craswell, E. M. Voorhees, and P. Castells, "Human Preferences as Dueling Bandits", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Kamphuis, C., F. Hasibi, J. Lin, and A. P. de Vries, "REBL: Entity Linking at Scale (Prototype)", Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), 2022.
Kargar, M., L. Golab, D. Srivastava, J. Szlichta, and M. Zihayat, "Effective Keyword Search Over Weighted Graphs", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 34, issue 2, pp. 601--616, 2022.
Khatiwada, A., R. Shraga, W. Gatterbauer, and R. Miller, "Integrating Data Lake Tables", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 4, pp. 932--945, 2022.
Maiyya, S., Y. Steinhart, D. Agrawal, P. Ananth, and A. El Abbadi, "ORTOA: One Round Trip Oblivious Access", IACR Cryptology ePrint Archive, pp. 1506, 2022.
Arabzadeh, N., A. Vtyurina, X. Yan, and C. Clarke, "Shallow Pooling for Sparse Labels", Information Retrieval Journal, vol. 25, issue 4, pp. 365--385, 2022.
Agarwal, P. K., X. Hu, S. Sintos, and J. Yang, "Dynamic Enumeration of Similarity Joins", International Colloquium on Automata, Languages and Programming (ICALP), 2021.
Langendoen, K., B. Glasbergen, and K. Daudjee, "NIR-Tree: A Non-Intersecting R-Tree", International Conference on Statistical and Scientific Database Management (SSDBM), 2021.
Nemec, J., H. Davoudi, L. Golab, M. Kargar, Y. Lytvyn, P. Mierzejewski, J. Szlichta, and M. Zihayat, "RW-Team: Robust Team Formation Using Random Walk", International Conference on Information and Knowledge Management (CIKM), 2021.
Pradeep, R., X. Ma, R. Frassetto Nogueira, and J. Lin, "Scientific Claim Verification With VerT5erini", International Workshop on Health Text Mining and Information Analysis (Louhi), 2021.
Anand, M., J. Zhang, S. Ding, J. Xin, and J. Lin, "Serverless BM25 Search and BERT Reranking", Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), 2021.
Abualsaud, M., M. Smucker, and C. Clarke, "Visualizing Searcher Gaze Patterns", Conference on Human Information Interaction and Retrieval (CHIIR), 2021.
Tang, R., K. Kumar, K. Chalkley, J. Xin, L. Zhang, W. Li, G. Yang, Y. Mao, J. Shin, G. Craig Murray, et al., "Voice Query Auto Completion", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Clarke, C., A. Vtyurina, and M. Smucker, "Assessing Top- Preferences", ACM Transactions on Information Systems (TOIS), vol. 39, issue 3, pp. 33:1--33:21, 2021.
Ouellette, P., A. Sciortino, F. Nargesian, B. Ghadiri Bashardoost, E. Zhu, K. Q. Pu, and R. Miller, "RONIN: Data Lake Exploration", Proceedings of the VLDB Endowment (PVLDB), vol. 14, issue 12, pp. 2863--2866, 2021.
Ng, Y. Ki, D. J. Fraser, B. Kassaie, G. Labahn, M. S. Marzouk, F. Tompa, and K. Wang, "Dowsing for Math Answers With Tangent-L", Conference and Labs of the Evaluation Forum (CLEF), 2020.
Yates, A., K. Martin Jose, X. Zhang, and J. Lin, "Flexible IR Pipelines With Capreolus", International Conference on Information and Knowledge Management (CIKM), 2020.
Zeng, L., L. Zou, T. Ozsu, L. Hu, and F. Zhang, "GSI: GPU-friendly Subgraph Isomorphism", IEEE International Conference on Data Engineering (ICDE), 2020.
Clarke, C., A. Vtyurina, and M. Smucker, "Offline Evaluation Without Gain", International Conference on the Theory of Information Retrieval (ICTIR), 2020.
Nargesian, F., K. Q. Pu, E. Zhu, B. Ghadiri Bashardoost, and R. Miller, "Organizing Data Lakes for Navigation", ACM International Conference on Management of Data (SIGMOD), 2020.
Glasbergen, B., M. Abebe, K. Daudjee, D. Vogel, and J. Zhao, "Sentinel: Understanding Data Systems", ACM International Conference on Management of Data (SIGMOD), 2020.
Livshits, E., A. Heidari, I. Ilyas, and B. Kimelfeld, "Approximate Denial Constraints", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 10, pp. 1682--1695, 2020.
Bashardoost, B. Ghadiri, R. Miller, K. A. Lyons, and F. Nargesian, "Knowledge Translation", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 11, pp. 2018--2032, 2020.
Christodoulakis, C., E. B. Munson, M. Gabel, A. Demke Brown, and R. Miller, "Pytheas: Pattern-Based Table Discovery in CSV Files", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 11, pp. 2075--2089, 2020.
Bryson, S., H. Davoudi, L. Golab, M. Kargar, Y. Lytvyn, P. Mierzejewski, J. Szlichta, and M. Zihayat, "Robust Keyword Search in Large Attributed Graphs", Information Retrieval Journal, vol. 23, issue 5, pp. 502--524, 2020.
Yilmaz, Z. Akkalyoncu, S. Wang, W. Yang, H. Zhang, and J. Lin, "Applying BERT to Document Retrieval With Birch", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Davoudi, H., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Bring Order to Data", Alberto Mendelzon International Workshop on Foundations of Data Management (AMW), 2019.
Alonso, G., C. Binnig, I. Pandis, K. Salem, J. Skrzypczak, R. Stutsman, L. Thostrup, T. Wang, Z. Wang, and T. Ziegler, "DPI: The Data Processing Interface for Modern Networks", Conference on Innovative Data Systems Research (CIDR), 2019.
Cormack, G., H. Zhang, N. Ghelani, M. Abualsaud, M. Smucker, M. Grossman, S. Rahbariasl, and A. Ghenai, "Dynamic Sampling Meets Pooling", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Vollmer, M., L. Golab, K. Böhm, and D. Srivastava, "Informative Summarization of Numeric Data", International Conference on Statistical and Scientific Database Management (SSDBM), 2019.
Kazhamiaka, M., B. Naveed Memon, C. Kankanamge, S. Sahu, S. Rizvi, B. Wong, and K. Daudjee, "Sift: Resource-Efficient Consensus With RDMA", Conference on Emerging Network Experiment and Technology (CoNEXT), 2019.
Nargesian, F., E. Zhu, R. Miller, K. Q. Pu, and P. C. Arocena, "Data Lake Management: Challenges and Opportunities", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 12, pp. 1986--1989, 2019.
Kotsogiannis, I., Y. Tao, X. He, M. Fanaeepour, A. Machanavajjhala, M. Hay, and G. Miklau, "PrivateSQL: A Differentially Private SQL Query Engine", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 11, pp. 1371--1384, 2019.
Abualsaud, M., N. Ghelani, H. Zhang, M. Smucker, G. Cormack, and M. Grossman, "A System for Efficient High-Recall Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Mansour, E., D. Deng, R. Castro Fernandez, A. Ali Qahtan, W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "Building Data Civilizer Pipelines With an Advanced Workflow Engine", IEEE International Conference on Data Engineering (ICDE), 2018.
Langouri, M. Alipour, Z. Zheng, F. Chiang, L. Golab, and J. Szlichta, "Contextual Data Cleaning", IEEE International Conference on Data Engineering (ICDE), 2018.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Effective Team Formation in Expert Networks", Alberto Mendelzon International Workshop on Foundations of Data Management (AMW), 2018.
Mihaylov, A., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "FASTOD: Bringing Order to Data", IEEE International Conference on Data Engineering (ICDE), 2018.
Santoro, D., P. C. Arocena, B. Glavic, G. Mecca, R. Miller, and P. Papotti, "Let's Make It Dirty With BART!", Sistemi Evoluti per Basi di Dati (SEBD), 2018.
Zhao, Z., R. Christensen, F. Li, X. Hu, and K. Yi, "Random Sampling Over Joins Revisited", ACM International Conference on Management of Data (SIGMOD), 2018.
Abualsaud, M., G. Cormack, N. Ghelani, A. Ghenai, M. Grossman, S. Rahbariasl, H. Zhang, and M. Smucker, "UWaterlooMDS at the TREC 2018 Common Core Track", Text Retrieval Conference (TREC), 2018.
Gebaly, K. El, G. Feng, L. Golab, F. Korn, and D. Srivastava, "Explanation Tables", IEEE Data Engineering Bulletin, vol. 41, issue 3, pp. 43--51, 2018.
Nargesian, F., E. Zhu, K. Q. Pu, and R. Miller, "Table Union Search on Open Data", Proceedings of the VLDB Endowment (PVLDB), vol. 11, issue 7, pp. 813--825, 2018.
Fernandez, R. Castro, D. Deng, E. Mansour, A. Ali Qahtan, W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "A Demo of the Data Civilizer System", ACM International Conference on Management of Data (SIGMOD), 2017.
Ghelani, N., S. Mohammed, S. Wang, and J. Lin, "Event Detection on Curated Tweet Streams", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Kankanamge, C., S. Sahu, A. Mhedhbi, J. Chen, and S. Salihoglu, "Graphflow: An Active Graph Database", ACM International Conference on Management of Data (SIGMOD), 2017.
Mohammed, S., M. Crane, and J. Lin, "Quantization in Append-Only Collections", International Conference on the Theory of Information Retrieval (ICTIR), 2017.
Deng, D., R. Castro Fernandez, Z. Abedjan, S. Wang, M. Stonebraker, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, and N. Tang, "The Data Civilizer System", Conference on Innovative Data Systems Research (CIDR), 2017.
Sadiq, S. Wasim, T. Dasu, X. Luna Dong, J. Freire, I. Ilyas, S. Link, R. J. Miller, F. Naumann, X. Zhou, and D. Srivastava, "Data Quality: The Role of Empiricism", SIGMOD Record, vol. 46, issue 4, pp. 35--43, 2017.
Deng, D., W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Entity Consolidation: The Golden Record Problem", ArXiv, vol. abs/1709.10436, 2017.
Zhu, E., K. Q. Pu, F. Nargesian, and R. Miller, "Interactive Navigation of Open Data Linkages", Proceedings of the VLDB Endowment (PVLDB), vol. 10, issue 12, pp. 1837--1840, 2017.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Inverted Treaps", ACM Transactions on Information Systems (TOIS), vol. 35, issue 3, pp. 22:1--22:45, 2017.
Mior, M. J., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema Design for NoSQL Applications", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 29, issue 10, pp. 2275--2289, 2017.
Allan, J., N. J. Belkin, P. N. Bennett, J. Callan, C. Clarke, F. Diaz, S. T. Dumais, N. Ferro, D. Harman, D. Hiemstra, et al., "Overview of Special Issue", SIGIR Forum, vol. 51, issue 2, pp. 1--25, 2017.
Farid, M. H., A. Roatis, I. Ilyas, H-F. Hoffmann, and X. Chu, "CLAMS: Bringing Quality to Data Lakes", ACM International Conference on Management of Data (SIGMOD), 2016.
Lyons, K. A., E. Stroulia, D. Luo, R. Miller, and V. Onut, "Data-Driven Knowledge Mobilization", Conference of the Centre for Advanced Studies on Collaborative Research (CASCON), 2016.
Baruah, G., H. Zhang, R. Guttikonda, J. Lin, M. Smucker, and O. Vechtomova, "Optimizing Nugget Annotations With Active Learning", International Conference on Information and Knowledge Management (CIKM), 2016.
Davila, K., R. Zanibbi, A. Kane, and F. Tompa, "Tangent-3 at the NTCIR-12 MathIR Task", Conference on Evaluation of Information Access Technologies (NTCIR), 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Total Recall: Blue Sky on Mars", International Conference on the Theory of Information Retrieval (ICTIR), 2016.
Radhakrishnan, S., B. J. Muscedere, and K. Daudjee, "V-Hadoop: Virtualized Hadoop Using Containers", IEEE International Symposium on Network Computing and Applications (NCA), 2016.
Arocena, P. C., B. Glavic, G. Mecca, R. Miller, P. Papotti, and D. Santoro, "Benchmarking Data Curation Systems", IEEE Data Engineering Bulletin, vol. 39, issue 2, pp. 47--62, 2016.
Abedjan, Z., X. Chu, D. Deng, R. Castro Fernandez, I. Ilyas, M. Ouzzani, P. Papotti, M. Stonebraker, and N. Tang, "Detecting Data Errors: Where Are We and What Needs to Be Done?", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 12, pp. 993--1004, 2016.
Zhu, E., F. Nargesian, K. Q. Pu, and R. Miller, "LSH Ensemble: Internet-Scale Domain Search", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 12, pp. 1185--1196, 2016.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 13, pp. 1481--1484, 2016.
Shen, X., L. Zou, T. Ozsu, L. Chen, Y. Li, S. Han, and D. Zhao, "A Graph-Based RDF Triple Store", IEEE International Conference on Data Engineering (ICDE), 2015.
Liu, X., L. Golab, W. M. Golab, and I. Ilyas, "Benchmarking Smart Meter Data Analytics", International Conference on Extending Database Technology (EDBT), 2015.
Khayyat, Z., I. Ilyas, A. Jindal, S. Madden, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "BigDansing: A System for Big Data Cleansing", ACM International Conference on Management of Data (SIGMOD), 2015.
Baruah, G., M. Smucker, and C. Clarke, "Evaluating Streams of Evolving News Events", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Salihoglu, S., J. Shin, V. Khanna, B. Quan Truong, and J. Widom, "Graft: A Debugging Tool for Apache Giraph", ACM International Conference on Management of Data (SIGMOD), 2015.
Ge, C., M. Kaufmann, L. Golab, P. M. Fischer, and A. K. Goel, "Indexing Bi-Temporal Windows", International Conference on Statistical and Scientific Database Management (SSDBM), 2015.
Hudek, A. K., D. Toman, and G. Weddell, "On Enumerating Query Plans Using Analytic Tableau", International Conference on Theorem Proving with Analytic Tableaux and Related Methods (TABLEAUX), 2015.
Hashemi, S. Hadi, C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "On the Reusability of Open Test Collections", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Golab, L., F. Korn, F. Li, B. Saha, and D. Srivastava, "Size-Constrained Weighted Set Cover", IEEE International Conference on Data Engineering (ICDE), 2015.
Prokoshyna, N., J. Szlichta, F. Chiang, R. Miller, and D. Srivastava, "Combining Quantitative and Logical Data Cleaning", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 4, pp. 300--311, 2015.
Hanbury, A., H. Müller, K. Balog, T. Brodt, G. Cormack, I. Eggel, T. Gollub, F. Hopfgartner, J. Kalpathy-Cramer, N. Kando, et al., "Evaluation-as-a-Service: Overview and Outlook", ArXiv, vol. abs/1512.07454, 2015.
Arocena, P. C., R. Ciucanu, B. Glavic, and R. Miller, "Gain Control Over Your Integration Evaluations", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 12, pp. 1960--1963, 2015.
Abu-Khzam, F. N., K. Daudjee, A. E. Mouawad, and N. Nishimura, "On Scalable Parallel Recursive Backtracking", Journal of Parallel and Distributed Computing, vol. 84, pp. 65--75, 2015.
Hopfgartner, F., A. Hanbury, H. Müller, N. Kando, S. Mercer, J. Kalpathy-Cramer, M. Potthast, T. Gollub, A. Krithara, J. Lin, et al., "Report on the Evaluation-as-a-Service (EaaS) Expert Workshop", SIGIR Forum, vol. 49, issue 1, pp. 57--65, 2015.
Arocena, P. C., B. Glavic, R. Ciucanu, and R. Miller, "The iBench Integration Metadata Generator", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 3, pp. 108--119, 2015.
Albakour, M-D., C. Macdonald, I. Ounis, C. Clarke, and V. Bicer, "Information Access in Smart Cities (I-Asc)", European Conference on Information Retrieval (ECIR), 2014.
Voorhees, E. M., J. Lin, and M. Efron, "On Run Diversity in Evaluation as a Service", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Golab, L., H. J. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 26, issue 6, pp. 1332--1348, 2014.
Ng, R. T., P. C. Arocena, D. Barbosa, G. Carenini, L. Celso Gomes, Jr., S. Jou, R. Anthony Leung, E. E. Milios, R. J. Miller, J. Mylopoulos, et al., Perspectives on Business Intelligence: Morgan & Claypool, 2013.
Stonebraker, M., D. Bruckner, I. Ilyas, G. Beskales, M. Cherniack, S. B. Zdonik, A. Pagan, and S. Xu, "Data Curation at Scale: The Data Tamer System", Conference on Innovative Data Systems Research (CIDR), 2013.
Dean-Hall, A., C. Clarke, J. Kamps, and P. Thomas, "Evaluating Contextual Suggestion", Conference on Evaluation of Information Access Technologies (NTCIR), 2013.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Faster and Smaller Inverted Indices With Treaps", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Salihoglu, S., and J. Widom, "GPS: A Graph Processing System", International Conference on Statistical and Scientific Database Management (SSDBM), 2013.
Dallachiesa, M., A. Ebaid, A. Eldawy, A. K. Elmagarmid, I. Ilyas, M. Ouzzani, and N. Tang, "NADEEF: A Commodity Data Cleaning System", ACM International Conference on Management of Data (SIGMOD), 2013.
Northam, L., R. Smits, K. Daudjee, and J. Istead, "Ray Tracing in the Cloud Using MapReduce", International Conference on High Performance Computing & Simulation (HPCS), 2013.
Chu, X., I. Ilyas, and P. Papotti, "Discovering Denial Constraints", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 13, pp. 1498--1509, 2013.
Hassanzadeh, O., K. Q. Pu, S. Hassas Yeganeh, R. Miller, L. Popa, M. A. Hernández, and H. Ho, "Discovering Linkage Points Over Web Data", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 6, pp. 444--456, 2013.
Ebaid, A., A. K. Elmagarmid, I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF: A Generalized Data Cleaning System", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 12, pp. 1218--1221, 2013.
Golab, L., H. J. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules", IEEE International Conference on Data Engineering (ICDE), 2012.
Busch, M., K. Gade, B. Larson, P. Lok, S. Luckenbill, and J. Lin, "Earlybird: Real-Time Search at Twitter", IEEE International Conference on Data Engineering (ICDE), 2012.
McCullough, D., J. Lin, C. Macdonald, I. Ounis, and R. McCreadie, "Evaluating Real-Time Search Over Tweets", International Conference on Web and Social Media (ICWSM), 2012.
McCreadie, R., I. Soboroff, J. Lin, C. Macdonald, I. Ounis, and D. McCullough, "On Building a Reusable Twitter Corpus", International Conference on Research and Development in Information Retrieval (SIGIR), 2012.
Asadi, N., D. Metzler, and J. Lin, "Cross-Corpus Relevance Projection", International Conference on Research and Development in Information Retrieval (SIGIR), 2011.
Ozsu, T., P. Valduriez, S. Abiteboul, B. Kemme, R. Jiménez-Peris, and B. Chin Ooi, "Distributed Data Management in 2020?", IEEE International Conference on Data Engineering (ICDE), 2011.
Hassanzadeh, O., S. Hassas Yeganeh, and R. Miller, "Linking Semistructured Data on the Web", International Workshop on the Web and Databases (WebDB), 2011.
Miller, R. J., F. Tompa, S. A. McIlraith, J. Slonim, and E. S. K. Yu, "NSERC Business Intelligence Network: Selected Topics", Conference of the Centre for Advanced Studies on Collaborative Research (CASCON), 2011.
Glavic, B., J. Du, R. Miller, G. Alonso, and L. M. Haas, "Debugging Data Exchange With Vagabond", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 12, pp. 1383--1386, 2011.
Yakout, M., A. K. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided Data Repair", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 5, pp. 279--289, 2011.
Chockler, G. V., E. Dekel, J. F. JáJá, and J. Lin, "Special Issue on Cloud Computing", Journal of Parallel and Distributed Computing, vol. 71, issue 6, pp. 731, 2011.
Itakura, K. Y., and C. Clarke, "A Framework for BM25F-based XML Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2010.
Hassanzadeh, O., R. Xin, C. Fritz, Y. Yang, J. Du, M. Zhao, and R. Miller, "BibBase Triplified", International Conference on Semantic Systems (SEMANTiCS), 2010.
Zou, L., L. Chen, T. Ozsu, and D. Zhao, "Dynamic Skyline Queries in Large Graphs", International Conference on Database Systems for Advanced Applications (DASFAA), 2010.
Wang, L., J. Lin, and D. Metzler, "Learning to Efficiently Rank", International Conference on Research and Development in Information Retrieval (SIGIR), 2010.
Dolman, L., F. Tompa, I. Kiringa, R. Pottinger, and J. Mylopoulos, "Next Generation Business Intelligence (BI) Tools", Conference of the Centre for Advanced Studies on Collaborative Research (CASCON), 2010.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "The Combined Approach to Query Answering in DL-Lite", International Conference on Principles of Knowledge Representation and Reasoning (KR), 2010.
Lo, E., C. Binnig, D. Kossmann, T. Ozsu, and W-K. Hon, "A Framework for Testing DBMS Features", The VLDB Journal, vol. 19, issue 2, pp. 203--230, 2010.
Srivastava, D., L. Golab, R. Greer, T. Johnson, J. Seidel, V. Shkapenyuk, O. Spatscheck, and J. Yates, "Enabling Real Time Data Analysis", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 1, pp. 1--2, 2010.
Hentschel, M., L. M. Haas, and R. Miller, "Just-in-Time Data Integration in Action", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 2, pp. 1621--1624, 2010.
Ben-David, S., R. J. Trefler, and G. Weddell, "Model Checking Using Description Logic", Journal of Logic and Computation, vol. 20, issue 1, pp. 111--131, 2010.
Ailamaki, A., L. M. Haas, H. V. Jagadish, D. Maier, T. Ozsu, and M. Winslett, "Time for Our Field to Grow Up", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 2, pp. 1658, 2010.
Duchateau, F., R. Coletta, Z. Bellahsene, and R. Miller, "(Not) Yet Another Matcher", International Conference on Information and Knowledge Management (CIKM), 2009.
Fagin, R., L. M. Haas, M. A. Hernández, R. Miller, L. Popa, and Y. Velegrakis, "Clio: Schema Mapping Creation and Data Exchange", Description Logic, Theory Combination, and All That - Essays Dedicated to Franz Baader, 2009.
Tang, N., J. Xu Yu, H. Tang, T. Ozsu, and P. A. Boncz, "Materialized View Selection in XML Databases", International Conference on Database Systems for Advanced Applications (DASFAA), 2009.
Golab, L., T. Johnson, S. J. Seidel, and V. Shkapenyuk, "Stream Warehousing With DataDepot", ACM International Conference on Management of Data (SIGMOD), 2009.
Ashkan, A., and C. Clarke, "Term-Based Commercial Intent Analysis", International Conference on Research and Development in Information Retrieval (SIGIR), 2009.
Duchateau, F., R. Coletta, Z. Bellahsene, and R. Miller, "YAM: A Schema Matcher Factory", International Conference on Information and Knowledge Management (CIKM), 2009.
Wan, Q., R. Chi- Wing Wong, I. Ilyas, T. Ozsu, and Y. Peng, "Creating Competitive Products", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 898--909, 2009.
Aboulnaga, A., K. Salem, A. A. Soror, U. Farooq Minhas, P. Kokosielis, and S. Kamath, "Deploying Database Appliances in the Cloud", IEEE Data Engineering Bulletin, vol. 32, issue 1, pp. 13--20, 2009.
Hassanzadeh, O., R. Xin, R. Miller, A. Kementsietsidis, L. Lim, and M. Wang, "Linkage Query Writer", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 2, pp. 1590--1593, 2009.
Golab, L., H. J. Karloff, F. Korn, A. Saha, and D. Srivastava, "Sequential Dependencies", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 574--585, 2009.
Clarke, C., G. Cormack, T. R. Lynam, C. Buckley, and D. Harman, "Swapping Documents and Terms", Information Retrieval Journal, vol. 12, issue 6, pp. 680--694, 2009.
Sarma, A. Das, A. Deshpande, T. Hubauer, I. Ilyas, B. König-Ries, M. Renz, and M. Theobald, "08421 Working Group: Lineage/Provenance", Dagstuhl Publications, 2008.
Reznik-Zellen, R., B. Stevens, M. Thorn, J. Morse, M. Smucker, J. Allan, D. M. Mimno, A. McCallum, and M. Tuominen, "InterNano: E-Science for the Nanomanufacturing Community", IEEE International Conference on e-Science (E-Science), 2008.
Clarke, C., M. Kolla, G. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I. MacKinnon, "Novelty and Diversity in Information Retrieval Evaluation", International Conference on Research and Development in Information Retrieval (SIGIR), 2008.
Golab, L., T. Johnson, N. Koudas, D. Srivastava, and D. Toman, "Optimizing Away Joins on Data Streams", International Conference on Extending Database Technology (EDBT), 2008.
Wang, Q., R. Li, L. Chen, J. Lian, and T. Ozsu, "Speed Up Semantic Search in P2p Networks", International Conference on Information and Knowledge Management (CIKM), 2008.
Korth, H. F., P. A. Bernstein, M. F. Fernández, L. Gruenwald, P. G. Kolaitis, K. S. McKinley, and T. Ozsu, "Paper and Proposal Reviews: Is the Process Flawed?", SIGMOD Record, vol. 37, issue 3, pp. 36--39, 2008.
Gil, J., W. Pugh, G. Weddell, and Y. Zibin, "Two-Dimensional Bidirectional Object Layout", ACM Transactions on Programming Languages and Systems (TOPLAS), vol. 30, issue 5, pp. 28:1--28:38, 2008.
Lushman, B., and G. Cormack, "A Larger Decidable Semiunification Problem", ACM-SIGPLAN International Conference on Principles and Practice of Declarative Programming (PPDP), 2007.
Hernández, M. A., H. Ho, L. Popa, A. Fuxman, R. Miller, T. Fukuda, and P. Papotti, "Creating Nested Mappings With Clio", IEEE International Conference on Data Engineering (ICDE), 2007.
Cormack, G., J. María Gó Hidalgo, and E. Puertas Sanz, "Spam Filtering for Short Messages", International Conference on Information and Knowledge Management (CIKM), 2007.
Dakdouk, R. Ramzi, S. Salihoglu, H. Wang, H. Xie, and Y. Richard Yang, "Interdomain Routing as Social Choice", International Conference on Distributed Computing Systems (ICDCS) - Workshops, 2006.
Fuxman, A., M. A. Hernández, C. T. Howard Ho, R. Miller, P. Papotti, and L. Popa, "Nested Mappings: Schema Mapping Reloaded", Very Large Data Bases Conference (VLDB), 2006.
Lynam, T. R., G. Cormack, and D. R. Cheriton, "On-Line Spam Filter Fusion", International Conference on Research and Development in Information Retrieval (SIGIR), 2006.
Fuxman, A., P. G. Kolaitis, R. Miller, and W. Chiew Tan, "Peer Data Exchange", ACM Transactions on Database Systems (TODS), vol. 31, issue 4, pp. 1454--1498, 2006.
Rodríguez-Gianolli, P., M. Garzetti, L. Jiang, A. Kementsietsidis, I. Kiringa, M. Masud, R. Miller, and J. Mylopoulos, "Data Sharing in the Hyperion Peer Database System", Very Large Data Bases Conference (VLDB), 2005.
Bernstein, P. A., D. J. DeWitt, A. Heuer, Z. G. Ives, C. S. Jensen, H. Meyer, T. Ozsu, R. T. Snodgrass, K-Y. Whang, and J. Widom, "Database Publication Practices", Very Large Data Bases Conference (VLDB), 2005.
Amer-Yahia, S., N. Koudas, A. Marian, D. Srivastava, and D. Toman, "Structure and Content Scoring for XML", Very Large Data Bases Conference (VLDB), 2005.
Hammad, M. A., M. F. Mokbel, M. H. Ali, W. G. Aref, A. Christine Catlin, A. K. Elmagarmid, M. Y. Eltabakh, M. G. Elfeky, T. M. Ghanem, R. Gwadera, et al., "Nile: A Query Processing Engine for Data Streams", IEEE International Conference on Data Engineering (ICDE), 2004.
Ilyas, I., R. Shah, W. G. Aref, J. Scott Vitter, and A. K. Elmagarmid, "Rank-Aware Query Optimization", ACM International Conference on Management of Data (SIGMOD), 2004.
Jaleel, N. Abdul, J. Allan, B. W. Croft, F. Diaz, L. S. Larkey, X. Li, M. Smucker, and C. Wade, "UMass at TREC 2004: Novelty and HARD", Text Retrieval Conference (TREC), 2004.
Ross, K. A., P. A. Boncz, I. Ilyas, V. Markl, and V. Vassalos, "Reminiscences on Influential Papers", SIGMOD Record, vol. 33, issue 4, pp. 91--92, 2004.
Chomicki, J., D. Q. Goldin, G. M. Kuper, and D. Toman, "Variable Independence in Constraint Databases", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 15, issue 6, pp. 1422--1436, 2003.
Aref, W. G., A. Christine Catlin, A. K. Elmagarmid, J. Fan, J. Guo, M. A. Hammad, I. Ilyas, M. S. Marzouk, S. Prabhakar, A. Rezgui, et al., "A Distributed Database Server for Continuous Media", IEEE International Conference on Data Engineering (ICDE), 2002.
Hernández, M. A., L. Popa, Y. Velegrakis, R. Miller, F. Naumann, and C-T. Ho, "Mapping XML and Relational Schemas With Clio", IEEE International Conference on Data Engineering (ICDE), 2002.
Dumais, S. T., M. Banko, E. Brill, J. Lin, and A. Y. Ng, "Web Question Answering: Is More Always Better?", International Conference on Research and Development in Information Retrieval (SIGIR), 2002.
Andritsos, P., R. Fagin, A. Fuxman, L. M. Haas, M. A. Hernández, C. T. Howard Ho, A. Kementsietsidis, R. Miller, F. Naumann, L. Popa, et al., "Schema Management", IEEE Data Engineering Bulletin, vol. 25, issue 3, pp. 32--38, 2002.
Clarke, C., G. Cormack, and T. R. Lynam, "Exploiting Redundancy in Question Answering", International Conference on Research and Development in Information Retrieval (SIGIR), 2001.
Goralwalla, I. A., Y. Leontiev, T. Ozsu, D. Szafron, and C. Combi, "Temporal Granularity: Completing the Puzzle", Journal of Intelligent Information Systems (JIIS), vol. 16, issue 1, pp. 41--63, 2001.
Miller, R., M. A. Hernández, L. M. Haas, L-L. Yan, C. T. Howard Ho, R. Fagin, and L. Popa, "The Clio Project: Managing Heterogeneity", SIGMOD Record, vol. 30, issue 1, pp. 78--83, 2001.
Cox, A., C. Clarke, and S. Elliott Sim, "A Model Independent Source Code Repository", Conference of the Centre for Advanced Studies on Collaborative Research (CASCON), 1999.
Sim, S. Elliott, C. Clarke, R. C. Holt, and A. Cox, "Browsing and Searching Software Architectures", IEEE International Conference on Software Maintenance and Evolution (ICSME), 1999.
Miller, R., and A. Gujarathi, "Mining for Program Structure", International Journal of Software Engineering and Knowledge Engineering (IJSEKE), vol. 9, issue 5, pp. 499--517, 1999.
Goralwalla, I. A., Y. Leontiev, T. Ozsu, D. Szafron, and C. Combi, "Temporal Granularity for Unanchored Temporal Data", International Conference on Information and Knowledge Management (CIKM), 1998.
Cowan, D. D., C. I. Mayfield, F. Tompa, and W. Gasparini, "New Role for Community Networks", Communications of the ACM, vol. 41, issue 4, pp. 61--63, 1998.
Goralwalla, I. A., Y. Leontiev, T. Ozsu, and D. Szafron, "Modeling Temporal Primitives: Back to Basics", International Conference on Information and Knowledge Management (CIKM), 1997.
Wong, J. W., K. A. Lyons, D. Evans, R. J. Velthuys, G. von Bochmann, E. Dubois, N. D. Georganas, G. W. Neufeld, T. Ozsu, J. Brinskelle, et al., "Enabling Technology for Distributed Multimedia Applications", IBM Systems Journal, vol. 36, issue 4, pp. 489--507, 1997.
Böhlen, M. H., J. Chomicki, R. T. Snodgrass, and D. Toman, "Querying TSQL2 Databases With Temporal Logic", International Conference on Extending Database Technology (EDBT), 1996.
Garcia-Molina, H., and K. Salem, "Non-Deterministic Queue Operations", Journal of Computer and System Sciences (JCSS), vol. 51, issue 2, pp. 211--222, 1995.
Ioannidis, Y. E., M. Livny, E. M. Haber, R. Miller, O. G. Tsatalos, and J. L. Wiener, "Desktop Experiment Management", IEEE Data Engineering Bulletin, vol. 16, issue 1, pp. 19--23, 1993.
Garcia-Molina, H., and K. Salem, "Main Memory Database Systems: An Overview", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 4, issue 6, pp. 509--516, 1992.
Cormack, G., and A. K. Wright, "Type-Dependent Parameter Inference", ACM-SIGPLAN Symposium on Programming Language Design and Implementation (PLDI), 1990.
Tompa, F., B. Botten, D. Godfrey, J. Norton, L. Schneider, and A. van Dam, "The Role of Videotex (Panel Session)", International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 1982.
R. Bergeron, D., J. D. Gannon, D. P. Shecter, F. Tompa, and A. van Dam, "Systems Programming Languages", Advances in Computers, vol. 12, pp. 175--284, 1972.