Publications

Sort by: Author Type Year

Book

Lin, J., R. Nogueira, and A. Yates, Pretrained Transformers for Text Ranking: BERT and Beyond: Morgan & Claypool, 2021.
Ozsu, T., and P. Valduriez, Principles of Distributed Database Systems, 4th Edition: Springer, 2020.
Ilyas, I., and X. Chu, Data Cleaning: ACM, 2019.
Abedjan, Z., L. Golab, F. Naumann, and T. Papenbrock, Data Profiling: Morgan & Claypool, 2018.
Liu, L., and T. Ozsu, Encyclopedia of Database Systems, Second Edition: Springer, 2018.
Ng, R. T., P. C. Arocena, D. Barbosa, G. Carenini, L. Celso Gomes, Jr., S. Jou, R. Anthony Leung, E. E. Milios, R. J. Miller, J. Mylopoulos, et al., Perspectives on Business Intelligence: Morgan & Claypool, 2013.
Toman, D., and G. Weddell, Fundamentals of Physical Design and Query Compilation: Morgan & Claypool, 2011.
Ozsu, T., and P. Valduriez, Principles of Distributed Database Systems, Third Edition: Springer, 2011.
Ilyas, I., and M. A. Soliman, Probabilistic Ranking Techniques in Relational Databases: Morgan & Claypool, 2011.
Golab, L., and T. Ozsu, Data Stream Management: Morgan & Claypool, 2010.
Lin, J., and C. Dyer, Data-Intensive Text Processing With MapReduce: Morgan & Claypool, 2010.
Büttcher, S., C. Clarke, and G. Cormack, Information Retrieval - Implementing and Evaluating Search Engines: MIT Press, 2010.
Liu, L., and T. Ozsu, Encyclopedia of Database Systems: Springer, 2009.
Ozsu, T., and P. Valduriez, Principles of Distributed Database Systems, Second Edition: Prentice-Hall, 1999.
Ozsu, T., and P. Valduriez, Principles of Distributed Database Systems: Prentice-Hall, 1991.

Book Chapter

Ilyas, I., "Data Unification at Scale: Data Tamer", Making Databases Work: the Pragmatic Wisdom of Michael Stonebraker: ACM / Morgan & Claypool, 2019.
Salihoglu, S., and N. Yakovets, "Graph Query Processing", Encyclopedia of Big Data Technologies: Springer, 2019.
Golab, L., "Types of Stream Processing Algorithms", Encyclopedia of Big Data Technologies: Springer, 2019.
Chomicki, J., and D. Toman, "Abstract Versus Concrete Temporal Query Languages", Encyclopedia of Database Systems: Springer, 2018.
Machanavajjhala, A., and X. He, "Analyzing Your Location Data With Provable Privacy Guarantees", Springer Handbooks: Springer, 2018.
Ozsu, T., "Client-Server Architecture", Encyclopedia of Database Systems: Springer, 2018.
Ozsu, T., "Data Manipulation Language (DML)", Encyclopedia of Database Systems: Springer, 2018.
Golab, L., "Data Stream", Encyclopedia of Database Systems: Springer, 2018.
Ozsu, T., "Database", Encyclopedia of Database Systems: Springer, 2018.
Ozsu, T., "Database Administrator (DBA)", Encyclopedia of Database Systems: Springer, 2018.
Tompa, F., "Document Databases", Encyclopedia of Database Systems: Springer, 2018.
Tompa, F., "Enterprise Content Management", Encyclopedia of Database Systems: Springer, 2018.
Tompa, F., "Hypertexts", Encyclopedia of Database Systems: Springer, 2018.
Toman, D., "Point-Stamped Temporal Models", Encyclopedia of Database Systems: Springer, 2018.
Ilyas, I., "Rank-Aware Query Processing", Encyclopedia of Database Systems: Springer, 2018.
Ilyas, I., "Rank-Join", Encyclopedia of Database Systems: Springer, 2018.
Salem, K., "Sagas", Encyclopedia of Database Systems: Springer, 2018.
Fuxman, A., and R. Miller, "Schema Mapping", Encyclopedia of Database Systems: Springer, 2018.
Golab, L., "Stream Models", Encyclopedia of Database Systems: Springer, 2018.
Lin, J., "Summarization", Encyclopedia of Database Systems: Springer, 2018.
Chomicki, J., and D. Toman, "Temporal Logic in Database Query Languages", Encyclopedia of Database Systems: Springer, 2018.
Chomicki, J., and D. Toman, "Temporal Relational Calculus", Encyclopedia of Database Systems: Springer, 2018.
Roddick, J. F., and D. Toman, "Temporal Vacuuming", Encyclopedia of Database Systems: Springer, 2018.
Ilyas, I., "Top-K Queries", Encyclopedia of Database Systems: Springer, 2018.
Clarke, C., "Web Question Answering", Encyclopedia of Database Systems: Springer, 2018.
Shen, C., T. Shen, and J. Lin, "Comparative Assessment of Alignment Algorithms for NGS Data: Features, Considerations, Implementations, and Future", Algorithms for Next-Generation Sequencing Data, Techniques, Approaches, and Applications: Springer, 2017.
Ozsu, T., and P. Valduriez, "Distributed and Parallel Database Systems", Computing Handbook: Information Systems and Information Technology: CRC Press, 2014.
Golab, L., "Data Warehouse Quality: Summary and Outlook", Handbook of Data Quality: Springer, 2013.
Hassanzadeh, O., A. Kementsietsidis, L. Lim, R. Miller, and M. Wang, "Semantic Link Discovery Over Relational Data", Semantic Search over the Web: Springer, 2012.
Smucker, M., "Information Representation", Interactive Information Seeking, Behaviour and Retrieval: Facet Publishing, 2011.
Chomicki, J., and D. Toman, "Abstract Versus Concrete Temporal Query Languages", Encyclopedia of Database Systems: Springer, 2009.
Ozsu, T., "Client-Server DBMS", Encyclopedia of Database Systems: Springer, 2009.
Golab, L., "Data Stream", Encyclopedia of Database Systems: Springer, 2009.
Tompa, F., "Document Databases", Encyclopedia of Database Systems: Springer, 2009.
Tompa, F., "Enterprise Content Management", Encyclopedia of Database Systems: Springer, 2009.
Tompa, F., "Hypertexts", Encyclopedia of Database Systems: Springer, 2009.
Toman, D., "Point-Stamped Temporal Models", Encyclopedia of Database Systems: Springer, 2009.
Salem, K., "Sagas", Encyclopedia of Database Systems: Springer, 2009.
Fuxman, A., and R. Miller, "Schema Mapping", Encyclopedia of Database Systems: Springer, 2009.
Golab, L., "Stream Models", Encyclopedia of Database Systems: Springer, 2009.
Lin, J., "Summarization", Encyclopedia of Database Systems: Springer, 2009.
Chomicki, J., and D. Toman, "Temporal Logic in Database Query Languages", Encyclopedia of Database Systems: Springer, 2009.
Chomicki, J., and D. Toman, "Temporal Relational Calculus", Encyclopedia of Database Systems: Springer, 2009.
Roddick, J. F., and D. Toman, "Temporal Vacuuming", Encyclopedia of Database Systems: Springer, 2009.
Clarke, C., "Web Question Answering", Encyclopedia of Database Systems: Springer, 2009.
Chomicki, J., and D. Toman, "Temporal Databases", Foundations of Artificial Intelligence: Elsevier, 2005.
Ozsu, T., "Distributed Databases", Academic Press Reference: Academic Press, 2002.
Ozsu, T., and B. Bin Yao, "Building Component Database Systems Using CORBA", Component Database Systems: Morgan Kaufmann, 2001.
Toman, D., "SQL/TP: A Temporal Extension of SQL", International Symposium on the Applications of Constraint Databases (CDB): Springer, 2000.
Ozsu, T., and P. Valduriez, "Distributed and Parallel Database Systems", The Computer Science and Engineering Handbook: CRC Press, 1997.
Ozsu, T., and J. A. Blakeley, "Query Processing in Object-Oriented Database Systems", Modern Database Systems: The Object Model, Interoperability, and Beyond: ACM Press and Addison-Wesley, 1995.
Buchmann, A. P., T. Ozsu, D. Georgakopoulos, and F. Manola, "A Transaction Model for Active Distributed Object Systems", Database Transaction Models for Advanced Applications: Morgan Kaufmann, 1992.

Conference Paper

Leventidis, A., M. Pekár Christensen, M. Lissandrini, L. Di Rocco, K. Hose, and R. Miller, "A Large Scale Test Corpus for Semantic Table Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Arabzadeh, N., A. Bigdeli, and C. Clarke, "Adapting Standard Retrieval Benchmarks to Evaluate Generated Answers", European Conference on Information Retrieval (ECIR), 2024.
Usta, A., C. Liu, and S. Salihoglu, "Analysis of Open Government Datasets From a Data Design and Integration Perspective", International Conference on Extending Database Technology (EDBT), 2024.
Yu, A., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "CAMO: Explaining Consensus Across MOdels", IEEE International Conference on Data Engineering (ICDE), 2024.
Li, M., H. Zhuang, K. Hui, Z. Qin, J. Lin, R. Jagerman, X. Wang, and M. Bendersky, "Can Query Expansion Improve Generalization of Strong Cross-Encoder Rankers?", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Zhang, C., M. Li, and J. Lin, "CELI: Simple Yet Effective Approach to Enhance Out-of-Domain Generalization Of Cross-Encoders", North American Chapter of the Association for Computational Linguistics (NAACL), 2024.
Adeyemi, M., A. Oladipo, X. Zhang, D. Alfonso-Hermelo, M. Rezagholizadeh, B. Chen, A-H. Omotayo, I. Abdulmumin, N. A. Etori, T. Babatunde Musa, et al., "CIRAL: A Test Collection for CLIR Evaluations in African Languages", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Glavic, B., G. Mecca, R. Miller, P. Papotti, D. Santoro, and E. Veltri, "Comparing Incomplete Database Instances", Sistemi Evoluti per Basi di Dati (SEBD), 2024.
Mousavi, A., X. Zhan, H. Bai, P. Shi, T. Rekatsinas, B. Han, Y. Li, J. Pound, J. M. Susskind, N. Schluter, et al., "Construction of Paired Knowledge Graph - Text Datasets Informed By Cyclic Evaluation", International Conference on Computational Linguistics (COLING), 2024.
Dehghan, M., M. Ali Alomrani, S. Bagga, D. Alfonso-Hermelo, K. Bibi, A. Ghaddar, Y. Zhang, X. Li, J. Hao, Q. Liu, et al., "EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval For Citation-Based Question Answering Systems", Association for Computational Linguistics (ACL), 2024.
Golzadeh, K., L. Golab, and J. Szlichta, "Explaining Expert Search Systems With ExES", IEEE International Conference on Data Engineering (ICDE), 2024.
Yu, A., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "Exploring the Space of Model Comparisons", IEEE International Conference on Data Engineering (ICDE), 2024.
Hu, X., and S. Sintos, "Finding Smallest Witnesses for Conjunctive Queries", International Conference on Database Theory (ICDT), 2024.
Ma, X., L. Wang, N. Yang, F. Wei, and J. Lin, "Fine-Tuning LLaMA for Multi-Stage Text Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Tang, R., X. Crystina Zhang, X. Ma, J. Lin, and F. Türe, "Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models", North American Chapter of the Association for Computational Linguistics (NAACL), 2024.
Arabzadeh, N., and C. Clarke, "Fréchet Distance for Offline Evaluation of Information Retrieval Systems With Sparse Labels", Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024.
Fan, G., R. Shraga, and R. Miller, "Gen-T: Table Reclamation in Data Lakes", IEEE International Conference on Data Engineering (ICDE), 2024.
Lin, J., J. Li, J. Gao, W. Ma, and Y. Liu, "Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification", AAAI Conference on Artificial Intelligence (AAAI), 2024.
Arabzadeh, N., K. Golzadeh, C. Risi, C. Clarke, and J. Zhao, "KnowFIRES: A Knowledge-Graph Framework for Interpreting Retrieved Entities From Search", European Conference on Information Retrieval (ECIR), 2024.
Thakur, N., J. Ni, G. Hernández Ábrego, J. Wieting, J. Lin, and D. Cer, "Leveraging LLMs for Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval", North American Chapter of the Association for Computational Linguistics (NAACL), 2024.
Rahmani, H. A., C. Siro, M. Aliannejadi, N. Craswell, C. Clarke, G. Faggioli, B. Mitra, P. Thomas, and E. Yilmaz, "LLM4Eval: Large Language Model for Evaluation in IR", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Hebert, L., G. Sahu, Y. Guo, N. Kishore Sreenivas, L. Golab, and R. Cohen, "Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media", AAAI Conference on Artificial Intelligence (AAAI), 2024.
Oladipo, A., M. Adeyemi, and J. Lin, "On Backbones and Training Regimes for Dense Retrieval in African Languages", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Esmaeilzadeh, A., J. Rorseth, A. Yu, P. Godfrey, L. Golab, D. Srivastava, J. Szlichta, and K. Taghva, "On Integrating the Data-Science and Machine-Learning Pipelines For Responsible AI", Workshop in Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI), 2024.
Feng, E., D. Toman, and G. Weddell, "On Mixed Semantics of Path Description Dependencies in FunDL", International Workshop on Description Logics (DL), 2024.
Sahu, S., and S. Salihoglu, "Optimizing Differential Computation for Large-Scale Graph Processing", International Workshop on Graph Data Management Experiences and Systems (GRADES), 2024.
Maiyya, S., Y. Steinhart, A. Davila, J. Du, D. Agrawal, P. Ananth, and A. El Abbadi, "ORTOA: A Family of One Round Trip Protocols for Operation-Type Obliviousness", International Conference on Extending Database Technology (EDBT), 2024.
Zhou, A., Y. Wang, L. Chen, and T. Ozsu, "Positive Communities on Signed Graphs That Are Not Echo Chambers: A Clique-Based Approach", IEEE International Conference on Data Engineering (ICDE), 2024.
Khalaji, M., T. Brown, K. Daudjee, and V. Aksenov, "Practical Hardware Transactional vEB Trees", ACM Symposium on Principles & Practice of Parallel Programming (PPoPP), 2024.
Rorseth, J., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "RAGE Against the Machine: Retrieval-Augmented LLM Explanations", IEEE International Conference on Data Engineering (ICDE), 2024.
Zong, S., S. Kolagati, A. Chaudhary, J. Seltzer, and J. Lin, "Reflections on the Coding Ability of LLMs for Analyzing Market Research Surveys", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Kamalloo, E., N. Thakur, C. Lassance, X. Ma, J-H. Yang, and J. Lin, "Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Zhang, S., X. He, A. Kundu, S. Mehrotra, and S. Sharma, "Secure Normal Form: Mediation Among Cross Cryptographic Leakages In Encrypted Databases", IEEE International Conference on Data Engineering (ICDE), 2024.
Glavic, B., G. Mecca, R. Miller, P. Papotti, D. Santoro, and E. Veltri, "Similarity Measures for Incomplete Database Instances", International Conference on Extending Database Technology (EDBT), 2024.
Thakur, N., L. Bonifacio, M. Fröbe, A. Bondarenko, E. Kamalloo, M. Potthast, M. Hagen, and J. Lin, "Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Bonifati, A., T. Ozsu, Y. Tian, H. Voigt, W. Yu, and W. Zhang, "The Future of Graph Analytics", ACM International Conference on Management of Data (SIGMOD), 2024.
Azzopardi, L., C. Clarke, P. B. Kantor, B. Mitra, J. R. Trippas, and Z. Ren, "The Search Futures Workshop", European Conference on Information Retrieval (ECIR), 2024.
Pradeep, R., and J. Lin, "Towards Automated End-to-End Health Misinformation Free Search With A Large Language Model", European Conference on Information Retrieval (ECIR), 2024.
Rorseth, J., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "Towards Explainability in Retrieval-Augmented LLMs", IEEE International Conference on Data Engineering (ICDE), 2024.
Kamalloo, E., S. Upadhyay, and J. Lin, "Towards Robust QA Evaluation via Open LLMs", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Cormack, G., M. Grossman, A. Harbison, T. O'Halloran, and B. McManus, "Unbiased Validation of Technology-Assisted Review for eDiscovery", International Conference on Research and Development in Information Retrieval (SIGIR), 2024.
Xian, J., T. Teofili, R. Pradeep, and J. Lin, "Vector Search With OpenAI Embeddings: Lucene Is All You Need", Web Search and Data Mining (WSDM), 2024.
Jiang, Z., M. Y. R. Yang, M. Tsirlin, R. Tang, Y. Dai, and J. Lin, ""Low-Resource" Text Classification: A Parameter-Free Classification Method With Compressors", Association for Computational Linguistics (ACL), 2023.
Arabzadeh, N., O. Kmet, B. Carterette, C. Clarke, C. Hauff, and P. Chandar, "A Is for Adele: An Offline Evaluation Metric for Instant Search", International Conference on the Theory of Information Retrieval (ICTIR), 2023.
Seifikar, M., L. Nhi Phan Minh, N. Arabzadeh, C. Clarke, and M. Smucker, "A Preference Judgment Tool for Authoritative Assessment", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Fernando, L., H. Bindra, and K. Daudjee, "An Experimental Analysis of Quantile Sketches Over Data Streams", International Conference on Extending Database Technology (EDBT), 2023.
Zhang, C., A. Bonifati, and T. Ozsu, "An Overview of Reachability Indexes on Graphs", ACM International Conference on Management of Data (SIGMOD), 2023.
Ma, X., T. Teofili, and J. Lin, "Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes", International Conference on Information and Knowledge Management (CIKM), 2023.
Zhong, W., Y. Xie, and J. Lin, "Answer Retrieval for Math Questions Using Structural and Dense Retrieval", Conference and Labs of the Evaluation Forum (CLEF), 2023.
Yang, J-H., C. Lassance, R. Sampaio de Rezende, K. Srinivasan, M. Redi, S. Clinchant, and J. Lin, "AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Oladipo, A., M. Adeyemi, O. Ahia, A. Toluwase Owodunni, O. Ogundepo, D. Ifeoluwa Adelani, and J. Lin, "Better Quality Pre-Training Data and T5 Models for African Languages", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Adeyemi, M., A. Oladipo, X. Zhang, D. Alfonso-Hermelo, M. Rezagholizadeh, B. Chen, and J. Lin, "CIRAL at FIRE 2023: Cross-Lingual Information Retrieval for African Languages", Forum for Information Retrieval Evaluation (FIRE), 2023.
Li, M., S-C. Lin, B. Oguz, A. Ghoshal, J. Lin, Y. Mehdad, W-tau. Yih, and X. Chen, "CITADEL: Conditional Token Interaction via Dynamic Lexical Routing For Efficient and Effective Multi-Vector Retrieval", Association for Computational Linguistics (ACL), 2023.
Rorseth, J., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "CREDENCE: Counterfactual Explanations for Document Ranking", IEEE International Conference on Data Engineering (ICDE), 2023.
Khatiwada, A., R. Shraga, and R. Miller, "DIALITE: Discover, Align and Integrate Open Data Tables", ACM International Conference on Management of Data (SIGMOD), 2023.
Ghazi, B., X. Hu, R. Kumar, and P. Manurangsi, "Differentially Private Data Release Over Multiple Tables", ACM Symposium on Principles of Database Systems (PODS), 2023.
Wang, R., J. Wang, P. Kadam, T. Ozsu, and W. G. Aref, "dLSM: An LSM-Based Index for Memory Disaggregation", IEEE International Conference on Data Engineering (ICDE), 2023.
Chai, A., A. Vezvaei, L. Golab, M. Kargar, D. Srivastava, J. Szlichta, and M. Zihayat, "EAGER: Explainable Question Answering Using Knowledge Graphs", International Workshop on Graph Data Management Experiences and Systems (GRADES), 2023.
Ma, X., H. Fun, X. Yin, A. Mallia, and J. Lin, "Enhancing Sparse Retrieval via Unsupervised Learning", ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region (SIGIR-AP), 2023.
Kamalloo, E., X. Zhang, O. Ogundepo, N. Thakur, D. Alfonso-Hermelo, M. Rezagholizadeh, and J. Lin, "Evaluating Embedding APIs for Information Retrieval", Association for Computational Linguistics (ACL), 2023.
Kamalloo, E., N. Dziri, C. Clarke, and D. Rafiei, "Evaluating Open-Domain Question Answering in the Era of Large Language Models", Association for Computational Linguistics (ACL), 2023.
Hebert, L., L. Golab, P. Poupart, and R. Cohen, "FedFormer: Contextual Federation With Attention in Reinforcement Learning", International Joint Conference on Autonomous Agents & Multiagent Systems (AAMAS), 2023.
Bayat, F. Fatahi, K. Qian, B. Han, Y. Sang, A. Belyi, S. Khorshidi, F. Wu, I. Ilyas, and Y. Li, "FLEEK: Factual Error Detection and Correction With Evidence Retrieved From External Knowledge", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Piktus, A., O. Ogundepo, C. Akiki, A. Oladipo, X. Zhang, H. Schoelkopf, S. Biderman, M. Potthast, and J. Lin, "GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration", Association for Computational Linguistics (ACL), 2023.
Hu, L., L. Zou, and T. Ozsu, "GAMMA: A Graph Pattern Mining Framework for Large Graphs on GPU", IEEE International Conference on Data Engineering (ICDE), 2023.
Deep, S., X. Hu, and P. Koutris, "General Space-Time Tradeoffs via Relational Queries", International Symposium on Algorithms and Data Structures (WADS), 2023.
Pang, Y., L. Yang, L. Zou, and T. Ozsu, "gFOV: A Full-Stack SPARQL Query Optimizer & Plan Visualizer", International Conference on Information and Knowledge Management (CIKM), 2023.
Liu, C., A. Usta, J. Zhao, and S. Salihoglu, "Governor: Turning Open Government Data Portals Into Interactive Databases", ACM Conference on Human Factors in Computing Systems (CHI), 2023.
Ilyas, I., JP. Lacerda, Y. Li, U. Farooq Minhas, A. Mousavi, J. Pound, T. Rekatsinas, and C. Sumanth, "Growing and Serving Large Open-Domain Knowledge Graphs", ACM International Conference on Management of Data (SIGMOD), 2023.
Pradeep, R., K. Hui, J. Gupta, Á. D. Lelkes, H. Zhuang, J. Lin, D. Metzler, and V. Q. Tran, "How Does Generative Retrieval Scale to Millions of Passages?", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Lin, S-C., A. Asai, M. Li, B. Oguz, J. Lin, Y. Mehdad, W-tau. Yih, and X. Chen, "How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Conia, S., M. Li, D. Lee, U. Farooq Minhas, I. Ilyas, and Y. Li, "Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Esmaeilzadeh, A., L. Golab, and K. Taghva, "InfoMoD: Information-Theoretic Model Diagnostics", International Conference on Statistical and Scientific Database Management (SSDBM), 2023.
Bianchi, A., R. Karegar, P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "iORDER: Mining Implicit Domain Orders", IEEE International Conference on Data Engineering (ICDE), 2023.
Jin, G., X. Feng, Z. Chen, C. Liu, and S. Salihoglu, "KÙZU Graph Database Management System", Conference on Innovative Data Systems Research (CIDR), 2023.
Kamalloo, E., C. Clarke, and D. Rafiei, "Limitations of Open-Domain Question Answering Benchmarks for Document-Level Reasoning", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Buchanan, G. Robert, D. McKay, and C. Clarke, "Made to Measure: A Workshop on Human-Centred Metrics for Information Seeking", Conference on Human Information Interaction and Retrieval (CHIIR), 2023.
Lin, S-C., A. Ahmad, and J. Lin, "mAggretriever: A Simple Yet Effective Approach to Zero-Shot Multilingual Dense Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Kamphuis, C., A. Lin, S. Yang, J. Lin, A. P. de Vries, and F. Hasibi, "MMEAD: MS MARCO Entity Annotations and Disambiguations", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Ghazi, B., X. Hu, R. Kumar, and P. Manurangsi, "On Differentially Private Sampling From Gaussian and Product Distributions", Conference on Neural Information Processing Systems (NeurIPS), 2023.
Ghasemitaheri, S., A. Holcomb, L. Golab, and S. Keshav, "On the Data Quality of Remotely Sensed Forest Maps", Very Large Data Bases Conference (VLDB), 2023.
Zhong, W., S-C. Lin, J-H. Yang, and J. Lin, "One Blade for One Purpose: Advancing Math Information Retrieval Using Hybrid Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Xin, J., R. Tang, Z. Jiang, Y. Yu, and J. Lin, "Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers", Association for Computational Linguistics (ACL), 2023.
Adeyemi, M., A. Oladipo, X. Crystina Zhang, D. Alfonso-Hermelo, M. Rezagholizadeh, B. Chen, and J. Lin, "Overview of the CIRAL Track at FIRE 2023: Cross-Lingual Information Retrieval for African Languages", Forum for Information Retrieval Evaluation (FIRE), 2023.
Feng, E., A. Borgida, E. Franconi, P. F. Patel-Schneider, D. Toman, and G. Weddell, "Path Description Dependencies in Feature-Based DLs", International Workshop on Description Logics (DL), 2023.
Faggioli, G., L. Dietz, C. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, et al., "Perspectives on Large Language Models for Relevance Judgment", International Conference on the Theory of Information Retrieval (ICTIR), 2023.
Tamber, M. Singh, R. Pradeep, and J. Lin, "Pre-Processing Matters! Improved Wikipedia Corpora for Open-Domain Question Answering", European Conference on Information Retrieval (ECIR), 2023.
Gao, L., X. Ma, J. Lin, and J. Callan, "Precise Zero-Shot Dense Retrieval Without Relevance Labels", Association for Computational Linguistics (ACL), 2023.
Ehrlinger, L., H. Harmouch, I. Ilyas, and F. Naumann, "Preface QDB", Very Large Data Bases Conference (VLDB), 2023.
Ozsu, T., and X. Xue, "Preface SDA", Very Large Data Bases Conference (VLDB), 2023.
Clarke, C., F. Diaz, and N. Arabzadeh, "Preference-Based Offline Evaluation", Web Search and Data Mining (WSDM), 2023.
Pradeep, R., H. Chen, L. Gu, M. Singh Tamber, and J. Lin, "PyGaggle: A Gaggle of Resources for Open-Domain Question Answering", European Conference on Information Retrieval (ECIR), 2023.
Saxena, H., L. Golab, S. Idreos, and I. Ilyas, "Real-Time LSM-Trees for HTAP Workloads", IEEE International Conference on Data Engineering (ICDE), 2023.
Huo, S., N. Arabzadeh, and C. Clarke, "Retrieving Supporting Evidence for Generative Question Answering", ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region (SIGIR-AP), 2023.
Li, M., S-C. Lin, X. Ma, and J. Lin, "SLIM: Sparsified Late Interaction for Multi-Vector Retrieval With Inverted Indexes", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Akiki, C., O. Ogundepo, A. Piktus, X. Zhang, A. Oladipo, J. Lin, and M. Potthast, "Spacerini: Plug-and-Play Search Engines With Pyserini and Hugging Face", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Thakur, N., K. Wang, I. Gurevych, and J. Lin, "SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-Shot Neural Sparse Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Fan, G., J. Wang, Y. Li, and R. Miller, "Table Discovery in Data Lakes: State-of-the-Art and Future Directions", ACM International Conference on Management of Data (SIGMOD), 2023.
O'Halloran, T., B. McManus, A. Harbison, M. Grossman, and G. Cormack, "Technology-Assisted Review for Spreadsheets and Noisy Text", ACM Symposium on Document Engineering (DocEng), 2023.
Gao, L., X. Ma, J. Lin, and J. Callan, "Tevatron: An Efficient and Flexible Toolkit for Neural Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Usta, A., and S. Salihoglu, "To Join or Not to Join: An Analysis on the Usefulness of Joining Tables In Open Government Data Portals", Very Large Data Bases Conference (VLDB), 2023.
Tang, R., L. Liu, A. Pandey, Z. Jiang, G. Yang, K. Kumar, P. Stenetorp, J. Lin, and F. Türe, "What the DAAM: Interpreting Stable Diffusion Using Cross Attention", Association for Computational Linguistics (ACL), 2023.
Amiri, M. Javad, D. Shu, S. Maiyya, D. Agrawal, and A. El Abbadi, "Ziziphus: Scalable Data Management Across Byzantine Edge Servers", IEEE International Conference on Data Engineering (ICDE), 2023.
Trotman, A., J. Mackenzie, P. Parameswaran, and J. Lin, "A Common Framework for Exploring Document-at-a-Time and Score-at-a-Time Retrieval Methods", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Borgida, A., E. Franconi, D. Toman, and G. Weddell, "Accessing Document Data Sources Using Referring Expression Types", International Workshop on Description Logics (DL), 2022.
Ogundepo, O., X. Zhang, S. Sun, K. Duh, and J. Lin, "AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Devins, J., J. Tibshirani, and J. Lin, "Aligning the Research and Practice of Building Search Applications: Elasticsearch and Pyserini", Web Search and Data Mining (WSDM), 2022.
Parsa, M. S., H. Shi, Y. Xu, A. Yim, Y. Yin, and L. Golab, "Analyzing Climate Change Discussions on Reddit", International Conference on Computational Science and Computational Intelligence (CSCI), 2022.
Ma, X., K. Sun, R. Pradeep, M. Li, and J. Lin, "Another Look at DPR: Reproduction of Training and Replication Of Retrieval", European Conference on Information Retrieval (ECIR), 2022.
Liu, Y., C. Hu, and J. Lin, "Another Look at Information Retrieval as Statistical Translation", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Zhong, W., Y. Xie, and J. Lin, "Applying Structural and Dense Semantic Matching for the ARQMath Lab 2022, Clef", Conference and Labs of the Evaluation Forum (CLEF), 2022.
Li, M., X. Zhang, J. Xin, H. Zhang, and J. Lin, "Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Hu, X., S. Sintos, J. Gao, P. K. Agarwal, and J. Yang, "Computing Complex Temporal Join Queries Efficiently", ACM International Conference on Management of Data (SIGMOD), 2022.
Chambers, O., R. Cohen, M. Grossman, and Q. Chen, "Creating a User Model to Support User-Specific Explanations of AI Systems", User Modeling, Adaptation, and Personalization (UMAP), 2022.
Shi, P., L. Song, L. Jin, H. Mi, H. Bai, J. Lin, and D. Yu, "Cross-Lingual Text-to-SQL Semantic Parsing With Representation Mixup", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Karegar, R., M. Mirsafian, P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Discovering Domain Orders via Order Dependencies", IEEE International Conference on Data Engineering (ICDE), 2022.
Ma, X., R. Pradeep, R. Nogueira, and J. Lin, "Document Expansion Baselines and Learned Sparse Lexical Representations For MS MARCO V1 and V2", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Kane, A., Y. Ki Ng, and F. Tompa, "Dowsing for Answers to Math Questions: Doing Better With Less", Conference and Labs of the Evaluation Forum (CLEF), 2022.
Shehata, D., N. Arabzadeh, and C. Clarke, "Early Stage Sparse Retrieval With Entity Linking", International Conference on Information and Knowledge Management (CIKM), 2022.
Pacaci, A., A. Bonifati, and T. Ozsu, "Evaluating Complex Queries on Streaming Graphs", IEEE International Conference on Data Engineering (ICDE), 2022.
Zhong, W., J-H. Yang, Y. Xie, and J. Lin, "Evaluating Token-Level and Passage-Level Dense Retrieval Models For Math Information Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Chen, Y., G. Xiao, T. Ozsu, Z. Tang, A. Y. Zomaya, and K. Li, "Exploiting Hierarchical Parallelism and Reusability in Tensor Kernel Processing on Heterogeneous HPC Systems", IEEE International Conference on Data Engineering (ICDE), 2022.
Jiang, Z., Y. Dai, J. Xin, M. Li, and J. Lin, "Few-Shot Non-Parametric Learning With Deep Latent Variable Model", Conference on Neural Information Processing Systems (NeurIPS), 2022.
Vezvaei, A., L. Golab, M. Kargar, D. Srivastava, J. Szlichta, and M. Zihayat, "Fine-Tuning Dependencies With Parameters", International Conference on Extending Database Technology (EDBT), 2022.
Toman, D., and G. Weddell, "First Order Rewritability in Ontology-Mediated Querying in Horn Description Logics", AAAI Conference on Artificial Intelligence (AAAI), 2022.
Seltzer, J., K. Cheng, S. Zong, and J. Lin, "Flipping the Script: Inverse Information Seeking Dialogues for Market Research", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Lin, J., D. Campos, N. Craswell, B. Mitra, and E. Yilmaz, "Fostering Coopetition While Plugging Leaks: The Design and Implementation Of the MS MARCO Leaderboards", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Chopra, S., and L. Golab, "Gender Differences in Early Career Performance Reviews: A Text Mining Study", International Conference on Extending Database Technology (EDBT), 2022.
Kalavri, V., and S. Salihoglu, "GRADES-NDA'22: 5th International Workshop on Graph Data Management Experiences and Systems (GRADES) and Network Data Analytics (NDA)", ACM International Conference on Management of Data (SIGMOD), 2022.
Jin, G., N. Anzum, and S. Salihoglu, "GRainDB: A Relational-Core Graph-Relational DBMS", Conference on Innovative Data Systems Research (CIDR), 2022.
Dehghan, M., D. Kumar, and L. Golab, "GRS: Combining Generation and Revision in Unsupervised Sentence Simplification", Association for Computational Linguistics (ACL), 2022.
Yan, X., C. Luo, C. Clarke, N. Craswell, E. M. Voorhees, and P. Castells, "Human Preferences as Dueling Bandits", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Guo, R., V. Guo, A. Kim, J. Hildred, and K. Daudjee, "Hydrozoa: Dynamic Hybrid-Parallel DNN Training on Serverless Containers", Conference on Machine Learning and Systems (MLSys), 2022.
Zhong, Y., J. Xiao, T. Vetterli, M. Matin, E. Loo, J. Lin, R. Bourgon, and O. Shapira, "Improving Precancerous Case Characterization via Transformer-Based Ensemble Learning", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Li, H., S. Zhuang, A. Mourad, X. Ma, J. Lin, and G. Zuccon, "Improving Query Representations for Dense Retrieval With Pseudo Relevance Feedback: A Reproducibility Study", European Conference on Information Retrieval (ECIR), 2022.
Yang, M. Y. R., S. Yang, and J. Lin, "Integration of Text and Geospatial Search for Hydrographic Datasets Using the Lucene Search Library", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2022.
Zhang, D., A. Vakili Tahami, M. Abualsaud, and M. Smucker, "Learning Trustworthy Web Sources to Derive Correct Answers and Reduce Health Misinformation in Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Feng, E., D. Toman, and G. Weddell, "Magic Sets in Interpolation-Based Rule Driven Query Optimization", International Web Rule Symposium (RuleML), 2022.
Peng, P., T. Ozsu, L. Zou, C. Yan, and C. Liu, "MPC: Minimum Property-Cut RDF Graph Partitioning", IEEE International Conference on Data Engineering (ICDE), 2022.
Pradeep, R., Y. Li, Y. Wang, and J. Lin, "Neural Query Synthesis and Domain-Specific Ranking Templates for Multi-Stage Clinical Trial Matching", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Craswell, N., B. Mitra, E. Yilmaz, D. Campos, J. Lin, E. M. Voorhees, and I. Soboroff, "Overview of the TREC 2022 Deep Learning Track", Text Retrieval Conference (TREC), 2022.
Hebert, L., L. Golab, and R. Cohen, "Predicting Hateful Discussions on Reddit Using Graph Transformer Networks And Communal Context", IEEE/WIC/ACM International Conference on Web Intelligence (WI), 2022.
Abebe, M., H. Lazu, and K. Daudjee, "Proteus: Autonomous Adaptive Storage for Mixed Workloads", ACM International Conference on Management of Data (SIGMOD), 2022.
Li, H., S. Zhuang, X. Ma, J. Lin, and G. Zuccon, "Pseudo-Relevance Feedback With Dense Retrievers in Pyserini", Australasian Document Computing Symposium (ADCS), 2022.
Maiyya, S., S. Ibrahim, C. Scarberry, D. Agrawal, A. El Abbadi, H. Lin, S. Tessaro, and V. Zakhary, "QuORAM: A Quorum-Replicated Fault Tolerant ORAM Datastore", USENIX Security Symposium, 2022.
Kamphuis, C., F. Hasibi, J. Lin, and A. P. de Vries, "REBL: Entity Linking at Scale (Prototype)", Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), 2022.
Ilyas, I., T. Rekatsinas, V. Konda, J. Pound, X. Qi, and M. A. Soliman, "Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale", ACM International Conference on Management of Data (SIGMOD), 2022.
Hu, X., Y. Liu, H. Xiu, P. K. Agarwal, D. Panigrahi, S. Roy, and J. Yang, "Selectivity Functions of Range Queries Are Learnable", ACM International Conference on Management of Data (SIGMOD), 2022.
Lin, J., D. Alfonso-Hermelo, V. Jeronymo, E. Kamalloo, C. Lassance, R. Frassetto Nogueira, O. Ogundepo, M. Rezagholizadeh, N. Thakur, J-H. Yang, et al., "Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval", Text Retrieval Conference (TREC), 2022.
Tang, R., K. Kumar, G. Yang, A. Pandey, Y. Mao, V. Belyaev, M. Emmadi, C. G. Murray, F. Türe, and J. Lin, "SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Pradeep, R., Y. Liu, X. Zhang, Y. Li, A. Yates, and J. Lin, "Squeezing Water From a Stone: A Bag of Tricks for Further Improving Cross-Encoder Effectiveness for Reranking", European Conference on Information Retrieval (ECIR), 2022.
Tang, R., K. Kumar, J. Xin, P. Vyas, W. Li, G. Yang, Y. Mao, C. G. Murray, and J. Lin, "Temporal Early Exiting for Streaming Speech Commands Recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022.
Abualsaud, M., and M. Smucker, "The Dark Side of Relevance: The Effect of Non-Relevant Results On Search Behavior", Conference on Human Information Interaction and Retrieval (CHIIR), 2022.
Mohapatra, S., S. Sasy, X. He, G. Kamath, and O. Thakkar, "The Role of Adaptive Optimizers for Honest Private Hyperparameter Selection", AAAI Conference on Artificial Intelligence (AAAI), 2022.
Li, H., S. Wang, S. Zhuang, A. Mourad, X. Ma, J. Lin, and G. Zuccon, "To Interpolate or Not to Interpolate: PRF, Dense and Sparse Retrievers", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Voorhees, E. M., N. Craswell, and J. Lin, "Too Many Relevants: Whither Cranfield Test Collections?", International Conference on Research and Development in Information Retrieval (SIGIR), 2022.
Xue, H., F. D. Salim, Y. Ren, and C. Clarke, "Translating Human Mobility Forecasting Through Natural Language Generation", Web Search and Data Mining (WSDM), 2022.
Borgida, A., E. Franconi, D. Toman, and G. Weddell, "Understanding Document Data Sources Using Ontologies With Referring Expressions", Australian Joint Conference on Artificial Intelligence (AUS-AI), 2022.
Arabzadeh, N., M. Seifikar, and C. Clarke, "Unsupervised Question Clarity Prediction Through Retrieved Item Coherency", International Conference on Information and Knowledge Management (CIKM), 2022.
Tahami, A. Vakili, D. Zhang, and M. Smucker, "UWaterlooMDS at the TREC 2022 Health Misinformation Track", Text Retrieval Conference (TREC), 2022.
Durvasula, S., R. Kiguru, S. Mathur, J. Xu, J. Lin, and N. Vijaykumar, "VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks", International Conference on Parallel Architectures and Compilation Techniques (PACT), 2022.
Huo, S., X. Yan, and C. Clarke, "WaterlooClarke at the TREC 2022 Conversational Assistant Track", Text Retrieval Conference (TREC), 2022.
Shi, P., R. Zhang, H. Bai, and J. Lin, "XRICL: Cross-Lingual Retrieval-Augmented in-Context Learning For Cross-Lingual Text-to-SQL Semantic Parsing", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022.
Mhedhbi, A., P. Gupta, S. Khaliq, and S. Salihoglu, "A+ Indexes: Tunable and Space-Efficient Adjacency Lists in Graph Database Management Systems", IEEE International Conference on Data Engineering (ICDE), 2021.
Parsa, M. S., and L. Golab, "Academic Integrity in Online Education During the COVID-19 Pandemic: A Social Media Mining Study", Educational Data Mining (EDM), 2021.
Hu, X., P. Koutris, and S. Blanas, "Algorithms for a Topology-Aware Massively Parallel Computation Model", ACM Symposium on Principles of Database Systems (PODS), 2021.
Chopra, S., and L. Golab, "Analyzing Ranking Strategies to Characterize Competition for Co-Operative Work Placements", Educational Data Mining (EDM), 2021.
Zhong, W., X. Zhang, J. Xin, R. Zanibbi, and J. Lin, "Approach Zero and Anserini at the CLEF-2021 ARQMath Track: Applying Substructure Search and BM25 on Operator Tree Path Tokens", Conference and Labs of the Evaluation Forum (CLEF), 2021.
Brown, D. G., L. Byl, and M. Grossman, "Are Machine Learning Corpora "Fair Dealing" Under Canadian Law?", International Conference on Computational Creativity (ICCC), 2021.
Xin, J., R. Tang, Y. Yu, and J. Lin, "BERxiT: Early Exiting for BERT With Better Fine-Tuning and Extension To Regression", Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021.
Alway, K., E. Blais, and S. Salihoglu, "Box Covers and Domain Orderings for Beyond Worst-Case Join Processing", International Conference on Database Theory (ICDT), 2021.
Zhang, E., S-C. Lin, J-H. Yang, R. Pradeep, R. Nogueira, and J. Lin, "Chatty Goose: A Python Framework for Conversational Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Zhang, X., A. Yates, and J. Lin, "Comparing Score Aggregation Approaches for Document Retrieval With Pretrained Transformers", European Conference on Information Retrieval (ECIR), 2021.
Lin, S-C., J-H. Yang, and J. Lin, "Contextualized Query Embeddings for Conversational Search", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Hu, X., "Cover or Pack: New Upper and Lower Bounds for Massively Parallel Joins", ACM Symposium on Principles of Database Systems (PODS), 2021.
Glasbergen, B., F. Wu, and K. Daudjee, "Dendrite: Bolt-on Adaptivity for Data Systems", ACM International Conference on Management of Data (SIGMOD), 2021.
Leventidis, A., L. Di Rocco, W. Gatterbauer, R. Miller, and M. Riedewald, "DomainNet: Homograph Detection for Data Lake Disambiguation", International Conference on Extending Database Technology (EDBT), 2021.
Zhang, M., L. Tan, Z. Fu, K. Xiong, J. Lin, M. Li, and Z. Tu, "Don't Change Me! User-Controllable Selective Paraphrase Generation", Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021.
Ng, Y. Ki, D. J. Fraser, B. Kassaie, and F. Tompa, "Dowsing for Answers to Math Questions: Ongoing Viability of Traditional MathIR", Conference and Labs of the Evaluation Forum (CLEF), 2021.
Ng, Y. Ki, D. J. Fraser, B. Kassaie, and F. Tompa, "Dowsing for Math Answers", Conference and Labs of the Evaluation Forum (CLEF), 2021.
Xia, S., B. Chang, K. Knopf, Y. He, Y. Tao, and X. He, "DPGraph: A Benchmark Platform for Differentially Private Graph Analysis", ACM International Conference on Management of Data (SIGMOD), 2021.
Agarwal, P. K., X. Hu, S. Sintos, and J. Yang, "Dynamic Enumeration of Similarity Joins", International Colloquium on Automata, Languages and Programming (ICALP), 2021.
Kargar, M., L. Golab, D. Srivastava, J. Szlichta, and M. Zihayat, "Effective Keyword Search in Weighted Graphs (Extended Abstract)", IEEE International Conference on Data Engineering (ICDE), 2021.
Karegar, R., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Efficient Discovery of Approximate Order Dependencies", International Conference on Extending Database Technology (EDBT), 2021.
Hofstätter, S., S-C. Lin, J-H. Yang, J. Lin, and A. Hanbury, "Efficiently Teaching an Effective Dense Retriever With Balanced Topic Aware Sampling", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Deep, S., X. Hu, and P. Koutris, "Enumeration Algorithms for Conjunctive Queries With Projection", International Conference on Database Theory (ICDT), 2021.
Clarke, C., C. Luo, and M. Smucker, "Evaluation Measures Based on Preference Graphs", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Golab, L., and D. Srivastava, "Exploring Data Using Pa Erns: A Survey and Open Problems", International Workshop on Data Warehousing and OLAP (DOLAP), 2021.
Jiang, K., R. Pradeep, and J. Lin, "Exploring Listwise Evidence Reasoning With T5 for Fact Verification", Association for Computational Linguistics (ACL), 2021.
Chen, H. H., S. Mohapatra, G. Michalopoulos, X. He, and I. McKillop, "Federated Deep Learning Architecture for Personalized Healthcare", Medical Informatics Europe (MIE), 2021.
Toman, D., and G. Weddell, "FO Rewritability for OMQ Using Beth Definability and Interpolation", International Workshop on Description Logics (DL), 2021.
Sahu, S., and S. Salihoglu, "Graphsurge: Graph Analytics on View Collections Using Differential Computation", ACM International Conference on Management of Data (SIGMOD), 2021.
Jiang, Z., R. Tang, J. Xin, and J. Lin, "How Does BERT Rerank Passages? An Attribution Analysis With Information Bottlenecks", Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2021.
Lin, S-C., J-H. Yang, and J. Lin, "In-Batch Negatives for Knowledge Distillation With Tightly-Coupled Teachers for Dense Retrieval", Workshop on Representation Learning for NLP (RepL4NLP), 2021.
Farhat, O., K. Daudjee, and L. Querzoni, "Klink: Progress-Aware Scheduling for Streaming Data Systems", ACM International Conference on Management of Data (SIGMOD), 2021.
Xia, S., N. Anzum, S. Salihoglu, and J. Zhao, "KTabulator: Interactive Ad Hoc Table Creation Using Knowledge Graphs", ACM Conference on Human Factors in Computing Systems (CHI), 2021.
Zhang, Y., C. Hu, Y. Liu, H. Fang, and J. Lin, "Learning to Rank in the Age of Muppets: Effectiveness-Efficiency Tradeoffs In Multi-Stage Ranking", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Craswell, N., B. Mitra, E. Yilmaz, D. Campos, and J. Lin, "MS MARCO: Benchmarking Ranking Models in the Large-Data Regime", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Li, M., M. Li, K. Xiong, and J. Lin, "Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Langendoen, K., B. Glasbergen, and K. Daudjee, "NIR-Tree: A Non-Intersecting R-Tree", International Conference on Statistical and Scientific Database Management (SSDBM), 2021.
Lin, J., X. Ma, J. Mackenzie, and A. Mallia, "On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications", Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), 2021.
Craswell, N., B. Mitra, E. Yilmaz, D. Campos, and J. Lin, "Overview of the TREC 2021 Deep Learning Track", Text Retrieval Conference (TREC), 2021.
Clarke, C., M. Maistro, and M. Smucker, "Overview of the TREC 2021 Health Misinformation Track", Text Retrieval Conference (TREC), 2021.
Shafieinejad, M., F. Kerschbaum, and I. Ilyas, "PCOR: Private Contextual Outlier Release via Differentially Private Search", ACM International Conference on Management of Data (SIGMOD), 2021.
He, X., J. Rogers, J. Bater, A. Machanavajjhala, C. Wang, and X. Wang, "Practical Security and Privacy for Database Systems", ACM International Conference on Management of Data (SIGMOD), 2021.
Arabzadeh, N., X. Yan, and C. Clarke, "Predicting Efficiency/Effectiveness Trade-Offs for Dense vs. Sparse Retrieval Strategy Selection", International Conference on Information and Knowledge Management (CIKM), 2021.
Yates, A., R. Nogueira, and J. Lin, "Pretrained Transformers for Text Ranking: BERT and Beyond", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Yates, A., R. Nogueira, and J. Lin, "Pretrained Transformers for Text Ranking: BERT and Beyond", Web Search and Data Mining (WSDM), 2021.
Toman, D., and G. Wedell, "Projective Beth Definability and Craig Interpolation for Relational Query Optimization (Material to Accompany Invited Talk)", International Conference on Principles of Knowledge Representation and Reasoning (KR), 2021.
Livshits, E., R. Kochirgan, S. Tsur, I. Ilyas, B. Kimelfeld, and S. Roy, "Properties of Inconsistency Measures for Databases", ACM International Conference on Management of Data (SIGMOD), 2021.
Zhong, W., and J. Lin, "PYA0: A Python Toolkit for Accessible Math-Aware Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Lin, J., X. Ma, S-C. Lin, J-H. Yang, R. Pradeep, and R. Nogueira, "Pyserini: A Python Toolkit for Reproducible Information Retrieval Research With Sparse and Dense Representations", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Anzum, N., and S. Salihoglu, "R2GSync and Edge Views: Practical RDBMS to GDBMS Synchronization", ACM International Conference on Management of Data (SIGMOD), 2021.
Odunayo, O., N. N. Sookoo, G. Bathla, A. Cavallin, B. D. Persaud, K. Szigeti, P. Van Cappellen, and J. Lin, "Rescuing Historical Climate Observations to Support Hydrological Research: A Case Study of Solar Radiation Data", ACM Symposium on Document Engineering (DocEng), 2021.
Nemec, J., H. Davoudi, L. Golab, M. Kargar, Y. Lytvyn, P. Mierzejewski, J. Szlichta, and M. Zihayat, "RW-Team: Robust Team Formation Using Random Walk", International Conference on Information and Knowledge Management (CIKM), 2021.
Maiyya, S., I. Ahmad, D. Agrawal, and A. El Abbadi, "Samya: A Geo-Distributed Data System for High Contention Aggregate Data", IEEE International Conference on Data Engineering (ICDE), 2021.
Pradeep, R., X. Ma, R. Frassetto Nogueira, and J. Lin, "Scientific Claim Verification With VerT5erini", International Workshop on Health Text Mining and Information Analysis (Louhi), 2021.
Bai, H., P. Shi, J. Lin, Y. Xie, L. Tan, K. Xiong, W. Gao, and M. Li, "Segatron: Segment-Aware Transformer for Language Modeling and Understanding", AAAI Conference on Artificial Intelligence (AAAI), 2021.
Bai, H., P. Shi, J. Lin, L. Tan, K. Xiong, W. Gao, J. Liu, and M. Li, "Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation With GPT2", Association for Computational Linguistics (ACL), 2021.
Anand, M., J. Zhang, S. Ding, J. Xin, and J. Lin, "Serverless BM25 Search and BERT Reranking", Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), 2021.
Lin, J., D. Campos, N. Craswell, B. Mitra, and E. Yilmaz, "Significant Improvements Over the State of the Art? A Case Study Of the MS MARCO Document Ranking Leaderboard", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Ma, X., M. Li, K. Sun, J. Xin, and J. Lin, "Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Xin, J., R. Tang, Y. Yu, and J. Lin, "The Art of Abstention: Selective Prediction and Error Regularization For Natural Language Processing", Association for Computational Linguistics (ACL), 2021.
Han, X., Y. Liu, and J. Lin, "The Simplest Thing That Can Possibly Work: (Pseudo-)Relevance Feedback Via Text Classification", International Conference on the Theory of Information Retrieval (ICTIR), 2021.
Mitra, A., C. Gorenflo, L. Golab, and S. Keshav, "TimeFabric: Trusted Time for Permissioned Blockchains", International Symposium on Foundations and Applications of Blockchain (FAB) , 2021.
Bashardoost, B. Ghadiri, K. A. Lyons, and R. Miller, "Towards Knowledge Exchange: State-of-the-Art and Open Problems", Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM), 2021.
Deshmukh, A. Anand, Q. Zhang, M. Li, J. Lin, and L. Mou, "Unsupervised Chunking as Syntactic Structure Induction With a Knowledge-Transfer Approach", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Abualsaud, M., K. Ghajar, L. Nhi Phan Minh, D. Zhang, I. Xiangyi Chen, M. Smucker, and A. Vakili Tahami, "UWaterlooMDS at the TREC 2021 Health Misinformation Track", Text Retrieval Conference (TREC), 2021.
Pradeep, R., X. Ma, R. Nogueira, and J. Lin, "Vera: Prediction Techniques for Reducing Harmful Misinformation In Consumer Health Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2021.
Abualsaud, M., M. Smucker, and C. Clarke, "Visualizing Searcher Gaze Patterns", Conference on Human Information Interaction and Retrieval (CHIIR), 2021.
Tang, R., K. Kumar, K. Chalkley, J. Xin, L. Zhang, W. Li, G. Yang, Y. Mao, J. Shin, G. Craig Murray, et al., "Voice Query Auto Completion", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021.
Yan, X., C. Clarke, and N. Arabzadeh, "WaterlooClarke at the TREC 2021 Conversational Assistant Track", Text Retrieval Conference (TREC), 2021.
Kassaie, B., and F. Tompa, "A Framework for Extracted View Maintenance", ACM Symposium on Document Engineering (DocEng), 2020.
Yilmaz, Z. Akkalyoncu, C. Clarke, and J. Lin, "A Lightweight Environment for Learning Experimental IR Research Practices", International Conference on Research and Development in Information Retrieval (SIGIR), 2020.
Zhang, X., A. Yates, and J. Lin, "A Little Bit Is Worse Than None: Ranking With Limited Training Data", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Vtyurina, A., C. Clarke, E. Law, J. R. Trippas, and H. Bota, "A Mixed-Method Analysis of Text and Audio Search Interfaces With Varying Task Complexity", International Conference on the Theory of Information Retrieval (ICTIR), 2020.
Ghenai, A., M. Smucker, and C. Clarke, "A Think-Aloud Study to Understand Factors Affecting Online Health Search", Conference on Human Information Interaction and Retrieval (CHIIR), 2020.
Gauch, M., J. Bai, J. Mai, and J. Lin, "An Open-Source Interface to the Canadian Surface Prediction Archive", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2020.
Tu, Z., W. Yang, Z. Fu, Y. Xie, L. Tan, K. Xiong, M. Li, and J. Lin, "Approximate Nearest Neighbor Search and Lightweight Dense Vector Reranking In Multi-Stage Retrieval Architectures", International Conference on the Theory of Information Retrieval (ICTIR), 2020.
Wu, R., A. Zhang, I. Ilyas, and T. Rekatsinas, "Attention-Based Learning for Missing Data Imputation in HoloClean", Conference on Machine Learning and Systems (MLSys), 2020.
Agrawal, D., A. El Abbadi, M. Javad Amiri, S. Maiyya, and V. Zakhary, "Blockchains and Databases: Opportunities and Challenges for the Permissioned And the Permissionless", Symposium on Advances in Databases and Information Systems (ADBIS), 2020.
Yates, A., S. Arora, X. Zhang, W. Yang, K. Martin Jose, and J. Lin, "Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval", Web Search and Data Mining (WSDM), 2020.
Glasbergen, B., K. Langendoen, M. Abebe, and K. Daudjee, "ChronoCache: Predictive and Adaptive Mid-Tier Query Result Caching", ACM International Conference on Management of Data (SIGMOD), 2020.
Tao, Y., X. He, A. Machanavajjhala, and S. Roy, "Computing Local Sensitivities of Counting Queries With Joins", ACM International Conference on Management of Data (SIGMOD), 2020.
Agarwal, R. Raj, D. Kumar, L. Golab, and S. Keshav, "Consentio: Managing Consent to Data Access Using Permissioned Blockchains", IEEE International Conference on Blockchain and Cryptocurrency (ICBC), 2020.
Adewoye, T., X. Han, N. Ruest, I. Milligan, S. Fritz, and J. Lin, "Content-Based Exploration of Archival Images Using Neural Networks", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2020.
Zhang, E., N. Gupta, R. Tang, X. Han, R. Pradeep, K. Lu, Y. Zhang, R. Nogueira, K. Cho, H. Fang, et al., "Covidex: Neural Ranking Models and Keyword Search Infrastructure For The COVID-19 Open Research Dataset", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Shi, P., H. Bai, and J. Lin, "Cross-Lingual Training of Neural Models for Document Ranking", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Chowdhury, A. Roy, C. Wang, X. He, A. Machanavajjhala, and S. Jha, "Crypt?: Crypto-Assisted Differential Privacy on Untrusted Servers", ACM International Conference on Management of Data (SIGMOD), 2020.
Ding, S., E. Zhang, and J. Lin, "Cydex: Neural Search Infrastructure for the Scholarly Literature", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Xin, J., R. Tang, J. Lee, Y. Yu, and J. Lin, "DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference", Association for Computational Linguistics (ACL), 2020.
Yang, J-H., S-C. Lin, R. Nogueira, M-F. Tsai, C-J. Wang, and J. Lin, "Designing Templates for Eliciting Commonsense Knowledge From Pretrained Sequence-to-Sequence Models", International Conference on Computational Linguistics (COLING), 2020.
Xie, Y., W. Yang, L. Tan, K. Xiong, N. Jing Yuan, B. Huai, M. Li, and J. Lin, "Distant Supervision for Multi-Stage Fine-Tuning in Retrieval-Based Question Answering", The Web Conference (WWW), 2020.
Nogueira, R., Z. Jiang, R. Pradeep, and J. Lin, "Document Ranking With a Pretrained Sequence-to-Sequence Model", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Ng, Y. Ki, D. J. Fraser, B. Kassaie, G. Labahn, M. S. Marzouk, F. Tompa, and K. Wang, "Dowsing for Math Answers With Tangent-L", Conference and Labs of the Evaluation Forum (CLEF), 2020.
Abebe, M., B. Glasbergen, and K. Daudjee, "DynaMast: Adaptive Dynamic Mastering for Replicated Systems", IEEE International Conference on Data Engineering (ICDE), 2020.
Xin, J., R. Nogueira, Y. Yu, and J. Lin, "Early Exiting BERT for Efficient Document Ranking", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Zhang, X., T. Ozsu, and L. Chen, "ELite: Cost-Effective Approximation of Exploration-Based Graph Analysis", ACM International Conference on Management of Data (SIGMOD), 2020.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Erratum for Discovering Order Dependencies Through Order Compatibility (Edbt 2019)", International Conference on Extending Database Technology (EDBT), 2020.
Nogueira, R., Z. Jiang, K. Cho, and J. Lin, "Evaluating Pretrained Transformer Models for Citation Recommendation", International Workshop on Bibliometric-enhanced Information Retrieval (BIR), 2020.
Adhikari, A., A. Ram, R. Tang, W. L. Hamilton, and J. Lin, "Exploring the Limits of Simple Learners in Knowledge Distillation For Document Classification With DocBERT", Workshop on Representation Learning for NLP (RepL4NLP), 2020.
Deep, S., X. Hu, and P. Koutris, "Fast Join Project Query Evaluation Using Matrix Multiplication", ACM International Conference on Management of Data (SIGMOD), 2020.
Maiyya, S., D. Hyun Bum Cho, D. Agrawal, and A. El Abbadi, "Fides: Managing Data on Untrusted Infrastructure", IEEE International Conference on Distributed Computing Systems (ICDCS), 2020.
Toman, D., and G. Weddell, "First Order Rewritability for Ontology Mediated Querying in Horn-DLFD", International Workshop on Description Logics (DL), 2020.
Yates, A., K. Martin Jose, X. Zhang, and J. Lin, "Flexible IR Pipelines With Capreolus", International Conference on Information and Knowledge Management (CIKM), 2020.
Grand, A., R. Muir, J. Ferenczi, and J. Lin, "From MAXSCORE to Block-Max Wand: The Story of How Lucene Significantly Improved Query Evaluation Performance", European Conference on Information Retrieval (ECIR), 2020.
Yan, D., G. Guo, M. Mashiur Ra Chowdhury, T. Ozsu, W-S. Ku, and J. C. S. Lui, "G-Thinker: A Distributed Framework for Mining Subgraphs in a Big Graph", IEEE International Conference on Data Engineering (ICDE), 2020.
Lin, J., C. Zhong, D. Hu, C. Rudin, and M. I. Seltzer, "Generalized and Scalable Optimal Sparse Decision Trees", International Conference on Machine Learning (ICML), 2020.
Zeng, L., L. Zou, T. Ozsu, L. Hu, and F. Zhang, "GSI: GPU-friendly Subgraph Isomorphism", IEEE International Conference on Data Engineering (ICDE), 2020.
Pradeep, R., X. Ma, X. Zhang, H. Cui, R. Xu, R. Nogueira, and J. Lin, "H2oloo at TREC 2020: When All You Got Is a Hammer... Deep Learning, Health Misinformation, and Precision Medicine", Text Retrieval Conference (TREC), 2020.
Jiang, Z., R. Tang, J. Xin, and J. Lin, "Inserting Information Bottleneck for Attribution in Transformers", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.
Kumar, D., L. Mou, L. Golab, and O. Vechtomova, "Iterative Edit-Based Unsupervised Sentence Simplification", Association for Computational Linguistics (ACL), 2020.
Farhat, O., H. Bindra, and K. Daudjee, "Leaving Stragglers at the Window: Low-Latency Stream Sampling With Accuracy Guarantees", Distributed Event-Based Systems (DEBS), 2020.
Xiang, Z., B. Ding, X. He, and J. Zhou, "Linear and Range Counting Under Metric-Based Local Differential Privacy", International Symposium on Information Theory (ISIT), 2020.
Agarwal, R. Raj, R. Cohen, L. Golab, and A. Tsang, "Locating Influential Agents in Social Networks: Budget-Constrained Seed Set Selection", Canadian Conference on Artificial Intelligence (AI), 2020.
Buchanan, G., D. McKay, C. Clarke, L. Azzopardi, and J. R. Trippas, "Made to Measure: A Workshop on Human-Centred Metrics for Information Seeking", Conference on Human Information Interaction and Retrieval (CHIIR), 2020.
Li, Q., T. Ozsu, and H. Xiong, "Message From the General Chairs of DSC 2020", International Conference on Data Science in Cyberspace (DSC), 2020.
Grossman, M., G. Cormack, and B'. Pham, "MRG_UWaterloo Participation in the TREC 2020 Precision Medicine Track", Text Retrieval Conference (TREC), 2020.
Clarke, C., M. Smucker, and A. Vtyurina, "Offline Evaluation by Maximum Similarity to an Ideal Ranking", International Conference on Information and Knowledge Management (CIKM), 2020.
Clarke, C., A. Vtyurina, and M. Smucker, "Offline Evaluation Without Gain", International Conference on the Theory of Information Retrieval (ICTIR), 2020.
Nargesian, F., K. Q. Pu, E. Zhu, B. Ghadiri Bashardoost, and R. Miller, "Organizing Data Lakes for Navigation", ACM International Conference on Management of Data (SIGMOD), 2020.
Clarke, C., S. Rizvi, M. Smucker, M. Maistro, and G. Zuccon, "Overview of the TREC 2020 Health Misinformation Track", Text Retrieval Conference (TREC), 2020.
Hu, X., and K. Yi, "Parallel Algorithms for Sparse Matrix Multiplication and Join-Aggregate Queries", ACM Symposium on Principles of Database Systems (PODS), 2020.
Meng, X., and L. Golab, "Parallel Scheduling of Data-Intensive Tasks", European Conference on Parallel Processing (Euro-Par), 2020.
Khan, A., and L. Golab, "Reddit Mining to Understand Gendered Movements", International Conference on Extending Database Technology (EDBT), 2020.
Jacobs, A., S. Chopra, and L. Golab, "Reddit Mining to Understand Women's Issues in STEM", International Conference on Extending Database Technology (EDBT), 2020.
Pacaci, A., A. Bonifati, and T. Ozsu, "Regular Path Query Evaluation on Streaming Graphs", ACM International Conference on Management of Data (SIGMOD), 2020.
Lin, J., and Q. Zhang, "Reproducibility Is a Process, Not an Achievement: The Replicability Of IR Reproducibility Experiments", European Conference on Information Retrieval (ECIR), 2020.
Guo, R. Benson, and K. Daudjee, "Research Challenges in Deep Reinforcement Learning-Based Join Query Optimization", ACM International Conference on Management of Data (SIGMOD), 2020.
Mior, M. J., and K. Salem, "ReSpark: Automatic Caching for Iterative Applications in Apache Spark", IEEE International Conference on Big Data (IEEE BigData), 2020.
Amiri, M. Javad, S. Maiyya, D. Agrawal, and A. El Abbadi, "SeeMoRe: A Fault-Tolerant Protocol for Hybrid Cloud Environments", IEEE International Conference on Data Engineering (ICDE), 2020.
Glasbergen, B., M. Abebe, K. Daudjee, D. Vogel, and J. Zhao, "Sentinel: Understanding Data Systems", ACM International Conference on Management of Data (SIGMOD), 2020.
Tang, R., J. Lee, J. Xin, X. Liu, Y. Yu, and J. Lin, "Showing Your Work Doesn't Always Work", Association for Computational Linguistics (ACL), 2020.
Satuluri, V., Y. Wu, X. Zheng, Y. Qian, B. Wichers, Q. Dai, G. Ming Tang, J. Jiang, and J. Lin, "SimClusters: Community-Based Representations for Heterogeneous Recommendations At Twitter", ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2020.
Parsa, M. S., and L. Golab, "Social Media Mining to Understand the Impact of Co-Operative Education On Mental Health", Educational Data Mining (EDM), 2020.
Ozsu, T., "Streaming Graph Processing and Analytics", Distributed Event-Based Systems (DEBS), 2020.
Lin, J., J. M. Mackenzie, C. Kamphuis, C. Macdonald, A. Mallia, M. Siedlaczek, A. Trotman, and A. P. de Vries, "Supporting Interoperability Between Open-Source Search Engines With The Common Index File Format", International Conference on Research and Development in Information Retrieval (SIGIR), 2020.
Naseem, S. Saad, D. Kumar, M. S. Parsa, and L. Golab, "Text Mining of COVID-19 Discussions on Reddit", IEEE/WIC/ACM International Conference on Web Intelligence (WI), 2020.
Ruest, N., J. Lin, I. Milligan, and S. Fritz, "The Archives Unleashed Project: Technology, Process, and Community To Improve Scholarly Access to Web Archives", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2020.
Lin, S-C., J-H. Yang, and J. Lin, "TREC 2020 Notebook: CAsT Track", Text Retrieval Conference (TREC), 2020.
Shahidi, H., M. Li, and J. Lin, "Two Birds, One Stone: A Simple, Unified Model for Text Generation From Structured and Unstructured Data", Association for Computational Linguistics (ACL), 2020.
Sequiera, R., L. Tan, Y. Zhang, and J. Lin, "Update Delivery Mechanisms for Prospective Information Needs: A Reproducibility Study", Conference on Human Information Interaction and Retrieval (CHIIR), 2020.
Arabzadeh, N., and C. Clarke, "WaterlooClarke at the Trec 2020 Conversational Assistant Track", Text Retrieval Conference (TREC), 2020.
Lin, J., I. Milligan, D. W. Oard, N. Ruest, and K. Shilton, "We Could, but Should We?: Ethical Considerations for Providing Access To GeoCities and Other Historical Digital Collections", Conference on Human Information Interaction and Retrieval (CHIIR), 2020.
Kamphuis, C., A. P. de Vries, L. Boytsov, and J. Lin, "Which BM25 Do You Mean? A Large-Scale Reproducibility Study Of Scoring Variants", European Conference on Information Retrieval (ECIR), 2020.
Gorenflo, C., L. Golab, and S. Keshav, "XOX Fabric: A Hybrid Approach to Blockchain Transaction Execution", IEEE International Conference on Blockchain and Cryptocurrency (ICBC), 2020.
De Sa, C., I. Ilyas, B. Kimelfeld, C. Ré, and T. Rekatsinas, "A Formal Framework for Probabilistic Unclean Databases", International Conference on Database Theory (ICDT), 2019.
Kushagra, S., H. Saxena, I. Ilyas, and S. Ben-David, "A Semi-Supervised Framework of Clustering Selection for De-Duplication", IEEE International Conference on Data Engineering (ICDE), 2019.
Yang, H-W., Y. Zou, P. Shi, W. Lu, J. Lin, and X. Sun, "Aligning Cross-Lingual Entities With Multi-Aspect Information", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Ge, C., X. He, I. Ilyas, and A. Machanavajjhala, "APEx: Accuracy-Aware Differentially Private Data Exploration", ACM International Conference on Management of Data (SIGMOD), 2019.
Yilmaz, Z. Akkalyoncu, S. Wang, W. Yang, H. Zhang, and J. Lin, "Applying BERT to Document Retrieval With Birch", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Heidari, A., I. Ilyas, and T. Rekatsinas, "Approximate Inference in Structured Instances With Noisy Categorical Observations", Conference on Uncertainty in Artificial Intelligence (UAI), 2019.
Rao, J., L. Liu, Y. Tay, H-W. Yang, P. Shi, and J. Lin, "Bridging the Gap Between Relevance Matching and Semantic Matching For Short Text Similarity Modeling", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Davoudi, H., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Bring Order to Data", Alberto Mendelzon International Workshop on Foundations of Data Management (AMW), 2019.
Milligan, I., N. Casemajor, S. Fritz, J. Lin, N. Ruest, M. S. Weber, and N. Worby, "Building Community and Tools for Analyzing Web Archives Through Datathons", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2019.
Ilyas, I., "Building Scalable Machine Learning Solutions for Data Cleaning", Datenbanksysteme für Business, Technologie und Web(BTW), 2019.
Türe, F., J. Rao, R. Tang, and J. Lin, "Challenges and Opportunities in Understanding Spoken Queries Directed At Modern Entertainment Platforms", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Yang, W., K. Lu, P. Yang, and J. Lin, "Critically Examining the "Neural Hype": Weak Baselines and the Additivity Of Effectiveness Gains From Neural Ranking Models", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Yilmaz, Z. Akkalyoncu, W. Yang, H. Zhang, and J. Lin, "Cross-Domain Modeling of Sentence-Level Evidence for Document Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Neumann, T., and K. Salem, "DaMoN 19: The 15th International Workshop on Data Management on New Hardware", ACM International Conference on Management of Data (SIGMOD), 2019.
Maiyya, S., V. Zakhary, M. Javad Amiri, D. Agrawal, and A. El Abbadi, "Database and Distributed Computing Foundations of Blockchains", ACM International Conference on Management of Data (SIGMOD), 2019.
Yang, W., L. Tan, C. Lu, A. Cui, H. Li, X. Chen, K. Xiong, M. Wang, M. Li, J. Pei, et al., "Detecting Customer Complaint Escalation With Recurrent Neural Networks And Manually-Engineered Features", North American Chapter of the Association for Computational Linguistics (NAACL), 2019.
Saxena, H., L. Golab, and I. Ilyas, "Distributed Discovery of Functional Dependencies", IEEE International Conference on Data Engineering (ICDE), 2019.
Alonso, G., C. Binnig, I. Pandis, K. Salem, J. Skrzypczak, R. Stutsman, L. Thostrup, T. Wang, Z. Wang, and T. Ziegler, "DPI: The Data Processing Interface for Modern Networks", Conference on Innovative Data Systems Research (CIDR), 2019.
Cormack, G., H. Zhang, N. Ghelani, M. Abualsaud, M. Smucker, M. Grossman, S. Rahbariasl, and A. Ghenai, "Dynamic Sampling Meets Pooling", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Yang, W., Y. Xie, A. Lin, X. Li, L. Tan, K. Xiong, M. Li, and J. Lin, "End-to-End Open-Domain Question Answering With BERTserini", North American Chapter of the Association for Computational Linguistics (NAACL), 2019.
Toman, D., and G. Weddell, "Exhaustive Query Answering via Referring Expressions", International Workshop on Description Logics (DL), 2019.
Pacaci, A., and T. Ozsu, "Experimental Analysis of Streaming Algorithms for Graph Partitioning", ACM International Conference on Management of Data (SIGMOD), 2019.
Le Guilly, M., J-M. Petit, V-M. Scuturici, and I. Ilyas, "ExplIQuE: Interactive Databases Exploration With SQL", International Conference on Information and Knowledge Management (CIKM), 2019.
Gorenflo, C., S. Lee, L. Golab, and S. Keshav, "FastFabric: Scaling Hyperledger Fabric to 20, 000 Transactions Per Second", IEEE International Conference on Blockchain and Cryptocurrency (ICBC), 2019.
Toman, D., and G. Weddell, "Finding ALL Answers to OBDA Queries Using Referring Expressions", Australian Joint Conference on Artificial Intelligence (AUS-AI), 2019.
McIntyre, S., D. Toman, and G. Weddell, "FunDL - A Family of Feature-Based Description Logics, With Applications In Querying Structured Data Sources", Description Logic, Theory Combination, and All That - Essays Dedicated to Franz Baader, 2019.
Chopra, S., A. Khan, M. Mirsafian, and L. Golab, "Gender Differences in Science and Engineering: A Data Mining Approach", International Conference on Extending Database Technology (EDBT), 2019.
Chopra, S., A. Khan, M. Mirsafian, and L. Golab, "Gender Differences in Work-Integrated Learning Assessments", Educational Data Mining (EDM), 2019.
Anzum, N., S. Salihoglu, and D. Vogel, "GraphWrangler: An Interactive Graph View on Relational Data", ACM International Conference on Management of Data (SIGMOD), 2019.
Heidari, A., J. McGrath, I. Ilyas, and T. Rekatsinas, "HoloDetect: Few-Shot Learning for Error Detection", ACM International Conference on Management of Data (SIGMOD), 2019.
Lee, J., R. Tang, and J. Lin, "Honkling: In-Browser Personalization for Ubiquitous Keyword Spotting", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
McCoy, A. B., D. F. Sittig, J. Lin, and A. Wright, "Identification and Ranking of Biomedical Informatics Researcher Citation Statistics Through a Google Scholar Scraper", American Medical Informatics Association Annual Symposium (AMIA), 2019.
Toman, D., and G. Weddell, "Identity Resolution in Ontology Based Data Access to Structured Data Sources", Pacific Rim International Conference on Artificial Intelligence (PRICAI), 2019.
Liu, L., W. Yang, J. Rao, R. Tang, and J. Lin, "Incorporating Contextual and Syntactic Structures Improves Semantic Similarity Modeling", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Clancy, R., J. Lee, Z. Akkalyoncu Yilmaz, and J. Lin, "Information Retrieval Meets Scalable Text Analytics: Solr Integration With Spark", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Vollmer, M., L. Golab, K. Böhm, and D. Srivastava, "Informative Summarization of Numeric Data", International Conference on Statistical and Scientific Database Management (SSDBM), 2019.
Hu, X., and K. Yi, "Instance and Output Optimal Parallel Algorithms for Acyclic Joins", ACM Symposium on Principles of Database Systems (PODS), 2019.
Zhu, E., D. Deng, F. Nargesian, and R. Miller, "JOSIE: Overlap Set Similarity Search for Finding Joinable Tables In Data Lakes", ACM International Conference on Management of Data (SIGMOD), 2019.
Clarke, C., "Length Normalization in the Era of Neural Rankers", International Workshop on Evaluating Information Access (EVIA), 2019.
Gorenflo, C., L. Golab, and S. Keshav, "Mitigating Trust Issues in Electric Vehicle Charging Using a Blockchain", Energy-Efficient Computing and Networking (e-Energy), 2019.
Rao, J., W. Yang, Y. Zhang, F. Türe, and J. Lin, "Multi-Perspective Relevance Matching With Hierarchical ConvNets For Social Media Search", AAAI Conference on Artificial Intelligence (AAAI), 2019.
Tang, R., Y. Lu, and J. Lin, "Natural Language Generation for Effective Knowledge Distillation", Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing (DeepLo), 2019.
McIntyre, S., A. Borgida, D. Toman, and G. Weddell, "On Limited Conjunctions and Partial Features in Parameter-Tractable Feature Logics", AAAI Conference on Artificial Intelligence (AAAI), 2019.
Borgida, A., D. Toman, and G. Weddell, "On Special Description Logics for Processes and Plans", International Workshop on Description Logics (DL), 2019.
Kumar, D., R. Cohen, and L. Golab, "Online Abuse Detection: The Value of Preprocessing and Neural Attention Models", Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2019.
Clancy, R., N. Ferro, C. Hauff, J. Lin, T. Sakai, and Z. Zhong Wu, "Overview of the 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Abualsaud, M., and M. Smucker, "Patterns of Search Result Examination: Query to First Action", International Conference on Information and Knowledge Management (CIKM), 2019.
Kassaie, B., and F. Tompa, "Predictable and Consistent Information Extraction", ACM Symposium on Document Engineering (DocEng), 2019.
Rogers, J., J. Bater, X. He, A. Machanavajjhala, M. Suresh, and X. Wang, "Privacy Changes Everything", Very Large Data Bases Conference (VLDB), 2019.
Cormack, G., and M. Grossman, "Quantifying Bias and Variance of System Rankings", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Yang, J-H., S-C. Lin, C-J. Wang, J. Lin, and M-F. Tsai, "Query and Answer Expansion From Conversation History", Text Retrieval Conference (TREC), 2019.
Yang, P., and J. Lin, "Reproducing and Generalizing Semantic Term Matching in Axiomatic Information Retrieval", European Conference on Information Retrieval (ECIR), 2019.
Adhikari, A., A. Ram, R. Tang, and J. Lin, "Rethinking Complex Neural Network Architectures for Document Classification", North American Chapter of the Association for Computational Linguistics (NAACL), 2019.
Yang, H-W., L. Liu, I. Milligan, N. Ruest, and J. Lin, "Scalable Content-Based Analysis of Images in Web Archives With TensorFlow And the Archives Unleashed Toolkit", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2019.
Kushagra, S., S. Ben-David, and I. Ilyas, "Semi-Supervised Clustering for De-Duplication", International Conference on Artificial Intelligence and Statistics (AISTATS), 2019.
Kazhamiaka, M., B. Naveed Memon, C. Kankanamge, S. Sahu, S. Rizvi, B. Wong, and K. Daudjee, "Sift: Resource-Efficient Consensus With RDMA", Conference on Emerging Network Experiment and Technology (CoNEXT), 2019.
Shi, P., J. Rao, and J. Lin, "Simple Attention-Based Representation Learning for Ranking Short Social Media Posts", North American Chapter of the Association for Computational Linguistics (NAACL), 2019.
Yu, R., Y. Xie, and J. Lin, "Simple Techniques for Cross-Collection Relevance Feedback", European Conference on Information Retrieval (ECIR), 2019.
Clancy, R., T. Eskildsen, N. Ruest, and J. Lin, "Solr Integration in the Anserini Information Retrieval Toolkit", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Yan, D., G. Guo, M. Mashiur Ra Chowdhury, T. Ozsu, J. C. S. Lui, and W. Tan, "T-Thinker: A Task-Centric Distributed Framework for Compute-Intensive Divide-and-Conquer Algorithms", ACM Symposium on Principles & Practice of Parallel Programming (PPoPP), 2019.
Deschamps, R., N. Ruest, J. Lin, S. Fritz, and I. Milligan, "The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration of Web Archives", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2019.
Deschamps, R., S. Fritz, J. Lin, I. Milligan, and N. Ruest, "The Cost of a WARC: Analyzing Web Archives in the Cloud", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2019.
Lin, J., and P. Yang, "The Impact of Score Ties on Repeatability in Document Ranking", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Clancy, R., N. Ferro, C. Hauff, J. Lin, T. Sakai, and Z. Zhong Wu, "The SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Li, Y., L. Zou, T. Ozsu, and D. Zhao, "Time Constrained Continuous Subgraph Search Over Streaming Graphs", IEEE International Conference on Data Engineering (ICDE), 2019.
Rahbariasl, S., and M. Smucker, "Time-Limits and Summaries for Faster Relevance Assessing", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Bashardoost, B. Ghadiri, R. Miller, and K. A. Lyons, "Towards a Benchmark for Knowledge Base Exchange", ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2019.
Cormack, G., and M. Grossman, "Unbiased Low-Variance Estimators for Precision and Related Information Retrieval Effectiveness Measures", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Lee, J., R. Tang, and J. Lin, "Universal Voice-Enabled User Interfaces Using JavaScript", International Conference on Intelligent User Interfaces (IUI), 2019.
Clancy, R., Z. Akkalyoncu Yilmaz, Z. Zhong Wu, and J. Lin, "University of Waterloo Docker Images for OSIRRC at SIGIR 2019", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Deng, D., W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, G. Li, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Unsupervised String Transformation Learning for Entity Consolidation", IEEE International Conference on Data Engineering (ICDE), 2019.
Abualsaud, M., F. C. Beylunioglu, M. Smucker, and R. P. Duimering, "UWaterlooMDS at the TREC 2019 Decision Track", Text Retrieval Conference (TREC), 2019.
Ruest, N., I. Milligan, and J. Lin, "Warclight: A Rails Engine for Web Archive Discovery", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2019.
Abebe, M., B. Glasbergen, and K. Daudjee, "WatDFS: A Project for Understanding Distributed Systems in the Undergraduate Curriculum", Technical Symposium on Computer Science Education (SIGCSE), 2019.
Clarke, C., "WaterlooClarke at the TREC 2019 Conversational Assistant Track", Text Retrieval Conference (TREC), 2019.
Xin, J., J. Lin, and Y. Yu, "What Part of the Neural Network Does This? Understanding LSTMs By Measuring and Dissecting Neurons", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019.
Tang, R., F. Türe, and J. Lin, "Yelling at Your TV: An Analysis of Speech Recognition Errors And Subsequent User Behavior on Entertainment Systems", International Conference on Research and Development in Information Retrieval (SIGIR), 2019.
Zhang, H., M. Abualsaud, and M. Smucker, "A Study of Immediate Requery Behavior in Search", Conference on Human Information Interaction and Retrieval (CHIIR), 2018.
Abualsaud, M., N. Ghelani, H. Zhang, M. Smucker, G. Cormack, and M. Grossman, "A System for Efficient High-Recall Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Koutris, P., S. Salihoglu, and D. Suciu, "Algorithmic Aspects of Parallel Query Processing", ACM International Conference on Management of Data (SIGMOD), 2018.
Tang, R., W. Wang, Z. Tu, and J. Lin, "An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018.
Glasbergen, B., M. Abebe, K. Daudjee, S. Foggo, and A. Pacaci, "Apollo: Learning Query Correlations for Predictive Caching in Geo-Distributed Systems", International Conference on Extending Database Technology (EDBT), 2018.
Cormack, G., and M. Grossman, "Beyond Pooling", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Mansour, E., D. Deng, R. Castro Fernandez, A. Ali Qahtan, W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "Building Data Civilizer Pipelines With an Advanced Workflow Engine", IEEE International Conference on Data Engineering (ICDE), 2018.
Yan, X., L. Yang, H. Zhang, X. Charles Lin, B. Wong, K. Salem, and T. Brecht, "Carousel: Low-Latency Transaction Processing for Globally-Distributed Data", ACM International Conference on Management of Data (SIGMOD), 2018.
Fraser, D. J., A. Kane, and F. Tompa, "Choosing Math Features for BM25 Ranking With Tangent-L", ACM Symposium on Document Engineering (DocEng), 2018.
Liang, Y., Z. Tu, L. Huang, and J. Lin, "CNNs for NLP in the Browser: Client-Side Deployment and Visualization Opportunities", North American Chapter of the Association for Computational Linguistics (NAACL), 2018.
Lin, J., "Computing Without Servers, V8, Rocket Ships, and Other Batsh*t Crazy Ideas in Data Systems", Conference on Design of Experimental Search & Information Retrieval Systems (DESIRES), 2018.
Langouri, M. Alipour, Z. Zheng, F. Chiang, L. Golab, and J. Szlichta, "Contextual Data Cleaning", IEEE International Conference on Data Engineering (ICDE), 2018.
Chopra, S., Y. Helen Jiang, A. Toulis, and L. Golab, "Data Analytics to Improve Co-Operative Education", International Conference on Extending Database Technology (EDBT), 2018.
Tang, R., and J. Lin, "Deep Residual Learning for Small-Footprint Keyword Spotting", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018.
Pacaci, A., and T. Ozsu, "Distribution-Aware Stream Partitioning for Distributed Stream Processing Systems", ACM International Conference on Management of Data (SIGMOD), 2018.
Arora, V., R. Kumar Sure Babu, S. Maiyya, D. Agrawal, A. El Abbadi, X. Xue, Y. Zhi, and J. Zhu, "Dynamic Timestamp Allocation for Reducing Transaction Aborts", IEEE International Conference on Cloud Computing (CLOUD), 2018.
Abebe, M., K. Daudjee, B. Glasbergen, and Y. Tian, "EC-Store: Bridging the Gap Between Storage and Latency in Distributed Erasure Coded Systems", IEEE International Conference on Distributed Computing Systems (ICDCS), 2018.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Effective Team Formation in Expert Networks", Alberto Mendelzon International Workshop on Foundations of Data Management (AMW), 2018.
Zhang, H., M. Abualsaud, N. Ghelani, M. Smucker, G. Cormack, and M. Grossman, "Effective User Interaction for High-Recall Retrieval: Less Is More", International Conference on Information and Knowledge Management (CIKM), 2018.
Azmy, M., P. Shi, J. Lin, and I. Ilyas, "Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia", International Conference on Computational Linguistics (COLING), 2018.
Tompa, F., "Fashioning a Search Engine to Support Humanities Research", ACM Symposium on Document Engineering (DocEng), 2018.
Mihaylov, A., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "FASTOD: Bringing Order to Data", IEEE International Conference on Data Engineering (ICDE), 2018.
Zheng, Z., M. Alipour, Z. Qu, I. Currie, F. Chiang, L. Golab, and J. Szlichta, "FastOFD: Contextual Data Cleaning With Ontology Functional Dependencies", International Conference on Extending Database Technology (EDBT), 2018.
Chopra, S., H. Gautreau, A. Khan, M. Mirsafian, and L. Golab, "Gender Differences in Undergraduate Engineering Applicants: A Text Mining Approach", Educational Data Mining (EDM), 2018.
Yu, R., Y. Xie, and J. Lin, "H2oloo at TREC 2018: Cross-Collection Relevance Transfer for The Common Core Track", Text Retrieval Conference (TREC), 2018.
Toman, D., and G. Weddell, "Identity Resolution in Conjunctive Querying Over DL-Based Knowledge Bases", International Workshop on Description Logics (DL), 2018.
Chopra, S., and L. Golab, "Job Description Mining to Understand Work-Integrated Learning", Educational Data Mining (EDM), 2018.
Santoro, D., P. C. Arocena, B. Glavic, G. Mecca, R. Miller, and P. Papotti, "Let's Make It Dirty With BART!", Sistemi Evoluti per Basi di Dati (SEBD), 2018.
Grossman, M., and G. Cormack, "MRG_UWaterloo Participation in the TREC 2018 Common Core Track", Text Retrieval Conference (TREC), 2018.
Peng, P., L. Zou, T. Ozsu, and D. Zhao, "Multi-Query Optimization in Federated RDF Systems", International Conference on Database Systems for Advanced Applications (DASFAA), 2018.
Rao, J., F. Türe, and J. Lin, "Multi-Task Learning With Neural Networks for Voice Query Understanding On an Entertainment Platform", ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2018.
McIntyre, S., A. Borgida, D. Toman, and G. Weddell, "On Limited Conjunctions in Polynomial Feature Logics, With Applications In OBDA", International Conference on Principles of Knowledge Representation and Reasoning (KR), 2018.
Sequiera, R., L. Tan, and J. Lin, "Overview of the TREC 2018 Real-Time Summarization Track", Text Retrieval Conference (TREC), 2018.
Tu, Z., M. Li, and J. Lin, "Pay-Per-Request Deployment of Neural Network Models Using Serverless Architectures", North American Chapter of the Association for Computational Linguistics (NAACL), 2018.
Mackenzie, J. M., S. J. Culpepper, R. Blanco, M. Crane, C. Clarke, and J. Lin, "Query Driven Algorithm Selection in Early Stage Retrieval", Web Search and Data Mining (WSDM), 2018.
Memon, B. Naveed, X. Charles Lin, A. Mufti, A. Scott Wesley, T. Brecht, K. Salem, B. Wong, and B. Cassell, "RaMP: A Lightweight RDMA Abstraction for Loosely Coupled Applications", USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), 2018.
Zhao, Z., R. Christensen, F. Li, X. Hu, and K. Yi, "Random Sampling Over Joins Revisited", ACM International Conference on Management of Data (SIGMOD), 2018.
Grewal, A., J. Jiang, G. Lam, T. Jung, L. Vuddemarri, Q. Li, A. Landge, and J. Lin, "RecService: Distributed Real-Time Graph Processing at Twitter", USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), 2018.
Ghelani, N., G. Cormack, and M. Smucker, "Refresh Strategies in Continuous Active Learning", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Mior, M. J., and K. Salem, "Renormalization of NoSQL Database Schemas", International Conference on Conceptual Modeling (ER), 2018.
Yang, P., S. Thiagarajan, and J. Lin, "Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter", ACM International Conference on Management of Data (SIGMOD), 2018.
Fernandez, R. Castro, E. Mansour, A. Ali Qahtan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery", IEEE International Conference on Data Engineering (ICDE), 2018.
Kim, Y., and J. Lin, "Serverless Data Analytics With Flint", IEEE International Conference on Cloud Computing (CLOUD), 2018.
Aleardi, L. Castelli, S. Salihoglu, G. Singh, and M. Ovsjanikov, "Spectral Measures of Distortion for Change Detection in Dynamic Graphs", International Workshop on Complex Networks & Their Applications, 2018.
Kane, A., and F. Tompa, "Split-Lists and Initial Thresholds for WAND-based Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Gao, L., L. Golab, T. Ozsu, and G. Aluç, "Stream WatDiv: A Streaming RDF Benchmark", ACM International Conference on Management of Data (SIGMOD), 2018.
Mohammed, S., P. Shi, and J. Lin, "Strong Baselines for Simple Question Answering Over Knowledge Graphs With and Without Neural Networks", North American Chapter of the Association for Computational Linguistics (NAACL), 2018.
Cormack, G., and M. Grossman, "Technology-Assisted Review in Empirical Medicine: Waterloo Participation In CLEF eHealth 2018", Conference and Labs of the Evaluation Forum (CLEF), 2018.
Grewal, A., and J. Lin, "The Evolution of Content Analysis for Personalized Recommendations At Twitter", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Cormack, G., and M. Grossman, "The Quest for Total Recall", ACM Symposium on Document Engineering (DocEng), 2018.
Ma, W., M. C. Keet, W. Oldford, D. Toman, and G. Weddell, "The Utility of the Abstract Relational Model and Attribute Paths In SQL", International Conference Knowledge Engineering and Knowledge Management (EKAW), 2018.
Glasbergen, B., M. Abebe, and K. Daudjee, "Tutorial: Adaptive Replication and Partitioning in Data Systems", International Middleware Conference (Middleware), 2018.
Lin, J., S. Mohammed, R. Sequiera, and L. Tan, "Update Delivery Mechanisms for Prospective Information Needs: An Analysis Of Attention in Mobile Users", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Abualsaud, M., G. Cormack, N. Ghelani, A. Ghenai, M. Grossman, S. Rahbariasl, H. Zhang, and M. Smucker, "UWaterlooMDS at the TREC 2018 Common Core Track", Text Retrieval Conference (TREC), 2018.
Rao, J., F. Türe, and J. Lin, "What Do Viewers Say to Their TVs?: An Analysis of Voice Queries To Entertainment Systems", International Conference on Research and Development in Information Retrieval (SIGIR), 2018.
Korkmaz, M., M. Karsten, K. Salem, and S. Salihoglu, "Workload-Aware CPU Performance Scaling for Transactional Database Systems", ACM International Conference on Management of Data (SIGMOD), 2018.
Kimmig, A., A. Memory, R. Miller, and L. Getoor, "A Collective, Probabilistic Approach to Schema Mapping", IEEE International Conference on Data Engineering (ICDE), 2017.
Crane, M., S. J. Culpepper, J. Lin, J. M. Mackenzie, and A. Trotman, "A Comparison of Document-at-a-Time and Score-at-a-Time Query Evaluation", Web Search and Data Mining (WSDM), 2017.
Baruah, G., R. McCreadie, and J. Lin, "A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries", International Conference on Information and Knowledge Management (CIKM), 2017.
Fernandez, R. Castro, D. Deng, E. Mansour, A. Ali Qahtan, W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "A Demo of the Data Civilizer System", ACM International Conference on Management of Data (SIGMOD), 2017.
Karyakin, A., and K. Salem, "An Analysis of Memory Power Consumption in Database Systems", International Workshop on Data Management on New Hardware (DaMoN), 2017.
Crane, M., and J. Lin, "An Exploration of Serverless Architectures for Information Retrieval", International Conference on the Theory of Information Retrieval (ICTIR), 2017.
He, H., K. Ganjam, N. Jain, J. Lundin, R. White, and J. Lin, "An Insight Extraction System on BioMedical Literature With Deep Neural Networks", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2017.
Toman, D., and G. Weddell, "An Interpolation-Based Compiler and Optimizer for Relational Queries (System Design Report)", International Conference on Logic Programming and Automated Reasoning (LPAR), 2017.
Yang, P., H. Fang, and J. Lin, "Anserini: Enabling the Use of Lucene for Information Retrieval Research", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Authority-Based Team Discovery in Social Networks", International Conference on Extending Database Technology (EDBT), 2017.
Grossman, M., G. Cormack, and A. Roegiest, "Automatic and Semi-Automatic Document Selection for Technology-Assisted Review", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Zhang, H., J. Rao, J. Lin, and M. Smucker, "Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Answering", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
He, X., A. Machanavajjhala, C. J. Flynn, and D. Srivastava, "Composing Differential Privacy and Secure Computation: A Case Study On Scaling Private Record Linkage", Conference on Computer and Communications Security (CCS), 2017.
Borgida, A., D. Toman, and G. Weddell, "Concerning Referring Expressions in Query Answers", International Joint Conference on Artificial Intelligence (IJCAI), 2017.
Abedjan, Z., L. Golab, and F. Naumann, "Data Profiling: A Tutorial", ACM International Conference on Management of Data (SIGMOD), 2017.
Bejnordi, B. Ehteshami, J. Lin, B. Glass, M. Mullooly, G. L. Gierach, M. E. Sherman, N. Karssemeijer, J. van der Laak, and A. H. Beck, "Deep Learning-Based Assessment of Tumor-Associated Stroma for Diagnosing Breast Cancer in Histopathology Images", IEEE International Symposium on Biomedical Imaging (ISBI), 2017.
Du, J., R. Miller, B. Glavic, and W. Tan, "DeepSea: Progressive Workload-Aware Partitioning of Materialized Views In Scalable Data Analytics", International Conference on Extending Database Technology (EDBT), 2017.
Machanavajjhala, A., X. He, and M. Hay, "Differential Privacy in the Wild: A Tutorial on Current Practices & Open Challenges", ACM International Conference on Management of Data (SIGMOD), 2017.
Pacaci, A., A. Zhou, J. Lin, and T. Ozsu, "Do We Need Specialized Graph Databases?: Benchmarking Real-Time Social Networking Applications", International Workshop on Graph Data Management Experiences and Systems (GRADES), 2017.
Baskaran, S., A. Keller, F. Chiang, L. Golab, and J. Szlichta, "Efficient Discovery of Ontology Functional Dependencies", International Conference on Information and Knowledge Management (CIKM), 2017.
Ghelani, N., S. Mohammed, S. Wang, and J. Lin, "Event Detection on Curated Tweet Streams", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Rao, J., H. He, and J. Lin, "Experiments With Convolutional Neural Network Models for Answer Selection", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Vtyurina, A., D. Savenkov, E. Agichtein, and C. Clarke, "Exploring Conversational Search With Humans, Assistants, and Wizards", ACM Conference on Human Factors in Computing Systems (CHI), 2017.
Sequiera, R., and J. Lin, "Finally, a Downloadable Test Collection of Tweets", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Toulis, A., and L. Golab, "Graph Mining to Characterize Competition for Employment", ACM International Conference on Management of Data (SIGMOD), 2017.
Kankanamge, C., S. Sahu, A. Mhedhbi, J. Chen, and S. Salihoglu, "Graphflow: An Active Graph Database", ACM International Conference on Management of Data (SIGMOD), 2017.
Afrati, F. N., M. R. Joglekar, C. Ré, S. Salihoglu, and J. D. Ullman, "GYM: A Multiround Distributed Join Algorithm", International Conference on Database Theory (ICDT), 2017.
Fink, S. Dominik, L. Golab, S. Keshav, and H. de Meer, "How Similar Is the Usage of Electric Cars and Electric Bicycles?", Energy-Efficient Computing and Networking (e-Energy), 2017.
Gebaly, K. El, and J. Lin, "In-Browser Interactive SQL Analytics With Afterburner", ACM International Conference on Management of Data (SIGMOD), 2017.
Lamb, C., D. G. Brown, and C. Clarke, "Incorporating Novelty, Meaning, Reaction and Craft Into Computational Poetry: A Negative Experimental Result", International Conference on Computational Creativity (ICCC), 2017.
Gorenflo, C., L. Golab, and S. Keshav, "Managing Sensor Data Streams: Lessons Learned From the WeBike Project", International Conference on Statistical and Scientific Database Management (SSDBM), 2017.
Rao, J., F. Türe, X. Niu, and J. Lin, "Mining the Temporal Statistics of Query Terms for Searching Social Media Posts", International Conference on the Theory of Information Retrieval (ICTIR), 2017.
Grossman, M., and G. Cormack, "MRG_UWaterloo and WaterlooCormack Participation in the TREC 2017 Common Core Track", Text Retrieval Conference (TREC), 2017.
Grossman, M., and G. Cormack, "MRG_UWaterloo and WaterlooCormack Participation in the TREC 2017 Common Core Track", Text Retrieval Conference (TREC), 2017.
Cormack, G., and M. Grossman, "Navigating Imprecision in Relevance Assessments on the Road to Total Recall: Roger and Me", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Cui, X., M. Mior, B. Wong, K. Daudjee, and S. Rizvi, "Netstore: Leveraging Network Optimizations to Improve Distributed Transaction Processing Performance", International Middleware Conference (Middleware), 2017.
Toman, D., and G. Weddell, "On Partial Features in the DLF Dialects of Description Logic With Inverse Features", International Workshop on Description Logics (DL), 2017.
Tan, L., G. Baruah, and J. Lin, "On the Reusability of "Living Labs" Test Collections: : A Case Study Of Real-Time Summarization", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Roegiest, A., L. Tan, and J. Lin, "Online in-Situ Interleaved Evaluation of Real-Time Push Notification Systems", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Meng, X., and L. Golab, "Optimal Reducer Placement to Minimize Data Transfer in MapReduce-style Processing", IEEE International Conference on Big Data (IEEE BigData), 2017.
Hu, X., Y. Tao, and K. Yi, "Output-Optimal Parallel Algorithms for Similarity Joins", ACM Symposium on Principles of Database Systems (PODS), 2017.
Lin, J., S. Mohammed, R. Sequiera, L. Tan, N. Ghelani, M. Abualsaud, R. McCreadie, D. Milajevs, and E. M. Voorhees, "Overview of the TREC 2017 Real-Time Summarization Track", Text Retrieval Conference (TREC), 2017.
Clarke, C., N. Kando, and T. Sakai, "Preface From NTCIR-13 General Chairs", Conference on Evaluation of Information Access Technologies (NTCIR), 2017.
Mohammed, S., M. Crane, and J. Lin, "Quantization in Append-Only Collections", International Conference on the Theory of Information Retrieval (ICTIR), 2017.
Mate, J., K. Daudjee, and S. Kamali, "Robust Multi-Tenant Server Consolidation in the Cloud for Data Analytics Workloads", IEEE International Conference on Distributed Computing Systems (ICDCS), 2017.
Feng, G., L. Golab, and D. Srivastava, "Scalable Informative Rule Mining", IEEE International Conference on Data Engineering (ICDE), 2017.
Lyons, K. A., E. Stroulia, R. Miller, and K. S. Booth, "Second Annual Workshop on Data Driven Knowledge Mobilization", Conference of the Centre for Advanced Studies on Collaborative Research (CASCON), 2017.
Kane, A., and F. Tompa, "Small-Term Distribution for Disk-Based Search", ACM Symposium on Document Engineering (DocEng), 2017.
Toulis, A., and L. Golab, "Social Media Mining to Understand Public Mental Health", Very Large Data Bases Conference (VLDB), 2017.
Rao, J., F. Türe, H. He, O. Jojic, and J. Lin, "Talking to Your TV: Context-Aware Voice Search With Hierarchical Recurrent Neural Networks", International Conference on Information and Knowledge Management (CIKM), 2017.
Cormack, G., and M. Grossman, "Technology-Assisted Review in Empirical Medicine: Waterloo Participation In CLEF eHealth 2017", Conference and Labs of the Evaluation Forum (CLEF), 2017.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Ten Blue Links on Mars", The Web Conference (WWW), 2017.
Deng, D., R. Castro Fernandez, Z. Abedjan, S. Wang, M. Stonebraker, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, and N. Tang, "The Data Civilizer System", Conference on Innovative Data Systems Research (CIDR), 2017.
Miller, R., "The Future of Data Integration", ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2017.
Azzopardi, L., M. Crane, H. Fang, G. Ingersoll, J. Lin, Y. Moshfeghi, H. Scells, P. Yang, and G. Zuccon, "The Lucene for Information Access and Retrieval Research (LIARR) Workshop at SIGIR 2017", International Conference on Research and Development in Information Retrieval (SIGIR), 2017.
Baruah, G., and J. Lin, "The Pareto Frontier of Utility Models as a Framework for Evaluating Push Notification Systems", International Conference on the Theory of Information Retrieval (ICTIR), 2017.
Pogacar, F. A., A. Ghenai, M. Smucker, and C. Clarke, "The Positive and Negative Influence of Search Results on People's Decisions About the Efficacy of Medical Treatments", International Conference on the Theory of Information Retrieval (ICTIR), 2017.
Wang, Z., B. Lin, I. Milligan, and J. Lin, "Topic Shifts Between Two US Presidential Administrations", Web Archiving and Digital Libraries Workshop (WADL), 2017.
Zhang, H., M. Abualsaud, N. Ghelani, A. Ghosh, M. Smucker, G. Cormack, and M. Grossman, "UWaterlooMDS at the TREC 2017 Common Core Track", Text Retrieval Conference (TREC), 2017.
Christodoulakis, C., E. Kandogan, I. G. Terrizzano, and R. Miller, "VIQS: Visual Interactive Exploration of Query Semantics", International Conference on Intelligent User Interfaces (IUI), 2017.
Cormack, G., and M. Grossman, ""When to Stop" Waterloo (Cormack) Participation in the TREC 2016 Total Recall Track", Text Retrieval Conference (TREC), 2016.
Agrawal, S., and K. Daudjee, "A Performance Comparison of Algorithms for Byzantine Agreement In Distributed Systems", European Dependable Computing Conference (EDCC), 2016.
Roegiest, A., L. Tan, J. Lin, and C. Clarke, "A Platform for Streaming Push Notifications to Mobile Assessors", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Wu, G. Zhiping, and F. Tompa, "A Space-Efficient Data Structure for Fast Access Control in ECM Systems", ACM Symposium on Access Control Models and Technologies (SACMAT), 2016.
Roegiest, A., and G. Cormack, "An Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Hashemi, S. Hadi, C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "An Easter Egg Hunting Approach to Test Collection Building in Dynamic Domains", Conference on Evaluation of Information Access Technologies (NTCIR), 2016.
Tan, L., A. Roegiest, J. Lin, and C. Clarke, "An Exploration of Evaluation Metrics for Mobile Push Notifications", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Al-Harbi, A. Lafi, and M. Smucker, "Are Secondary Assessors Uncertain When They Disagree About Relevance Judgements?", Conference on Human Information Interaction and Retrieval (CHIIR), 2016.
Santoro, D., P. C. Arocena, B. Glavic, G. Mecca, R. Miller, and P. Papotti, "BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning Systems", ACM International Conference on Management of Data (SIGMOD), 2016.
Buntain, C., and J. Lin, "Burst Detection in Social Media Streams for Tracking Interest Profiles In Real Time", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Farid, M. H., A. Roatis, I. Ilyas, H-F. Hoffmann, and X. Chu, "CLAMS: Bringing Quality to Data Lakes", ACM International Conference on Management of Data (SIGMOD), 2016.
Rao, J., X. Niu, and J. Lin, "Compressing and Decoding Term Statistics Time Series", European Conference on Information Retrieval (ECIR), 2016.
Milligan, I., N. Ruest, and J. Lin, "Content Selection and Curation for Web Archiving: The Gatekeepers Vs. The Masses", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2016.
Cafarella, M. J., I. Ilyas, M. Kornacker, T. Kraska, and C. Ré, "Dark Data: Are We Solving the Right Problems?", IEEE International Conference on Data Engineering (ICDE), 2016.
Chu, X., I. Ilyas, S. Krishnan, and J. Wang, "Data Cleaning: Overview and Emerging Challenges", ACM International Conference on Management of Data (SIGMOD), 2016.
Abedjan, Z., L. Golab, and F. Naumann, "Data Profiling", IEEE International Conference on Data Engineering (ICDE), 2016.
Lyons, K. A., E. Stroulia, D. Luo, R. Miller, and V. Onut, "Data-Driven Knowledge Mobilization", Conference of the Centre for Advanced Studies on Collaborative Research (CASCON), 2016.
Abedjan, Z., J. Morcos, I. Ilyas, M. Ouzzani, P. Papotti, and M. Stonebraker, "DataXFormer: A Robust Transformation Discovery System", IEEE International Conference on Data Engineering (ICDE), 2016.
Jackson, A., J. Lin, I. Milligan, and N. Ruest, "Desiderata for Exploratory Search Interfaces to Web Archives in Support Of Scholarly Activities", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2016.
Buntain, C., J. Lin, and J. Golbeck, "Discovering Key Moments in Social Media Streams", Consumer Communications and Networking Conference (CCNC), 2016.
J. Culpepper, S., C. Clarke, and J. Lin, "Dynamic Cutoff Prediction in Multi-Stage Retrieval Systems", Australasian Document Computing Symposium (ADCS), 2016.
Kargar, M., L. Golab, and J. Szlichta, "eGraphSearch: Effective Keyword Search in Graphs", International Conference on Information and Knowledge Management (CIKM), 2016.
Cormack, G., and M. Grossman, "Engineering Quality and Reliability in Technology-Assisted Review", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Bommannavar, P., J. Lin, and A. Rajaraman, "Estimating Topical Volume in Social Media Streams", ACM Symposium on Applied Computing (SAC), 2016.
Lamb, C., D. G. Brown, and C. Clarke, "Evaluating Digital Poetry: Insights From the CAT", International Conference on Computational Creativity (ICCC), 2016.
Oard, D. W., K. Shilton, and J. Lin, "Evaluating Search Among Secrets", Conference on Evaluation of Information Access Technologies (NTCIR), 2016.
Milligan, I., J. Lin, J. Wiebe, and A. Zhou, "Exploring and Discovering Archive-It Collections With Warcbase", Digital Humanities Conference (DH), 2016.
Roegiest, A., and G. Cormack, "Impact of Review-Set Selection on Human Assessment for Text Classification", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Trotman, A., and J. Lin, "In Vacuo and in Situ Evaluation of SIMD Codecs", Australasian Document Computing Symposium (ADCS), 2016.
Qian, X., J. Lin, and A. Roegiest, "Interleaved Evaluation for Retrospective Summarization and Prospective Notification on Document Streams", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Farid, M. H., I. Ilyas, S. Euijong Whang, and C. Yu, "LONLIES: Estimating Property Values for Long Tail Entities", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Smucker, M., and C. Clarke, "Modeling Optimal Switching Behavior", Conference on Human Information Interaction and Retrieval (CHIIR), 2016.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "Multi-Stage Math Formula Search: Using Appearance-Based Similarity Metrics at Scale", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Rao, J., H. He, and J. Lin, "Noise-Contrastive Estimation for Answer Selection With Deep Neural Networks", International Conference on Information and Knowledge Management (CIKM), 2016.
Mior, M. J., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema Design for NoSQL Applications", IEEE International Conference on Data Engineering (ICDE), 2016.
Jacques, J. St., D. Toman, and G. Weddell, "Object-Relational Queries over CFDI^∀−_nc Knowledge Bases: OBDA for the SQL-Literate", International Joint Conference on Artificial Intelligence (IJCAI), 2016.
Jacques, J. St., D. Toman, and G. Weddell, "Object-Relational Queries Over CFDI_nc Knowledge Bases: OBDA For the SQL-Literate (Extended Abstract)", International Workshop on Description Logics (DL), 2016.
Jiang, Y. Helen, and L. Golab, "On Competition for Undergraduate Co-Op Placements: A Graph Mining Approach", Educational Data Mining (EDM), 2016.
Toman, D., and G. Weddell, "On Partial Features in the DLF Family of Description Logics", Pacific Rim International Conference on Artificial Intelligence (PRICAI), 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Information Systems Derived From Conceptual Modelling", International Conference on Conceptual Modeling (ER), 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Query Answering Over First Order Knowledge Bases", International Conference on Principles of Knowledge Representation and Reasoning (KR), 2016.
Toman, D., and G. Weddell, "Ontology Based Data Access With Referring Expressions for Logics With The Tree Model Property - (Extended Abstract)", Australian Joint Conference on Artificial Intelligence (AUS-AI), 2016.
Baruah, G., H. Zhang, R. Guttikonda, J. Lin, M. Smucker, and O. Vechtomova, "Optimizing Nugget Annotations With Active Learning", International Conference on Information and Knowledge Management (CIKM), 2016.
Hashemi, S. Hadi, J. Kamps, J. Kiseleva, C. Clarke, and E. M. Voorhees, "Overview of the TREC 2016 Contextual Suggestion Track", Text Retrieval Conference (TREC), 2016.
Lin, J., A. Roegiest, L. Tan, R. McCreadie, E. M. Voorhees, and F. Diaz, "Overview of the TREC 2016 Real-Time Summarization Track", Text Retrieval Conference (TREC), 2016.
He, H., and J. Lin, "Pairwise Word Interaction Modeling With Deep Neural Networks for Semantic Similarity Measurement", North American Chapter of the Association for Computational Linguistics (NAACL), 2016.
Bonenfant, M., B. C. Desai, D. Desai, B. C. M. Fung, T. Ozsu, and J. D. Ullman, "Panel: The State of Data: Invited Paper From Panelists", International Database Engineering and Applications Symposium (IDEAS), 2016.
Yang, G. Hui, I. Soboroff, L. Xiong, C. Clarke, and S. L. Garfinkel, "Privacy-Preserving IR 2016: Differential Privacy, Search, and Social Media", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Lin, J., Z. Tu, M. Rose, and P. White, "Prizm: A Wireless Access Point for Proxy-Based Web Lifelogging", ACM International Conference on Multimedia (MM), 2016.
Han, M., and K. Daudjee, "Providing Serializability for Pregel-Like Graph Processing Systems", International Conference on Extending Database Technology (EDBT), 2016.
Gebhard, L., L. Golab, S. Keshav, and H. de Meer, "Range Prediction for Electric Bicycles", Energy-Efficient Computing and Networking (e-Energy), 2016.
Elbagoury, A., M. Crane, and J. Lin, "Rank-at-a-Time Query Processing", International Conference on the Theory of Information Retrieval (ICTIR), 2016.
Paik, J. H., and J. Lin, "Retrievability in API-Based "Evaluation as a Service"", International Conference on the Theory of Information Retrieval (ICTIR), 2016.
Zhang, H., J. Lin, G. Cormack, and M. Smucker, "Sampling Strategies and Active Learning for Volume Estimation", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Cormack, G., and M. Grossman, "Scalability of Continuous Active Learning for Reliable High-Recall Text Classification", International Conference on Information and Knowledge Management (CIKM), 2016.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Second Workshop on Search and Exploration of X-Rated Information (SEXI'16): WSDM Workshop Summary", Web Search and Data Mining (WSDM), 2016.
Moschitti, A., L. Màrquez, P. Nakov, E. Agichtein, C. Clarke, and I. Szpektor, "SIGIR 2016 Workshop WebQA II: Web Question Answering Beyond Factoids", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Tan, L., A. Roegiest, C. Clarke, and J. Lin, "Simple Dynamic Emission Strategies for Microblog Filtering", International Conference on Research and Development in Information Retrieval (SIGIR), 2016.
Davila, K., R. Zanibbi, A. Kane, and F. Tompa, "Tangent-3 at the NTCIR-12 MathIR Task", Conference on Evaluation of Information Access Technologies (NTCIR), 2016.
Rao, J., and J. Lin, "Temporal Query Expansion Using a Continuous Hidden Markov Model", International Conference on the Theory of Information Retrieval (ICTIR), 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Total Recall: Blue Sky on Mars", International Conference on the Theory of Information Retrieval (ICTIR), 2016.
Lin, J., M. Crane, A. Trotman, J. Callan, I. Chattopadhyaya, J. Foley, G. Ingersoll, C. Macdonald, and S. Vigna, "Toward Reproducible Baselines: The Open-Source IR Reproducibility Challenge", European Conference on Information Retrieval (ECIR), 2016.
Hu, X., and K. Yi, "Towards a Worst-Case I/O-Optimal Algorithm for Acyclic Joins", ACM Symposium on Principles of Database Systems (PODS), 2016.
Grossman, M., G. Cormack, and A. Roegiest, "TREC 2016 Total Recall Track Overview", Text Retrieval Conference (TREC), 2016.
He, H., J. Wieting, K. Gimpel, J. Rao, and J. Lin, "UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement", International Workshop on Semantic Evaluation (SemEval), 2016.
Ehsan, N., F. Tompa, and A. Shakery, "Using a Dictionary and N-Gram Alignment to Improve Fine-Grained Cross-Language Plagiarism Detection", ACM Symposium on Document Engineering (DocEng), 2016.
Radhakrishnan, S., B. J. Muscedere, and K. Daudjee, "V-Hadoop: Virtualized Hadoop Using Containers", IEEE International Symposium on Network Computing and Applications (NCA), 2016.
Hartig, O., and T. Ozsu, "Walking Without a Map: Ranking-Based Traversal for Querying Linked Data", International Semantic Web Conference (ISWC), 2016.
Ozsu, T., "Web Data Management in the RDF Age: Keynote Talk Abstract", International Database Engineering and Applications Symposium (IDEAS), 2016.
Shen, X., L. Zou, T. Ozsu, L. Chen, Y. Li, S. Han, and D. Zhao, "A Graph-Based RDF Triple Store", IEEE International Conference on Data Engineering (ICDE), 2015.
Wu, J., T. Kinash, D. Toman, and G. Weddell, "Absorption for ABoxes and TBoxes With General Value Restrictions", Australian Joint Conference on Artificial Intelligence (AUS-AI), 2015.
Lin, J., and A. Trotman, "Anytime Ranking for Impact-Ordered Indexes", International Conference on the Theory of Information Retrieval (ICTIR), 2015.
Wang, Y., G. Sherman, J. Lin, and M. Efron, "Assessor Differences and User Preferences in Tweet Timeline Generation", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Hassanzadeh, O., and R. Miller, "Automatic Curation of Clinical Trials Data in LinkedCT", International Semantic Web Conference (ISWC), 2015.
Liu, X., L. Golab, W. M. Golab, and I. Ilyas, "Benchmarking Smart Meter Data Analytics", International Conference on Extending Database Technology (EDBT), 2015.
Khayyat, Z., I. Ilyas, A. Jindal, S. Madden, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "BigDansing: A System for Big Data Cleansing", ACM International Conference on Management of Data (SIGMOD), 2015.
Lin, J., "Building a Self-Contained Search Engine in the Browser", International Conference on the Theory of Information Retrieval (ICTIR), 2015.
Buntain, C., and J. Lin, "Burst Detection in Social Media Streams for Tracking Interest Profiles In Real Time", Text Retrieval Conference (TREC), 2015.
Bär, A., L. Golab, S. Ruehrup, M. Schiavone, and P. Casas, "Cache-Oblivious Scheduling of Shared Workloads", IEEE International Conference on Data Engineering (ICDE), 2015.
Kiseleva, J., J. Kamps, and C. Clarke, "Contextual Search and Exploration", Russian Summer School on Information Retrieval (RuSSIR), 2015.
Kim, J., K. Salem, K. Daudjee, A. Aboulnaga, and X. Pan, "Database High Availability Using SHADOW Systems", ACM Symposium on Cloud Computing (SoCC), 2015.
Morcos, J., Z. Abedjan, I. Ilyas, M. Ouzzani, P. Papotti, and M. Stonebraker, "DataXFormer: An Interactive Data Transformation Tool", ACM International Conference on Management of Data (SIGMOD), 2015.
Abedjan, Z., J. Morcos, M. N. Gubanov, I. Ilyas, M. Stonebraker, P. Papotti, and M. Ouzzani, "Dataxformer: Leveraging the Web for Semantic Transformations", Conference on Innovative Data Systems Research (CIDR), 2015.
Sittig, D. F., A. B. McCoy, A. Wright, and J. Lin, "Developing an Open-Source Bibliometric Ranking Website Using Google Scholar Citation Profiles for Researchers in the Field of Biomedical Informatics", World Congress on Medical and Health (Medical) Informatics (MedInfo), 2015.
Saxena, H., and K. Salem, "EdgeX: Edge Replication for Web Applications", IEEE International Conference on Cloud Computing (CLOUD), 2015.
Drzadzewski, G., and F. Tompa, "Enhancing Exploration With a Faceted Browser Through Summarization", ACM Symposium on Document Engineering (DocEng), 2015.
Baruah, G., M. Smucker, and C. Clarke, "Evaluating Streams of Evolving News Events", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Aluç, G., T. Ozsu, K. Daudjee, and O. Hartig, "Executing Queries Over Schemaless RDF Databases", IEEE International Conference on Data Engineering (ICDE), 2015.
Salihoglu, S., J. Shin, V. Khanna, B. Quan Truong, and J. Widom, "Graft: A Debugging Tool for Apache Giraph", ACM International Conference on Management of Data (SIGMOD), 2015.
Bislimovska, B., G. Aluç, T. Ozsu, and P. Fraternali, "Graph Search of Software Models Using Multidimensional Scaling", International Conference on Extending Database Technology (EDBT), 2015.
Petroni, F., L. Querzoni, K. Daudjee, S. Kamali, and G. Iacoboni, "HDRF: Stream-Based Partitioning for Power-Law Graphs", International Conference on Information and Knowledge Management (CIKM), 2015.
Nicoara, D., S. Kamali, K. Daudjee, and L. Chen, "Hermes: Dynamic Partitioning for Distributed Social Network Graph Databases", International Conference on Extending Database Technology (EDBT), 2015.
Lamb, C., D. G. Brown, and C. Clarke, "Human Competence in Creativity Evaluation", International Conference on Computational Creativity (ICCC), 2015.
Weissman, S., S. Ayhan, J. Bradley, and J. Lin, "Identifying Duplicate and Contradictory Information in Wikipedia", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2015.
Roegiest, A., G. Cormack, C. Clarke, and M. Grossman, "Impact of Surrogate Assessments on High-Recall Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Ge, C., M. Kaufmann, L. Golab, P. M. Fischer, and A. K. Goel, "Indexing Bi-Temporal Windows", International Conference on Statistical and Scientific Database Management (SSDBM), 2015.
Clarke, C., M. Smucker, and E. Yilmaz, "IR Evaluation: Modeling User Behavior for Measuring Effectiveness", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Chu, X., J. Morcos, I. Ilyas, M. Ouzzani, P. Papotti, N. Tang, and Y. Ye, "KATARA: A Data Cleaning System Powered by Knowledge Bases And Crowdsourcing", ACM International Conference on Management of Data (SIGMOD), 2015.
Kandogan, E., M. Roth, P. M. Schwarz, J. Hui, I. G. Terrizzano, C. Christodoulakis, and R. Miller, "LabBook: Metadata-Driven Social Collaborative Data Analysis", IEEE International Conference on Big Data (IEEE BigData), 2015.
Salihoglu, S., "Let's Rethink Join Optimization in Distributed Systems", Conference on Innovative Data Systems Research (CIDR), 2015.
Tan, L., H. Zhang, C. Clarke, and M. Smucker, "Lexical Comparison Between Wikipedia and Twitter Corpora by Using Word Embeddings", Association for Computational Linguistics (ACL), 2015.
Hassanzadeh, O., R. Miller, F. Nargesian, and E. Zhu, "LinkedCT Live: Platform for Online Curation of Clinical Trials Data", International Semantic Web Conference (ISWC), 2015.
Cormack, G., and M. Grossman, "Multi-Faceted Recall of Continuous Active Learning for Technology-Assisted Review", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
He, H., K. Gimpel, and J. Lin, "Multi-Perspective Sentence Similarity Modeling With Convolutional Neural Networks", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2015.
Szlichta, J., L. Golab, and D. Srivastava, "On Axiomatization and Inference Complexity Over a Hierarchy of Functional Dependencies", Alberto Mendelzon International Workshop on Foundations of Data Management (AMW), 2015.
Hudek, A. K., D. Toman, and G. Weddell, "On Enumerating Query Plans Using Analytic Tableau", International Conference on Theorem Proving with Analytic Tableaux and Related Methods (TABLEAUX), 2015.
Toman, D., and G. Weddell, "On the Krom Extension of CFDI^∀−_nc", Australian Joint Conference on Artificial Intelligence (AUS-AI), 2015.
Hashemi, S. Hadi, C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "On the Reusability of Open Test Collections", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Toman, D., and G. Weddell, "On the Utility of CFDI", International Workshop on Description Logics (DL), 2015.
Dean-Hall, A., C. Clarke, J. Kamps, and J. Kiseleva, "Online Evaluation of Point-of-Interest Recommendation Systems", European Conference on Information Retrieval (ECIR), 2015.
Dean-Hall, A., C. Clarke, J. Kamps, J. Kiseleva, and E. M. Voorhees, "Overview of the TREC 2015 Contextual Suggestion Track", Text Retrieval Conference (TREC), 2015.
Lin, J., M. Efron, G. Sherman, Y. Wang, and E. M. Voorhees, "Overview of the TREC-2015 Microblog Track", Text Retrieval Conference (TREC), 2015.
Fillottrani, P. R., M. C. Keet, and D. Toman, "Polynomial Encoding of ORM Conceptual Models in CFDI", International Workshop on Description Logics (DL), 2015.
Baruah, G., A. Roegiest, and M. Smucker, "Pooling for User-Oriented Evaluation Measures", International Conference on the Theory of Information Retrieval (ICTIR), 2015.
Rao, J., J. Lin, and M. Efron, "Reproducible Experiments on Lexical and Temporal Feedback for Tweet Search", European Conference on Information Retrieval (ECIR), 2015.
Arguello, J., F. Diaz, J. Lin, and A. Trotman, "SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability Of Results (RIGOR)", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Borgida, A., D. Toman, and G. Weddell, "Singular Referring Expressions in Conjunctive Query Answers: The Case For a CFD DL Dialect", International Workshop on Description Logics (DL), 2015.
Golab, L., F. Korn, F. Li, B. Saha, and D. Srivastava, "Size-Constrained Weighted Set Cover", IEEE International Conference on Data Engineering (ICDE), 2015.
Liu, X., L. Golab, and I. Ilyas, "SMAS: A Smart Meter Data Analytics System", IEEE International Conference on Data Engineering (ICDE), 2015.
Wang, Y., and J. Lin, "The Feasibility of Brute Force Scans for Real-Time Tweet Search", International Conference on the Theory of Information Retrieval (ICTIR), 2015.
Dean-Hall, A., and C. Clarke, "The Power of Contextual Suggestion", European Conference on Information Retrieval (ECIR), 2015.
Lin, J., "The Sum of All Human Knowledge in Your Pocket: Full-Text Searchable Wikipedia on a Raspberry Pi", ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2015.
Korkmaz, M., A. Karyakin, M. Karsten, and K. Salem, "Towards Dynamic Green-Sizing for Database Servers", Very Large Data Bases Conference (VLDB), 2015.
Roegiest, A., G. Cormack, C. Clarke, and M. Grossman, "TREC 2015 Total Recall Track Overview", Text Retrieval Conference (TREC), 2015.
Tan, L., A. Roegiest, and C. Clarke, "University of Waterloo at TREC 2015 Microblog Track", Text Retrieval Conference (TREC), 2015.
Bashardoost, B. Ghadiri, C. Christodoulakis, S. Hassas Yeganeh, R. Miller, K. A. Lyons, and O. Hassanzadeh, "VizCurator: A Visual Tool for Curating Open Data", The Web Conference (WWW), 2015.
Cormack, G., and M. Grossman, "Waterloo (Cormack) Participation in the TREC 2015 Total Recall Track", Text Retrieval Conference (TREC), 2015.
Ghenai, A., E. Khalilov, P. Valov, and C. Clarke, "WaterlooClarke: TREC 2015 Clinical Decision Support Track", Text Retrieval Conference (TREC), 2015.
Hoffmann, H., P. Addala, and C. Clarke, "WaterlooClarke: TREC 2015 Contextual Suggestion Track", Text Retrieval Conference (TREC), 2015.
Vtyurina, A., A. Dey, B. Sarrafzadeh, and C. Clarke, "WaterlooClarke: TREC 2015 LiveQA Track", Text Retrieval Conference (TREC), 2015.
Abualsaud, M., M. Ghaznavi, D. Recoskie, and C. Clarke, "WaterlooClarke: TREC 2015 Microblog Track", Text Retrieval Conference (TREC), 2015.
Raza, A., D. M. Rotondo, and C. Clarke, "WaterlooClarke: TREC 2015 Temporal Summarization Track", Text Retrieval Conference (TREC), 2015.
Zhang, H., W. Lin, Y. Wang, C. Clarke, and M. Smucker, "WaterlooClarke: TREC 2015 Total Recall Track", Text Retrieval Conference (TREC), 2015.
Agichtein, E., D. Carmel, C. Clarke, P. Paritosh, D. Pelleg, and I. Szpektor, "Web Question Answering: Beyond Factoids: SIGIR 2015 Workshop", International Conference on Research and Development in Information Retrieval (SIGIR), 2015.
Gao, P. Xiang, L. Golab, and S. Keshav, "What's Wrong With My Solar Panels: A Data-Driven Approach", International Conference on Extending Database Technology (EDBT), 2015.
Kim, J., K. Salem, and K. Daudjee, "Write Amplification: An Analysis of in-Memory Database Durability Techniques", Very Large Data Bases Conference (VLDB), 2015.
Al-Harbi, A. Lafi, and M. Smucker, "A Qualitative Exploration of Secondary Assessor Relevance Judging Behavior", International Conference on Information Interaction in Context (IIiX), 2014.
Afrati, F. N., A. Das Sarma, A. Rajaraman, P. Rule, S. Salihoglu, and J. D. Ullman, "Anchor-Points Algorithms for Hamming and Edit Distances Using MapReduce", International Conference on Database Theory (ICDT), 2014.
Dean-Hall, A., and C. Clarke, "Assessing Contextual Suggestion", Conference on Evaluation of Information Access Technologies (NTCIR), 2014.
Miller, R., "Big Data Curation", Joint International Conference on Data Science & Management of Data (COMAD), 2014.
He, X., A. Machanavajjhala, and B. Ding, "Blowfish Privacy: Tuning Privacy-Utility Trade-Offs Using Policies", ACM International Conference on Management of Data (SIGMOD), 2014.
Mühleisen, H., T. Samar, J. Lin, and A. P. de Vries, "Column Stores as an IR Prototyping Tool", European Conference on Information Retrieval (ECIR), 2014.
Ardakanian, O., N. Koochakzadeh, R. Preet Singh, L. Golab, and S. Keshav, "Computing Electricity Consumption Profiles From Household Smart Meter Data", International Conference on Extending Database Technology (EDBT), 2014.
Volkovs, M., F. Chiang, J. Szlichta, and R. Miller, "Continuous Data Cleaning", IEEE International Conference on Data Engineering (ICDE), 2014.
Robinson, N., S. A. McIlraith, and D. Toman, "Cost-Based Query Optimization via AI Planning", AAAI Conference on Artificial Intelligence (AAAI), 2014.
Gebremeskel, G. G., J. He, A. P. de Vries, and J. Lin, "Cumulative Citation Recommendation: A Feature-Aware Comparison Of Approaches", International Conference on Database and Expert Systems Applications (DEXA) - Workshops, 2014.
Syed, S. Javaad, Y. Helen Jiang, and L. Golab, "Data Mining of Undergraduate Course Evaluations", Educational Data Mining (EDM), 2014.
Golab, L., and T. Johnson, "Data Stream Warehousing", IEEE International Conference on Data Engineering (ICDE), 2014.
Bär, A., P. Casas, L. Golab, and A. Finamore, "DBStream: An Online Aggregation, Filtering and Processing System For Network Traffic Monitoring", International Conference on Wireless Communications and Mobile Computing (IWCMC), 2014.
Chalamalla, A., I. Ilyas, M. Ouzzani, and P. Papotti, "Descriptive and Prescriptive Data Cleaning", ACM International Conference on Management of Data (SIGMOD), 2014.
Golab, L., M. Hadjieleftheriou, H. J. Karloff, and B. Saha, "Distributed Data Placement to Minimize Communication Costs via Graph Partitioning", International Conference on Statistical and Scientific Database Management (SSDBM), 2014.
Aluç, G., O. Hartig, T. Ozsu, and K. Daudjee, "Diversified Stress Testing of RDF Data Management Systems", International Semantic Web Conference (ISWC), 2014.
Said, A., A. Bellogín, J. Lin, and A. P. de Vries, "Do Recommendations Matter?: News Recommendation in Real Life", Conference on Computer Supported Cooperative Work (CSCW), 2014.
Wu, G. Zhiping, and F. Tompa, "Effective and Efficient Bitmaps for Access Control", Data Compression Conference (DCC), 2014.
Cormack, G., and M. Grossman, "Evaluation of Machine-Learning Protocols for Technology-Assisted Review In Electronic Discovery", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Salihoglu, S., and J. Widom, "HelP: High-Level Primitives for Large-Scale Graph Processing", ACM International Conference on Management of Data (SIGMOD), 2014.
Albakour, M-D., C. Macdonald, I. Ounis, C. Clarke, and V. Bicer, "Information Access in Smart Cities (I-Asc)", European Conference on Information Retrieval (ECIR), 2014.
Myers, S. A., A. Sharma, P. Gupta, and J. Lin, "Information Network or Social Network?: The Structure of the Twitter Follow Graph", The Web Conference (WWW), 2014.
Lin, J., M. Gholami, and J. Rao, "Infrastructure for Supporting Exploration and Discovery in Web Archives", The Web Conference (WWW), 2014.
Lin, J., and M. Efron, "Infrastructure Support for Evaluation as a Service", The Web Conference (WWW), 2014.
Carpenter, T., L. Golab, and S. Javaad Syed, "Is the Grass Greener?: Mining Electric Vehicle Opinions", Energy-Efficient Computing and Networking (e-Energy), 2014.
Bär, A., A. Finamore, P. Casas, L. Golab, and M. Mellia, "Large-Scale Network Traffic Monitoring With DBStream, a System For Rolling Big Data Analysis", IEEE International Conference on Big Data (IEEE BigData), 2014.
Avram, C-A., K. Salem, and B. Wong, "Latency Amplification: Characterizing the Impact of Web Page Content On Load Times", IEEE International Symposium on Reliable Distributed Systems (SRDS), 2014.
Wang, L., J. Lin, D. Metzler, and J. Han, "Learning to Efficiently Rank on Big Data", The Web Conference (WWW), 2014.
Hartig, O., and T. Ozsu, "Linked Data Query Processing", IEEE International Conference on Data Engineering (ICDE), 2014.
Singh, A. K., X. Cui, B. Cassell, B. Wong, and K. Daudjee, "MicroFuge: A Middleware Approach to Providing Performance Isolation In Cloud Storage Systems", IEEE International Conference on Distributed Computing Systems (ICDCS), 2014.
Smucker, M., X. Sunny Guo, and A. Toulis, "Mouse Movement During Relevance Judging: Implications for Determining User Attention", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Elmagarmid, A. K., I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF/ER: Generic and Interactive Entity Resolution", ACM International Conference on Management of Data (SIGMOD), 2014.
Mühleisen, H., T. Samar, J. Lin, and A. P. de Vries, "Old Dogs Are Great at New Tricks: Column Stores for Ir Prototyping", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Toman, D., and G. Weddell, "On Adding Inverse Features to the Description Logic CFD^∀_nc", Pacific Rim International Conference on Artificial Intelligence (PRICAI), 2014.
Voorhees, E. M., J. Lin, and M. Efron, "On Run Diversity in Evaluation as a Service", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Daudjee, K., S. Kamali, and A. López-Ortiz, "On the Online Fault-Tolerant Server Consolidation Problem", ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), 2014.
K. Kumar, A., J. Gluck, A. Deshpande, and J. Lin, "Optimization Techniques for "Scaling Down" Hadoop on Multi-Core, Shared-Memory Systems", International Conference on Extending Database Technology (EDBT), 2014.
Dean-Hall, A., C. Clarke, J. Kamps, P. Thomas, and E. M. Voorhees, "Overview of the TREC 2014 Contextual Suggestion Track", Text Retrieval Conference (TREC), 2014.
Lin, J., Y. Wang, M. Efron, and G. Sherman, "Overview of the TREC-2014 Microblog Track", Text Retrieval Conference (TREC), 2014.
Rao, J., J. Lin, and H. Samet, "Partitioning Strategies for Spatio-Textual Similarity Join", ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems (GIS), 2014.
Jiang, Y. Helen, R. Levman, L. Golab, and J. Nathwani, "Predicting Peak-Demand Days in the Ontario Peak Reduction Program For Large Consumers", Energy-Efficient Computing and Networking (e-Energy), 2014.
Toman, D., and G. Weddell, "Pushing the CFDnc Envelope", International Workshop on Description Logics (DL), 2014.
Li, F., T. Ozsu, G. Chen, and B. Chin Ooi, "R-Store: A Scalable Distributed System for Supporting Real-Time Analytics", IEEE International Conference on Data Engineering (ICDE), 2014.
Hartig, O., and T. Ozsu, "Reachable Subwebs for Traversal-Based Query Execution", The Web Conference (WWW), 2014.
Chu, X., I. Ilyas, P. Papotti, and Y. Ye, "RuleMiner: Data Quality Rules Discovery", IEEE International Conference on Data Engineering (ICDE), 2014.
Hong, S., S. Salihoglu, J. Widom, and K. Olukotun, "Simplifying Scalable Graph Processing With a Domain-Specific Language", IEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2014.
Kane, A., and F. Tompa, "Skewed Partial Bitvectors for List Intersection", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Tan, L., and C. Clarke, "Succinct Queries for Linking and Tracking News in Social Media", International Conference on Information and Knowledge Management (CIKM), 2014.
Lin, J., K. Kraus, and R. L. Punzalan, "Supporting "Distant Reading" for Web Archives", Digital Humanities Conference (DH), 2014.
Efron, M., J. Lin, J. He, and A. P. de Vries, "Temporal Feedback for Tweet Search With Non-Parametric Density Estimation", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Baruah, G., A. Roegiest, and M. Smucker, "The Effect of Expanding Relevance Judgements With Duplicates", International Conference on Research and Development in Information Retrieval (SIGIR), 2014.
Wang, Y., and J. Lin, "The Impact of Future Term Statistics in Real-Time Tweet Search", European Conference on Information Retrieval (ECIR), 2014.
Clarke, C., and M. Smucker, "Time Well Spent", International Conference on Information Interaction in Context (IIiX), 2014.
Li, L., and M. Smucker, "Tolerance of Effectiveness Measures to Relevance Judging Errors", European Conference on Information Retrieval (ECIR), 2014.
Tan, L., A. Dean-Hall, P. Addala, and C. Clarke, "University of Waterloo at TREC 2014 Contextual Suggestion: Experiments With Suggestion Clustering", Text Retrieval Conference (TREC), 2014.
Wongsuphasawat, K., and J. Lin, "Using Visualizations to Monitor Changes and Harvest Insights From A Global-Scale Logging Infrastructure at Twitter", IEEE Conference on Visual Analytics Science and Technology (VAST), 2014.
Xu, Z., D. Goldwasser, B. B. Bederson, and J. Lin, "Visual Analytics of MOOCs at Maryland", ACM Conference on Learning @ Scale (L@S), 2014.
Christodoulakis, C., C. Faloutsos, and R. Miller, "VoidWiz: Resolving Incompleteness Using Network Effects", IEEE International Conference on Data Engineering (ICDE), 2014.
Said, A., J. Lin, A. Bellogín, and A. P. de Vries, "A Month in the Life of a Production News Recommender System", International Conference on Information and Knowledge Management (CIKM), 2013.
Wu, J., T. Kinash, D. Toman, and G. Weddell, "Absorption for ABoxes With Local Universal Restrictions", International Workshop on Description Logics (DL), 2013.
Mehdad, Y., G. Carenini, F. Tompa, and R. T. Ng, "Abstractive Meeting Summarization With Entailment and Fusion", European Workshop on Natural Language Generation (ENLG), 2013.
Balkesen, C., N. Tatbul, and T. Ozsu, "Adaptive Input Admission and Management for Parallel Stream Processing", Distributed Event-Based Systems (DEBS), 2013.
Deziel, M., D. Olawo, L. Truchon, and L. Golab, "Analyzing the Mental Health of Engineering Students Using Classification And Regression", Educational Data Mining (EDM), 2013.
Toman, D., and G. Weddell, "CFDnc: A PTIME Description Logic With Functional Constraints And Disjointness", International Workshop on Description Logics (DL), 2013.
Balog, K., D. Elsweiler, E. Kanoulas, L. Kelly, and M. Smucker, "CIKM 2013 Workshop on Living Labs for Information Retrieval Evaluation", International Conference on Information and Knowledge Management (CIKM), 2013.
Whissell, J. S., and C. Clarke, "Classification-Based Clustering Evaluation", IEEE International Conference on Data Mining (ICDM), 2013.
Toman, D., and G. Weddell, "Conjunctive Query Answering in CFD_nc: A PTIME Description Logic with Functional Constraints and Disjointness", Australian Joint Conference on Artificial Intelligence (AUS-AI), 2013.
Bellogín, A., G. G. Gebremeskel, J. He, A. Said, T. Samar, A. P. de Vries, J. Lin, and J. B. P. Vuurens, "CWI and TU Delft Notebook TREC 2013: Contextual Suggestion, Federated Web Search, KBA, and Web Tracks", Text Retrieval Conference (TREC), 2013.
Stonebraker, M., D. Bruckner, I. Ilyas, G. Beskales, M. Cherniack, S. B. Zdonik, A. Pagan, and S. Xu, "Data Curation at Scale: The Data Tamer System", Conference on Innovative Data Systems Research (CIDR), 2013.
Lei, B., I. Surya, S. Kamali, and K. Daudjee, "Data Partitioning for Video-on-Demand Services", IEEE International Symposium on Network Computing and Applications (NCA), 2013.
Golab, L., and T. Johnson, "Data Stream Warehousing", ACM International Conference on Management of Data (SIGMOD), 2013.
Asadi, N., J. Lin, and M. Busch, "Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search", ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2013.
Whissell, J. S., and C. Clarke, "Effective Measures for Inter-Document Similarity", International Conference on Information and Knowledge Management (CIKM), 2013.
Asadi, N., and J. Lin, "Effectiveness/Efficiency Tradeoffs for Candidate Generation in Multi-Stage Retrieval Architectures", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Dean-Hall, A., C. Clarke, J. Kamps, and P. Thomas, "Evaluating Contextual Suggestion", Conference on Evaluation of Information Access Technologies (NTCIR), 2013.
Mishne, G., J. Dalton, Z. Li, A. Sharma, and J. Lin, "Fast Data in the Era of Big Data: Twitter's Real-Time Related Query Suggestion Architecture", ACM International Conference on Management of Data (SIGMOD), 2013.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Faster and Smaller Inverted Indices With Treaps", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Türe, F., and J. Lin, "Flat vs. Hierarchical Phrase-Based Translation Models for Cross-Language Information Retrieval", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Salihoglu, S., and J. Widom, "GPS: A Graph Processing System", International Conference on Statistical and Scientific Database Management (SSDBM), 2013.
Chu, X., I. Ilyas, and P. Papotti, "Holistic Data Cleaning: Putting Violations Into Context", IEEE International Conference on Data Engineering (ICDE), 2013.
Ge, C., and L. Golab, "Lazy Data Structure Maintenance for Main-Memory Analytics Over Sliding Windows", International Workshop on Data Warehousing and OLAP (DOLAP), 2013.
Balkesen, C., J. Teubner, G. Alonso, and T. Ozsu, "Main-Memory Hash Joins on Multi-Core CPUs: Tuning to the Underlying Hardware", IEEE International Conference on Data Engineering (ICDE), 2013.
Agrawal, D., A. El Abbadi, H. A. Mahmoud, F. Nawab, and K. Salem, "Managing Geo-Replicated Data in Multi-Datacenters", Databases in Networked Information Systems (DNIS), 2013.
He, H., J. Lin, and A. Lopez, "Massively Parallel Suffix Array Queries and on-Demand Phrase Extraction For Statistical Machine Translation Using GPUs", North American Chapter of the Association for Computational Linguistics (NAACL), 2013.
Jin, C., R. Liu, and K. Salem, "Materialized Views for Eventually Consistent Record Stores", IEEE International Conference on Data Engineering (ICDE), 2013.
Eidelman, V., K. Wu, F. Türe, P. Resnik, and J. Lin, "Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce", Association for Computational Linguistics (ACL), 2013.
Dallachiesa, M., A. Ebaid, A. Eldawy, A. K. Elmagarmid, I. Ilyas, M. Ouzzani, and N. Tang, "NADEEF: A Commodity Data Cleaning System", ACM International Conference on Management of Data (SIGMOD), 2013.
Clarke, C., "Nugget-Based Computation of Graded Relevance", Conference on Evaluation of Information Access Technologies (NTCIR), 2013.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "On the Relative Trust Between Inconsistent Data and Inaccurate Constraints", IEEE International Conference on Data Engineering (ICDE), 2013.
Dean-Hall, A., C. Clarke, N. Simone, J. Kamps, P. Thomas, and E. M. Voorhees, "Overview of the TREC 2013 Contextual Suggestion Track", Text Retrieval Conference (TREC), 2013.
Smucker, M., G. Kazai, and M. Lease, "Overview of the TREC 2013 Crowdsourcing Track", Text Retrieval Conference (TREC), 2013.
Lin, J., and M. Efron, "Overview of the TREC-2013 Microblog Track", Text Retrieval Conference (TREC), 2013.
Glavic, B., J. Siddique, P. Andritsos, and R. Miller, "Provenance for Data Mining", Workshop on the Theory and Practice of Provenance (TaPP), 2013.
Northam, L., R. Smits, K. Daudjee, and J. Istead, "Ray Tracing in the Cloud Using MapReduce", International Conference on High Performance Computing & Simulation (HPCS), 2013.
Kamali, S., and F. Tompa, "Retrieving Documents With Mathematical Content", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Search and Exploration of X-Rated Information (SEXI 2013)", Web Search and Data Mining (WSDM), 2013.
Clarke, C., L. Freund, M. Smucker, and E. Yilmaz, "SIGIR 2013 Workshop on Modeling User Behavior for Information Retrieval Evaluation", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Kamali, S., and F. Tompa, "Structural Similarity Search for Mathematics Retrieval", International Conference on Intelligent Computer Mathematics (CICM), 2013.
Lutz, C., I. Seylan, D. Toman, and F. Wolter, "The Combined Approach to OBDA: Taming Role Hierarchies Using Filters", International Semantic Web Conference (ISWC), 2013.
Sakai, T., Z. Dou, and C. Clarke, "The Impact of Intent Selection on Diversified Search Evaluation", International Conference on Research and Development in Information Retrieval (SIGIR), 2013.
Clarke, C., "Time-Biased Gain", Conference on Evaluation of Information Access Technologies (NTCIR), 2013.
Eidelman, V., K. Wu, F. Türe, P. Resnik, and J. Lin, "Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation", Conference on Machine Translation (WMT), 2013.
Asadi, N., and J. Lin, "Training Efficient Tree-Based Models for Document Ranking", European Conference on Information Retrieval (ECIR), 2013.
Forsyth, S., and K. Daudjee, "Update Management in Decentralized Social Networks", International Conference on Distributed Computing Systems (ICDCS) - Workshops, 2013.
Glavic, B., R. Miller, and G. Alonso, "Using SQL for Efficient Generation and Querying of Provenance Information", Description Logic, Theory Combination, and All That - Essays Dedicated to Franz Baader, 2013.
Arocena, P. C., B. Glavic, and R. Miller, "Value Invention in Data Exchange", ACM International Conference on Management of Data (SIGMOD), 2013.
Rios, M., and J. Lin, "Visualizing the "Pulse" of World Cities on Twitter", International Conference on Web and Social Media (ICWSM), 2013.
DeWitt, D. J., I. Ilyas, J. F. Naughton, and M. Stonebraker, "We Are Drowning in a Sea of Least Publishable Units (LPUs)", ACM International Conference on Management of Data (SIGMOD), 2013.
Ammar, K., and T. Ozsu, "WGB: Towards a Universal Graph Benchmark", Workshop on Big Data Benchmarking (WBDB), 2013.
Gupta, P., A. Goel, J. Lin, A. Sharma, D. Wang, and R. Zadeh, "WTF: The Who to Follow Service at Twitter", The Web Conference (WWW), 2013.
Macdonald, C., J. Wang, and C. Clarke, "2nd International Workshop on Diversity in Document Retrieval (DDR 2012)", Web Search and Data Mining (WSDM), 2012.
Golab, L., T. Johnson, S. Sen, and J. Yates, "A Sequence-Oriented Stream Warehouse Paradigm for Network Monitoring Applications", Passive and Active Network Measurement Conference (PAM), 2012.
Lin, J., and G. Mishne, "A Study of "Churn" in Tweets and Real-Time Search Queries", International Conference on Web and Social Media (ICWSM), 2012.
Wu, J., A. K. Hudek, D. Toman, and G. Weddell, "Absorption for ABoxes", International Workshop on Description Logics (DL), 2012.
Wu, J., A. K. Hudek, D. Toman, and G. Weddell, "Assertion Absorption in Object Queries Over Knowledge Bases", International Conference on Principles of Knowledge Representation and Reasoning (KR), 2012.
Chiang, F., P. Andritsos, E. Zhu, and R. Miller, "AutoDict: Automated Dictionary Discovery", IEEE International Conference on Data Engineering (ICDE), 2012.
Chiang, F., and R. Miller, "Automated Dictionary Discovery for the Online Marketplace", iConference, 2012.
Türe, F., J. Lin, and D. W. Oard, "Combining Statistical Translation Techniques for Cross-Language Information Retrieval", International Conference on Computational Linguistics (COLING), 2012.
Afrati, F. N., M. Balazinska, A. Das Sarma, B. Howe, S. Salihoglu, and J. D. Ullman, "Designing Good Algorithms for MapReduce and B