Publications

Sort by: Author Type Year

2019

De Sa, C., I. Ilyas, B. Kimelfeld, C. R\'e, and T. Rekatsinas, "A Formal Framework for Probabilistic Unclean Databases.", ICDT, 2019.
Alonso, G., C. Binnig, I. Pandis, K. Salem, J. Skrzypczak, R. Stutsman, L. Thostrup, T. Wang, Z. Wang, and T. Ziegler, "DPI: The Data Processing Interface for Modern Networks.", CIDR, 2019.
Chopra, S., A. Khan, M. Mirsafian, and L. Golab, "Gender Differences in Science and Engineering: A Data Mining Approach.", EDBT/ICDT Workshops, 2019.
Yu, R., Y. Xie, and J. Lin, "Simple Techniques for Cross-Collection Relevance Feedback.", ECIR (1), 2019.
Yan, D., G. Guo, M. Chowdhury, M. Özsu, J. Lui, and W. Tan, "T-thinker: a task-centric distributed framework for compute-intensive divide-and-conquer algorithms.", PPoPP, pp. 411-412, 2019.
Lee, J., R. Tang, and J. Lin, "Universal voice-enabled user interfaces using JavaScript.", IUI Companion, pp. 81-82, 2019.
Abebe, M., B. Glasbergen, and K. Daudjee, "WatDFS: A Project for Understanding Distributed Systems in the Undergraduate Curriculum.", SIGCSE, pp. 920-926, 2019.
Alu\c{c}, G\"unes., M. \" Ozsu, and K. Daudjee, "Building self-clustering RDF databases using Tunable-LSH.", VLDB J., vol. 28, issue 2, 2019.
Yang, W., Y. Xie, L. Tan, K. Xiong, M. Li, and J. Lin, "Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering.", CoRR, vol. abs/1904.06652, 2019.
Ilyas, I., "Data unification at scale: data tamer.", Making Databases Work, 2019.
Tang, R., Y. Lu, L. Liu, L. Mou, O. Vechtomova, and J. Lin, "Distilling Task-Specific Knowledge from BERT into Simple Neural Networks.", CoRR, vol. abs/1903.12136, 2019.
Saxena, H., L. Golab, and I. Ilyas, "Distributed Dependency Discovery.", CoRR, vol. abs/1903.05228, 2019.
Adhikari, A., A. Ram, R. Tang, and J. Lin, "DocBERT: BERT for Document Classification.", CoRR, vol. abs/1904.08398, 2019.
Nogueira, R., W. Yang, J. Lin, and K. Cho, "Document Expansion by Query Prediction.", CoRR, vol. abs/1904.08375, 2019.
Yang, W., Y. Xie, A. Lin, X. Li, L. Tan, K. Xiong, M. Li, and J. Lin, "End-to-End Open-Domain Question Answering with BERTserini.", CoRR, vol. abs/1902.01718, 2019.
Gorenflo, C., S. Lee, L. Golab, and S. Keshav, "FastFabric: Scaling Hyperledger Fabric to 20, 000 Transactions per Second.", CoRR, vol. abs/1901.00910, 2019.
Salihoglu, S., and N. Yakovets, "Graph Query Processing.", Encyclopedia of Big Data Technologies, 2019.
Heidari, A., J. McGrath, I. Ilyas, and T. Rekatsinas, "HoloDetect: Few-Shot Learning for Error Detection.", CoRR, vol. abs/1904.02285, 2019.
Azmy, M., P. Shi, J. Lin, and I. Ilyas, "Matching Entities Across Different Knowledge Graphs with Graph Embeddings.", CoRR, vol. abs/1903.06607, 2019.
Mhedhbi, A., and S. Salihoglu, "Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins.", CoRR, vol. abs/1903.02076, 2019.
Livshits, E., I. Ilyas, B. Kimelfeld, and S. Roy, "Principles of Progress Indicators for Database Repairing.", CoRR, vol. abs/1904.06492, 2019.
Yang, W., H. Zhang, and J. Lin, "Simple Applications of BERT for Ad Hoc Document Retrieval.", CoRR, vol. abs/1903.10972, 2019.
Shi, P., and J. Lin, "Simple BERT Models for Relation Extraction and Semantic Role Labeling.", CoRR, vol. abs/1904.05255, 2019.
Golab, L., "Types of Stream Processing Algorithms.", Encyclopedia of Big Data Technologies, 2019.
"Foreword.", Making Databases Work, 2019.

2018

Zhang, H., M. Abualsaud, and M. Smucker, "A Study of Immediate Requery Behavior in Search.", CHIIR, pp. 181–190, 2018.
Abualsaud, M., N. Ghelani, H. Zhang, M. Smucker, G. Cormack, and M. Grossman, "A System for Efficient High-Recall Retrieval.", SIGIR, pp. 1317–1320, 2018.
Koutris, P., S. Salihoglu, and D. Suciu, "Algorithmic Aspects of Parallel Query Processing.", SIGMOD Conference, pp. 1659–1664, 2018.
Tang, R., W. Wang, Z. Tu, and J. Lin, "An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting.", ICASSP, pp. 5479–5483, 2018.
Glasbergen, B., M. Abebe, K. Daudjee, S. Foggo, and A. Pacaci, "Apollo: Learning Query Correlations for Predictive Caching in Geo-Distributed Systems.", EDBT, pp. 253–264, 2018.
Cormack, G., and M. Grossman, "Beyond Pooling.", SIGIR, pp. 1169–1172, 2018.
Mansour, E., D. Deng, R. Fernandez, A. Qahtan, W. Tao, Z. Abedjan, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "Building Data Civilizer Pipelines with an Advanced Workflow Engine.", ICDE, pp. 1593–1596, 2018.
Yan, X., L. Yang, H. Zhang, X. Lin, B. Wong, K. Salem, and T. Brecht, "Carousel: Low-Latency Transaction Processing for Globally-Distributed Data.", SIGMOD Conference, pp. 231–243, 2018.
Fraser, D., A. Kane, and F. Tompa, "Choosing Math Features for BM25 Ranking with Tangent-L.", DocEng, pp. 17:1-17:10, 2018.
Langouri, M., Z. Zheng, F. Chiang, L. Golab, and J. Szlichta, "Contextual Data Cleaning.", ICDE Workshops, pp. 21–24, 2018.
Chopra, S., Y. Jiang, A. Toulis, and L. Golab, "Data Analytics to Improve Co-Operative Education.", EDBT/ICDT Workshops, pp. 16–21, 2018.
Tang, R., and J. Lin, "Deep Residual Learning for Small-Footprint Keyword Spotting.", ICASSP, pp. 5484–5488, 2018.
Pacaci, A., and M. Özsu, "Distribution-Aware Stream Partitioning for Distributed Stream Processing Systems.", BeyondMR@SIGMOD, pp. 6:1-6:10, 2018.
Abebe, M., K. Daudjee, B. Glasbergen, and Y. Tian, "EC-Store: Bridging the Gap between Storage and Latency in Distributed Erasure Coded Systems.", ICDCS, pp. 255–266, 2018.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Effective Team Formation in Expert Networks.", AMW, 2018.
Zhang, H., M. Abualsaud, N. Ghelani, M. Smucker, G. Cormack, and M. Grossman, "Effective User Interaction for High-Recall Retrieval: Less is More.", CIKM, pp. 187–196, 2018.
Azmy, M., P. Shi, J. Lin, and I. Ilyas, "Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia.", COLING, pp. 2093–2103, 2018.
Mihaylov, A., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "FASTOD: Bringing Order to Data.", ICDE, pp. 1561–1564, 2018.
Zheng, Z., M. Alipour, Z. Qu, I. Currie, F. Chiang, L. Golab, and J. Szlichta, "FastOFD: Contextual Data Cleaning with Ontology Functional Dependencies.", EDBT, pp. 694–697, 2018.
Chopra, S., H. Gautreau, A. Khan, M. Mirsafian, and L. Golab, "Gender Differences in Undergraduate Engineering Applicants: A Text Mining Approach.", EDM, 2018.
Toman, D., and G. Weddell, "Identity Resolution in Conjunctive Querying over DL-Based Knowledge Bases.", Description Logics, 2018.
Peng, P., L. Zou, M. Özsu, and D. Zhao, "Multi-query Optimization in Federated RDF Systems.", DASFAA (1), pp. 745–765, 2018.
McIntyre, S., A. Borgida, D. Toman, and G. Weddell, "On Limited Conjunctions in Polynomial Feature Logics, with Applications in OBDA.", KR, pp. 655–656, 2018.
Mackenzie, J., J. Culpepper, R. Blanco, M. Crane, C. Clarke, and J. Lin, "Query Driven Algorithm Selection in Early Stage Retrieval.", WSDM, pp. 396–404, 2018.
Memon, B., X. Lin, A. Mufti, A. Wesley, T. Brecht, K. Salem, B. Wong, and B. Cassell, "RaMP: A Lightweight RDMA Abstraction for Loosely Coupled Applications.", HotCloud, 2018.
Grewal, A., J. Jiang, G. Lam, T. Jung, L. Vuddemarri, Q. Li, A. Landge, and J. Lin, "RecService: Distributed Real-Time Graph Processing at Twitter.", HotCloud, 2018.
Ghelani, N., G. Cormack, and M. Smucker, "Refresh Strategies in Continuous Active Learning.", ProfS/KG4IR/Data:Search@SIGIR, pp. 18–23, 2018.
Mior, M., and K. Salem, "Renormalization of NoSQL Database Schemas.", ER, pp. 479–487, 2018.
Yang, P., S. Thiagarajan, and J. Lin, "Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter.", SIGMOD Conference, pp. 595–599, 2018.
Fernandez, R., E. Mansour, A. Qahtan, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery.", ICDE, pp. 989–1000, 2018.
Kim, Y., and J. Lin, "Serverless Data Analytics with Flint.", IEEE CLOUD, pp. 451–455, 2018.
Aleardi, L., S. Salihoglu, G. Singh, and M. Ovsjanikov, "Spectral Measures of Distortion for Change Detection in Dynamic Graphs.", COMPLEX NETWORKS (2), pp. 54–66, 2018.
Kane, A., and F. Tompa, "Split-Lists and Initial Thresholds for WAND-based Search.", SIGIR, pp. 877–880, 2018.
Gao, L., L. Golab, M. Özsu, and G. Aluç, "Stream WatDiv: A Streaming RDF Benchmark.", SBD@SIGMOD, pp. 3:1-3:6, 2018.
Mohammed, S., P. Shi, and J. Lin, "Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", NAACL-HLT (2), pp. 291–296, 2018.
Grewal, A., and J. Lin, "The Evolution of Content Analysis for Personalized Recommendations at Twitter.", SIGIR, pp. 1355–1356, 2018.
Cormack, G., and M. Grossman, "The Quest for Total Recall.", DocEng, pp. 6:1-6:2, 2018.
Ma, W., C. Keet, W. Oldford, D. Toman, and G. Weddell, "The Utility of the Abstract Relational Model and Attribute Paths in SQL.", EKAW, pp. 195–211, 2018.
Glasbergen, B., M. Abebe, and K. Daudjee, "Tutorial: Adaptive Replication and Partitioning in Data Systems.", Middleware (Tutorials), pp. 1:1-1:5, 2018.
Lin, J., S. Mohammed, R. Sequiera, and L. Tan, "Update Delivery Mechanisms for Prospective Information Needs: An Analysis of Attention in Mobile Users.", SIGIR, pp. 785–794, 2018.
Rao, J., F. Türe, and J. Lin, "What Do Viewers Say to Their TVs?: An Analysis of Voice Queries to Entertainment Systems.", SIGIR, pp. 1213–1216, 2018.
Korkmaz, M., M. Karsten, K. Salem, and S. Salihoglu, "Workload-Aware CPU Performance Scaling for Transactional Database Systems.", SIGMOD Conference, pp. 291–306, 2018.
Liang, Y., Z. Tu, L. Huang, and J. Lin, "CNNs for NLP in the Browser: Client-Side Deployment and Visualization Opportunities.", NAACL-HLT (Demonstrations), pp. 61-65, 2018.
Tompa, F., Fashioning a Search Engine to Support Humanities Research., vol. abs/1901.00910: DocEng, pp. 32:1-32:10, 2018.
Tompa, F., "Hypertexts.", Encyclopedia of Database Systems (2nd ed.), 2018.
Grossman, M., and G. Cormack, "MRG_UWaterloo Participation in the TREC 2018 Common Core Track.", TREC, 2018.
Tu, Z., M. Li, and J. Lin, "Pay-Per-Request Deployment of Neural Network Models Using Serverless Architectures.", NAACL-HLT (Demonstrations), pp. 6-10, 2018.
Abualsaud, M., G. Cormack, N. Ghelani, A. Ghenai, M. Grossman, S. Rahbariasl, H. Zhang, and M. Smucker, "UWaterlooMDS at the TREC 2018 Common Core Track.", TREC, 2018.
De Sa, C., I. Ilyas, B. Kimelfeld, C. Ré, and T. Rekatsinas, "A Formal Framework For Probabilistic Unclean Databases.", CoRR, vol. abs/1801.06750, 2018.
Ren, Y., M. Tomko, F. Salim, J. Chan, C. Clarke, and M. Sanderson, "A Location-Query-Browse Graph for Contextual Recommendation.", IEEE Trans. Knowl. Data Eng., vol. 30, no. 2, pp. 204–218, 2018.
Chomicki, J., and D. Toman, "Abstract Versus Concrete Temporal Query Languages.", Encyclopedia of Database Systems (2nd ed.), 2018.
Tang, R., and J. Lin, "Adaptive Pruning of Neural Language Models for Mobile Devices.", CoRR, vol. abs/1809.10282, 2018.
Koutris, P., S. Salihoglu, and D. Suciu, "Algorithmic Aspects of Parallel Data Processing.", Foundations and Trends in Databases, vol. 8, no. 4, pp. 239–370, 2018.
Yang, P., H. Fang, and J. Lin, "Anserini: Reproducible Ranking Baselines Using Lucene.", J. Data and Information Quality, vol. 10, no. 4, pp. 16:1-16:20, 2018.
Tang, G., S. Keshav, L. Golab, and K. Wu, "Bikeshare Pool Sizing for Bike-and-Ride Multimodal Transit.", IEEE Trans. Intelligent Transportation Systems, vol. 19, no. 7, pp. 2279–2289, 2018.
Özsu, M., "Client-Server Architecture.", Encyclopedia of Database Systems (2nd ed.), 2018.
Stonebraker, M., and I. Ilyas, "Data Integration: The Current Status and the Way Forward.", IEEE Data Eng. Bull., vol. 41, no. 2, pp. 3–9, 2018.
Özsu, M., "Data Manipulation Language (DML).", Encyclopedia of Database Systems (2nd ed.), 2018.
Golab, L., "Data Stream.", Encyclopedia of Database Systems (2nd ed.), 2018.
Ozsu, M. \", "Database Administrator (DBA).", Encyclopedia of Database Systems (2nd ed.), 2018.
Ozsu, M. \", "Database.", Encyclopedia of Database Systems (2nd ed.), 2018.
Ammar, K., F. McSherry, S. Salihoglu, and M. Joglekar, "Distributed Evaluation of Subgraph Queries Using Worst-case Optimal and Low-Memory Dataflows.", PVLDB, vol. 11, no. 6, pp. 691–704, 2018.
Ammar, K., F. McSherry, S. Salihoglu, and M. Joglekar, "Distributed Evaluation of Subgraph Queries Using Worstcase Optimal LowMemory Dataflows.", CoRR, vol. abs/1802.03760, 2018.
Tompa, F., "Document Databases.", Encyclopedia of Database Systems (2nd ed.), 2018.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and complete discovery of bidirectional order dependencies via set-based axioms.", VLDB J., vol. 27, no. 4, pp. 573–591, 2018.
Tompa, F., "Enterprise Content Management.", Encyclopedia of Database Systems (2nd ed.), 2018.
Lamb, C., D. Brown, and C. Clarke, "Evaluating Computational Creativity: An Interdisciplinary Tutorial.", ACM Comput. Surv., vol. 51, no. 2, pp. 28:1-28:34, 2018.
Zhang, H., G. Cormack, M. Grossman, and M. Smucker, "Evaluating Sentence-Level Relevance Feedback for High-Recall Information Retrieval.", CoRR, vol. abs/1803.08988, 2018.
Hopfgartner, F., A. Hanbury, H. Müller, I. Eggel, K. Balog, T. Brodt, G. Cormack, J. Lin, J. Kalpathy-Cramer, N. Kando, et al., "Evaluation-as-a-Service for the Computational Sciences: Overview and Outlook.", J. Data and Information Quality, vol. 10, no. 4, pp. 15:1-15:32, 2018.
Ammar, K., and M. Özsu, "Experimental Analysis of Distributed Graph Systems.", PVLDB, vol. 11, no. 10, pp. 1151–1164, 2018.
Ammar, K., and M. Özsu, "Experimental Analysis of Distributed Graph Systems.", CoRR, vol. abs/1806.08082, 2018.
Gebaly, K., G. Feng, L. Golab, F. Korn, and D. Srivastava, "Explanation Tables.", IEEE Data Eng. Bull., vol. 41, no. 3, pp. 43–51, 2018.
Tang, R., A. Adhikari, and J. Lin, "FLOPs as a Direct Optimization Objective for Learning Sparse Neural Networks.", CoRR, vol. abs/1811.03060, 2018.
Gebaly, K., and J. Lin, "In-Browser Split-Execution Support for Interactive Analytics in the Cloud.", CoRR, vol. abs/1804.08822, 2018.
Rao, J., W. Yang, Y. Zhang, F. Türe, and J. Lin, "Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search.", CoRR, vol. abs/1805.08159, 2018.
Toman, D., "Point-Stamped Temporal Models.", Encyclopedia of Database Systems (2nd ed.), 2018.
Tang, R., and J. Lin, "Progress and Tradeoffs in Neural Language Models.", CoRR, vol. abs/1811.00942, 2018.
Ilyas, I., "Rank-Aware Query Processing.", Encyclopedia of Database Systems (2nd ed.), 2018.
Ilyas, I., "Rank-Join.", Encyclopedia of Database Systems (2nd ed.), 2018.
Lin, J., and P. Yang, "Repeatability Corner Cases in Document Ranking: The Impact of Score Ties.", CoRR, vol. abs/1807.05798, 2018.
Liu, Y., M. Kato, C. Clarke, N. Kando, and T. Sakai, "Report on NTCIR-13: The Thirteenth Round of NII Testbeds and Community for Information Access Research.", SIGIR Forum, vol. 52, no. 1, pp. 102–110, 2018.
Culpepper, J., F. Diaz, and M. Smucker, "Research Frontiers in Information Retrieval: Report from the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018).", SIGIR Forum, vol. 52, no. 1, pp. 34–90, 2018.
Salihoglu, S., and M. Özsu, "Response to “Scale Up or Scale Out for Graph Processing”.", IEEE Internet Computing, vol. 22, no. 5, pp. 18–24, 2018.
Salem, K., "Sagas.", Encyclopedia of Database Systems (2nd ed.), 2018.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple.", CoRR, vol. abs/1805.11728, 2018.
Lin, J., "Scale Up or Scale Out for Graph Processing?", IEEE Internet Computing, vol. 22, no. 3, pp. 72–78, 2018.
Kushagra, S., S. Ben-David, and I. Ilyas, "Semi-supervised clustering for de-duplication.", CoRR, vol. abs/1810.04361, 2018.
Kim, Y., and J. Lin, "Serverless Data Analytics with Flint.", CoRR, vol. abs/1803.06354, 2018.
Shi, P., J. Rao, and J. Lin, "Simple Attention-Based Representation Learning for Ranking Short Social Media Posts.", CoRR, vol. abs/1811.01013, 2018.
Golab, L., "Stream Models.", Encyclopedia of Database Systems (2nd ed.), 2018.
Tang, R., G. Yang, H. Wei, Y. Mao, F. Türe, and J. Lin, "Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks.", CoRR, vol. abs/1812.07754, 2018.
Chomicki, J., and D. Toman, "Temporal Logic in Database Query Languages.", Encyclopedia of Database Systems (2nd ed.), 2018.
Chomicki, J., and D. Toman, "Temporal Relational Calculus.", Encyclopedia of Database Systems (2nd ed.), 2018.
Roddick, J., and D. Toman, "Temporal Vacuuming.", Encyclopedia of Database Systems (2nd ed.), 2018.
Lin, J., "The Neural Hype and Comparisons Against Weak Baselines.", SIGIR Forum, vol. 52, issue 2, pp. 40–51, 2018.
Li, Y., L. Zou, M. Özsu, and D. Zhao, "Time Constrained Continuous Subgraph Search over Streaming Graphs.", CoRR, vol. abs/1801.09240, 2018.
Ilyas, I., "Top-k Queries.", Encyclopedia of Database Systems (2nd ed.), 2018.
Clarke, C., "Web Question Answering.", Encyclopedia of Database Systems (2nd ed.), 2018.
Lin, J., "Summarization.", Encyclopedia of Database Systems (2nd ed.), 2018.

2017

Crane, M., J. Culpepper, J. Lin, J. Mackenzie, and A. Trotman, "A Comparison of Document-at-a-Time and Score-at-a-Time Query Evaluation.", WSDM, pp. 201–210, 2017.
Baruah, G., R. McCreadie, and J. Lin, "A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries.", CIKM, pp. 67–76, 2017.
Fernandez, R., D. Deng, E. Mansour, A. Qahtan, W. Tao, Z. Abedjan, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "A Demo of the Data Civilizer System.", SIGMOD Conference, pp. 1639–1642, 2017.
Karyakin, A., and K. Salem, "An analysis of memory power consumption in database systems.", DaMoN, pp. 2:1-2:9, 2017.
Crane, M., and J. Lin, "An Exploration of Serverless Architectures for Information Retrieval.", ICTIR, pp. 241–244, 2017.
He, H., K. Ganjam, N. Jain, J. Lundin, R. White, and J. Lin, "An Insight Extraction System on BioMedical Literature with Deep Neural Networks.", EMNLP, pp. 2691–2701, 2017.
Yang, P., H. Fang, and J. Lin, "Anserini: Enabling the Use of Lucene for Information Retrieval Research.", SIGIR, pp. 1253–1256, 2017.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Authority-based Team Discovery in Social Networks.", EDBT, pp. 498–501, 2017.
Grossman, M., G. Cormack, and A. Roegiest, "Automatic and Semi-Automatic Document Selection for Technology-Assisted Review.", SIGIR, pp. 905–908, 2017.
Zhang, H., J. Rao, J. Lin, and M. Smucker, "Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Answering.", SIGIR, pp. 797–800, 2017.
Borgida, A., D. Toman, and G. Weddell, "Concerning Referring Expressions in Query Answers.", IJCAI, pp. 4791–4795, 2017.
Abedjan, Z., L. Golab, and F. Naumann, "Data Profiling: A Tutorial.", SIGMOD Conference, pp. 1747–1751, 2017.
Pacaci, A., A. Zhou, J. Lin, and M. Özsu, "Do We Need Specialized Graph Databases?: Benchmarking Real-Time Social Networking Applications.", GRADES@SIGMOD/PODS, pp. 12:1-12:7, 2017.
Baskaran, S., A. Keller, F. Chiang, L. Golab, and J. Szlichta, "Efficient Discovery of Ontology Functional Dependencies.", CIKM, pp. 1847–1856, 2017.
Ghelani, N., S. Mohammed, S. Wang, and J. Lin, "Event Detection on Curated Tweet Streams.", SIGIR, pp. 1325–1328, 2017.
Rao, J., H. He, and J. Lin, "Experiments with Convolutional Neural Network Models for Answer Selection.", SIGIR, pp. 1217–1220, 2017.
Vtyurina, A., D. Savenkov, E. Agichtein, and C. Clarke, "Exploring Conversational Search With Humans, Assistants, and Wizards.", CHI Extended Abstracts, pp. 2187–2193, 2017.
Sequiera, R., and J. Lin, "Finally, a Downloadable Test Collection of Tweets.", SIGIR, pp. 1225–1228, 2017.
Toulis, A., and L. Golab, "Graph Mining to Characterize Competition for Employment.", NDA@SIGMOD, pp. 3:1-3:7, 2017.
Kankanamge, C., S. Sahu, A. Mhedbhi, J. Chen, and S. Salihoglu, "Graphflow: An Active Graph Database.", SIGMOD Conference, pp. 1695–1698, 2017.
Afrati, F., M. Joglekar, C. Ré, S. Salihoglu, and J. Ullman, "GYM: A Multiround Distributed Join Algorithm.", ICDT, pp. 4:1-4:18, 2017.
Fink, S., L. Golab, S. Keshav, and H. de Meer, "How Similar is the Usage of Electric Cars and Electric Bicycles?", e-Energy, pp. 334–340, 2017.
Gebaly, K., and J. Lin, "In-Browser Interactive SQL Analytics with Afterburner.", SIGMOD Conference, pp. 1623–1626, 2017.
Gorenflo, C., L. Golab, and S. Keshav, "Managing Sensor Data Streams: Lessons Learned from the WeBike Project.", SSDBM, pp. 1:1-1:11, 2017.
Rao, J., F. Türe, X. Niu, and J. Lin, "Mining the Temporal Statistics of Query Terms for Searching Social Media Posts.", ICTIR, pp. 133–140, 2017.
Cui, X., M. Mior, B. Wong, K. Daudjee, and S. Rizvi, "Netstore: leveraging network optimizations to improve distributed transaction processing performance.", ACTIVE@Middleware, pp. 1–10, 2017.
Roegiest, A., L. Tan, and J. Lin, "Online In-Situ Interleaved Evaluation of Real-Time Push Notification Systems.", SIGIR, pp. 415–424, 2017.
Meng, X., and L. Golab, "Optimal reducer placement to minimize data transfer in MapReduce-style processing.", BigData, pp. 339–346, 2017.
Lin, J., S. Mohammed, R. Sequiera, L. Tan, N. Ghelani, M. Abualsaud, R. McCreadie, D. Milajevs, and E. Voorhees, "Overview of the TREC 2017 Real-Time Summarization Track.", TREC, 2017.
Mohammed, S., M. Crane, and J. Lin, "Quantization in Append-Only Collections.", ICTIR, pp. 265–268, 2017.
Mate, J., K. Daudjee, and S. Kamali, "Robust Multi-tenant Server Consolidation in the Cloud for Data Analytics Workloads.", ICDCS, pp. 2111–2118, 2017.
Feng, G., L. Golab, and D. Srivastava, "Scalable Informative Rule Mining.", ICDE, pp. 437–448, 2017.
Kane, A., and F. Tompa, "Small-Term Distribution for Disk-Based Search.", DocEng, pp. 49–58, 2017.
Toulis, A., and L. Golab, "Social Media Mining to Understand Public Mental Health.", DMAH@VLDB, pp. 55–70, 2017.
Rao, J., F. Türe, H. He, O. Jojic, and J. Lin, "Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks.", CIKM, pp. 557–566, 2017.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Ten Blue Links on Mars.", WWW, pp. 273–281, 2017.
Deng, D., R. Fernandez, Z. Abedjan, S. Wang, M. Stonebraker, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, and N. Tang, "The Data Civilizer System.", CIDR, 2017.
Azzopardi, L., M. Crane, H. Fang, G. Ingersoll, J. Lin, Y. Moshfeghi, H. Scells, P. Yang, and G. Zuccon, "The Lucene for Information Access and Retrieval Research (LIARR) Workshop at SIGIR 2017.", SIGIR, pp. 1429–1430, 2017.
Pogacar, F., A. Ghenai, M. Smucker, and C. Clarke, "The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments.", ICTIR, pp. 209–216, 2017.
Zhang, H., M. Abualsaud, N. Ghelani, A. Ghosh, M. Smucker, G. Cormack, and M. Grossman, "UWaterlooMDS at the TREC 2017 Common Core Track.", TREC, 2017.
Tang, R., W. Wang, Z. Tu, and J. Lin, "An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting.", CoRR, vol. abs/1711.00333, 2017.
Tu, Z., M. Crane, R. Sequiera, J. Zhang, and J. Lin, "An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures.", CoRR, vol. abs/1707.08275, 2017.
Abdelaziz, I., R. Harbi, S. Salihoglu, and P. Kalnis, "Combining Vertex-Centric Graph Processing with SPARQL for Large-Scale RDF Data Analytics.", IEEE Trans. Parallel Distrib. Syst., vol. 28, no. 12, pp. 3374–3388, 2017.
Sadiq, S., T. Dasu, X. Dong, J. Freire, I. Ilyas, S. Link, R. Miller, F. Naumann, X. Zhou, and D. Srivastava, "Data Quality: The Role of Empiricism.", SIGMOD Record, vol. 46, no. 4, pp. 35–43, 2017.
Tang, R., and J. Lin, "Deep Residual Learning for Small-Footprint Keyword Spotting.", CoRR, vol. abs/1710.10361, 2017.
Mohammed, S., N. Ghelani, and J. Lin, "Distant Supervision for Topic Classification of Tweets in Curated Streams.", CoRR, vol. abs/1704.06726, 2017.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization.", PVLDB, vol. 10, no. 7, pp. 721–732, 2017.
Mackenzie, J., J. Culpepper, R. Blanco, M. Crane, C. Clarke, and J. Lin, "Efficient and Effective Tail Latency Minimization in Multi-Stage Retrieval Systems.", CoRR, vol. abs/1704.03970, 2017.
Deng, D., W. Tao, Z. Abedjan, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Entity Consolidation: The Golden Record Problem.", CoRR, vol. abs/1709.10436, 2017.
Sequiera, R., G. Baruah, Z. Tu, S. Mohammed, J. Rao, H. Zhang, and J. Lin, "Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering.", CoRR, vol. abs/1707.07804, 2017.
Yan, D., H. Chen, J. Cheng, M. Özsu, Q. Zhang, and J. Lui, "G-thinker: Big Graph Mining Made Easier and Faster.", CoRR, vol. abs/1709.03110, 2017.
Zou, L., and M. Özsu, "Graph-Based RDF Data Management.", Data Science and Engineering, vol. 2, no. 1, pp. 56–70, 2017.
Rekatsinas, T., X. Chu, I. Ilyas, and C. Ré, "HoloClean: Holistic Data Repairs with Probabilistic Inference.", PVLDB, vol. 10, no. 11, pp. 1190–1201, 2017.
Rekatsinas, T., X. Chu, I. Ilyas, and C. Ré, "HoloClean: Holistic Data Repairs with Probabilistic Inference.", CoRR, vol. abs/1702.00820, 2017.
Vadehra, A., M. Grossman, and G. Cormack, "Impact of Feature Selection on Micro-Text Classification.", CoRR, vol. abs/1708.08123, 2017.
Lin, J., "In Defense of MapReduce.", IEEE Internet Computing, vol. 21, no. 3, pp. 94–98, 2017.
Rao, J., H. He, H. Zhang, F. Türe, R. Sequiera, S. Mohammed, and J. Lin, "Integrating Lexical and Temporal Signals in Neural Ranking Models for Searching Social Media Streams.", CoRR, vol. abs/1707.07792, 2017.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Inverted Treaps.", ACM Trans. Inf. Syst., vol. 35, no. 3, pp. 22:1-22:45, 2017.
"Logic programming approach to automata-based decision procedures.", J. Log. Algebr. Meth. Program., vol. 86, no. 1, pp. 391–407, 2017.
Mior, M., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema Design for NoSQL Applications.", IEEE Trans. Knowl. Data Eng., vol. 29, no. 10, pp. 2275–2289, 2017.
Allan, J., N. Belkin, P. Bennett, J. Callan, C. Clarke, F. Diaz, S. Dumais, N. Ferro, D. Harman, D. Hiemstra, et al., "Overview of Special Issue.", SIGIR Forum, vol. 51, no. 2, pp. 1–25, 2017.
Ge, C., I. Ilyas, X. He, and A. Machanavajjhala, "Private Exploration Primitives for Data Cleaning.", CoRR, vol. abs/1712.10266, 2017.
Liu, X., L. Golab, W. Golab, I. Ilyas, and S. Jin, "Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking.", ACM Trans. Database Syst., vol. 42, no. 1, pp. 2:1-2:39, 2017.
Mohammed, S., P. Shi, and J. Lin, "Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", CoRR, vol. abs/1712.01969, 2017.
Rao, J., F. Türe, H. He, O. Jojic, and J. Lin, "Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks.", CoRR, vol. abs/1705.04892, 2017.
Lin, J., "The Lambda and the Kappa.", IEEE Internet Computing, vol. 21, no. 5, pp. 60–66, 2017.
Lin, J., and A. Trotman, "The role of index compression in score-at-a-time query evaluation.", Inf. Retr. Journal, vol. 20, no. 3, pp. 199–220, 2017.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and M. Özsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing.", PVLDB, vol. 11, no. 4, pp. 420–431, 2017.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and M. Özsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: A User Survey.", CoRR, vol. abs/1709.03188, 2017.
Yang, Y., L. Golab, and M. Özsu, "ViewDF: Declarative incremental view maintenance for streaming data.", Inf. Syst., vol. 71, pp. 55–67, 2017.
Lin, J., I. Milligan, J. Wiebe, and A. Zhou, "Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives.", JOCCH, vol. 10, no. 4, pp. 22:1-22:30, 2017.
Shen, C., T. Shen, and J. Lin, "Comparative Assessment of Alignment Algorithms for NGS Data: Features, Considerations, Implementations, and Future.", Algorithms for Next-Generation Sequencing Data, pp. 187–202, 2017.

2016

Jacques, J. St., D. Toman, and G. Weddell, "", IJCAI, pp. 1258–1264, 2016.
Agrawal, S., and K. Daudjee, "A Performance Comparison of Algorithms for Byzantine Agreement in Distributed Systems.", EDCC, pp. 249–260, 2016.
Roegiest, A., L. Tan, J. Lin, and C. Clarke, "A Platform for Streaming Push Notifications to Mobile Assessors.", SIGIR, pp. 1077–1080, 2016.
Wu, G., and F. Tompa, "A Space-Efficient Data Structure for Fast Access Control in ECM Systems.", SACMAT, pp. 191–201, 2016.
Roegiest, A., and G. Cormack, "An Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments.", SIGIR, pp. 1085–1088, 2016.
Hashemi, S., C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "An Easter Egg Hunting Approach to Test Collection Building in Dynamic Domains.", EVIA@NTCIR, 2016.
Tan, L., A. Roegiest, J. Lin, and C. Clarke, "An Exploration of Evaluation Metrics for Mobile Push Notifications.", SIGIR, pp. 741–744, 2016.
Al-Harbi, A., and M. Smucker, "Are Secondary Assessors Uncertain When They Disagree About Relevance Judgements?", CHIIR, pp. 233–236, 2016.
Farid, M., A. Roatis, I. Ilyas, H-F. Hoffmann, and X. Chu, "CLAMS: Bringing Quality to Data Lakes.", SIGMOD Conference, pp. 2089–2092, 2016.
Rao, J., X. Niu, and J. Lin, "Compressing and Decoding Term Statistics Time Series.", ECIR, pp. 675–681, 2016.
Milligan, I., N. Ruest, and J. Lin, "Content Selection and Curation for Web Archiving: The Gatekeepers vs. the Masses.", JCDL, pp. 107–110, 2016.
Cafarella, M., I. Ilyas, M. Kornacker, T. Kraska, and C. Ré, "Dark Data: Are we solving the right problems?", ICDE, pp. 1444–1445, 2016.
Chu, X., I. Ilyas, S. Krishnan, and J. Wang, "Data Cleaning: Overview and Emerging Challenges.", SIGMOD Conference, pp. 2201–2206, 2016.
Abedjan, Z., L. Golab, and F. Naumann, "Data profiling.", ICDE, pp. 1432–1435, 2016.
Abedjan, Z., J. Morcos, I. Ilyas, M. Ouzzani, P. Papotti, and M. Stonebraker, "DataXFormer: A robust transformation discovery system.", ICDE, pp. 1134–1145, 2016.
Jackson, A., J. Lin, I. Milligan, and N. Ruest, "Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities.", JCDL, pp. 103–106, 2016.
Buntain, C., J. Lin, and J. Golbeck, "Discovering key moments in social media streams.", CCNC, pp. 366–374, 2016.
Culpepper, J., C. Clarke, and J. Lin, "Dynamic Cutoff Prediction in Multi-Stage Retrieval Systems.", ADCS, pp. 17–24, 2016.
Kargar, M., L. Golab, and J. Szlichta, "eGraphSearch: Effective Keyword Search in Graphs.", CIKM, pp. 2461–2464, 2016.
Cormack, G., and M. Grossman, "Engineering Quality and Reliability in Technology-Assisted Review.", SIGIR, pp. 75–84, 2016.
Bommannavar, P., J. Lin, and A. Rajaraman, "Estimating topical volume in social media streams.", SAC, pp. 1096–1101, 2016.
Lamb, C., D. Brown, and C. Clarke, "Evaluating digital poetry: Insights from the CAT.", ICCC, pp. 60–67, 2016.
Oard, D., K. Shilton, and J. Lin, "Evaluating Search Among Secrets.", EVIA@NTCIR, 2016.
Milligan, I., J. Lin, J. Wiebe, and A. Zhou, "Exploring and Discovering Archive-It Collections with Warcbase.", DH, pp. 285–288, 2016.
Roegiest, A., and G. Cormack, "Impact of Review-Set Selection on Human Assessment for Text Classification.", SIGIR, pp. 861–864, 2016.
Trotman, A., and J. Lin, "In Vacuo and In Situ Evaluation of SIMD Codecs.", ADCS, pp. 1–8, 2016.
Farid, M., I. Ilyas, S. Whang, and C. Yu, "LONLIES: Estimating Property Values for Long Tail Entities.", SIGIR, pp. 1125–1128, 2016.
Smucker, M., and C. Clarke, "Modeling Optimal Switching Behavior.", CHIIR, pp. 317–320, 2016.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "Multi-Stage Math Formula Search: Using Appearance-Based Similarity Metrics at Scale.", SIGIR, pp. 145–154, 2016.
Rao, J., H. He, and J. Lin, "Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks.", CIKM, pp. 1913–1916, 2016.
Mior, M., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema design for NoSQL applications.", ICDE, pp. 181–192, 2016.
Jacques, J. St., D. Toman, and G. Weddell, "Object-Relational Queries over CFDI_nc Knowledge Bases: OBDA for the SQL-Literate (extended abstract).", Description Logics, 2016.
Jiang, Y., and L. Golab, "On Competition for Undergraduate Co-op Placements: A Graph Mining Approach.", EDM, pp. 394–399, 2016.
Toman, D., and G. Weddell, "On Partial Features in the DLF Family of Description Logics.", PRICAI, pp. 529–542, 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Information Systems Derived from Conceptual Modelling.", ER, pp. 183–197, 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Query Answering over First Order Knowledge Bases.", KR, pp. 319–328, 2016.
Toman, D., and G. Weddell, "Ontology Based Data Access with Referring Expressions for Logics with the Tree Model Property - (Extended Abstract).", Australasian Conference on Artificial Intelligence, pp. 353–361, 2016.
Baruah, G., H. Zhang, R. Guttikonda, J. Lin, M. Smucker, and O. Vechtomova, "Optimizing Nugget Annotations with Active Learning.", CIKM, pp. 2359–2364, 2016.
Bonenfant, M., B. Desai, D. Desai, B. Fung, M. Özsu, and J. Ullman, "Panel: The State of Data: Invited Paper from panelists.", IDEAS, pp. 2–11, 2016.
Yilmaz, E., and C. Clarke, "Preface.", EVIA@NTCIR, 2016.
Yang, G., I. Soboroff, L. Xiong, C. Clarke, and S. Garfinkel, "Privacy-Preserving IR 2016: Differential Privacy, Search, and Social Media.", SIGIR, pp. 1247–1248, 2016.
Lin, J., Z. Tu, M. Rose, and P. White, "Prizm: A Wireless Access Point for Proxy-Based Web Lifelogging.", LTA@MM, pp. 19–25, 2016.
Han, M., and K. Daudjee, "Providing Serializability for Pregel-like Graph Processing Systems.", EDBT, pp. 77–88, 2016.
Gebhard, L., L. Golab, S. Keshav, and H. de Meer, "Range prediction for electric bicycles.", e-Energy, pp. 21:1-21:11, 2016.
Elbagoury, A., M. Crane, and J. Lin, "Rank-at-a-Time Query Processing.", ICTIR, pp. 229–232, 2016.
Paik, J., and J. Lin, "Retrievability in API-Based “Evaluation as a Service”.", ICTIR, pp. 91–94, 2016.
Zhang, H., J. Lin, G. Cormack, and M. Smucker, "Sampling Strategies and Active Learning for Volume Estimation.", SIGIR, pp. 981–984, 2016.
Cormack, G., and M. Grossman, "Scalability of Continuous Active Learning for Reliable High-Recall Text Classification.", CIKM, pp. 1039–1048, 2016.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Second Workshop on Search and Exploration of X-Rated Information (SEXI’16): WSDM Workshop Summary.", WSDM, pp. 697–698, 2016.
Moschitti, A., L. Màrquez, P. Nakov, E. Agichtein, C. Clarke, and I. Szpektor, "SIGIR 2016 Workshop WebQA II: Web Question Answering Beyond Factoids.", SIGIR, pp. 1251–1252, 2016.
Tan, L., A. Roegiest, C. Clarke, and J. Lin, "Simple Dynamic Emission Strategies for Microblog Filtering.", SIGIR, pp. 1009–1012, 2016.
Davila, K., R. Zanibbi, A. Kane, and F. Tompa, "Tangent-3 at the NTCIR-12 MathIR Task.", NTCIR, 2016.
Rao, J., and J. Lin, "Temporal Query Expansion Using a Continuous Hidden Markov Model.", ICTIR, pp. 295–298, 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Total Recall: Blue Sky on Mars.", ICTIR, pp. 45–48, 2016.
Lin, J., M. Crane, A. Trotman, J. Callan, I. Chattopadhyaya, J. Foley, G. Ingersoll, C. MacDonald, and S. Vigna, "Toward Reproducible Baselines: The Open-Source IR Reproducibility Challenge.", ECIR, pp. 408–420, 2016.
Grossman, M., G. Cormack, and A. Roegiest, "TREC 2016 Total Recall Track Overview.", TREC, 2016.
He, H., J. Wieting, K. Gimpel, J. Rao, and J. Lin, "UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement.", SemEval@NAACL-HLT, pp. 1103–1108, 2016.
Radhakrishnan, S., B. Muscedere, and K. Daudjee, "V-Hadoop: Virtualized Hadoop using containers.", NCA, pp. 237–241, 2016.
Hartig, O., and M. Özsu, "Walking Without a Map: Ranking-Based Traversal for Querying Linked Data.", International Semantic Web Conference (1), pp. 305–324, 2016.
Yan, D., J. Cheng, M. Özsu, F. Yang, Y. Lu, J. Lui, Q. Zhang, and W. Ng, "A General-Purpose Query-Centric Framework for Querying Big Graphs.", PVLDB, vol. 9, no. 7, pp. 564–575, 2016.
Özsu, M., "A survey of RDF data management systems.", Frontiers Comput. Sci., vol. 10, no. 3, pp. 418–432, 2016.
Özsu, M., "A Survey of RDF Data Management Systems.", CoRR, vol. abs/1601.00707, 2016.
Gebaly, K., and J. Lin, "Afterburner: The Case for In-Browser Analytics.", CoRR, vol. abs/1605.04035, 2016.
Clarke, C., J. Culpepper, and A. Moffat, "Assessing efficiency-effectiveness tradeoffs in multi-stage retrieval systems without using relevance judgments.", Inf. Retr. Journal, vol. 19, no. 4, pp. 351–377, 2016.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Authority-based Team Discovery in Social Networks.", CoRR, vol. abs/1611.02992, 2016.
Jiang, Y., S. Syed, and L. Golab, "Data Mining of Undergraduate Course Evaluations.", Informatics in Education, vol. 15, no. 1, pp. 85–102, 2016.
Bär, A., P. Casas, A. D’Alconzo, P. Fiadino, L. Golab, M. Mellia, and E. Schikuta, "DBStream: A holistic approach to large-scale network traffic monitoring and analysis.", Computer Networks, vol. 107, pp. 5–19, 2016.
Abedjan, Z., X. Chu, D. Deng, R. Fernandez, I. Ilyas, M. Ouzzani, P. Papotti, M. Stonebraker, and N. Tang, "Detecting Data Errors: Where are we and what needs to be done?", PVLDB, vol. 9, no. 12, pp. 993–1004, 2016.
Chu, X., I. Ilyas, and P. Koutris, "Distributed Data Deduplication.", PVLDB, vol. 9, no. 11, pp. 864–875, 2016.
Culpepper, J., C. Clarke, and J. Lin, "Dynamic Trade-Off Prediction in Multi-Stage Retrieval Systems.", CoRR, vol. abs/1610.02502, 2016.
Bizer, C., L. Dong, I. Ilyas, and M-E. Vidal, "Editorial: Special Issue on Web Data Quality.", J. Data and Information Quality, vol. 8, no. 1, pp. 1:1-1:3, 2016.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization.", CoRR, vol. abs/1608.06169, 2016.
Ilyas, I., "Effective Data Cleaning with Continuous Evaluation.", IEEE Data Eng. Bull., vol. 39, no. 2, pp. 38–46, 2016.
Clarke, C., and E. Yilmaz, "EVIA 2016: The Seventh International Workshop on Evaluating Information Access.", SIGIR Forum, vol. 50, no. 2, pp. 44–46, 2016.
Boncz, P., and K. Salem, "Front Matter.", PVLDB, vol. 10, no. 1, pp. i–vi, 2016.
Sharma, A., J. Jiang, P. Bommannavar, B. Larson, and J. Lin, "GraphJet: Real-Time Content Recommendations at Twitter.", PVLDB, vol. 9, no. 13, pp. 1281–1292, 2016.
Khabsa, M., A. Elmagarmid, I. Ilyas, H. Hammady, and M. Ouzzani, "Learning to identify relevant studies for systematic reviews using random forest and external information.", Machine Learning, vol. 102, no. 3, pp. 465–482, 2016.
Quamar, A., A. Deshpande, and J. Lin, "NScale: neighborhood-centric large-scale graph analytics in the cloud.", VLDB J., vol. 25, no. 2, pp. 125–150, 2016.
Drzadzewski, G., and F. Tompa, "Partial materialization for online analytical processing over multi-tagged document collections.", Knowl. Inf. Syst., vol. 47, no. 3, pp. 697–732, 2016.
Peng, P., L. Zou, M. Özsu, L. Chen, and D. Zhao, "Processing SPARQL queries over distributed RDF graphs.", VLDB J., vol. 25, no. 2, pp. 243–268, 2016.
Chu, X., and I. Ilyas, "Qualitative Data Cleaning.", PVLDB, vol. 9, no. 13, pp. 1605–1608, 2016.
Yan, D., J. Cheng, M. Özsu, F. Yang, Y. Lu, J. Lui, Q. Zhang, and W. Ng, "Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs.", CoRR, vol. abs/1601.06497, 2016.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple.", PVLDB, vol. 9, no. 13, pp. 1481–1484, 2016.
Lin, J., C. Clarke, and G. Baruah, "Searching from Mars.", IEEE Internet Computing, vol. 20, no. 1, pp. 78–82, 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Ten Blue Links on Mars.", CoRR, vol. abs/1610.06468, 2016.
Tan, L., J. Lin, A. Roegiest, and C. Clarke, "The Effects of Latency Penalties in Evaluating Push Notification Systems.", CoRR, vol. abs/1606.03066, 2016.
Lin, J., and K. Gebaly, "The Future of Big Data Is ... JavaScript?", IEEE Internet Computing, vol. 20, no. 5, pp. 82–88, 2016.

2015

Shen, X., L. Zou, M. Özsu, L. Chen, Y. Li, S. Han, and D. Zhao, "A graph-based RDF triple store.", ICDE, pp. 1508–1511, 2015.
Wu, J., T. Kinash, D. Toman, and G. Weddell, "Absorption for ABoxes and TBoxes with General Value Restrictions.", Australasian Conference on Artificial Intelligence, pp. 609–622, 2015.
Lin, J., and A. Trotman, "Anytime Ranking for Impact-Ordered Indexes.", ICTIR, pp. 301–304, 2015.
Wang, Y., G. Sherman, J. Lin, and M. Efron, "Assessor Differences and User Preferences in Tweet Timeline Generation.", SIGIR, pp. 615–624, 2015.
Liu, X., L. Golab, W. Golab, and I. Ilyas, "Benchmarking Smart Meter Data Analytics.", EDBT, pp. 385–396, 2015.
Khayyat, Z., I. Ilyas, A. Jindal, S. Madden, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "BigDansing: A System for Big Data Cleansing.", SIGMOD Conference, pp. 1215–1230, 2015.
Lin, J., "Building a Self-Contained Search Engine in the Browser.", ICTIR, pp. 309–312, 2015.
Bär, A., L. Golab, S. Ruehrup, M. Schiavone, and P. Casas, "Cache-oblivious scheduling of shared workloads.", ICDE, pp. 855–866, 2015.
Kiseleva, J., J. Kamps, and C. Clarke, "Contextual Search and Exploration.", RuSSIR, pp. 3–23, 2015.
Kim, J., K. Salem, K. Daudjee, A. Aboulnaga, and X. Pan, "Database high availability using SHADOW systems.", SoCC, pp. 209–221, 2015.
Morcos, J., Z. Abedjan, I. Ilyas, M. Ouzzani, P. Papotti, and M. Stonebraker, "DataXFormer: An Interactive Data Transformation Tool.", SIGMOD Conference, pp. 883–888, 2015.
Abedjan, Z., J. Morcos, M. Gubanov, I. Ilyas, M. Stonebraker, P. Papotti, and M. Ouzzani, "Dataxformer: Leveraging the Web for Semantic Transformations.", CIDR, 2015.
Saxena, H., and K. Salem, "EdgeX: Edge Replication for Web Applications.", CLOUD, pp. 1041–1044, 2015.
Drzadzewski, G., and F. Tompa, "Enhancing Exploration with a Faceted Browser through Summarization.", DocEng, pp. 61–64, 2015.
Baruah, G., M. Smucker, and C. Clarke, "Evaluating Streams of Evolving News Events.", SIGIR, pp. 675–684, 2015.
Aluç, G., M. Özsu, K. Daudjee, and O. Hartig, "Executing queries over schemaless RDF databases.", ICDE, pp. 807–818, 2015.
Bislimovska, B., G. Aluç, M. Özsu, and P. Fraternali, "Graph Search of Software Models Using Multidimensional Scaling.", EDBT/ICDT Workshops, pp. 163–170, 2015.
Petroni, F., L. Querzoni, K. Daudjee, S. Kamali, and G. Iacoboni, "HDRF: Stream-Based Partitioning for Power-Law Graphs.", CIKM, pp. 243–252, 2015.
Nicoara, D., S. Kamali, K. Daudjee, and L. Chen, "Hermes: Dynamic Partitioning for Distributed Social Network Graph Databases.", EDBT, pp. 25–36, 2015.
Lamb, C., D. Brown, and C. Clarke, "Human Competence in Creativity Evaluation.", ICCC, pp. 102–109, 2015.
Weissman, S., S. Ayhan, J. Bradley, and J. Lin, "Identifying Duplicate and Contradictory Information in Wikipedia.", JCDL, pp. 57–60, 2015.
Roegiest, A., G. Cormack, C. Clarke, and M. Grossman, "Impact of Surrogate Assessments on High-Recall Retrieval.", SIGIR, pp. 555–564, 2015.
Ge, C., M. Kaufmann, L. Golab, P. Fischer, and A. Goel, "Indexing bi-temporal windows.", SSDBM, pp. 19:1-19:12, 2015.
Clarke, C., M. Smucker, and E. Yilmaz, "IR Evaluation: Modeling User Behavior for Measuring Effectiveness.", SIGIR, pp. 1117–1120, 2015.
Chu, X., J. Morcos, I. Ilyas, M. Ouzzani, P. Papotti, N. Tang, and Y. Ye, "KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing.", SIGMOD Conference, pp. 1247–1261, 2015.
Tan, L., H. Zhang, C. Clarke, and M. Smucker, "Lexical Comparison Between Wikipedia and Twitter Corpora by Using Word Embeddings.", ACL (2), pp. 657–661, 2015.
Cormack, G., and M. Grossman, "Multi-Faceted Recall of Continuous Active Learning for Technology-Assisted Review.", SIGIR, pp. 763–766, 2015.
He, H., K. Gimpel, and J. Lin, "Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks.", EMNLP, pp. 1576–1586, 2015.
Hudek, A., D. Toman, and G. Weddell, "On Enumerating Query Plans Using Analytic Tableau.", TABLEAUX, pp. 339–354, 2015.
Toman, D., and G. Weddell, "On the Krom Extension of CFDI^∀ -_nc.", Australasian Conference on Artificial Intelligence, pp. 559–571, 2015.
Hashemi, S., C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "On the Reusability of Open Test Collections.", SIGIR, pp. 827–830, 2015.
Toman, D., and G. Weddell, "On the Utility of CFDI.", Description Logics, 2015.
Dean-Hall, A., C. Clarke, J. Kamps, and J. Kiseleva, "Online Evaluation of Point-Of-Interest Recommendation Systems.", SCST@ECIR, 2015.
Dean-Hall, A., C. Clarke, J. Kamps, J. Kiseleva, and E. Voorhees, "Overview of the TREC 2015 Contextual Suggestion Track.", TREC, 2015.
Fillottrani, P., C. Keet, and D. Toman, "Polynomial encoding of ORM conceptual models in CFDI.", Description Logics, 2015.
Baruah, G., A. Roegiest, and M. Smucker, "Pooling for User-Oriented Evaluation Measures.", ICTIR, pp. 341–344, 2015.
Rao, J., J. Lin, and M. Efron, "Reproducible Experiments on Lexical and Temporal Feedback for Tweet Search.", ECIR, pp. 755–767, 2015.
Lin, J., "Scaling Down Distributed Infrastructure on Wimpy Machines for Personal Web Archiving.", WWW (Companion Volume), pp. 1351–1355, 2015.
Arguello, J., F. Diaz, J. Lin, and A. Trotman, "SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR).", SIGIR, pp. 1147–1148, 2015.
Borgida, A., D. Toman, and G. Weddell, "Singular Referring Expressions in Conjunctive Query Answers: the case for a CFD DL Dialect.", Description Logics, 2015.
Golab, L., F. Korn, F. Li, B. Saha, and D. Srivastava, "Size-Constrained Weighted Set Cover.", ICDE, pp. 879–890, 2015.
Liu, X., L. Golab, and I. Ilyas, "SMAS: A smart meter data analytics system.", ICDE, pp. 1476–1479, 2015.
Wang, Y., and J. Lin, "The Feasibility of Brute Force Scans for Real-Time Tweet Search.", ICTIR, pp. 321–324, 2015.
Dean-Hall, A., and C. Clarke, "The Power of Contextual Suggestion.", ECIR, pp. 352–357, 2015.
Korkmaz, M., A. Karyakin, M. Karsten, and K. Salem, "Towards Dynamic Green-Sizing for Database Servers.", ADMS@VLDB, pp. 25–36, 2015.
Tan, L., A. Roegiest, and C. Clarke, "University of Waterloo at TREC 2015 Microblog Track.", TREC, 2015.
Ghenai, A., E. Khalilov, P. Valov, and C. Clarke, "WaterlooClarke: TREC 2015 Clinical Decision Support Track.", TREC, 2015.
Hoffmann, H., P. Addala, and C. Clarke, "WaterlooClarke: TREC 2015 Contextual Suggestion Track.", TREC, 2015.
Vtyurina, A., A. Dey, B. Sarrafzadeh, and C. Clarke, "WaterlooClarke: TREC 2015 LiveQA Track.", TREC, 2015.
Abualsaud, M., M. Ghaznavi, D. Recoskie, and C. Clarke, "WaterlooClarke: TREC 2015 Microblog Track.", TREC, 2015.
Raza, A., D. Rotondo, and C. Clarke, "WaterlooClarke: TREC 2015 Temporal Summarization Track.", TREC, 2015.
Zhang, H., W. Lin, Y. Wang, C. Clarke, and M. Smucker, "WaterlooClarke: TREC 2015 Total Recall Track.", TREC, 2015.
Agichtein, E., D. Carmel, C. Clarke, P. Paritosh, D. Pelleg, and I. Szpektor, "Web Question Answering: Beyond Factoids: SIGIR 2015 Workshop.", SIGIR, pp. 1143, 2015.
Gao, P., L. Golab, and S. Keshav, "What’s Wrong with my Solar Panels: a Data-Driven Approach.", EDBT/ICDT Workshops, pp. 86–93, 2015.
Kim, J., K. Salem, and K. Daudjee, "Write Amplification: An Analysis of In-Memory Database Durability Techniques.", IMDM@VLDB, pp. 1:1-1:7, 2015.
Tan, L., and C. Clarke, "A Family of Rank Similarity Measures Based on Maximized Effectiveness Difference.", IEEE Trans. Knowl. Data Eng., vol. 27, no. 11, pp. 2865–2877, 2015.
Chowdhury, S., A. Roy, M. Shaikh, and K. Daudjee, "A taxonomy of decentralized online social networks.", Peer-to-Peer Networking and Applications, vol. 8, no. 3, pp. 367–383, 2015.
Agrawal, D., A. Abbadi, and K. Salem, "A Taxonomy of Partitioned Replicated Cloud-based Database Systems.", IEEE Data Eng. Bull., vol. 38, no. 1, pp. 4–9, 2015.
Clarke, C., J. Culpepper, and A. Moffat, "Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments.", CoRR, vol. abs/1506.00717, 2015.
Cormack, G., and M. Grossman, "Autonomy and Reliability of Continuous Active Learning for Technology-Assisted Review.", CoRR, vol. abs/1504.06868, 2015.
Aluç, G., M. Özsu, and K. Daudjee, "Clustering RDF Databases Using Tunable-LSH.", CoRR, vol. abs/1504.02523, 2015.
Kargar, M., L. Golab, and J. Szlichta, "Effective Keyword Search in Graphs.", CoRR, vol. abs/1512.06395, 2015.
Hanbury, A., H. Müller, K. Balog, T. Brodt, G. Cormack, I. Eggel, T. Gollub, F. Hopfgartner, J. Kalpathy-Cramer, N. Kando, et al., "Evaluation-as-a-Service: Overview and Outlook.", CoRR, vol. abs/1512.07454, 2015.
He, H., J. Lin, and A. Lopez, "Gappy Pattern Matching on GPUs for On-Demand Extraction of Hierarchical Translation Grammars.", TACL, vol. 3, pp. 87–100, 2015.
Han, M., and K. Daudjee, "Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems.", PVLDB, vol. 8, no. 9, pp. 950–961, 2015.
Lin, J., "Is Big Data a Transient Problem?", IEEE Internet Computing, vol. 19, no. 5, pp. 86–90, 2015.
Chu, X., M. Ouzzani, J. Morcos, I. Ilyas, P. Papotti, N. Tang, and Y. Ye, "KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing.", PVLDB, vol. 8, no. 12, pp. 1952–1955, 2015.
Buntain, C., J. Lin, and J. Golbeck, "Learning to Discover Key Moments in Social Media Streams.", CoRR, vol. abs/1508.00488, 2015.
Balkesen, C., J. Teubner, G. Alonso, and M. Özsu, "Main-Memory Hash Joins on Modern Processor Architectures.", IEEE Trans. Knowl. Data Eng., vol. 27, no. 7, pp. 1754–1766, 2015.
Abu-Khzam, F., K. Daudjee, A. Mouawad, and N. Nishimura, "On scalable parallel recursive backtracking.", J. Parallel Distrib. Comput., vol. 84, pp. 65–75, 2015.
Abedjan, Z., L. Golab, and F. Naumann, "Profiling relational data: a survey.", VLDB J., vol. 24, no. 4, pp. 557–581, 2015.
Hopfgartner, F., A. Hanbury, H. Müller, N. Kando, S. Mercer, J. Kalpathy-Cramer, M. Potthast, T. Gollub, A. Krithara, J. Lin, et al., "Report on the Evaluation-as-a-Service (EaaS) Expert Workshop.", SIGIR Forum, vol. 49, no. 1, pp. 57–65, 2015.
Arguello, J., M. Crane, F. Diaz, J. Lin, and A. Trotman, "Report on the SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR).", SIGIR Forum, vol. 49, no. 2, pp. 107–116, 2015.
Calvanese, D., M. Koubarakis, and D. Toman, "Special issue of the Journal of Web Semantics on ontology-based data access.", J. Web Semant., vol. 33, pp. 1-2, 2015.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "The Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search.", CoRR, vol. abs/1507.06235, 2015.
Ilyas, I., and X. Chu, "Trends in Cleaning Relational Data: Consistency and Deduplication.", Foundations and Trends in Databases, vol. 5, no. 4, pp. 281–393, 2015.

2014

Toman, D., and G. Weddell, "", PRICAI, pp. 587–599, 2014.
Al-Harbi, A., and M. Smucker, "A qualitative exploration of secondary assessor relevance judging behavior.", IIiX, pp. 195–204, 2014.
Dean-Hall, A., and C. Clarke, "Assessing Contextual Suggestion.", EVIA@NTCIR, 2014.
Mühleisen, H., T. Samar, J. Lin, and A. de Vries, "Column Stores as an IR Prototyping Tool.", ECIR, pp. 789–792, 2014.
Ardakanian, O., N. Koochakzadeh, R. Singh, L. Golab, and S. Keshav, "Computing Electricity Consumption Profiles from Household Smart Meter Data.", EDBT/ICDT Workshops, pp. 140–147, 2014.
Robinson, N., S. McIlraith, and D. Toman, "Cost-Based Query Optimization via AI Planning.", AAAI, pp. 2344–2351, 2014.
Gebremeskel, G., J. He, A. de Vries, and J. Lin, "Cumulative Citation Recommendation: A Feature-Aware Comparison of Approaches.", DEXA Workshops, pp. 193–197, 2014.
Syed, S., Y. Jiang, and L. Golab, "Data mining of undergraduate course evaluations.", EDM, pp. 347–348, 2014.
Golab, L., and T. Johnson, "Data stream warehousing.", ICDE, pp. 1290–1293, 2014.
Bär, A., P. Casas, L. Golab, and A. Finamore, "DBStream: An online aggregation, filtering and processing system for network traffic monitoring.", IWCMC, pp. 611–616, 2014.
Chalamalla, A., I. Ilyas, M. Ouzzani, and P. Papotti, "Descriptive and prescriptive data cleaning.", SIGMOD Conference, pp. 445–456, 2014.
Golab, L., M. Hadjieleftheriou, H. Karloff, and B. Saha, "Distributed data placement to minimize communication costs via graph partitioning.", SSDBM, pp. 20:1-20:12, 2014.
Aluç, G., O. Hartig, M. Özsu, and K. Daudjee, "Diversified Stress Testing of RDF Data Management Systems.", International Semantic Web Conference (1), pp. 197–212, 2014.
Said, A., A. Bellogín, J. Lin, and A. de Vries, "Do recommendations matter?: news recommendation in real life.", CSCW Companion, pp. 237–240, 2014.
Wu, G., and F. Tompa, "Effective and Efficient Bitmaps for Access Control.", DCC, pp. 433, 2014.
Albakour, M-D., C. MacDonald, I. Ounis, C. Clarke, and V. Bicer, "Information Access in Smart Cities (i-ASC).", ECIR, pp. 810–814, 2014.
Myers, S., A. Sharma, P. Gupta, and J. Lin, "Information network or social network?: the structure of the twitter follow graph.", WWW (Companion Volume), pp. 493–498, 2014.
Lin, J., M. Gholami, and J. Rao, "Infrastructure for supporting exploration and discovery in web archives.", WWW (Companion Volume), pp. 851–856, 2014.
Lin, J., and M. Efron, "Infrastructure support for evaluation as a service.", WWW (Companion Volume), pp. 79–82, 2014.
Carpenter, T., L. Golab, and S. Syed, "Is the grass greener?: mining electric vehicle opinions.", e-Energy, pp. 241–252, 2014.
Bär, A., A. Finamore, P. Casas, L. Golab, and M. Mellia, "Large-scale network traffic monitoring with DBStream, a system for rolling big data analysis.", BigData, pp. 165–170, 2014.
Avram, C-A., K. Salem, and B. Wong, "Latency Amplification: Characterizing the Impact of Web Page Content on Load Times.", SRDS Workshops, pp. 20–25, 2014.
Wang, L., J. Lin, D. Metzler, and J. Han, "Learning to efficiently rank on big data.", WWW (Companion Volume), pp. 209–210, 2014.
Hartig, O., and M. Özsu, "Linked Data query processing.", ICDE, pp. 1286–1289, 2014.
Singh, A., X. Cui, B. Cassell, B. Wong, and K. Daudjee, "MicroFuge: A Middleware Approach to Providing Performance Isolation in Cloud Storage Systems.", ICDCS, pp. 503–513, 2014.
Smucker, M., X. Guo, and A. Toulis, "Mouse movement during relevance judging: implications for determining user attention.", SIGIR, pp. 979–982, 2014.
Elmagarmid, A., I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF/ER: generic and interactive entity resolution.", SIGMOD Conference, pp. 1071–1074, 2014.
Mühleisen, H., T. Samar, J. Lin, and A. de Vries, "Old dogs are great at new tricks: column stores for ir prototyping.", SIGIR, pp. 863–866, 2014.
Voorhees, E., J. Lin, and M. Efron, "On run diversity in Evaluation as a Service.", SIGIR, pp. 959–962, 2014.
Daudjee, K., S. Kamali, and A. López-Ortiz, "On the online fault-tolerant server consolidation problem.", SPAA, pp. 12–21, 2014.
Kumar, K., J. Gluck, A. Deshpande, and J. Lin, "Optimization Techniques for “Scaling Down” Hadoop on Multi-Core, Shared-Memory Systems.", EDBT, pp. 13–24, 2014.
Dean-Hall, A., C. Clarke, J. Kamps, P. Thomas, and E. Voorhees, "Overview of the TREC 2014 Contextual Suggestion Track.", TREC, 2014.
Rao, J., J. Lin, and H. Samet, "Partitioning strategies for spatio-textual similarity join.", BigSpatial@SIGSPATIAL, pp. 40–49, 2014.
Jiang, Y., R. Levman, L. Golab, and J. Nathwani, "Predicting peak-demand days in the ontario peak reduction program for large consumers.", e-Energy, pp. 221–222, 2014.
Toman, D., and G. Weddell, "Pushing the CFDnc Envelope.", Description Logics, pp. 340–351, 2014.
Li, F., M. Özsu, G. Chen, and B. Ooi, "R-Store: A scalable distributed system for supporting real-time analytics.", ICDE, pp. 40–51, 2014.
Hartig, O., and M. Özsu, "Reachable subwebs for traversal-based query execution.", WWW (Companion Volume), pp. 541–546, 2014.
Chu, X., I. Ilyas, P. Papotti, and Y. Ye, "RuleMiner: Data quality rules discovery.", ICDE, pp. 1222–1225, 2014.
Kane, A., and F. Tompa, "Skewed partial bitvectors for list intersection.", SIGIR, pp. 263–272, 2014.
Tan, L., and C. Clarke, "Succinct Queries for Linking and Tracking News in Social Media.", CIKM, pp. 1883–1886, 2014.
Lin, J., K. Kraus, and R. Punzalan, "Supporting “Distant Reading” for Web Archives.", DH, 2014.
Efron, M., J. Lin, J. He, and A. de Vries, "Temporal feedback for tweet search with non-parametric density estimation.", SIGIR, pp. 33–42, 2014.
Baruah, G., A. Roegiest, and M. Smucker, "The effect of expanding relevance judgements with duplicates.", SIGIR, pp. 1159–1162, 2014.
Wang, Y., and J. Lin, "The Impact of Future Term Statistics in Real-Time Tweet Search.", ECIR, pp. 567–572, 2014.
Clarke, C., and M. Smucker, "Time well spent.", IIiX, pp. 205–214, 2014.
Li, L., and M. Smucker, "Tolerance of Effectiveness Measures to Relevance Judging Errors.", ECIR, pp. 148–159, 2014.
Xu, Z., D. Goldwasser, B. Bederson, and J. Lin, "Visual analytics of MOOCs at maryland.", L@S, pp. 195–196, 2014.
Tan, L., and C. Clarke, "A Family of Rank Similarity Measures based on Maximized Effectiveness Difference.", CoRR, vol. abs/1408.3587, 2014.
Wu, J., A. Hudek, D. Toman, and G. Weddell, "Absorption for ABoxes.", J. Autom. Reasoning, vol. 53, no. 3, pp. 215–243, 2014.
Serafini, M., E. Mansour, A. Aboulnaga, K. Salem, T. Rafiq, and U. Minhas, "Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions.", PVLDB, vol. 7, no. 12, pp. 1035–1046, 2014.
Han, M., K. Daudjee, K. Ammar, M. Özsu, X. Wang, and T. Jin, "An Experimental Comparison of Pregel-like Graph Processing Systems.", PVLDB, vol. 7, no. 12, pp. 1047–1058, 2014.
Chairunnanda, P., K. Daudjee, and M. Özsu, "ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases.", PVLDB, vol. 7, no. 11, pp. 947–958, 2014.
Golab, L., H. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules.", IEEE Trans. Knowl. Data Eng., vol. 26, no. 6, pp. 1332–1348, 2014.
Li, F., B. Ooi, M. Özsu, and S. Wu, "Distributed data management using MapReduce.", ACM Comput. Surv., vol. 46, no. 3, pp. 31:1-31:42, 2014.
Türe, F., and J. Lin, "Exploiting Representations from Statistical Machine Translation for Cross-Language Information Retrieval.", ACM Trans. Inf. Syst., vol. 32, no. 4, pp. 19:1-19:32, 2014.
Zou, L., M. Özsu, L. Chen, X. Shen, R. Huang, and D. Zhao, "gStore: a graph-based SPARQL query engine.", VLDB J., vol. 23, no. 4, pp. 565–590, 2014.
Weissman, S., S. Ayhan, J. Bradley, and J. Lin, "Identifying Duplicate and Contradictory Information in Wikipedia.", CoRR, vol. abs/1406.1143, 2014.
Liu, X., and K. Salem, "Integrating SSD Caching into Database Systems.", IEEE Data Eng. Bull., vol. 37, no. 2, pp. 35–43, 2014.
Gebaly, K., P. Agrawal, L. Golab, F. Korn, and D. Srivastava, "Interpretable and Informative Explanations of Outcomes.", PVLDB, vol. 8, no. 1, pp. 61–72, 2014.
Ashkan, A., and C. Clarke, "Location- and Query-Aware Modeling of Browsing and Click Behavior in Sponsored Search.", ACM TIST, vol. 5, no. 4, pp. 59:1-59:31, 2014.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-centric Analytics on Large Graphs.", PVLDB, vol. 7, no. 13, pp. 1673–1676, 2014.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-centric Large-Scale Graph Analytics in the Cloud.", CoRR, vol. abs/1405.1499, 2014.
Peng, P., L. Zou, M. Özsu, L. Chen, and D. Zhao, "Processing SPARQL Queries Over Linked Data-A Distributed Graph-based Approach.", CoRR, vol. abs/1411.6763, 2014.
Gupta, P., V. Satuluri, A. Grewal, S. Gurumurthy, V. Zhabiuk, Q. Li, and J. Lin, "Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs.", PVLDB, vol. 7, no. 13, pp. 1379–1380, 2014.
Albakour, M-D., C. MacDonald, I. Ounis, C. Clarke, and V. Bicer, "Report on the 1st International Workshop on Information Access in Smart Cities (i-ASC 2014).", SIGIR Forum, vol. 48, no. 2, pp. 96–104, 2014.
Balog, K., D. Elsweiler, E. Kanoulas, L. Kelly, and M. Smucker, "Report on the CIKM workshop on living labs for information retrieval evaluation.", SIGIR Forum, vol. 48, no. 1, pp. 21–28, 2014.
Asadi, N., J. Lin, and A. de Vries, "Runtime Optimizations for Tree-Based Machine Learning Models.", IEEE Trans. Knowl. Data Eng., vol. 26, no. 9, pp. 2281–2292, 2014.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "Sampling from repairs of conditional functional dependency violations.", VLDB J., vol. 23, no. 1, pp. 103–128, 2014.
Boykin, P., S. Ritchie, I. O’Connell, and J. Lin, "Summingbird: A Framework for Integrating Batch and Online MapReduce Computations.", PVLDB, vol. 7, no. 13, pp. 1441–1451, 2014.
Dallachiesa, M., T. Palpanas, and I. Ilyas, "Top-k Nearest Neighbor Search In Uncertain Data Series.", PVLDB, vol. 8, no. 1, pp. 13–24, 2014.
Toman, D., and G. Weddell, "Undecidability of Finite Model Reasoning in DLFD.", CoRR, vol. abs/1408.4468, 2014.
Aluç, G., M. Özsu, and K. Daudjee, "Workload Matters: Why RDF Databases Need a New Design.", PVLDB, vol. 7, no. 10, pp. 837–840, 2014.
Ilyas, I., "Data unification at scale: data tamer.", Making Databases Work, pp. 269–277, 2014.
"Distributed and Parallel Database Systems.", Computing Handbook, 3rd ed. (2), pp. 13: 1-24, 2014.

2013

Toman, D., and G. Weddell, "", Australasian Conference on Artificial Intelligence, pp. 350–361, 2013.
Said, A., J. Lin, A. Bellogín, and A. de Vries, "A month in the life of a production news recommender system.", LivingLab@CIKM, pp. 7–10, 2013.
Wu, J., T. Kinash, D. Toman, and G. Weddell, "Absorption for ABoxes with Local Universal Restrictions.", Description Logics, pp. 489–500, 2013.
Mehdad, Y., G. Carenini, F. Tompa, and R. Ng, "Abstractive Meeting Summarization with Entailment and Fusion.", ENLG, pp. 136–146, 2013.
Balkesen, C., N. Tatbul, and M. Özsu, "Adaptive input admission and management for parallel stream processing.", DEBS, pp. 15–26, 2013.
Deziel, M., D. Olawo, L. Truchon, and L. Golab, "Analyzing the Mental Health of Engineering Students using Classification and Regression.", EDM, pp. 228–231, 2013.
Toman, D., and G. Weddell, "CFDnc: A PTIME Description Logic with Functional Constraints and Disjointness.", Description Logics, pp. 451–463, 2013.
Balog, K., D. Elsweiler, E. Kanoulas, L. Kelly, and M. Smucker, "CIKM 2013 workshop on living labs for information retrieval evaluation.", CIKM, pp. 2557–2558, 2013.
Whissell, J., and C. Clarke, "Classification-Based Clustering Evaluation.", ICDM, pp. 1229–1234, 2013.
Bellogín, A., G. Gebremeskel, J. He, A. Said, T. Samar, A. de Vries, J. Lin, and J. Vuurens, "CWI and TU Delft Notebook TREC 2013: Contextual Suggestion, Federated Web Search, KBA, and Web Tracks.", TREC, 2013.
Stonebraker, M., D. Bruckner, I. Ilyas, G. Beskales, M. Cherniack, S. Zdonik, A. Pagan, and S. Xu, "Data Curation at Scale: The Data Tamer System.", CIDR, 2013.
Lei, B., I. Surya, S. Kamali, and K. Daudjee, "Data Partitioning for Video-on-Demand Services.", NCA, pp. 49–54, 2013.
Golab, L., and T. Johnson, "Data stream warehousing.", SIGMOD Conference, pp. 949–952, 2013.
Asadi, N., J. Lin, and M. Busch, "Dynamic memory allocation policies for postings in real-time Twitter search.", KDD, pp. 1186–1194, 2013.
Whissell, J., and C. Clarke, "Effective measures for inter-document similarity.", CIKM, pp. 1361–1370, 2013.
Dean-Hall, A., C. Clarke, J. Kamps, and P. Thomas, "Evaluating Contextual Suggestion.", EVIA@NTCIR, 2013.
Mishne, G., J. Dalton, Z. Li, A. Sharma, and J. Lin, "Fast data in the era of big data: Twitter’s real-time related query suggestion architecture.", SIGMOD Conference, pp. 1147–1158, 2013.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Faster and smaller inverted indices with treaps.", SIGIR, pp. 193–202, 2013.
Chu, X., I. Ilyas, and P. Papotti, "Holistic data cleaning: Putting violations into context.", ICDE, pp. 458–469, 2013.
Balkesen, C., J. Teubner, G. Alonso, and M. Özsu, "Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware.", ICDE, pp. 362–373, 2013.
Agrawal, D., A. Abbadi, H. Mahmoud, F. Nawab, and K. Salem, "Managing Geo-replicated Data in Multi-datacenters.", DNIS, pp. 23–43, 2013.
Jin, C., R. Liu, and K. Salem, "Materialized views for eventually consistent record stores.", ICDE Workshops, pp. 250–257, 2013.
Eidelman, V., K. Wu, F. Türe, P. Resnik, and J. Lin, "Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce.", ACL (Conference System Demonstrations), pp. 199–204, 2013.
Dallachiesa, M., A. Ebaid, A. Eldawy, A. Elmagarmid, I. Ilyas, M. Ouzzani, and N. Tang, "NADEEF: a commodity data cleaning system.", SIGMOD Conference, pp. 541–552, 2013.
Clarke, C., "Nugget-Based Computation of Graded Relevance.", EVIA@NTCIR, 2013.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "On the relative trust between inconsistent data and inaccurate constraints.", ICDE, pp. 541–552, 2013.
Dean-Hall, A., C. Clarke, N. Simone, J. Kamps, P. Thomas, and E. Voorhees, "Overview of the TREC 2013 Contextual Suggestion Track.", TREC, 2013.
Smucker, M., G. Kazai, and M. Lease, "Overview of the TREC 2013 Crowdsourcing Track.", TREC, 2013.
Lin, J., and M. Efron, "Overview of the TREC-2013 Microblog Track.", TREC, 2013.
Northam, L., R. Smits, K. Daudjee, and J. Istead, "Ray tracing in the cloud using MapReduce.", HPCS, pp. 19–26, 2013.
Kamali, S., and F. Tompa, "Retrieving documents with mathematical content.", SIGIR, pp. 353–362, 2013.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Search and exploration of X-Rated information (SEXI 2013).", WSDM, pp. 795–796, 2013.
Clarke, C., L. Freund, M. Smucker, and E. Yilmaz, "SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation.", SIGIR, pp. 1134, 2013.
Kamali, S., and F. Tompa, "Structural Similarity Search for Mathematics Retrieval.", MKM/Calculemus/DML, pp. 246–262, 2013.
Lutz, C., I. Seylan, D. Toman, and F. Wolter, "The Combined Approach to OBDA: Taming Role Hierarchies Using Filters.", International Semantic Web Conference (1), pp. 314–330, 2013.
Sakai, T., Z. Dou, and C. Clarke, "The impact of intent selection on diversified search evaluation.", SIGIR, pp. 921–924, 2013.
Clarke, C., "Time-Biased Gain.", NTCIR, 2013.
Eidelman, V., K. Wu, F. Türe, P. Resnik, and J. Lin, "Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation.", WMT@ACL, pp. 128–133, 2013.
Asadi, N., and J. Lin, "Training Efficient Tree-Based Models for Document Ranking.", ECIR, pp. 146–157, 2013.
Forsyth, S., and K. Daudjee, "Update Management in Decentralized Social Networks.", ICDCS Workshops, pp. 196–201, 2013.
Rios, M., and J. Lin, "Visualizing the “Pulse” of World Cities on Twitter.", ICWSM, 2013.
DeWitt, D., I. Ilyas, J. Naughton, and M. Stonebraker, "We are drowning in a sea of least publishable units (LPUs).", SIGMOD Conference, pp. 921–922, 2013.
Ammar, K., and M. Özsu, "WGB: Towards a Universal Graph Benchmark.", WBDB, pp. 58–72, 2013.
Gupta, P., A. Goel, J. Lin, A. Sharma, D. Wang, and R. Zadeh, "WTF: the who to follow service at Twitter.", WWW, pp. 505–514, 2013.
Özsu, M., "ACM books to launch.", Commun. ACM, vol. 56, no. 12, pp. 5, 2013.
Abu-Khzam, F., K. Daudjee, A. Mouawad, and N. Nishimura, "An Easy-to-use Scalable Framework for Parallel Recursive Backtracking.", CoRR, vol. abs/1312.7626, 2013.
Liu, R., A. Aboulnaga, and K. Salem, "DAX: A Widely Distributed Multi-tenant Storage Service for DBMS Hosting.", PVLDB, vol. 6, no. 4, pp. 253–264, 2013.
Chu, X., I. Ilyas, and P. Papotti, "Discovering Denial Constraints.", PVLDB, vol. 6, no. 13, pp. 1498–1509, 2013.
Golab, L., M. Hadjieleftheriou, H. Karloff, and B. Saha, "Distributed Data Placement via Graph Partitioning.", CoRR, vol. abs/1312.0285, 2013.
Asadi, N., and J. Lin, "Document vector representations for feature extraction in multi-stage document ranking.", Inf. Retr., vol. 16, no. 6, pp. 747–768, 2013.
Asadi, N., J. Lin, and M. Busch, "Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search", CoRR, vol. abs/1302.5302, 2013.
Lin, J., and M. Efron, "Evaluation as a service for information retrieval.", SIGIR Forum, vol. 47, no. 2, pp. 8–14, 2013.
Akinyemi, J., and C. Clarke, "Fast and effective soft links.", Softw., Pract. Exper., vol. 43, no. 5, pp. 577–593, 2013.
Asadi, N., and J. Lin, "Fast candidate generation for real-time tweet search with bloom filter chains.", ACM Trans. Inf. Syst., vol. 31, no. 3, pp. 13, 2013.
Asadi, N., and J. Lin, "Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections", CoRR, vol. abs/1305.0699, 2013.
Capra, R., L. Freund, C. Smith, M. Smucker, and R. White, "HCIR 2013: the seventh international symposium on human-computer interaction and information retrieval.", SIGIR Forum, vol. 47, no. 2, pp. 33–40, 2013.
Kumar, K., J. Gluck, A. Deshpande, and J. Lin, "Hone: “Scaling Down” Hadoop on Shared-Memory Systems.", PVLDB, vol. 6, no. 12, pp. 1354–1357, 2013.
Liu, X., and K. Salem, "Hybrid Storage Management for Database Systems.", PVLDB, vol. 6, no. 8, pp. 541–552, 2013.
Ashkan, A., and C. Clarke, "Impact of query intent and search context on clickthrough behavior in sponsored search.", Knowl. Inf. Syst., vol. 34, no. 2, pp. 425–452, 2013.
Golbus, P., J. Aslam, and C. Clarke, "Increasing evaluation sensitivity to diversity.", Inf. Retr., vol. 16, no. 4, pp. 530–555, 2013.
Balkesen, C., G. Alonso, J. Teubner, and M. Özsu, "Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited.", PVLDB, vol. 7, no. 1, pp. 85–96, 2013.
Ebaid, A., A. Elmagarmid, I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF: A Generalized Data Cleaning System.", PVLDB, vol. 6, no. 12, pp. 1218–1221, 2013.
Chen, T., L. Chen, M. Özsu, and N. Xiao, "Optimizing Multi-Top-k Queries over Uncertain Data Streams.", IEEE Trans. Knowl. Data Eng., vol. 25, no. 8, pp. 1814–1829, 2013.
Chen, L., I. Ilyas, C. Ré, and X. Zhou, "Probabilistic Web Data Management.", World Wide Web, vol. 16, no. 3, pp. 271–272, 2013.
Minhas, U., S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield, "RemusDB: transparent high availability for database systems.", VLDB J., vol. 22, no. 1, pp. 29–45, 2013.
Clarke, C., L. Freund, M. Smucker, and E. Yilmaz, "Report on the SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation (MUBE 2013).", SIGIR Forum, vol. 47, no. 2, pp. 84–95, 2013.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Report on the workshop on search and exploration of x-rated information (SEXI 2013).", SIGIR Forum, vol. 47, no. 1, pp. 31–37, 2013.
Golab, L., "Data Warehouse Quality: Summary and Outlook.", Handbook of Data Quality, pp. 121–140, 2013.
Ng, R., P. Arocena, D. Barbosa, G. Carenini, L. Gomes, Jr., S. Jou, R. Leung, E. Milios, R. Miller, J. Mylopoulos, et al., "Perspectives on Business Intelligence", Perspectives on Business Intelligence, pp. 1–163, 2013.

2012

MacDonald, C., J. Wang, and C. Clarke, "2nd international workshop on diversity in document retrieval (DDR 2012).", WSDM, pp. 769–770, 2012.
Golab, L., T. Johnson, S. Sen, and J. Yates, "A Sequence-Oriented Stream Warehouse Paradigm for Network Monitoring Applications.", PAM, pp. 53–63, 2012.
Wu, J., A. Hudek, D. Toman, and G. Weddell, "Absorption for ABoxes.", Description Logics, 2012.
Wu, J., A. Hudek, D. Toman, and G. Weddell, "Assertion Absorption in Object Queries over Knowledge Bases.", KR, 2012.
Türe, F., J. Lin, and D. Oard, "Combining Statistical Translation Techniques for Cross-Language Information Retrieval.", COLING, pp. 2685–2702, 2012.
Golab, L., H. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules.", ICDE, pp. 738–749, 2012.
Busch, M., K. Gade, B. Larson, P. Lok, S. Luckenbill, and J. Lin, "Earlybird: Real-Time Search at Twitter.", ICDE, pp. 1360–1369, 2012.
Minhas, U., R. Liu, A. Aboulnaga, K. Salem, J. Ng, and S. Robertson, "Elastic Scale-Out for Partition-Based Database Systems.", ICDE Workshops, pp. 281–288, 2012.
McCullough, D., J. Lin, C. MacDonald, I. Ounis, and R. McCreadie, "Evaluating Real-Time Search over Tweets.", ICWSM, 2012.
Drzadzewski, G., and F. Tompa, "Exploring and analyzing documents with OLAP.", PIKM, pp. 33–40, 2012.
Chairunnanda, P., S. Forsyth, and K. Daudjee, "Graph data partition models for online social networks.", HT, pp. 175–180, 2012.
Smucker, M., J. Allan, and B. Dachev, "Human question answering performance using an interactive document retrieval system.", IIiX, pp. 35–44, 2012.
Pound, J., A. Hudek, I. Ilyas, and G. Weddell, "Interpreting keyword queries over web knowledge bases.", CIKM, pp. 305–314, 2012.
El-Helw, A., M. Farid, and I. Ilyas, "Just-in-time information extraction using extraction views.", SIGMOD Conference, pp. 613–616, 2012.
Lin, J., and A. Kolcz, "Large-scale machine learning at twitter.", SIGMOD Conference, pp. 793–804, 2012.
Raveendran, G., and C. Clarke, "Lightweight contrastive summarization for news comment mining.", SIGIR, pp. 1103–1104, 2012.
Türe, F., J. Lin, and D. Oard, "Looking inside the box: context-sensitive translation for cross-language information retrieval.", SIGIR, pp. 1105–1106, 2012.
Ashkan, A., and C. Clarke, "Modeling browsing behavior for click analysis in sponsored search.", CIKM, pp. 2015–2019, 2012.
Smucker, M., and C. Clarke, "Modeling user variance in time-biased gain.", HCIR, pp. 3, 2012.
McCreadie, R., I. Soboroff, J. Lin, C. MacDonald, I. Ounis, and D. McCullough, "On building a reusable Twitter corpus.", SIGIR, pp. 1113–1114, 2012.
Dean-Hall, A., C. Clarke, J. Kamps, P. Thomas, and E. Voorhees, "Overview of the TREC 2012 Contextual Suggestion Track.", TREC, 2012.
Smucker, M., G. Kazai, and M. Lease, "Overview of the TREC 2012 Crowdsourcing Track.", TREC, 2012.
Clarke, C., N. Craswell, and E. Voorhees, "Overview of the TREC 2012 Web Track.", TREC, 2012.
Soboroff, I., I. Ounis, C. MacDonald, and J. Lin, "Overview of the TREC-2012 Microblog Track.", TREC, 2012.
Smucker, M., and C. Clarke, "Stochastic simulation of time-biased gain.", CIKM, pp. 2040–2044, 2012.
Lutz, C., I. Seylan, D. Toman, and F. Wolter, "The Combined Approach to OBDA: Taming Role Hierarchies using Filters.", SSWS+HPCSW@ISWC, pp. 16–31, 2012.
Smucker, M., and C. Jethani, "Time to judge relevance as an indicator of assessor error.", SIGIR, pp. 1153–1154, 2012.
Smucker, M., and C. Clarke, "Time-based calibration of effectiveness measures.", SIGIR, pp. 95–104, 2012.
Bär, A., and L. Golab, "Towards benchmarking stream data warehouses.", DOLAP, pp. 105–112, 2012.
Mishne, G., and J. Lin, "Twanchor text: a preliminary study of the value of tweets as anchor text.", SIGIR, pp. 1159–1160, 2012.
Lin, J., and G. Mishne, "A Study of “Churn” in Tweets and Real-Time Search Queries (Extended Version)", CoRR, vol. abs/1205.6855, 2012.
Zou, L., L. Chen, M. Özsu, and D. Zhao, "Answering pattern match queries in large graph databases via graph embedding.", VLDB J., vol. 21, no. 1, pp. 97–120, 2012.
Mishne, G., J. Dalton, Z. Li, A. Sharma, and J. Lin, "Fast Data in the Era of Big Data: Twitter’s Real-Time Related Query Suggestion Architecture", CoRR, vol. abs/1210.7350, 2012.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "On the Relative Trust between Inconsistent Data and Inaccurate Constraints", CoRR, vol. abs/1207.5226, 2012.
Trotman, A., C. Clarke, I. Ounis, J. Culpepper, M-A. Cartright, and S. Geva, "Open source information petrieval: a report on the SIGIR 2012 workshop.", SIGIR Forum, vol. 46, no. 2, pp. 95–101, 2012.
Asadi, N., J. Lin, and A. de Vries, "Runtime Optimizations for Prediction with Tree-Based Models", CoRR, vol. abs/1212.2287, 2012.
Golab, L., T. Johnson, and V. Shkapenyuk, "Scalable Scheduling of Updates in Streaming Data Warehouses.", IEEE Trans. Knowl. Data Eng., vol. 24, no. 6, pp. 1092–1105, 2012.
Lin, J., and D. Ryaboy, "Scaling big data mining infrastructure: the twitter experience.", SIGKDD Explorations, vol. 14, no. 2, pp. 6–19, 2012.
Beskales, G., G. Das, A. Elmagarmid, I. Ilyas, F. Naumann, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, and N. Tang, "The data analytics group at the qatar computing research institute.", SIGMOD Record, vol. 41, no. 4, pp. 33–38, 2012.
Lee, G., J. Lin, C. Liu, A. Lorek, and D. Ryaboy, "The Unified Logging Infrastructure for Data Analytics at Twitter", CoRR, vol. abs/1208.4171, 2012.
Lee, G., J. Lin, C. Liu, A. Lorek, and D. Ryaboy, "The Unified Logging Infrastructure for Data Analytics at Twitter.", PVLDB, vol. 5, no. 12, pp. 1771–1780, 2012.

2011

Wang, L., J. Lin, and D. Metzler, "A cascade ranking model for efficient ranked retrieval.", SIGIR, pp. 105–114, 2011.
Clarke, C., N. Craswell, I. Soboroff, and A. Ashkan, "A comparative analysis of cascade measures for novelty and diversity.", WSDM, pp. 75–84, 2011.
Pound, J., D. Toman, G. Weddell, and J. Wu, "An Assertion Retrieval Algebra for Object Queries over Knowledge Bases.", IJCAI, pp. 1051–1056, 2011.
Leibert, F., J. Mannix, J. Lin, and B. Hamadani, "Automatic management of partitioned, replicated search services.", SoCC, pp. 27, 2011.
Whissell, J., and C. Clarke, "Clustering for semi-supervised spam filtering.", CEAS, pp. 125–134, 2011.
Golab, L., and T. Johnson, "Consistency in a Stream Warehouse.", CIDR, pp. 114–122, 2011.
Asadi, N., D. Metzler, and J. Lin, "Cross-corpus relevance projection.", SIGIR, pp. 1163–1164, 2011.
"Distributed data management in 2020?", ICDE, pp. 1360, 2011.
Akinyemi, J., and C. Clarke, "Do Subtopic Judgments Reflect Diversity?", ICTIR, pp. 309–312, 2011.
Kamali, S., P. Ghodsnia, and K. Daudjee, "Dynamic data allocation with replication in distributed systems.", IPCCC, pp. 1–8, 2011.
Cheng, J., Y. Ke, S. Chu, and M. Özsu, "Efficient core decomposition in massive networks.", ICDE, pp. 51–62, 2011.
Franconi, E., and D. Toman, "Fixpoints in Temporal Description Logics.", IJCAI, pp. 875–880, 2011.
Kamali, S., and F. Tompa, "Grammar Inference for Web Documents.", WebDB, 2011.
Smucker, M., and C. Jethani, "Measuring assessor accuracy: a comparison of nist assessors and user study participants.", SIGIR, pp. 1231–1232, 2011.
Türe, F., T. Elsayed, and J. Lin, "No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity.", SIGIR, pp. 943–952, 2011.
Miller, R., F. Tompa, S. McIlraith, J. Slonim, and E. Yu, "NSERC business intelligence network: selected topics.", CASCON, pp. 313–315, 2011.
Ashkan, A., and C. Clarke, "On the informativeness of cascade and intent-aware effectiveness measures.", WWW, pp. 407–416, 2011.
Grossman, M., G. Cormack, B. Hedin, and D. Oard, "Overview of the TREC 2011 Legal Track.", TREC, 2011.
Clarke, C., N. Craswell, I. Soboroff, and E. Voorhees, "Overview of the TREC 2011 Web Track.", TREC, 2011.
Asadi, N., D. Metzler, T. Elsayed, and J. Lin, "Pseudo test collections for learning web search ranking functions.", SIGIR, pp. 1073–1082, 2011.
Soliman, M., I. Ilyas, D. Martinenghi, and M. Tagliasacchi, "Ranking with uncertain scoring functions: semantics and sensitivity measures.", SIGMOD Conference, pp. 805–816, 2011.
Lin, J., R. Snow, and W. Morgan, "Smoothing techniques for adaptive online language models: topic tracking in tweet streams.", KDD, pp. 422–429, 2011.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "The Combined Approach to Ontology-Based Data Access.", IJCAI, pp. 2656–2661, 2011.
Itakura, K., C. Clarke, S. Geva, A. Trotman, and W. Huang, "Topical and Structural Linkage in Wikipedia.", ECIR, pp. 460–465, 2011.
Roegiest, A., and G. Cormack, "University of Waterloo at TREC 2011 Microblog Track.", TREC, 2011.
Akinyemi, J., and C. Clarke, "UWaterloo at NTCIR-9: Intent discovery with anchor text.", NTCIR, 2011.
Elsayed, T., J. Lin, and D. Metzler, "When close enough is good enough: approximate positional indexes for efficient ranked retrieval.", CIKM, pp. 1993–1996, 2011.
Chen, G., H. Vo, S. Wu, B. Ooi, and M. Özsu, "A Framework for Supporting DBMS-like Indexes in the Cloud.", PVLDB, vol. 4, no. 11, pp. 702–713, 2011.
Ataullah, A., and F. Tompa, "Business Policy Modeling and Enforcement in Databases.", PVLDB, vol. 4, no. 11, pp. 921–931, 2011.
Golab, L., F. Korn, and D. Srivastava, "Efficient and Effective Analysis of Data Quality using Pattern Tableaux.", IEEE Data Eng. Bull., vol. 34, no. 3, pp. 26–33, 2011.
Cormack, G., M. Smucker, and C. Clarke, "Efficient and effective spam filtering and re-ranking for large web datasets.", Inf. Retr., vol. 14, no. 5, pp. 441–465, 2011.
Zou, L., J. Mo, L. Chen, M. Özsu, and D. Zhao, "gStore: Answering SPARQL Queries via Subgraph Matching.", PVLDB, vol. 4, no. 8, pp. 482–493, 2011.
Yakout, M., A. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided Data Repair", CoRR, vol. abs/1103.3103, 2011.
Yakout, M., A. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided data repair.", PVLDB, vol. 4, no. 5, pp. 279–289, 2011.
Whissell, J., and C. Clarke, "Improving document clustering using Okapi BM25 feature weighting.", Inf. Retr., vol. 14, no. 5, pp. 466–487, 2011.
Kane, A., and F. Tompa, "Janus: the intertextuality search engine for the electronic Manipulus florum project.", LLC, vol. 26, no. 4, pp. 407–415, 2011.
Wong, R., M. Özsu, A. Fu, P. Yu, L. Liu, and Y. Liu, "Maximizing bichromatic reverse nearest neighbor for L p -norm in two- and three-dimensional spaces.", VLDB J., vol. 20, no. 6, pp. 893–919, 2011.
Minhas, U., S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield, "RemusDB: Transparent High Availability for Database Systems.", PVLDB, vol. 4, no. 11, pp. 738–748, 2011.
Belkin, N., C. Clarke, N. Gao, J. Kamps, and J. Karlgren, "Report on the SIGIR workshop on “entertain me”: supporting complex search tasks.", SIGIR Forum, vol. 45, no. 2, pp. 51–59, 2011.
Kling, P., M. Özsu, and K. Daudjee, "Scaling XML query processing: distribution, localization and pruning.", Distributed and Parallel Databases, vol. 29, no. 5–6, pp. 445–490, 2011.
Bateni, MH., L. Golab, MT. Hajiaghayi, and H. Karloff, "Scheduling to Minimize Staleness and Stretch in Real-Time Data Warehouses.", Theory Comput. Syst., vol. 49, no. 4, pp. 757–780, 2011.
Chockler, G., E. Dekel, J. JáJá, and J. Lin, "Special Issue on Cloud Computing.", J. Parallel Distrib. Comput., vol. 71, no. 6, pp. 731, 2011.
MacDonald, C., C. Clarke, and J. Wang, "The 1st international workshop on diversity in document retrieval.", SIGIR Forum, vol. 45, no. 2, pp. 87–93, 2011.
Toman, D., and G. Weddell, "Fundamentals of Physical Design and Query Compilation", Fundamentals of Physical Design and Query Compilation, 2011.
Özsu, M., and P. Valduriez, Principles of Distributed Database Systems, Third Edition., pp. I–XIX, 1–845, 2011.
Ilyas, I., and M. Soliman, "Probabilistic Ranking Techniques in Relational Databases", Probabilistic Ranking Techniques in Relational Databases, 2011.

2010

Itakura, K., and C. Clarke, "A framework for BM25F-based XML retrieval.", SIGIR, pp. 843–844, 2010.
Kamali, S., and F. Tompa, "A new mathematics retrieval system.", CIKM, pp. 1413–1416, 2010.
Abouzour, M., K. Salem, and P. Bumbulis, "Automatic tuning of the multiprogramming level in Sybase SQL Anywhere.", ICDE Workshops, pp. 99–104, 2010.
Lafreniere, B., A. Bunt, J. Whissell, C. Clarke, and M. Terry, "Characterizing large-scale use of a direct manipulation application in the wild.", Graphics Interface, pp. 11–18, 2010.
Clarke, C., "ClueWeb09 and TREC Diversity.", NTCIR, pp. 13, 2010.
Lin, J., and C. Dyer, "Data-Intensive Text Processing with MapReduce.", NAACL (Tutorial Abstracts), pp. 1–2, 2010.
Lin, J., and M. Schatz, "Design patterns for efficient graph algorithms in MapReduce.", MLG@KDD, pp. 78–85, 2010.
Savinov, S., and K. Daudjee, "Dynamic database replica provisioning through virtualization.", CloudDB, pp. 41–46, 2010.
Zou, L., L. Chen, M. Özsu, and D. Zhao, "Dynamic Skyline Queries in Large Graphs.", DASFAA (2), pp. 62–78, 2010.
Tao, Y., and M. Özsu, "Efficient Decision Tree Re-alignment for Clustering Time-Changing Data Streams.", From Active Data Management to Event-Based Systems and More, pp. 20–43, 2010.
Pound, J., I. Ilyas, and G. Weddell, "Expressive and flexible access to web-extracted data: a keyword-based structured query language.", SIGMOD Conference, pp. 423–434, 2010.
Smucker, M., and C. Jethani, "Human performance and retrieval precision revisited.", SIGIR, pp. 595–602, 2010.
Wang, L., J. Lin, and D. Metzler, "Learning to efficiently rank.", SIGIR, pp. 138–145, 2010.
Soliman, M., M. Saleeb, and I. Ilyas, "MashRank: Towards uncertainty-aware and rank-aware mashups.", ICDE, pp. 1137–1140, 2010.
Dolman, L., F. Tompa, I. Kiringa, R. Pottinger, and J. Mylopoulos, "Next generation business intelligence (BI) tools.", CASCON, pp. 352–354, 2010.
Stanchev, L., and G. Weddell, "On Building an Index Advisor for Semantic Web Queries.", FOIS, pp. 147–157, 2010.
Borgida, A., J. de Bruijn, E. Franconi, I. Seylan, U. Straccia, D. Toman, and G. Weddell, "On Finding Query Rewritings under Expressive Constraints.", SEBD, pp. 426–437, 2010.
Lunn, D., M. Bernstein, C. Marshall, J. Matias, J. Nyce, and F. Tompa, "Past visions of hypertext and their influence on us today.", HT, pp. 315, 2010.
Beskales, G., M. Soliman, I. Ilyas, S. Ben-David, and Y. Kim, "ProbClean: A probabilistic duplicate detection system.", ICDE, pp. 1193–1196, 2010.
Lin, J., N. Madnani, and B. Dorr, "Putting the User in the Loop: Interactive Maximal Marginal Relevance for Query-Focused Summarization.", HLT-NAACL, pp. 305–308, 2010.
Pound, J., D. Toman, G. Weddell, and J. Wu, "Query Algebra and Query Optimization for Concept Assertion Retrieval.", Description Logics, 2010.
Wang, L., D. Metzler, and J. Lin, "Ranking under temporal constraints.", CIKM, pp. 79–88, 2010.
Mojdeh, M., and G. Cormack, "Semi-supervised spam filtering using aggressive consistency learning.", SIGIR, pp. 751–752, 2010.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "The Combined Approach to Query Answering in DL-Lite.", KR, 2010.
Akinyemi, J., C. Clarke, and M. Kolla, "Towards a collection-based results diversification.", RIAO, pp. 202–205, 2010.
Ilyas, I., D. Martinenghi, N. Polyzotis, and M. Tagliasacchi, "Trends in Rank Join.", SeCO Workshop, pp. 135–137, 2010.
Elsayed, T., N. Asadi, L. Wang, J. Lin, and D. Metzler, "UMD and USC/ISI: TREC 2010 Web Track Experiments with Ivory.", TREC, 2010.
Ilyas, I., "Uncertainty in Rank Join.", SeCO Workshop, pp. 128–134, 2010.
Smucker, M., C. Clarke, G. Cormack, and O. Vechtomova, "University of Waterloo at TREC 2010: Legal Interactive.", TREC, 2010.
Ozmen, O., K. Salem, J. Schindler, and S. Daniel, "Workload-aware storage layout for database systems.", SIGMOD Conference, pp. 939–950, 2010.
Lo, E., C. Binnig, D. Kossmann, M. Özsu, and W-K. Hon, "A framework for testing DBMS features.", VLDB J., vol. 19, no. 2, pp. 203–230, 2010.
Soror, A., U. Minhas, A. Aboulnaga, K. Salem, P. Kokosielis, and S. Kamath, "Automatic virtual machine configuration for database workloads.", ACM Trans. Database Syst., vol. 35, no. 1, pp. 7:1-7:47, 2010.
Soliman, M., I. Ilyas, and M. Saleeb, "Building Ranked Mashups of Unstructured Sources with Uncertain Information.", PVLDB, vol. 3, issue 1, pp. 826–837, 2010.
Golab, L., H. Karloff, F. Korn, and D. Srivastava, "Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux.", PVLDB, vol. 3, issue 2, 2010.
Cormack, G., M. Smucker, and C. Clarke, "Efficient and Effective Spam Filtering and Re-ranking for Large Web Datasets", CoRR, vol. abs/1004.5168, 2010.
Srivastava, D., L. Golab, R. Greer, T. Johnson, J. Seidel, V. Shkapenyuk, O. Spatscheck, and J. Yates, "Enabling Real Time Data Analysis.", PVLDB, vol. 3, issue 1, pp. 1-2, 2010.
Kling, P., M. Özsu, and K. Daudjee, "Generating Efficient Execution Plans for Vertically Partitioned XML Databases.", PVLDB, vol. 4, no. 1, pp. 1–11, 2010.
Ben-David, S., R. Trefler, and G. Weddell, "Model Checking Using Description Logic.", J. Log. Comput., vol. 20, no. 1, pp. 111–131, 2010.
Wang, Q., K. Daudjee, and M. Özsu, "Popularity-aware prefetch in P2P range caching.", Peer-to-Peer Networking and Applications, vol. 3, no. 2, pp. 145–160, 2010.
Pound, J., I. Ilyas, and G. Weddell, "QUICK: Expressive and Flexible Search over Knowledge Bases and Text Collections.", PVLDB, vol. 3, issue 2, 2010.
Azzopardi, L., K. Järvelin, J. Kamps, and M. Smucker, "Report on the SIGIR 2010 workshop on the simulation of interaction.", SIGIR Forum, vol. 44, no. 2, pp. 35–47, 2010.
Beskales, G., I. Ilyas, and L. Golab, "Sampling the Repairs of Functional Dependency Violations under Hard Constraints.", PVLDB, vol. 3, issue 1, pp. 197–207, 2010.
Stanchev, L., and G. Weddell, "Saving space and time using index merging.", Data Knowl. Eng., vol. 69, no. 10, pp. 1062–1080, 2010.
Soliman, M., I. Ilyas, and S. Ben-David, "Supporting ranking queries on uncertain and incomplete data.", VLDB J., vol. 19, no. 4, pp. 477–501, 2010.
Ailamaki, A., L. Haas, H. Jagadish, D. Maier, M. \" Ozsu, and M. Winslett, "Time for Our Field to Grow Up.", PVLDB, vol. 3, issue 2, pp. 1658, 2010.
Golab, L., and M. Özsu, "Data Stream Management", Data Stream Management, 2010.
Lin, J., and C. Dyer, "Data-Intensive Text Processing with MapReduce", Data-Intensive Text Processing with MapReduce, 2010.
Büttcher, S., C. Clarke, and G. Cormack, Information Retrieval - Implementing and Evaluating Search Engines., pp. I–XXIV, 1–606, 2010.

2009

Fiser, P., and D. Toman, "A Fast SOP Minimizer for Logic Funcions Described by Many Product Terms.", DSD, pp. 757–764, 2009.
Smucker, M., and J. Allan, "A New Measure of the Cluster Hypothesis.", ICTIR, pp. 281–288, 2009.
Qasim, U., V. Oria, Y-fang. Wu, M. Houle, and M. Özsu, "A partial-order based active cache for recommender systems.", RecSys, pp. 209–212, 2009.
Smucker, M., J. Allan, and B. Carterette, "Agreement among statistical significance tests for information retrieval evaluation at varying sample sizes.", SIGIR, pp. 630–631, 2009.
Clarke, C., M. Kolla, and O. Vechtomova, "An Effectiveness Measure for Ambiguous and Underspecified Queries.", ICTIR, pp. 188–199, 2009.
Toman, D., and G. Weddell, "Applications and Extensions of PTIME Description Logics with Functional Constraints.", IJCAI, pp. 948–954, 2009.
Ashkan, A., and C. Clarke, "Characterizing commercial intent.", CIKM, pp. 67–76, 2009.
Ashkan, A., C. Clarke, E. Agichtein, and Q. Guo, "Classifying and Characterizing Query Intent.", ECIR, pp. 578–586, 2009.
Liu, X., A. Aboulnaga, K. Salem, and X. Li, "CLIC: CLient-Informed Caching for Storage Servers.", FAST, pp. 297–310, 2009.
Whissell, J., C. Clarke, and A. Ashkan, "Clustering web queries.", CIKM, pp. 899–908, 2009.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "Combined FO Rewritability for Conjunctive Query Answering in DL-Lite.", Description Logics, 2009.
Pound, J., D. Toman, G. Weddell, and J. Wu, "Concept Projection in Algebras for Computing Certain Answer Descriptions.", Description Logics, 2009.
Lutz, C., D. Toman, and F. Wolter, "Conjunctive Query Answering in the Description Logic EL Using a Relational Database System.", IJCAI, pp. 2070–2075, 2009.
Lin, J., and C. Dyer, "Data Intensive Text Processing with MapReduce.", HLT-NAACL (Tutorial Abstracts), pp. 1–2, 2009.
Özsu, M., "Distributed XML Processing.", APWeb/WAIM, pp. 1, 2009.
Tao, Y., and M. Özsu, "Efficient decision tree construction for mining time-varying data streams.", CASCON, pp. 43–57, 2009.
Chan, E., and J. Zhang, "Efficient Evaluation of Static and Dynamic Optimal Route Queries.", SSTD, pp. 386–391, 2009.
Henry, K., C. Swanson, Q. Xie, and K. Daudjee, "Efficient Hierarchical Quorums in Unstructured Peer-to-Peer Networks.", OTM Conferences (1), pp. 183–200, 2009.
Ashkan, A., C. Clarke, E. Agichtein, and Q. Guo, "Estimating Ad Clickthrough Rate through Query Intent Analysis.", Web Intelligence, pp. 222–229, 2009.
Cormode, G., L. Golab, F. Korn, A. McGregor, D. Srivastava, and X. Zhang, "Estimating the confidence of conditional functional dependencies.", SIGMOD Conference, pp. 469–482, 2009.
Smucker, M., C. Clarke, and G. Cormack, "Experiments with ClueWeb09: Relevance Feedback and Web Tracks.", TREC, 2009.
Ben-David, S., J. Pound, R. Trefler, D. Tsarkov, and G. Weddell, "Fair Cycle Detection using Description Logic Reasoning.", Description Logics, 2009.
Kolcz, A., and G. Cormack, "Genre-based decomposition of email class noise.", KDD, pp. 427–436, 2009.
Guo, Q., E. Agichtein, C. Clarke, and A. Ashkan, "In the Mood to Click? Towards Inferring Receptiveness to Search Advertising.", Web Intelligence, pp. 319–324, 2009.
Tang, N., J. Yu, H. Tang, M. Özsu, and P. Boncz, "Materialized View Selection in XML Databases.", DASFAA, pp. 616–630, 2009.
Tao, Y., and M. Özsu, "Mining data streams with periodically changing distributions.", CIKM, pp. 887–896, 2009.
Tao, Y., and M. Özsu, "Mining frequent itemsets in time-varying data streams.", CIKM, pp. 1521–1524, 2009.
Lin, J., T. Elsayed, L. Wang, and D. Metzler, "Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search.", TREC, 2009.
Cormack, G., and J-M. da Cruz, "On the relative age of spam and ham training samples for email filtering.", SIGIR, pp. 744–745, 2009.
Clarke, C., N. Craswell, and I. Soboroff, "Overview of the TREC 2009 Web Track.", TREC, 2009.
Zhang, H., I. Ilyas, and K. Salem, "PSALM: Cardinality Estimation inthe Presence of Fine-Grained Access Controls.", ICDE, pp. 505–516, 2009.
Ilyas, I., D. Martinenghi, and M. Tagliasacchi, "Rank-Join Algorithms for Search Computing.", SeCO Workshop, pp. 211–224, 2009.
Soliman, M., and I. Ilyas, "Ranking with Uncertain Scores.", ICDE, pp. 317–328, 2009.
Cormack, G., C. Clarke, and S. Büttcher, "Reciprocal rank fusion outperforms condorcet and individual rank learning methods.", SIGIR, pp. 758–759, 2009.
Bateni, MH., L. Golab, M. Hajiaghayi, and H. Karloff, "Scheduling to minimize staleness and stretch in real-time data warehouses.", SPAA, pp. 29–38, 2009.
Golab, L., T. Johnson, and V. Shkapenyuk, "Scheduling Updates in a Real-Time Stream Warehouse.", ICDE, pp. 1207–1210, 2009.
Cormack, G., and A. Kolcz, "Spam filter evaluation with imprecise ground truth.", SIGIR, pp. 604–611, 2009.
Golab, L., T. Johnson, J. Seidel, and V. Shkapenyuk, "Stream warehousing with DataDepot.", SIGMOD Conference, pp. 847–854, 2009.
Ashkan, A., and C. Clarke, "Term-based commercial intent analysis.", SIGIR, pp. 800–801, 2009.
Murray, G., J. Lin, W. Wilbur, and Z. Lu, "Users’ adjustments to unsuccessful queries in biomedical search.", JCDL, pp. 433–434, 2009.
Itakura, K., and C. Clarke, "Using dynamic markov compression to detect vandalism in the wikipedia.", SIGIR, pp. 822–823, 2009.
Lin, J., G. Murray, B. Dorr, J. Hajic, and P. Pecina, "A cost-effective lexical acquisition process for large-scale thesaurus translation.", Language Resources and Evaluation, vol. 43, no. 1, pp. 27–40, 2009.
Klavans, J., C. Sheffield, E. Abels, J. Lin, R. Passonneau, T. Sidhu, and D. Soergel, "Computational linguistics for metadata building (CLiMB): using text mining for the automatic identification, categorization, and disambiguation of subject terms for image metadata.", Multimedia Tools Appl., vol. 42, no. 1, pp. 115–138, 2009.
Wan, Q., R. Wong, I. Ilyas, M. Özsu, and Y. Peng, "Creating Competitive Products.", PVLDB, vol. 2, no. 1, pp. 898–909, 2009.
Aboulnaga, A., K. Salem, A. Soror, U. Minhas, P. Kokosielis, and S. Kamath, "Deploying Database Appliances in the Cloud.", IEEE Data Eng. Bull., vol. 32, no. 1, pp. 13–20, 2009.
Haas, P., I. Ilyas, G. Lohman, and V. Markl, "Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey.", Statistical Analysis and Data Mining, vol. 1, no. 4, pp. 223–250, 2009.
Zou, L., L. Chen, and M. Özsu, "DistanceJoin: Pattern Match Query In a Large Graph Database.", PVLDB, vol. 2, no. 1, pp. 886–897, 2009.
Wong, R., M. Özsu, P. Yu, A. Fu, and L. Liu, "Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor.", PVLDB, vol. 2, no. 1, pp. 1126–1137, 2009.
Hawes, T., J. Lin, and P. Resnik, "Elements of a computational model for multi-party discourse: The turn-taking behavior of Supreme Court justices.", JASIST, vol. 60, no. 8, pp. 1607–1615, 2009.
Ilyas, I., "Guest editorial: special issue on ranking in databases.", Distributed and Parallel Databases, vol. 26, no. 1, pp. 1–2, 2009.
Lin, J., "Is searching full text more effective than searching abstracts?", BMC Bioinformatics, vol. 10, 2009.
Zou, L., L. Chen, and M. Özsu, "K-Automorphism: A General Framework For Privacy Preserving Network Publication.", PVLDB, vol. 2, no. 1, pp. 946–957, 2009.
Lin, J., and W. Wilbur, "Modeling actions of PubMed users with n-gram language models.", Inf. Retr., vol. 12, no. 4, pp. 487–503, 2009.
Beskales, G., M. Soliman, I. Ilyas, and S. Ben-David, "Modeling and Querying Possible Repairs in Duplicate Detection.", PVLDB, vol. 2, no. 1, pp. 598–609, 2009.
Aboulnaga, A., and K. Salem, "Report: 4th Int’l Workshop on Self-Managing Database Systems (SMDB 2009).", IEEE Data Eng. Bull., vol. 32, no. 4, pp. 2–5, 2009.
Golab, L., H. Karloff, F. Korn, A. Saha, and D. Srivastava, "Sequential Dependencies.", PVLDB, vol. 2, no. 1, pp. 574–585, 2009.
Chan, E., and Y. Yang, "Shortest Path Tree Computation in Dynamic Graphs.", IEEE Trans. Computers, vol. 58, no. 4, pp. 541–557, 2009.
Chockler, G., E. Dekel, J. JáJá, and J. Lin, "Special Issue of the Journal of Parallel and Distributed Computing: Cloud Computing.", J. Parallel Distrib. Comput., vol. 69, no. 9, pp. 813, 2009.
El-Helw, A., I. Ilyas, and C. Zuzarte, "StatAdvisor: Recommending Statistical Views.", PVLDB, vol. 2, no. 2, pp. 1306–1317, 2009.
Clarke, C., G. Cormack, T. Lynam, C. Buckley, and D. Harman, "Swapping documents and terms.", Inf. Retr., vol. 12, no. 6, pp. 680–694, 2009.
Jaeger, P., J. Lin, J. Grimes, and S. Simmons, "Where is the Cloud? Geography, Economics, Environment, and Jurisdiction in Cloud Computing.", First Monday, vol. 14, no. 5, 2009.
Li, Y., M. Özsu, and K-L. Tan, "XCube: Processing XPath queries in a hypercube overlay network.", Peer-to-Peer Networking and Applications, vol. 2, no. 2, pp. 128–145, 2009.
Chomicki, J., and D. Toman, "Abstract Versus Concrete Temporal Query Languages.", Encyclopedia of Database Systems, pp. 1–6, 2009.
Özsu, M., "Client-Server DBMS.", Encyclopedia of Database Systems, pp. 342–344, 2009.
Golab, L., "Data Stream.", Encyclopedia of Database Systems, pp. 638, 2009.
Tompa, F., "Document Databases.", Encyclopedia of Database Systems, pp. 938–939, 2009.
Tompa, F., "Enterprise Content Management.", Encyclopedia of Database Systems, pp. 997, 2009.
Tompa, F., "Hypertexts.", Encyclopedia of Database Systems, pp. 1331–1332, 2009.
Toman, D., "Point-Stamped Temporal Models.", Encyclopedia of Database Systems, pp. 2119–2123, 2009.
Salem, K., "Sagas.", Encyclopedia of Database Systems, pp. 2466–2467, 2009.
Golab, L., "Stream Models.", Encyclopedia of Database Systems, pp. 2834–2836, 2009.
Lin, J., "Summarization.", Encyclopedia of Database Systems, pp. 2884–2889, 2009.
Chomicki, J., and D. Toman, "Temporal Logic in Database Query Languages.", Encyclopedia of Database Systems, pp. 2987–2991, 2009.
Chomicki, J., and D. Toman, "Temporal Relational Calculus.", Encyclopedia of Database Systems, pp. 3015–3016, 2009.
Roddick, J., and D. Toman, "Temporal Vacuuming.", Encyclopedia of Database Systems, pp. 3023–3027, 2009.
Clarke, C., "Web Question Answering.", Encyclopedia of Database Systems, pp. 3485–3490, 2009.

2008

Sarma, A., A. de Keijzer, A. Deshpande, P. Haas, I. Ilyas, C. Koch, T. Neumann, D. Olteanu, M. Theobald, and V. Vassalos, "08421 Working Group: Classification, Representation and Modeling.", Uncertainty Management in Information Systems, 2008.
Sarma, A., A. Deshpande, T. Hubauer, I. Ilyas, B. König-Ries, M. Renz, and M. Theobald, "08421 Working Group: Lineage/Provenance.", Uncertainty Management in Information Systems, 2008.
Soror, A., U. Minhas, A. Aboulnaga, K. Salem, P. Kokosielis, and S. Kamath, "Automatic virtual machine configuration for database workloads.", SIGMOD Conference, pp. 953–966, 2008.
Klavans, J., C. Sheffield, J. Lin, and T. Sidhu, "Computational linguistics for metadata building.", JCDL, pp. 427, 2008.
Lutz, C., D. Toman, and F. Wolter, "Conjunctive Query Answering in EL using a Database System.", OWLED, 2008.
Minhas, U., J. Yadav, A. Aboulnaga, and K. Salem, "Database systems on virtual machines: How much do you lose?", ICDE Workshops, pp. 35–41, 2008.
Artale, A., and D. Toman, "Decidable Reasoning over Timestamped Conceptual Models.", Description Logics, 2008.
Artale, A., and D. Toman, "Decidable Reasoning over Timestamped Conceptual Models.", SEBD, pp. 168–178, 2008.
Dyer, C., A. Cordova, A. Mont, and J. Lin, "Fast, Easy, and Cheap: Construction of Statistical Machine Translation Models with MapReduce.", WMT@ACL, pp. 199–207, 2008.
Sculley, D., and G. Cormack, "Filtering Email Spam in the Presence of Noisy User Feedback.", CEAS, 2008.
Tang, N., J. Yu, M. Özsu, and K-F. Wong, "Hierarchical Indexing Approach to Support XPath Queries.", ICDE, pp. 1510–1512, 2008.
Toman, D., and G. Weddell, "Identifying Objects Over Time with Description Logics.", Description Logics, 2008.
Toman, D., and G. Weddell, "Identifying Objects Over Time with Description Logics.", KR, pp. 724–732, 2008.
Reznik-Zellen, R., B. Stevens, M. Thorn, J. Morse, M. Smucker, J. Allan, D. Mimno, A. McCallum, and M. Tuominen, "InterNano: e-Science for the Nanomanufacturing Community.", eScience, pp. 382–383, 2008.
Özsu, M., "Internet-Scale Data Distribution: Some Research Problems.", ADBIS (local proceedings), pp. 3, 2008.
Hristidis, V., and I. Ilyas, "Message from the DBRANK’08 program co-chairs.", ICDE Workshops, pp. 538, 2008.
Tang, N., J. Yu, M. Özsu, B. Choi, and K-F. Wong, "Multiple Materialized View Selection for XPath Query Rewriting.", ICDE, pp. 873–882, 2008.
Lynam, T., and G. Cormack, "MultiText Legal Experiments at TREC 2008.", TREC, 2008.
Clarke, C., M. Kolla, G. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I. MacKinnon, "Novelty and diversity in information retrieval evaluation.", SIGIR, pp. 659–666, 2008.
Pound, J., L. Stanchev, D. Toman, and G. Weddell, "On Ordering and Indexing Metadata for the Semantic Web.", Description Logics, 2008.
Golab, L., T. Johnson, N. Koudas, D. Srivastava, and D. Toman, "Optimizing away joins on data streams.", SSPS, pp. 48–57, 2008.
Elsayed, T., J. Lin, and D. Oard, "Pairwise Document Similarity in Large Collections with MapReduce.", ACL (Short Papers), pp. 265–268, 2008.
Wang, Q., K. Daudjee, and M. Özsu, "Popularity-Aware Prefetch in P2P Range Caching.", Peer-to-Peer Computing, pp. 53–62, 2008.
Voigt, H., W. Lehner, and K. Salem, "Poster session: Constrained dynamic physical database design.", ICDE Workshops, pp. 63–70, 2008.
Wang, W., M. Sharaf, S. Guo, and M. Özsu, "Potential-driven load distribution for distributed data stream processing.", SSPS, pp. 13–22, 2008.
Golab, L., T. Johnson, and O. Spatscheck, "Prefilter: predicate pushdown at streaming speeds.", SSPS, pp. 29–37, 2008.
Ataullah, A., A. Aboulnaga, and F. Tompa, "Records retention in relational database systems.", CIKM, pp. 873–882, 2008.
Mojdeh, M., and G. Cormack, "Semi-supervised spam filtering: does it work?", SIGIR, pp. 745–746, 2008.
Wang, Q., R. Li, L. Chen, J. Lian, and M. Özsu, "Speed up semantic search in p2p networks.", CIKM, pp. 1341–1342, 2008.
Itakura, K., and C. Clarke, "University of Waterloo at INEX 2008: Adhoc, Book, and Link-the-Wiki Tracks.", INEX, pp. 132–139, 2008.
Aboulnaga, A., C. Amza, and K. Salem, "Virtualization and databases: state of the art and research challenges.", EDBT, pp. 746–747, 2008.
Ilyas, I., G. Beskales, and M. Soliman, "A survey of top-k query processing techniques in relational database systems.", ACM Comput. Surv., vol. 40, no. 4, pp. 11:1-11:58, 2008.
Beskales, G., M. Soliman, and I. Ilyas, "Efficient search for the top-k probable nearest neighbors in uncertain databases.", PVLDB, vol. 1, no. 1, pp. 326–339, 2008.
Plattner, C., G. Alonso, and M. Özsu, "Extending DBMSs with satellite databases.", VLDB J., vol. 17, no. 4, pp. 657–682, 2008.
Büttcher, S., and C. Clarke, "Hybrid index maintenance for contiguous inverted lists.", Inf. Retr., vol. 11, no. 3, pp. 175–207, 2008.
Lin, J., M. DiCuccio, V. Grigoryan, and W. Wilbur, "Navigating information spaces: A case study of related article search in PubMed.", Inf. Process. Manage., vol. 44, no. 5, pp. 1771–1783, 2008.
Golab, L., H. Karloff, F. Korn, D. Srivastava, and B. Yu, "On generating near-optimal tableaux for conditional functional dependencies.", PVLDB, vol. 1, no. 1, pp. 376–390, 2008.
Toman, D., and G. Weddell, "On Keys and Functional Dependencies as First-Class Citizens in Description Logics.", J. Autom. Reasoning, vol. 40, no. 2–3, pp. 117–132, 2008.
Korth, H., P. Bernstein, M. Fernández, L. Gruenwald, P. Kolaitis, K. McKinley, and M. Özsu, "Paper and proposal reviews: is the process flawed?", SIGMOD Record, vol. 37, no. 3, pp. 36–39, 2008.
Soliman, M., I. Ilyas, and K. Chang, "Probabilistic top-k and ranking-aggregate queries.", ACM Trans. Database Syst., vol. 33, no. 3, pp. 13:1-13:54, 2008.
Ailamaki, A., S. Babu, P. Furtado, S. Lightstone, G. Lohman, P. Martin, V. Narasayya, G. Pauley, K. Salem, K-U. Sattler, et al., "Report: 3rd Int’l Workshop on Self-Managing Database Systems (SMDB 2008).", IEEE Data Eng. Bull., vol. 31, no. 4, pp. 2–5, 2008.
Zajic, D., B. Dorr, and J. Lin, "Single-document and multi-document summarization techniques for email threads using sentence compression.", Inf. Process. Manage., vol. 44, no. 4, pp. 1600–1610, 2008.
Lin, J., P. Wu, and E. Abels, "Toward automatic facet analysis and need negotiation: Lessons from mediated search.", ACM Trans. Inf. Syst., vol. 27, no. 1, pp. 6:1-6:42, 2008.