You are here

Publications

Sort by: Author Type [Year]

2018

Zhang, H., M. Abualsaud, and M. Smucker, "A Study of Immediate Requery Behavior in Search.", CHIIR, pp. 181–190, 2018.
Abualsaud, M., N. Ghelani, H. Zhang, M. Smucker, G. Cormack, and M. Grossman, "A System for Efficient High-Recall Retrieval.", SIGIR, pp. 1317–1320, 2018.
Koutris, P., S. Salihoglu, and D. Suciu, "Algorithmic Aspects of Parallel Query Processing.", SIGMOD Conference, pp. 1659–1664, 2018.
Begoli, E., J. Camacho-Rodríguez, J. Hyde, M. Mior, and D. Lemire, "Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources.", SIGMOD Conference, pp. 221–230, 2018.
Glasbergen, B., M. Abebe, K. Daudjee, S. Foggo, and A. Pacaci, "Apollo: Learning Query Correlations for Predictive Caching in Geo-Distributed Systems.", EDBT, pp. 253–264, 2018.
Cormack, G., and M. Grossman, "Beyond Pooling.", SIGIR, pp. 1169–1172, 2018.
Gao, J., Y. Wang, X. Chu, Y. He, and Z. Mao, "CAPED: Context-Aware Powerlet-Based Energy Disaggregation.", PAKDD (1), pp. 236–247, 2018.
Yan, X., L. Yang, H. Zhang, X. Lin, B. Wong, K. Salem, and T. Brecht, "Carousel: Low-Latency Transaction Processing for Globally-Distributed Data.", SIGMOD Conference, pp. 231–243, 2018.
Liang, Y., Z. Tu, L. Huang, and J. Lin, "CNNs for NLP in the Browser: Client-Side Deployment and Visualization Opportunities.", NAACL-HTL (Demonstrations), pp. 61–65, 2018.
Langouri, M., Z. Zheng, F. Chiang, L. Golab, and J. Szlichta, "Contextual Data Cleaning.", ICDE Workshops, pp. 21–24, 2018.
Chopra, S., Y. Jiang, A. Toulis, and L. Golab, "Data Analytics to Improve Co-Operative Education.", EDBT/ICDT Workshops, pp. 16–21, 2018.
Pacaci, A., and M.. Özsu, "Distribution-Aware Stream Partitioning for Distributed Stream Processing Systems.", BeyondMR@SIGMOD, pp. 6:1-6:10, 2018.
Abebe, M., K. Daudjee, B. Glasbergen, and Y. Tian, "EC-Store: Bridging the Gap between Storage and Latency in Distributed Erasure Coded Systems.", ICDCS, pp. 255–266, 2018.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Effective Team Formation in Expert Networks.", AMW, 2018.
Zheng, Z., M. Alipour, Z. Qu, I. Currie, F. Chiang, L. Golab, and J. Szlichta, "FastOFD: Contextual Data Cleaning with Ontology Functional Dependencies.", EDBT, pp. 694–697, 2018.
Baruah, G., and M. Kolla, "Klick Labs at CL-SciSumm 2018.", BIRNDL@SIGIR, pp. 134–141, 2018.
Peng, P., L. Zou, M.. Özsu, and D. Zhao, "Multi-query Optimization in Federated RDF Systems.", DASFAA (1), pp. 745–765, 2018.
Tu, Z., M. Li, and J. Lin, "Pay-Per-Request Deployment of Neural Network Models Using Serverless Architectures.", NAACL-HTL (Demonstrations), pp. 6–10, 2018.
Mackenzie, J., J.. Culpepper, R. Blanco, M. Crane, C. Clarke, and J. Lin, "Query Driven Algorithm Selection in Early Stage Retrieval.", WSDM, pp. 396–404, 2018.
Memon, B., X. Lin, A. Mufti, A. Wesley, T. Brecht, K. Salem, B. Wong, and B. Cassell, "RaMP: A Lightweight RDMA Abstraction for Loosely Coupled Applications.", HotCloud, 2018.
Grewal, A., J. Jiang, G. Lam, T. Jung, L. Vuddemarri, Q. Li, A. Landge, and J. Lin, "RecService: Distributed Real-Time Graph Processing at Twitter.", HotCloud, 2018.
Ghelani, N., G. Cormack, and M. Smucker, "Refresh Strategies in Continuous Active Learning.", ProfS/KG4IR/Data:Search@SIGIR, pp. 18–23, 2018.
Yang, P., S. Thiagarajan, and J. Lin, "Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter.", SIGMOD Conference, pp. 595–599, 2018.
Kane, A., and F. Tompa, "Split-Lists and Initial Thresholds for WAND-based Search.", SIGIR, pp. 877–880, 2018.
Gao, L., L. Golab, M.. Özsu, and G. Aluç, "Stream WatDiv: A Streaming RDF Benchmark.", SBD@SIGMOD, pp. 3:1-3:6, 2018.
Mohammed, S., P. Shi, and J. Lin, "Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", NAACL-HLT (2), pp. 291–296, 2018.
Grewal, A., and J. Lin, "The Evolution of Content Analysis for Personalized Recommendations at Twitter.", SIGIR, pp. 1355–1356, 2018.
He, Y., K. Ganjam, K. Lee, Y. Wang, V. Narasayya, S. Chaudhuri, X. Chu, and Y. Zheng, "Transform-Data-by-Example (TDE): Extensible Data Transformation in Excel.", SIGMOD Conference, pp. 1785–1788, 2018.
Lin, J., S. Mohammed, R. Sequiera, and L. Tan, "Update Delivery Mechanisms for Prospective Information Needs: An Analysis of Attention in Mobile Users.", SIGIR, pp. 785–794, 2018.
Rao, J., F. Türe, and J. Lin, "What Do Viewers Say to Their TVs?: An Analysis of Voice Queries to Entertainment Systems.", SIGIR, pp. 1213–1216, 2018.
Korkmaz, M., M. Karsten, K. Salem, and S. Salihoglu, "Workload-Aware CPU Performance Scaling for Transactional Database Systems.", SIGMOD Conference, pp. 291–306, 2018.
De Sa, C., I. Ilyas, B. Kimelfeld, C. Ré, and T. Rekatsinas, "A Formal Framework For Probabilistic Unclean Databases.", CoRR, vol. abs/1801.06750, 2018.
Ren, Y., M. Tomko, F. Salim, J. Chan, C. Clarke, and M. Sanderson, "A Location-Query-Browse Graph for Contextual Recommendation.", IEEE Trans. Knowl. Data Eng., vol. 30, no. 2, pp. 204–218, 2018.
Koutris, P., S. Salihoglu, and D. Suciu, "Algorithmic Aspects of Parallel Data Processing.", Foundations and Trends in Databases, vol. 8, no. 4, pp. 239–370, 2018.
Begoli, E., J. Camacho-Rodríguez, J. Hyde, M. Mior, and D. Lemire, "Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources.", CoRR, vol. abs/1802.10233, 2018.
Tang, G., S. Keshav, L. Golab, and K. Wu, "Bikeshare Pool Sizing for Bike-and-Ride Multimodal Transit.", IEEE Trans. Intelligent Transportation Systems, vol. 19, no. 7, pp. 2279–2289, 2018.
Stonebraker, M., and I. Ilyas, "Data Integration: The Current Status and the Way Forward.", IEEE Data Eng. Bull., vol. 41, no. 2, pp. 3–9, 2018.
Ammar, K., F. McSherry, S. Salihoglu, and M. Joglekar, "Distributed Evaluation of Subgraph Queries Using Worst-case Optimal and Low-Memory Dataflows.", PVLDB, vol. 11, no. 6, pp. 691–704, 2018.
Ammar, K., F. McSherry, S. Salihoglu, and M. Joglekar, "Distributed Evaluation of Subgraph Queries Using Worstcase Optimal LowMemory Dataflows.", CoRR, vol. abs/1802.03760, 2018.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and complete discovery of bidirectional order dependencies via set-based axioms.", VLDB J., vol. 27, no. 4, pp. 573–591, 2018.
Lamb, C., D. Brown, and C. Clarke, "Evaluating Computational Creativity: An Interdisciplinary Tutorial.", ACM Comput. Surv., vol. 51, no. 2, pp. 28:1-28:34, 2018.
Zhang, H., G. Cormack, M. Grossman, and M. Smucker, "Evaluating Sentence-Level Relevance Feedback for High-Recall Information Retrieval.", CoRR, vol. abs/1803.08988, 2018.
Ammar, K., and M.. Özsu, "Experimental Analysis of Distributed Graph Systems.", PVLDB, vol. 11, no. 10, pp. 1151–1164, 2018.
Ammar, K., and M.. Özsu, "Experimental Analysis of Distributed Graph Systems.", CoRR, vol. abs/1806.08082, 2018.
Gebaly, K., and J. Lin, "In-Browser Split-Execution Support for Interactive Analytics in the Cloud.", CoRR, vol. abs/1804.08822, 2018.
Rao, J., W. Yang, Y. Zhang, F. Türe, and J. Lin, "Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search.", CoRR, vol. abs/1805.08159, 2018.
Lin, J., and P. Yang, "Repeatability Corner Cases in Document Ranking: The Impact of Score Ties.", CoRR, vol. abs/1807.05798, 2018.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple.", CoRR, vol. abs/1805.11728, 2018.
Lin, J., "Scale Up or Scale Out for Graph Processing?", IEEE Internet Computing, vol. 22, no. 3, pp. 72–78, 2018.
Kim, Y., and J. Lin, "Serverless Data Analytics with Flint.", CoRR, vol. abs/1803.06354, 2018.
Li, Y., L. Zou, M.. Özsu, and D. Zhao, "Time Constrained Continuous Subgraph Search over Streaming Graphs.", CoRR, vol. abs/1801.09240, 2018.
He, Y., X. Chu, K. Ganjam, Y. Zheng, V. Narasayya, and S. Chaudhuri, "Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations.", PVLDB, vol. 11, no. 10, pp. 1165–1177, 2018.

2017

Crane, M., J.. Culpepper, J. Lin, J. Mackenzie, and A. Trotman, "A Comparison of Document-at-a-Time and Score-at-a-Time Query Evaluation.", WSDM, pp. 201–210, 2017.
Baruah, G., R. McCreadie, and J. Lin, "A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries.", CIKM, pp. 67–76, 2017.
Fernandez, R., D. Deng, E. Mansour, A. Qahtan, W. Tao, Z. Abedjan, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, et al., "A Demo of the Data Civilizer System.", SIGMOD Conference, pp. 1639–1642, 2017.
Karyakin, A., and K. Salem, "An analysis of memory power consumption in database systems.", DaMoN, pp. 2:1-2:9, 2017.
Crane, M., and J. Lin, "An Exploration of Serverless Architectures for Information Retrieval.", ICTIR, pp. 241–244, 2017.
He, H., K. Ganjam, N. Jain, J. Lundin, R. White, and J. Lin, "An Insight Extraction System on BioMedical Literature with Deep Neural Networks.", EMNLP, pp. 2691–2701, 2017.
Yang, P., H. Fang, and J. Lin, "Anserini: Enabling the Use of Lucene for Information Retrieval Research.", SIGIR, pp. 1253–1256, 2017.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Authority-based Team Discovery in Social Networks.", EDBT, pp. 498–501, 2017.
Grossman, M., G. Cormack, and A. Roegiest, "Automatic and Semi-Automatic Document Selection for Technology-Assisted Review.", SIGIR, pp. 905–908, 2017.
Zhang, H., J. Rao, J. Lin, and M. Smucker, "Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Answering.", SIGIR, pp. 797–800, 2017.
Sargent, G., A. Cassedy, H. Zhang, T. Aspiras, A. Morgan, E. Romstadt, A. Van Camp, V. Dicillo, A. D’Arcy, and V. Asari, "Brain machine interface for useful human interaction via extreme learning machine and state machine design.", SSCI, pp. 1–5, 2017.
Borgida, A., D. Toman, and G. Weddell, "Concerning Referring Expressions in Query Answers.", IJCAI, pp. 4791–4795, 2017.
Abedjan, Z., L. Golab, and F. Naumann, "Data Profiling: A Tutorial.", SIGMOD Conference, pp. 1747–1751, 2017.
Pacaci, A., A. Zhou, J. Lin, and M.. Özsu, "Do We Need Specialized Graph Databases?: Benchmarking Real-Time Social Networking Applications.", GRADES@SIGMOD/PODS, pp. 12:1-12:7, 2017.
Baskaran, S., A. Keller, F. Chiang, L. Golab, and J. Szlichta, "Efficient Discovery of Ontology Functional Dependencies.", CIKM, pp. 1847–1856, 2017.
Rijhwani, S., R. Sequiera, M. Choudhury, K. Bali, and C. Maddila, "Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique.", ACL (1), pp. 1971–1982, 2017.
Ghelani, N., S. Mohammed, S. Wang, and J. Lin, "Event Detection on Curated Tweet Streams.", SIGIR, pp. 1325–1328, 2017.
Rao, J., H. He, and J. Lin, "Experiments with Convolutional Neural Network Models for Answer Selection.", SIGIR, pp. 1217–1220, 2017.
Vtyurina, A., D. Savenkov, E. Agichtein, and C. Clarke, "Exploring Conversational Search With Humans, Assistants, and Wizards.", CHI Extended Abstracts, pp. 2187–2193, 2017.
Sequiera, R., and J. Lin, "Finally, a Downloadable Test Collection of Tweets.", SIGIR, pp. 1225–1228, 2017.
Toulis, A., and L. Golab, "Graph Mining to Characterize Competition for Employment.", NDA@SIGMOD, pp. 3:1-3:7, 2017.
Kankanamge, C., S. Sahu, A. Mhedbhi, J. Chen, and S. Salihoglu, "Graphflow: An Active Graph Database.", SIGMOD Conference, pp. 1695–1698, 2017.
Afrati, F., M. Joglekar, C. Ré, S. Salihoglu, and J. Ullman, "GYM: A Multiround Distributed Join Algorithm.", ICDT, pp. 4:1-4:18, 2017.
Ghenai, A., "Health Misinformation in Search and Social Media.", DH, pp. 235–236, 2017.
Ghenai, A., "Health Misinformation in Search and Social Media.", SIGIR, pp. 1371, 2017.
Fink, S., L. Golab, S. Keshav, and H. de Meer, "How Similar is the Usage of Electric Cars and Electric Bicycles?", e-Energy, pp. 334–340, 2017.
Sarrafzadeh, B., and E. Lank, "Improving Exploratory Search Experience through Hierarchical Knowledge Graphs.", SIGIR, pp. 145–154, 2017.
Gebaly, K., and J. Lin, "In-Browser Interactive SQL Analytics with Afterburner.", SIGMOD Conference, pp. 1623–1626, 2017.
Gorenflo, C., L. Golab, and S. Keshav, "Managing Sensor Data Streams: Lessons Learned from the WeBike Project.", SSDBM, pp. 1:1-1:11, 2017.
Rao, J., F. Türe, X. Niu, and J. Lin, "Mining the Temporal Statistics of Query Terms for Searching Social Media Posts.", ICTIR, pp. 133–140, 2017.
Cui, X., M. Mior, B. Wong, K. Daudjee, and S. Rizvi, "Netstore: leveraging network optimizations to improve distributed transaction processing performance.", ACTIVE@Middleware, pp. 1–10, 2017.
Roegiest, A., L. Tan, and J. Lin, "Online In-Situ Interleaved Evaluation of Real-Time Push Notification Systems.", SIGIR, pp. 415–424, 2017.
Meng, X., and L. Golab, "Optimal reducer placement to minimize data transfer in MapReduce-style processing.", BigData, pp. 339–346, 2017.
Lin, J., S. Mohammed, R. Sequiera, L. Tan, N. Ghelani, M. Abualsaud, R. McCreadie, D. Milajevs, and E. Voorhees, "Overview of the TREC 2017 Real-Time Summarization Track.", TREC, 2017.
Mohammed, S., M. Crane, and J. Lin, "Quantization in Append-Only Collections.", ICTIR, pp. 265–268, 2017.
Mate, J., K. Daudjee, and S. Kamali, "Robust Multi-Tenant Server Consolidation in the Cloud for Data Analytics Workloads", Proc. 37th IEEE International Conference on Distributed Computing Systems (ICDCS), 2017.
Mate, J., K. Daudjee, and S. Kamali, "Robust Multi-tenant Server Consolidation in the Cloud for Data Analytics Workloads.", ICDCS, pp. 2111–2118, 2017.
Feng, G., L. Golab, and D. Srivastava, "Scalable Informative Rule Mining.", ICDE, pp. 437–448, 2017.
Kane, A., and F. Tompa, "Small-Term Distribution for Disk-Based Search.", DocEng, pp. 49–58, 2017.
Toulis, A., and L. Golab, "Social Media Mining to Understand Public Mental Health.", DMAH@VLDB, pp. 55–70, 2017.
Rao, J., F. Türe, H. He, O. Jojic, and J. Lin, "Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks.", CIKM, pp. 557–566, 2017.
Dong, J., X. Meng, M. Chen, Z. Wang, and L. Tang, "Template Protection Based on Chaotic Map for Face Recognition.", ICPCSEE (1), pp. 242–250, 2017.
Dong, J., X. Meng, M. Chen, and Z. Wang, "Template protection based on DNA coding for multimodal biometric recognition.", ICSAI, pp. 1738–1742, 2017.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Ten Blue Links on Mars.", WWW, pp. 273–281, 2017.
Deng, D., R. Fernandez, Z. Abedjan, S. Wang, M. Stonebraker, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, and N. Tang, "The Data Civilizer System.", CIDR, 2017.
Azzopardi, L., M. Crane, H. Fang, G. Ingersoll, J. Lin, Y. Moshfeghi, H. Scells, P. Yang, and G. Zuccon, "The Lucene for Information Access and Retrieval Research (LIARR) Workshop at SIGIR 2017.", SIGIR, pp. 1429–1430, 2017.
Pogacar, F., A. Ghenai, M. Smucker, and C. Clarke, "The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments.", ICTIR, pp. 209–216, 2017.
El-Roby, A., and A. Aboulnaga, "UFeed: Refining Web Data Integration Based on User Feedback.", CIKM, pp. 187–196, 2017.
Zhang, H., M. Abualsaud, N. Ghelani, A. Ghosh, M. Smucker, G. Cormack, and M. Grossman, "UWaterlooMDS at the TREC 2017 Common Core Track.", TREC, 2017.
Zou, L., and T. M. Özsu, "Graph-based RDF Data Management", Data Science and Engineering, 2017.
Liu, X., L. Golab, W. Golab, I. F. Ilyas, and S. Jin, "Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking", ACM Transactions on Database Systems, vol. 42, issue 1, 2017.
Tang, R., W. Wang, Z. Tu, and J. Lin, "An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting.", CoRR, vol. abs/1711.00333, 2017.
Tu, Z., M. Crane, R. Sequiera, J. Zhang, and J. Lin, "An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures.", CoRR, vol. abs/1707.08275, 2017.
Abdelaziz, I., R. Harbi, S. Salihoglu, and P. Kalnis, "Combining Vertex-Centric Graph Processing with SPARQL for Large-Scale RDF Data Analytics.", IEEE Trans. Parallel Distrib. Syst., vol. 28, no. 12, pp. 3374–3388, 2017.
Shen, C., T. Shen, and J. Lin, "Comparative Assessment of Alignment Algorithms for NGS Data: Features, Considerations, Implementations, and Future.", Algorithms for Next-Generation Sequencing Data, pp. 187–202, 2017.
Sadiq, S., T. Dasu, X. Dong, J. Freire, I. Ilyas, S. Link, M. Miller, F. Naumann, X. Zhou, and D. Srivastava, "Data Quality: The Role of Empiricism.", SIGMOD Record, vol. 46, no. 4, pp. 35–43, 2017.
Kassaie, B., "De-identification In practice.", CoRR, vol. abs/1701.03129, 2017.
Tang, R., and J. Lin, "Deep Residual Learning for Small-Footprint Keyword Spotting.", CoRR, vol. abs/1710.10361, 2017.
Mohammed, S., N. Ghelani, and J. Lin, "Distant Supervision for Topic Classification of Tweets in Curated Streams.", CoRR, vol. abs/1704.06726, 2017.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization.", PVLDB, vol. 10, no. 7, pp. 721–732, 2017.
Mackenzie, J., J.. Culpepper, R. Blanco, M. Crane, C. Clarke, and J. Lin, "Efficient and Effective Tail Latency Minimization in Multi-Stage Retrieval Systems.", CoRR, vol. abs/1704.03970, 2017.
Deng, D., W. Tao, Z. Abedjan, A. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Entity Consolidation: The Golden Record Problem.", CoRR, vol. abs/1709.10436, 2017.
Sequiera, R., G. Baruah, Z. Tu, S. Mohammed, J. Rao, H. Zhang, and J. Lin, "Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering.", CoRR, vol. abs/1707.07804, 2017.
Yan, D., H. Chen, J. Cheng, M.. Özsu, Q. Zhang, and J. Lui, "G-thinker: Big Graph Mining Made Easier and Faster.", CoRR, vol. abs/1709.03110, 2017.
Zou, L., and M.. Özsu, "Graph-Based RDF Data Management.", Data Science and Engineering, vol. 2, no. 1, pp. 56–70, 2017.
Rekatsinas, T., X. Chu, I. Ilyas, and C. Ré, "HoloClean: Holistic Data Repairs with Probabilistic Inference.", PVLDB, vol. 10, no. 11, pp. 1190–1201, 2017.
Rekatsinas, T., X. Chu, I. Ilyas, and C. Ré, "HoloClean: Holistic Data Repairs with Probabilistic Inference.", CoRR, vol. abs/1702.00820, 2017.
Vadehra, A., M. Grossman, and G. Cormack, "Impact of Feature Selection on Micro-Text Classification.", CoRR, vol. abs/1708.08123, 2017.
Lin, J., "In Defense of MapReduce.", IEEE Internet Computing, vol. 21, no. 3, pp. 94–98, 2017.
Rao, J., H. He, H. Zhang, F. Türe, R. Sequiera, S. Mohammed, and J. Lin, "Integrating Lexical and Temporal Signals in Neural Ranking Models for Searching Social Media Streams.", CoRR, vol. abs/1707.07792, 2017.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Inverted Treaps.", ACM Trans. Inf. Syst., vol. 35, no. 3, pp. 22:1-22:45, 2017.
Kassaie, B., "Job Detection in Twitter.", CoRR, vol. abs/1701.03092, 2017.
Ünel, G., and D. Toman, "Logic programming approach to automata-based decision procedures.", J. Log. Algebr. Meth. Program., vol. 86, no. 1, pp. 391–407, 2017.
He, Y., X. Chu, J. Peng, J. Gao, and Y. Wang, "Motif-based Rule Discovery for Predicting Real-valued Time Series.", CoRR, vol. abs/1709.04763, 2017.
Mior, M., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema Design for NoSQL Applications.", IEEE Trans. Knowl. Data Eng., vol. 29, no. 10, pp. 2275–2289, 2017.
Allan, J., N. Belkin, P. Bennett, J. Callan, C. Clarke, F. Diaz, S. Dumais, N. Ferro, D. Harman, D. Hiemstra, et al., "Overview of Special Issue.", SIGIR Forum, vol. 51, no. 2, pp. 1–25, 2017.
Ge, C., I. Ilyas, X. He, and A. Machanavajjhala, "Private Exploration Primitives for Data Cleaning.", CoRR, vol. abs/1712.10266, 2017.
Zhang, H., R. Ayoub, and S. Sundaram, "Sensor selection for Kalman filtering of linear dynamical systems: Complexity, limitations and greedy algorithms.", Automatica, vol. 78, pp. 202–210, 2017.
Liu, X., L. Golab, W. Golab, I. Ilyas, and S. Jin, "Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking.", ACM Trans. Database Syst., vol. 42, no. 1, pp. 2:1-2:39, 2017.
Kassaie, B., "SPARQL over GraphX.", CoRR, vol. abs/1701.03091, 2017.
Mohammed, S., P. Shi, and J. Lin, "Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", CoRR, vol. abs/1712.01969, 2017.
Rao, J., F. Türe, H. He, O. Jojic, and J. Lin, "Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks.", CoRR, vol. abs/1705.04892, 2017.
Lin, J., "The Lambda and the Kappa.", IEEE Internet Computing, vol. 21, no. 5, pp. 60–66, 2017.
Lin, J., and A. Trotman, "The role of index compression in score-at-a-time query evaluation.", Inf. Retr. Journal, vol. 20, no. 3, pp. 199–220, 2017.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and M.. Özsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing.", PVLDB, vol. 11, no. 4, pp. 420–431, 2017.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and M.. Özsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: A User Survey.", CoRR, vol. abs/1709.03188, 2017.
Yang, Y., L. Golab, and M.. Özsu, "ViewDF: Declarative incremental view maintenance for streaming data.", Inf. Syst., vol. 71, pp. 55–67, 2017.
Lin, J., I. Milligan, J. Wiebe, and A. Zhou, "Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives.", JOCCH, vol. 10, no. 4, pp. 22:1-22:30, 2017.

2016

Jacques, J. St., D. Toman, and G. Weddell, "", IJCAI, pp. 1258–1264, 2016.
Agrawal, S., and K. Daudjee, "A Performance Comparison of Algorithms for Byzantine Agreement in Distributed Systems.", EDCC, pp. 249–260, 2016.
Roegiest, A., L. Tan, J. Lin, and C. Clarke, "A Platform for Streaming Push Notifications to Mobile Assessors.", SIGIR, pp. 1077–1080, 2016.
Wu, G., and F. Tompa, "A Space-Efficient Data Structure for Fast Access Control in ECM Systems.", SACMAT, pp. 191–201, 2016.
El-Roby, A., and A. Aboulnaga, "ALEX: Automatic Link Exploration in Linked Data.", ICDE, pp. 1322–1325, 2016.
Roegiest, A., and G. Cormack, "An Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments.", SIGIR, pp. 1085–1088, 2016.
Hashemi, S., C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "An Easter Egg Hunting Approach to Test Collection Building in Dynamic Domains.", EVIA@NTCIR, 2016.
Tan, L., A. Roegiest, J. Lin, and C. Clarke, "An Exploration of Evaluation Metrics for Mobile Push Notifications.", SIGIR, pp. 741–744, 2016.
Al-Harbi, A., and M. Smucker, "Are Secondary Assessors Uncertain When They Disagree About Relevance Judgements?", CHIIR, pp. 233–236, 2016.
Farid, M., A. Roatis, I. Ilyas, H-F. Hoffmann, and X. Chu, "CLAMS: Bringing Quality to Data Lakes.", SIGMOD Conference, pp. 2089–2092, 2016.
Rao, J., X. Niu, and J. Lin, "Compressing and Decoding Term Statistics Time Series.", ECIR, pp. 675–681, 2016.
Milligan, I., N. Ruest, and J. Lin, "Content Selection and Curation for Web Archiving: The Gatekeepers vs. the Masses.", JCDL, pp. 107–110, 2016.
Cafarella, M., I. Ilyas, M. Kornacker, T. Kraska, and C. Ré, "Dark Data: Are we solving the right problems?", ICDE, pp. 1444–1445, 2016.
Chu, X., I. Ilyas, S. Krishnan, and J. Wang, "Data Cleaning: Overview and Emerging Challenges.", SIGMOD Conference, pp. 2201–2206, 2016.
Abedjan, Z., L. Golab, and F. Naumann, "Data profiling.", ICDE, pp. 1432–1435, 2016.
Abedjan, Z., J. Morcos, I. Ilyas, M. Ouzzani, P. Papotti, and M. Stonebraker, "DataXFormer: A robust transformation discovery system.", ICDE, pp. 1134–1145, 2016.
Jackson, A., J. Lin, I. Milligan, and N. Ruest, "Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities.", JCDL, pp. 103–106, 2016.
Buntain, C., J. Lin, and J. Golbeck, "Discovering key moments in social media streams.", CCNC, pp. 366–374, 2016.
Culpepper, J.., C. Clarke, and J. Lin, "Dynamic Cutoff Prediction in Multi-Stage Retrieval Systems.", ADCS, pp. 17–24, 2016.
Kargar, M., L. Golab, and J. Szlichta, "eGraphSearch: Effective Keyword Search in Graphs.", CIKM, pp. 2461–2464, 2016.
Cormack, G., and M. Grossman, "Engineering Quality and Reliability in Technology-Assisted Review.", SIGIR, pp. 75–84, 2016.
Bommannavar, P., J. Lin, and A. Rajaraman, "Estimating topical volume in social media streams.", SAC, pp. 1096–1101, 2016.
Lamb, C., D. Brown, and C. Clarke, "Evaluating digital poetry: Insights from the CAT.", ICCC, pp. 60–67, 2016.
Oard, D., K. Shilton, and J. Lin, "Evaluating Search Among Secrets.", EVIA@NTCIR, 2016.
Milligan, I., J. Lin, J. Wiebe, and A. Zhou, "Exploring and Discovering Archive-It Collections with Warcbase.", DH, pp. 285–288, 2016.
Roegiest, A., and G. Cormack, "Impact of Review-Set Selection on Human Assessment for Text Classification.", SIGIR, pp. 861–864, 2016.
Trotman, A., and J. Lin, "In Vacuo and In Situ Evaluation of SIMD Codecs.", ADCS, pp. 1–8, 2016.
Sarrafzadeh, B., A. Vtyurina, E. Lank, and O. Vechtomova, "Knowledge Graphs versus Hierarchies: An Analysis of User Behaviours and Perspectives in Information Seeking.", CHIIR, pp. 91–100, 2016.
Farid, M., I. Ilyas, S. Whang, and C. Yu, "LONLIES: Estimating Property Values for Long Tail Entities.", SIGIR, pp. 1125–1128, 2016.
Smucker, M., and C. Clarke, "Modeling Optimal Switching Behavior.", CHIIR, pp. 317–320, 2016.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "Multi-Stage Math Formula Search: Using Appearance-Based Similarity Metrics at Scale.", SIGIR, pp. 145–154, 2016.
Rao, J., H. He, and J. Lin, "Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks.", CIKM, pp. 1913–1916, 2016.
Mior, M., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema design for NoSQL applications.", ICDE, pp. 181–192, 2016.
Jacques, J. St., D. Toman, and G. Weddell, "Object-Relational Queries over CFDI_nc Knowledge Bases: OBDA for the SQL-Literate (extended abstract).", Description Logics, 2016.
Jiang, Y., and L. Golab, "On Competition for Undergraduate Co-op Placements: A Graph Mining Approach.", EDM, pp. 394–399, 2016.
Toman, D., and G. Weddell, "On Partial Features in the D L F Family of Description Logics", 2016 Pacific Rim International Conference on Artificial Intelligence, pp. 529–542, 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Information Systems Derived from Conceptual Modelling.", ER, pp. 183–197, 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Query Answering over First Order Knowledge Bases", Proc. 15th International Conference on Principles of Knowledge Representation and Reasoning, 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Query Answering over First Order Knowledge Bases", 15th International Conference on Principles of Knowledge Representation and Reasoning, 2016.
Borgida, A., D. Toman, and G. Weddell, "On Referring Expressions in Query Answering over First Order Knowledge Bases.", KR, pp. 319–328, 2016.
Toman, D., and G. Weddell, "Ontology Based Data Access with Referring Expressions for Logics with the Tree Model Property - (Extended Abstract).", Australasian Conference on Artificial Intelligence, pp. 353–361, 2016.
Baruah, G., H. Zhang, R. Guttikonda, J. Lin, M. Smucker, and O. Vechtomova, "Optimizing Nugget Annotations with Active Learning.", CIKM, pp. 2359–2364, 2016.
Bonenfant, M., B. Desai, D. Desai, B. Fung, M.. Özsu, and J. Ullman, "Panel: The State of Data: Invited Paper from panelists.", IDEAS, pp. 2–11, 2016.
Yilmaz, E., and C. Clarke, "Preface.", EVIA@NTCIR, 2016.
Yang, G., I. Soboroff, L. Xiong, C. Clarke, and S. Garfinkel, "Privacy-Preserving IR 2016: Differential Privacy, Search, and Social Media.", SIGIR, pp. 1247–1248, 2016.
Lin, J., Z. Tu, M. Rose, and P. White, "Prizm: A Wireless Access Point for Proxy-Based Web Lifelogging.", LTA@MM, pp. 19–25, 2016.
Han, M., and K. Daudjee, "Providing Serializability for Pregel-like Graph Processing Systems.", EDBT, pp. 77–88, 2016.
Gebhard, L., L. Golab, S. Keshav, and H. de Meer, "Range prediction for electric bicycles.", e-Energy, pp. 21:1-21:11, 2016.
Elbagoury, A., M. Crane, and J. Lin, "Rank-at-a-Time Query Processing.", ICTIR, pp. 229–232, 2016.
Paik, J., and J. Lin, "Retrievability in API-Based “Evaluation as a Service”.", ICTIR, pp. 91–94, 2016.
Zhang, H., J. Lin, G. Cormack, and M. Smucker, "Sampling Strategies and Active Learning for Volume Estimation.", SIGIR, pp. 981–984, 2016.
Zhang, H., A. Chakrabarty, R. Ayoub, G. Buzzard, and S. Sundaram, "Sampling-based explicit nonlinear model predictive control for output tracking.", CDC, pp. 4722–4727, 2016.
Cormack, G., and M. Grossman, "Scalability of Continuous Active Learning for Reliable High-Recall Text Classification.", CIKM, pp. 1039–1048, 2016.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Second Workshop on Search and Exploration of X-Rated Information (SEXI’16): WSDM Workshop Summary.", WSDM, pp. 697–698, 2016.
Moschitti, A., L. Màrquez, P. Nakov, E. Agichtein, C. Clarke, and I. Szpektor, "SIGIR 2016 Workshop WebQA II: Web Question Answering Beyond Factoids.", SIGIR, pp. 1251–1252, 2016.
Tan, L., A. Roegiest, C. Clarke, and J. Lin, "Simple Dynamic Emission Strategies for Microblog Filtering.", SIGIR, pp. 1009–1012, 2016.
Davila, K., R. Zanibbi, A. Kane, and F. Tompa, "Tangent-3 at the NTCIR-12 MathIR Task.", NTCIR, 2016.
Ammar, K., "Techniques and Systems for Large Dynamic Graphs.", SIGMOD PhD Symposium, pp. 7–11, 2016.
Rao, J., and J. Lin, "Temporal Query Expansion Using a Continuous Hidden Markov Model.", ICTIR, pp. 295–298, 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Total Recall: Blue Sky on Mars.", ICTIR, pp. 45–48, 2016.
Lin, J., M. Crane, A. Trotman, J. Callan, I. Chattopadhyaya, J. Foley, G. Ingersoll, C. MacDonald, and S. Vigna, "Toward Reproducible Baselines: The Open-Source IR Reproducibility Challenge.", ECIR, pp. 408–420, 2016.
Grossman, M., G. Cormack, and A. Roegiest, "TREC 2016 Total Recall Track Overview.", TREC, 2016.
He, H., J. Wieting, K. Gimpel, J. Rao, and J. Lin, "UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement.", SemEval@NAACL-HLT, pp. 1103–1108, 2016.
El-Roby, A., "Utilizing user feedback to improve data integration systems.", ICDE Workshops, pp. 206–210, 2016.
Radhakrishnan, S., B. Muscedere, and K. Daudjee, "V-Hadoop: Virtualized Hadoop using containers.", NCA, pp. 237–241, 2016.
Hartig, O., and M.. Özsu, "Walking Without a Map: Ranking-Based Traversal for Querying Linked Data.", International Semantic Web Conference (1), pp. 305–324, 2016.
Yan, D., J. Cheng, T. M. Özsu, F. Yang, Y. Lu, J. C. S. Liu, Q. Zhang, and W. Ng, "A General-Purpose Query-Centric Framework for Querying Big Graphs", Proc. VLDB Endowment, vol. 9, issue 7, pp. 564 – 575, 2016.
Wu, G. Zhiping, and F. Tompa, "A Space-Efficient Data Structure for Fast Access Control in ECM Systems", Proc. 21st ACM Symposium on Access Control Models and Technologies, pp. 191-201, 2016.
Tan, L., A. Roegiest, J. Lin, and C. L. A. Clarke, "An Exploration of Evaluation Metrics for Mobile Push Notifications", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 741-744, 2016.
Buntain, C., and J. Lin, "Burst Detection in Social Media Streams for Tracking Interest Profiles in Real Time", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, 2016.
Chu, X., I. F. Ilyas, S. Krishnan, and J. Wang, "Data Cleaning: Overview and Emerging Challenges", Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 2201-2206, 2016.
Abedjan, Z., X. Chu, D. Deng, R. Castro Fernandez, I. F. Ilyas, M. Ouzzani, P. Papotti, M. Stonebraker, and N. Tang, "Detecting Data Errors: Where are we and what needs to be done?", Proc. VLDB Endowment, vol. 9, issue 12, 2016.
Abedjan, Z., X. Chu, D. Deng, R. Castro Fernandez, I. F. Ilyas, M. Ouzzani, P. Papotti, M. Stonebraker, and N. Tang, "Detecting Data Errors: Where are we and what needs to be done?", VLDB Endowment, vol. 9, issue 12, pp. 1004, 2016.
Chu, X., I. F. Ilyas, and P. Koutris, "Distributed Data Deduplication", Proc. VLDB Endowment, vol. 9, issue 11, pp. 864-875, 2016.
Cormack, G. V., and M. R. Grossman, "Engineering Quality and Reliability in Technology-Assisted Review", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 75-84, 2016.
Roegiest, A., and G. V. Cormack, "Impact of Review-Set Selection on Human Assessment for Text Classification", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 861-864, 2016.
"Interleaved Evaluation for Retrospective Summarization and Prospective Notification on Document Streams", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 175-184, 2016.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "Multi-Stage Math Formula Search: Using Appearance-Based Similarity Metrics at Scale", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 145-154, 2016.
Mior, M. J., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema Design for NoSQL Applications", Proc. International Conference on Data Engineering, pp. 181-192, 2016.
Jacques, J. St., D. Toman, and G. Weddell, "Object-Relational Queries over CFDI_nc Knowledge Bases: OBDA for the SQL-Literate", Proc. 25th International Joint Conference on Artificial Intelligence, 2016.
Drzadzewski, G., and F. Tompa, "Partial Materialization for Online Analytical Processing over Multi-Tagged Document Collections", Knowledge and Information Systems, vol. 47, issue 3, pp. 697-732, 2016.
Peng, P., L. Zou, T. M. Özsu, L. Chen, and D. Zhao, "Processing SPARQL Queries Over Distributed RDF Graphs", VLDB Journal, vol. 25, issue 2, pp. 243–268, 2016.
Han, M., and K. Daudjee, "Providing Serializability for Pregel-like Graph Processing Systems", Proc. International Conference on Extending Database Technology, pp. 77-88, 2016.
Chu, X., and I. F. Ilyas, "Qualitative Data Cleaning", Proc. VLDB Endowment, vol. 9, issue 13, pp. 1605-1608, 2016.
Zhang, H., J. Lin, G. V. Cormack, and M. Smucker, "Sampling Strategies and Active Learning for Volume Estimation", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 981-984, 2016.
Tan, L., A. Roegiest, C. L. A. Clarke, and J. Lin, "Simple Dynamic Emission Strategies for Microblog Filtering", Proc. 39th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 1009-1012, 2016.
Liu, X., L. Golab, W. Golab, I. F. Ilyas, and S. Jin, "Smart Meter Analytics: Systems, Algorithms and Benchmarking", TODS, vol. 42, issue 1, pp. 1-39, 2016.
Davila, K., R. Zanibbi, A. Kane, and F. Tompa, "Tangent-3 at the NTCIR-12 MathIR Task", Proc. 12th NTCIR Conference on Evaluation of Information Access Technologies, pp. 338-345, 2016.
Ehsan, N., F. Tompa, and A. Shakery, "Using a Dictionary and n-gram Alignment to Improve Fine-grained Cross-Language Plagiarism Detection", Proc. ACM Symposium on Document Engineering (DocEng), pp. 59-68, 2016.
El-Roby, A., "Utilizing User Feedback to Improve Data Integration Systems", Proc. 32nd IEEE International Conference on Data Engineering Workshops, pp. 206-210, 2016.
Hartig, O., and T. M. Özsu, "Walking without a Map: Ranking-Based Traversal for Querying Linked Data", Proc. 15th International Semantic Web Conference, pp. 305-324, 2016.
Yan, D., J. Cheng, M.. Özsu, F. Yang, Y. Lu, J. Lui, Q. Zhang, and W. Ng, "A General-Purpose Query-Centric Framework for Querying Big Graphs.", PVLDB, vol. 9, no. 7, pp. 564–575, 2016.
Özsu, M.., "A survey of RDF data management systems.", Frontiers Comput. Sci., vol. 10, no. 3, pp. 418–432, 2016.
Özsu, M.., "A Survey of RDF Data Management Systems.", CoRR, vol. abs/1601.00707, 2016.
Gebaly, K., and J. Lin, "Afterburner: The Case for In-Browser Analytics.", CoRR, vol. abs/1605.04035, 2016.
Clarke, C., J.. Culpepper, and A. Moffat, "Assessing efficiency-effectiveness tradeoffs in multi-stage retrieval systems without using relevance judgments.", Inf. Retr. Journal, vol. 19, no. 4, pp. 351–377, 2016.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Authority-based Team Discovery in Social Networks.", CoRR, vol. abs/1611.02992, 2016.
Jiang, Y., S. Syed, and L. Golab, "Data Mining of Undergraduate Course Evaluations.", Informatics in Education, vol. 15, no. 1, pp. 85–102, 2016.
Bär, A., P. Casas, A. D’Alconzo, P. Fiadino, L. Golab, M. Mellia, and E. Schikuta, "DBStream: A holistic approach to large-scale network traffic monitoring and analysis.", Computer Networks, vol. 107, pp. 5–19, 2016.
Abedjan, Z., X. Chu, D. Deng, R. Fernandez, I. Ilyas, M. Ouzzani, P. Papotti, M. Stonebraker, and N. Tang, "Detecting Data Errors: Where are we and what needs to be done?", PVLDB, vol. 9, no. 12, pp. 993–1004, 2016.
Chu, X., I. Ilyas, and P. Koutris, "Distributed Data Deduplication.", PVLDB, vol. 9, no. 11, pp. 864–875, 2016.
Culpepper, J.., C. Clarke, and J. Lin, "Dynamic Trade-Off Prediction in Multi-Stage Retrieval Systems.", CoRR, vol. abs/1610.02502, 2016.
Bizer, C., L. Dong, I. Ilyas, and M-E. Vidal, "Editorial: Special Issue on Web Data Quality.", J. Data and Information Quality, vol. 8, no. 1, pp. 1:1-1:3, 2016.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization.", CoRR, vol. abs/1608.06169, 2016.
Ilyas, I., "Effective Data Cleaning with Continuous Evaluation.", IEEE Data Eng. Bull., vol. 39, no. 2, pp. 38–46, 2016.
Clarke, C., and E. Yilmaz, "EVIA 2016: The Seventh International Workshop on Evaluating Information Access.", SIGIR Forum, vol. 50, no. 2, pp. 44–46, 2016.
Ghenai, A., and M. Ghanem, "Exploring Trust-Aware Neighbourhood in Trust-based Recommendation.", CoRR, vol. abs/1608.05380, 2016.
Boncz, P., and K. Salem, "Front Matter.", PVLDB, vol. 10, no. 1, pp. i–vi, 2016.
Sharma, A., J. Jiang, P. Bommannavar, B. Larson, and J. Lin, "GraphJet: Real-Time Content Recommendations at Twitter.", PVLDB, vol. 9, no. 13, pp. 1281–1292, 2016.
Crane, M., "Improved Indexing & Searching Throughput.", SIGIR Forum, vol. 50, no. 1, pp. 87, 2016.
Khabsa, M., A. Elmagarmid, I. Ilyas, H. Hammady, and M. Ouzzani, "Learning to identify relevant studies for systematic reviews using random forest and external information.", Machine Learning, vol. 102, no. 3, pp. 465–482, 2016.
Quamar, A., A. Deshpande, and J. Lin, "NScale: neighborhood-centric large-scale graph analytics in the cloud.", VLDB J., vol. 25, no. 2, pp. 125–150, 2016.
Drzadzewski, G., and F. Tompa, "Partial materialization for online analytical processing over multi-tagged document collections.", Knowl. Inf. Syst., vol. 47, no. 3, pp. 697–732, 2016.
Peng, P., L. Zou, M.. Özsu, L. Chen, and D. Zhao, "Processing SPARQL queries over distributed RDF graphs.", VLDB J., vol. 25, no. 2, pp. 243–268, 2016.
Chu, X., and I. Ilyas, "Qualitative Data Cleaning.", PVLDB, vol. 9, no. 13, pp. 1605–1608, 2016.
Yan, D., J. Cheng, M.. Özsu, F. Yang, Y. Lu, J. Lui, Q. Zhang, and W. Ng, "Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs.", CoRR, vol. abs/1601.06497, 2016.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple.", PVLDB, vol. 9, no. 13, pp. 1481–1484, 2016.
Lin, J., C. Clarke, and G. Baruah, "Searching from Mars.", IEEE Internet Computing, vol. 20, no. 1, pp. 78–82, 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Ten Blue Links on Mars.", CoRR, vol. abs/1610.06468, 2016.
Tan, L., J. Lin, A. Roegiest, and C. Clarke, "The Effects of Latency Penalties in Evaluating Push Notification Systems.", CoRR, vol. abs/1606.03066, 2016.
Lin, J., and K. Gebaly, "The Future of Big Data Is ... JavaScript?", IEEE Internet Computing, vol. 20, no. 5, pp. 82–88, 2016.

2015

Fillottrani, P., C.. Keet, and D. Toman, "", Description Logics, 2015.
Toman, D., and G. Weddell, "", Australasian Conference on Artificial Intelligence, pp. 559–571, 2015.
Toman, D., and G. Weddell, "", Description Logics, 2015.
Shen, X., L. Zou, M.. Özsu, L. Chen, Y. Li, S. Han, and D. Zhao, "A graph-based RDF triple store.", ICDE, pp. 1508–1511, 2015.
Sabri, M., "A Hybrid Framework for Online Execution of Linked Data Queries.", WWW (Companion Volume), pp. 515–519, 2015.
Wu, J., T. Kinash, D. Toman, and G. Weddell, "Absorption for ABoxes and TBoxes with General Value Restrictions.", Australasian Conference on Artificial Intelligence, pp. 609–622, 2015.
Hamdaqa, M., M. Sabri, A. Singh, and L. Tahvildari, "Adoop: MapReduce for ad-hoc cloud computing.", CASCON, pp. 26–34, 2015.
El-Roby, A., and A. Aboulnaga, "ALEX: Automatic Link Exploration in Linked Data.", SIGMOD Conference, pp. 1839–1853, 2015.
Lin, J., and A. Trotman, "Anytime Ranking for Impact-Ordered Indexes.", ICTIR, pp. 301–304, 2015.
Wang, Y., G. Sherman, J. Lin, and M. Efron, "Assessor Differences and User Preferences in Tweet Timeline Generation.", SIGIR, pp. 615–624, 2015.
Liu, X., L. Golab, W. Golab, and I. Ilyas, "Benchmarking Smart Meter Data Analytics.", EDBT, pp. 385–396, 2015.
Khayyat, Z., I. Ilyas, A. Jindal, S. Madden, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "BigDansing: A System for Big Data Cleansing.", SIGMOD Conference, pp. 1215–1230, 2015.
Lin, J., "Building a Self-Contained Search Engine in the Browser.", ICTIR, pp. 309–312, 2015.
Ammar, K., A. Elsayed, M. Sabri, and M. Terry, "BusMate: Understanding Mobility Behavior for Trajectory-Based Advertising.", MDM (2), pp. 74–79, 2015.
Bär, A., L. Golab, S. Ruehrup, M. Schiavone, and P. Casas, "Cache-oblivious scheduling of shared workloads.", ICDE, pp. 855–866, 2015.
Kiseleva, J., J. Kamps, and C. Clarke, "Contextual Search and Exploration.", RuSSIR, pp. 3–23, 2015.
Ammar, K., and M. Nascimento, "Continuous Median Queries in Wireless Sensor Networks.", MDM (1), pp. 203–212, 2015.
Kim, J., K. Salem, K. Daudjee, A. Aboulnaga, and X. Pan, "Database high availability using SHADOW systems.", SoCC, pp. 209–221, 2015.
Morcos, J., Z. Abedjan, I. Ilyas, M. Ouzzani, P. Papotti, and M. Stonebraker, "DataXFormer: An Interactive Data Transformation Tool.", SIGMOD Conference, pp. 883–888, 2015.
Abedjan, Z., J. Morcos, M. Gubanov, I. F. Ilyas, M. Stonebraker, P. Papotti, and M. Ouzzani, "DataXFormer: Leveraging the Web for Semantic Transformations", Proc of The Biennial Conference on Innovative Data Systems Research, 2015.
Abedjan, Z., J. Morcos, M. Gubanov, I. Ilyas, M. Stonebraker, P. Papotti, and M. Ouzzani, "Dataxformer: Leveraging the Web for Semantic Transformations.", CIDR, 2015.
Feng, G., X. Meng, and K. Ammar, "DISTINGER: A distributed graph data structure for massive dynamic graph processing.", Big Data, pp. 1814–1822, 2015.
Saxena, H., and K. Salem, "EdgeX: Edge Replication for Web Applications.", CLOUD, pp. 1041–1044, 2015.
Drzadzewski, G., and F. Tompa, "Enhancing Exploration with a Faceted Browser through Summarization.", DocEng, pp. 61–64, 2015.
Baruah, G., M. Smucker, and C. Clarke, "Evaluating Streams of Evolving News Events.", SIGIR, pp. 675–684, 2015.
Aluç, G., M.. Özsu, K. Daudjee, and O. Hartig, "Executing queries over schemaless RDF databases.", ICDE, pp. 807–818, 2015.
Bislimovska, B., G. Aluç, M.. Özsu, and P. Fraternali, "Graph Search of Software Models Using Multidimensional Scaling.", EDBT/ICDT Workshops, pp. 163–170, 2015.
Petroni, F., L. Querzoni, K. Daudjee, S. Kamali, and G. Iacoboni, "HDRF: Stream-Based Partitioning for Power-Law Graphs.", CIKM, pp. 243–252, 2015.
Nicoara, D., S. Kamali, K. Daudjee, and L. Chen, "Hermes: Dynamic Partitioning for Distributed Social Network Graph Databases.", EDBT, pp. 25–36, 2015.
Lamb, C., D. Brown, and C. Clarke, "Human Competence in Creativity Evaluation.", ICCC, pp. 102–109, 2015.
Weissman, S., S. Ayhan, J. Bradley, and J. Lin, "Identifying Duplicate and Contradictory Information in Wikipedia.", JCDL, pp. 57–60, 2015.
Roegiest, A., G. Cormack, C. Clarke, and M. Grossman, "Impact of Surrogate Assessments on High-Recall Retrieval.", SIGIR, pp. 555–564, 2015.
Meng, X., M. Chen, and Z. Wang, "Improved Locality Preserving Projections for Multimodal Biometrics.", RVSP, pp. 228–231, 2015.
Ge, C., M. Kaufmann, L. Golab, P. Fischer, and A. Goel, "Indexing Bi-Temporal Windows", SSDBM 2015, 2015.
Ge, C., M. Kaufmann, L. Golab, P. Fischer, and A. Goel, "Indexing bi-temporal windows.", SSDBM, pp. 19:1-19:12, 2015.
Clarke, C., M. Smucker, and E. Yilmaz, "IR Evaluation: Modeling User Behavior for Measuring Effectiveness.", SIGIR, pp. 1117–1120, 2015.
Chu, X., J. Morcos, I. Ilyas, M. Ouzzani, P. Papotti, N. Tang, and Y. Ye, "KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing.", SIGMOD Conference, pp. 1247–1261, 2015.
Tan, L., H. Zhang, C. Clarke, and M. Smucker, "Lexical Comparison Between Wikipedia and Twitter Corpora by Using Word Embeddings.", ACL (2), pp. 657–661, 2015.
Cormack, G., and M. Grossman, "Multi-Faceted Recall of Continuous Active Learning for Technology-Assisted Review.", SIGIR, pp. 763–766, 2015.
He, H., K. Gimpel, and J. Lin, "Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks.", EMNLP, pp. 1576–1586, 2015.
Hudek, A., D. Toman, and G. Weddell, "On Enumerating Query Plans Using Analytic Tableau.", TABLEAUX, pp. 339–354, 2015.
Hashemi, S., C. Clarke, A. Dean-Hall, J. Kamps, and J. Kiseleva, "On the Reusability of Open Test Collections.", SIGIR, pp. 827–830, 2015.
Dean-Hall, A., C. Clarke, J. Kamps, and J. Kiseleva, "Online Evaluation of Point-Of-Interest Recommendation Systems.", SCST@ECIR, 2015.
Sequiera, R., M. Choudhury, P. Gupta, P. Rosso, S. Kumar, S. Banerjee, S. Naskar, S. Bandyopadhyay, G. Chittaranjan, A. Das, et al., "Overview of FIRE-2015 Shared Task on Mixed Script Information Retrieval.", FIRE Workshops, pp. 19–25, 2015.
Dean-Hall, A., C. Clarke, J. Kamps, J. Kiseleva, and E. Voorhees, "Overview of the TREC 2015 Contextual Suggestion Track.", TREC, 2015.
Baruah, G., A. Roegiest, and M. Smucker, "Pooling for User-Oriented Evaluation Measures.", ICTIR, pp. 341–344, 2015.
Sequiera, R., M. Choudhury, and K. Bali, "POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments.", ICON, pp. 237–246, 2015.
Chen, M., X. Meng, and Z. Wang, "Quaternion Fisher Discriminant Analysis for Bimodal Multi-feature Fusion.", ECC, pp. 479–487, 2015.
Chen, M., C. Wang, X. Meng, and Z. Wang, "Quaternion Principal Component Analysis for Multi-modal Fusion.", ICGEC (2), pp. 11–19, 2015.
Rao, J., J. Lin, and M. Efron, "Reproducible Experiments on Lexical and Temporal Feedback for Tweet Search.", ECIR, pp. 755–767, 2015.
Lin, J., "Scaling Down Distributed Infrastructure on Wimpy Machines for Personal Web Archiving.", WWW (Companion Volume), pp. 1351–1355, 2015.
Zhang, H., R. Ayoub, and S. Sundaram, "Sensor selection for optimal filtering of linear dynamical systems: Complexity and approximation.", CDC, pp. 5002–5007, 2015.
Arguello, J., F. Diaz, J. Lin, and A. Trotman, "SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR).", SIGIR, pp. 1147–1148, 2015.
Borgida, A., D. Toman, and G. Weddell, "Singular Referring Expressions in Conjunctive Query Answers: the case for a CFD DL Dialect.", Description Logics, 2015.
Golab, L., F. Korn, F. Li, B. Saha, and D. Srivastava, "Size-Constrained Weighted Set Cover.", ICDE, pp. 879–890, 2015.
Liu, X., L. Golab, and I. Ilyas, "SMAS: A smart meter data analytics system.", ICDE, pp. 1476–1479, 2015.
Wang, Y., and J. Lin, "The Feasibility of Brute Force Scans for Real-Time Tweet Search.", ICTIR, pp. 321–324, 2015.
Dean-Hall, A., and C. Clarke, "The Power of Contextual Suggestion.", ECIR, pp. 352–357, 2015.
Korkmaz, M., A. Karyakin, M. Karsten, and K. Salem, "Towards Dynamic Green-Sizing for Database Servers.", ADMS@VLDB, pp. 25–36, 2015.
Tan, L., A. Roegiest, and C. Clarke, "University of Waterloo at TREC 2015 Microblog Track.", TREC, 2015.
Ghenai, A., E. Khalilov, P. Valov, and C. Clarke, "WaterlooClarke: TREC 2015 Clinical Decision Support Track.", TREC, 2015.
Hoffmann, H., P. Addala, and C. Clarke, "WaterlooClarke: TREC 2015 Contextual Suggestion Track.", TREC, 2015.
Vtyurina, A., A. Dey, B. Sarrafzadeh, and C. Clarke, "WaterlooClarke: TREC 2015 LiveQA Track.", TREC, 2015.
Abualsaud, M., M. Ghaznavi, D. Recoskie, and C. Clarke, "WaterlooClarke: TREC 2015 Microblog Track.", TREC, 2015.
Raza, A., D. Rotondo, and C. Clarke, "WaterlooClarke: TREC 2015 Temporal Summarization Track.", TREC, 2015.
Zhang, H., W. Lin, Y. Wang, C. Clarke, and M. Smucker, "WaterlooClarke: TREC 2015 Total Recall Track.", TREC, 2015.
Agichtein, E., D. Carmel, C. Clarke, P. Paritosh, D. Pelleg, and I. Szpektor, "Web Question Answering: Beyond Factoids: SIGIR 2015 Workshop.", SIGIR, pp. 1143, 2015.
Gao, P., L. Golab, and S. Keshav, "What’s Wrong with my Solar Panels: a Data-Driven Approach.", EDBT/ICDT Workshops, pp. 86–93, 2015.
Kim, J., K. Salem, and K. Daudjee, "Write Amplification: An Analysis of In-Memory Database Durability Techniques.", IMDM@VLDB, pp. 1:1-1:7, 2015.
Liu, X., L. Golab, W. Golab, and I. F. Ilyas, "Benchmarking Smart Meter Data Analytics", Proc. 18th International Conference on Extending Database Technology, pp. 385-396, 2015.
Nicoara, D., S. Kamali, K. Daudjee, and L. Chen, "Hermes: Dynamic Partitioning for Distributed Social Network Graph Databases", Proc. 18th International Conference on Extending Database Technology, pp. 25-36, 2015.
Yang, Y., L. Golab, and T. Ozsu, "ViewDF: Declarative Incremental View Maintenance For Streaming Data", BIRTE 2015, 2015.
El-Roby, A., and A. Aboulnaga, "ALEX: Automatic Link Exploration in Linked Data", Proc. ACM SIGMOD International Conference on Management of Data, pp. 1839-1853, 2015.
Khayyat, Z., I. F. Ilyas, A. Jindal, S. Madden, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "BigDansing: A System for Big Data Cleansing", Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 1215-1230, 2015.
Baer, A., L. Golab, S. Ruehrup, M. Schiavone, and P. Casas, "Cache-Oblivious Scheduling of Shared Workloads", Proc. 31st International Conference on Data Engineering, pp. 855-866, 2015.
Aluç, G., T. M. Özsu, K. Daudjee, and O. Hartig, "Executing Queries over Schemaless RDF Databases", Proc. 31st International Conference on Data Engineering, pp. 807-818, 2015.
Han, M., and K. Daudjee, "Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems", Proc. VLDB Endowment, vol. 8, issue 9, pp. 950-961, 2015.
Bislimovska, B., G. Aluç, T. M. Özsu, and P. Fraternali, "Graph Search of Software Models Using Multidimensional Scaling", Proc. EDBT/ICDE Joint Conference Workshops (4th International Workshop on Querying Graph Structured Data), pp. 163-170, 2015.
Chu, X., J. Morcos, I. F. Ilyas, M. Ouzzani, P. Papotti, N. Tang, and Y. Ye, "KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing", Knowledge and Information Systems, pp. 1-36, 2015.
Balkesen, C., J. Teubner, G. Alonso, and T. M. Özsu, "Main-memory hash joins on modern processor architectures", IEEE Trans. Knowl. and Data Eng., vol. 27, issue 7, pp. 1754-1766, 2015.
Abedjan, Z., L. Golab, and F. Naumann, "Profiling relational data: a survey", VLDB Journal, vol. 24, issue 4, pp. 557-581, 2015.
Golab, L., F. Korn, F. Li, B. Saha, and D. Srivastava, "Size-Constrained Weighted Set Cover", Proc. 31st International Conference on Data Engineering, pp. 879-890, 2015.
Chu, X., Y. He, K. Chakrabarti, and K. Ganjam, "TEGRA: Table Extraction by Global Record Alignment", Proc. of the 2015 ACM SIGMOD Int. Conf. on Management of Data, pp. 1713-1728, 2015.
Ilyas, I. F., and X. Chu, "Trends in Cleaning Relational Data: Consistency and Deduplication", Foundations and Trends in Databases, vol. 5, issue 4, pp. 281-393, 2015.
Tan, L., and C. Clarke, "A Family of Rank Similarity Measures Based on Maximized Effectiveness Difference.", IEEE Trans. Knowl. Data Eng., vol. 27, no. 11, pp. 2865–2877, 2015.
Zhang, H., E. Fata, and S. Sundaram, "A Notion of Robustness in Complex Networks.", IEEE Trans. Control of Network Systems, vol. 2, no. 3, pp. 310–320, 2015.
Chowdhury, S., A. Roy, M. Shaikh, and K. Daudjee, "A taxonomy of decentralized online social networks.", Peer-to-Peer Networking and Applications, vol. 8, no. 3, pp. 367–383, 2015.
Agrawal, D., A. Abbadi, and K. Salem, "A Taxonomy of Partitioned Replicated Cloud-based Database Systems.", IEEE Data Eng. Bull., vol. 38, no. 1, pp. 4–9, 2015.
Clarke, C., J.. Culpepper, and A. Moffat, "Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments.", CoRR, vol. abs/1506.00717, 2015.
Cormack, G., and M. Grossman, "Autonomy and Reliability of Continuous Active Learning for Technology-Assisted Review.", CoRR, vol. abs/1504.06868, 2015.
Aluç, G., M.. Özsu, and K. Daudjee, "Clustering RDF Databases Using Tunable-LSH.", CoRR, vol. abs/1504.02523, 2015.
Kargar, M., L. Golab, and J. Szlichta, "Effective Keyword Search in Graphs.", CoRR, vol. abs/1512.06395, 2015.
Hanbury, A., H. Müller, K. Balog, T. Brodt, G. Cormack, I. Eggel, T. Gollub, F. Hopfgartner, J. Kalpathy-Cramer, N. Kando, et al., "Evaluation-as-a-Service: Overview and Outlook.", CoRR, vol. abs/1512.07454, 2015.
He, H., J. Lin, and A. Lopez, "Gappy Pattern Matching on GPUs for On-Demand Extraction of Hierarchical Translation Grammars.", TACL, vol. 3, pp. 87–100, 2015.
Han, M., and K. Daudjee, "Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems.", PVLDB, vol. 8, no. 9, pp. 950–961, 2015.
Lin, J., "Is Big Data a Transient Problem?", IEEE Internet Computing, vol. 19, no. 5, pp. 86–90, 2015.
Chu, X., M. Ouzzani, J. Morcos, I. Ilyas, P. Papotti, N. Tang, and Y. Ye, "KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing.", PVLDB, vol. 8, no. 12, pp. 1952–1955, 2015.
Buntain, C., J. Lin, and J. Golbeck, "Learning to Discover Key Moments in Social Media Streams.", CoRR, vol. abs/1508.00488, 2015.
Balkesen, C., J. Teubner, G. Alonso, and M.. Özsu, "Main-Memory Hash Joins on Modern Processor Architectures.", IEEE Trans. Knowl. Data Eng., vol. 27, no. 7, pp. 1754–1766, 2015.
Abu-Khzam, F., K. Daudjee, A. Mouawad, and N. Nishimura, "On scalable parallel recursive backtracking.", J. Parallel Distrib. Comput., vol. 84, pp. 65–75, 2015.
Abedjan, Z., L. Golab, and F. Naumann, "Profiling relational data: a survey.", VLDB J., vol. 24, no. 4, pp. 557–581, 2015.
Hopfgartner, F., A. Hanbury, H. Müller, N. Kando, S. Mercer, J. Kalpathy-Cramer, M. Potthast, T. Gollub, A. Krithara, J. Lin, et al., "Report on the Evaluation-as-a-Service (EaaS) Expert Workshop.", SIGIR Forum, vol. 49, no. 1, pp. 57–65, 2015.
Arguello, J., M. Crane, F. Diaz, J. Lin, and A. Trotman, "Report on the SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR).", SIGIR Forum, vol. 49, no. 2, pp. 107–116, 2015.
Calvanese, D., M. Koubarakis, and D. Toman, "Special issue of the Journal of Web Semantics on ontology-based data access.", J. Web Sem., vol. 33, pp. 1–2, 2015.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "The Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search.", CoRR, vol. abs/1507.06235, 2015.
Ilyas, I., and X. Chu, "Trends in Cleaning Relational Data: Consistency and Deduplication.", Foundations and Trends in Databases, vol. 5, no. 4, pp. 281–393, 2015.

2014

Toman, D., and G. Weddell, "", PRICAI, pp. 587–599, 2014.
Al-Harbi, A., and M. Smucker, "A qualitative exploration of secondary assessor relevance judging behavior.", IIiX, pp. 195–204, 2014.
Diamantini, C., D. Potena, E. Storti, and H. Zhang, "An Ontology-Based Data Exploration Tool for Key Performance Indicators.", OTM Conferences, pp. 727–744, 2014.
Dean-Hall, A., and C. Clarke, "Assessing Contextual Suggestion.", EVIA@NTCIR, 2014.
Mior, M., "Automated schema design for NoSQL databases.", SIGMOD PhD Symposium, pp. 41–45, 2014.
Mühleisen, H., T. Samar, J. Lin, and A. de Vries, "Column Stores as an IR Prototyping Tool.", ECIR, pp. 789–792, 2014.
Sarrafzadeh, B., and O. Vechtomova, "Combining document retrieval with knowledge graphs for exploratory search.", IIiX, pp. 345–347, 2014.
Ardakanian, O., N. Koochakzadeh, R. Singh, L. Golab, and S. Keshav, "Computing Electricity Consumption Profiles from Household Smart Meter Data.", EDBT/ICDT Workshops, pp. 140–147, 2014.
Robinson, N., S. McIlraith, and D. Toman, "Cost-Based Query Optimization via AI Planning.", AAAI, pp. 2344–2351, 2014.
Gebremeskel, G., J. He, A. de Vries, and J. Lin, "Cumulative Citation Recommendation: A Feature-Aware Comparison of Approaches.", DEXA Workshops, pp. 193–197, 2014.
Syed, S., Y. Jiang, and L. Golab, "Data mining of undergraduate course evaluations.", EDM, pp. 347–348, 2014.
Golab, L., and T. Johnson, "Data stream warehousing.", ICDE, pp. 1290–1293, 2014.
Bär, A., P. Casas, L. Golab, and A. Finamore, "DBStream: An online aggregation, filtering and processing system for network traffic monitoring.", IWCMC, pp. 611–616, 2014.
Chalamalla, A., I. Ilyas, M. Ouzzani, and P. Papotti, "Descriptive and prescriptive data cleaning.", SIGMOD Conference, pp. 445–456, 2014.
Golab, L., M. Hadjieleftheriou, H. Karloff, and B. Saha, "Distributed data placement to minimize communication costs via graph partitioning.", SSDBM, pp. 20:1-20:12, 2014.
Aluç, G., O. Hartig, M.. Özsu, and K. Daudjee, "Diversified Stress Testing of RDF Data Management Systems.", International Semantic Web Conference (1), pp. 197–212, 2014.
Said, A., A. Bellogín, J. Lin, and A. de Vries, "Do recommendations matter?: news recommendation in real life.", CSCW Companion, pp. 237–240, 2014.
Wu, G., and F. Tompa, "Effective and Efficient Bitmaps for Access Control.", DCC, pp. 433, 2014.
Sarrafzadeh, B., O. Vechtomova, and V. Jokic, "Exploring knowledge graphs for exploratory search.", IIiX, pp. 135–144, 2014.
Albakour, M-D., C. MacDonald, I. Ounis, C. Clarke, and V. Bicer, "Information Access in Smart Cities (i-ASC).", ECIR, pp. 810–814, 2014.
Myers, S., A. Sharma, P. Gupta, and J. Lin, "Information network or social network?: the structure of the twitter follow graph.", WWW (Companion Volume), pp. 493–498, 2014.
Lin, J., M. Gholami, and J. Rao, "Infrastructure for supporting exploration and discovery in web archives.", WWW (Companion Volume), pp. 851–856, 2014.
Lin, J., and M. Efron, "Infrastructure support for evaluation as a service.", WWW (Companion Volume), pp. 79–82, 2014.
Carpenter, T., L. Golab, and S. Syed, "Is the grass greener?: mining electric vehicle opinions.", e-Energy, pp. 241–252, 2014.
Bär, A., A. Finamore, P. Casas, L. Golab, and M. Mellia, "Large-scale network traffic monitoring with DBStream, a system for rolling big data analysis.", BigData Conference, pp. 165–170, 2014.
Avram, C-A., K. Salem, and B. Wong, "Latency Amplification: Characterizing the Impact of Web Page Content on Load Times.", SRDS Workshops, pp. 20–25, 2014.
Wang, L., J. Lin, D. Metzler, and J. Han, "Learning to efficiently rank on big data.", WWW (Companion Volume), pp. 209–210, 2014.
Hartig, O., and M.. Özsu, "Linked Data query processing.", ICDE, pp. 1286–1289, 2014.
Singh, A., X. Cui, B. Cassell, B. Wong, and K. Daudjee, "MicroFuge: A Middleware Approach to Providing Performance Isolation in Cloud Storage Systems.", ICDCS, pp. 503–513, 2014.
Smucker, M., X. Guo, and A. Toulis, "Mouse movement during relevance judging: implications for determining user attention.", SIGIR, pp. 979–982, 2014.
Elmagarmid, A., I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF/ER: generic and interactive entity resolution.", SIGMOD Conference, pp. 1071–1074, 2014.
Mühleisen, H., T. Samar, J. Lin, and A. de Vries, "Old dogs are great at new tricks: column stores for ir prototyping.", SIGIR, pp. 863–866, 2014.
Voorhees, E., J. Lin, and M. Efron, "On run diversity in Evaluation as a Service.", SIGIR, pp. 959–962, 2014.
Daudjee, K., S. Kamali, and A. López-Ortiz, "On the online fault-tolerant server consolidation problem.", SPAA, pp. 12–21, 2014.
Kumar, K.., J. Gluck, A. Deshpande, and J. Lin, "Optimization Techniques for “Scaling Down” Hadoop on Multi-Core, Shared-Memory Systems.", EDBT, pp. 13–24, 2014.
Dean-Hall, A., C. Clarke, J. Kamps, P. Thomas, and E. Voorhees, "Overview of the TREC 2014 Contextual Suggestion Track.", TREC, 2014.
Ghodsnia, P., I. Bowman, and A. Nica, "Parallel I/O aware query optimization.", SIGMOD Conference, pp. 349–360, 2014.
Rao, J., J. Lin, and H. Samet, "Partitioning strategies for spatio-textual similarity join.", BigSpatial@SIGSPATIAL, pp. 40–49, 2014.
Jiang, Y., R. Levman, L. Golab, and J. Nathwani, "Predicting peak-demand days in the ontario peak reduction program for large consumers.", e-Energy, pp. 221–222, 2014.
Toman, D., and G. Weddell, "Pushing the CFDnc Envelope.", Description Logics, pp. 340–351, 2014.
Li, F., M.. Özsu, G. Chen, and B. Ooi, "R-Store: A scalable distributed system for supporting real-time analytics.", ICDE, pp. 40–51, 2014.
Hartig, O., and M.. Özsu, "Reachable subwebs for traversal-based query execution.", WWW (Companion Volume), pp. 541–546, 2014.
Chu, X., I. Ilyas, P. Papotti, and Y. Ye, "RuleMiner: Data quality rules discovery.", ICDE, pp. 1222–1225, 2014.
Kane, A., and F. Tompa, "Skewed partial bitvectors for list intersection.", SIGIR, pp. 263–272, 2014.
Tan, L., and C. Clarke, "Succinct Queries for Linking and Tracking News in Social Media.", CIKM, pp. 1883–1886, 2014.
Lin, J., K. Kraus, and R. Punzalan, "Supporting “Distant Reading” for Web Archives.", DH, 2014.
Efron, M., J. Lin, J. He, and A. de Vries, "Temporal feedback for tweet search with non-parametric density estimation.", SIGIR, pp. 33–42, 2014.
Baruah, G., A. Roegiest, and M. Smucker, "The effect of expanding relevance judgements with duplicates.", SIGIR, pp. 1159–1162, 2014.
Wang, Y., and J. Lin, "The Impact of Future Term Statistics in Real-Time Tweet Search.", ECIR, pp. 567–572, 2014.
Clarke, C., and M. Smucker, "Time well spent.", IIiX, pp. 205–214, 2014.
Li, L., and M. Smucker, "Tolerance of Effectiveness Measures to Relevance Judging Errors.", ECIR, pp. 148–159, 2014.
Xu, Z., D. Goldwasser, B. Bederson, and J. Lin, "Visual analytics of MOOCs at maryland.", L@S, pp. 195–196, 2014.
Serafini, M., E. Mansour, A. Aboulnaga, K. Salem, T. Rafiq, and U. Farooq Minhas, "Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions", Proc. VLDB Endowment, 12, vol. 7, pp. 1035-1046, 2014.
Han, M., K. Daudjee, K. Ammar, T. M. Özsu, X. Wang, and T. Jin, "An Experimental Comparison of Pregel-like Graph Processing Systems", Proc. VLDB Endowment, 12, vol. 7, pp. 1047-1058, 2014.
Baer, A., P. Casas, L. Golab, and A. Finamore, "DBStream: an Online Aggregation, Filtering and Processing System for Network Traffic Monitoring", Proc. 5th Int. Workshop on Traffic Analysis and Characterization, pp. 611-616, 2014.
Chalamalla, A., I. F. Ilyas, M. Ouzzani, and P. Papotti, "Descriptive and Prescriptive Data Cleaning", Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 445-456, 2014.
Golab, L., M. Hadjieleftheriou, H. Karloff, and B. Saha, "Distributed Data Placement to Minimize Communication Costs via Graph Partitioning", Proc. 26th International Conference on Scientific and Statistical Data Management, pp. 20-31, 2014.
Gebaly, K. El, P. Agrawal, L. Golab, F. Korn, and D. Srivastava, "Interpretable and Informative Explanations of Outcomes", Proc. Int. Conf. on Extending Database Technology (EDBT '14), 1, vol. 8, pp. 61-72, 2014.
Baer, A., A. Finamore, P. Casas, L. Golab, and M. Mellia, "Large-Scale Network Traffic Monitoring with DBStream, a System for Rolling Big Data Analysis", Proc. IEEE International Conference on Big Data, pp. 165-170, 2014.
Singh, A., X.. Cui, B. Cassell, B. Wong, and K. Daudjee, "MicroFuge: A Middleware Approach to Providing Performance Isolation in Cloud Storage Systems", Proc. 34th IEEE Inte, 2014.
Daudjee, K., S. Kamali, and A. López-Ortiz, "On the Online Fault-Tolerant Server Consolidation Problem", Proc. 26th ACM Symposium on Parallelism in Algorithms and Architectures, pp. 12-21, 2014.
Daudjee, K., S. Kamali, and A. López-Ortiz, "On the Online Fault-Tolerant Server Consolidation Problem", Proc. 26th ACM Symposium on Parallelism in Algorithms and Architectures,, pp. 12-21, 2014.
Ghodsnia, P., I. T. Bowman, and A. Nica, "Parallel I/O aware query optimization", Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 349-360, 2014.
Ead, M., H. Herodotou, A. Aboulnaga, and S. Babu, "PStorM: Profile Storage and Matching for Feedback-Based Tuning of MapReduce Jobs", Proc. Int. Conf. on Extending Database Technology (EDBT '14), pp. 1-12, 2014.
Li, F., T. M. Özsu, G. Chen, and B. Chin Ooi, "R-Store: A Scalable Distributed System for Supporting Real-time Analytics", Proc. 30th IEEE International Conference on Data Engineering, pp. 40-51, 2014.
Hartig, O., and T. M. Özsu, "Reachable Subwebs for Traversal-Based Query Execution", Proc. 23rd International World Wide Web Conference, pp. 541–546, 2014.
Xiang, J., H. Meng, and A. Aboulnaga, "Scalable Matrix Inversion Using MapReduce", Proc. Int. Symp. on High-Performance Parallel and Distributed Computing, pp. 177-190, 2014.
Kane, A., and F. Wm. Tompa, "Skewed partial bitvectors for list intersection", Proc. 37th International ACM SIGIR Conference on Research & Development in Information Retrieval, pp. 263-272, 2014.
"Top-k Nearest Neighbor Search In Uncertain Data Series", Proc. VLDB Endowment, 1, vol. 8, pp. 13-24, 2014.
Chowdhury, S., A. Roy, M. Shaikh, and K. Daudjee, "A Taxonomy of Decentralized Online Social Networks", Peer-to-Peer Networking and Applications, Springer, 2014.
Chairunnanda, P., K. Daudjee, and T. M. Özsu, "ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases", Proc. VLDB Endowment, vol. 7, issue 11, pp. 947-958, 2014.
Golab, L., H. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering conservation rules", IEEE Trans. Knowl. and Data Eng., vol. 26, issue 6, pp. 1332-1348, 2014.
Li, F., B. Chin Ooi, T. M. Özsu, and S. Wu, "Distributed Data Management Using MapReduce", ACM Computing Surveys, vol. 46, issue 3, 2014.
Kane, A., and F. Wm. Tompa, "Distribution by Document Size", 11th Int. Workshop on Large-Scale and Distributed Systems for Information Retrieval, 2014.
Aluç, G., O. Hartig, T. M. Özsu, and K. Daudjee, "Diversified Stress Testing of RDF Data Management Systems", Proc. 13th International Semantic Web Conference, pp. 197-212, 2014.
Zou, L., T. M. Özsu, L. Chen, X. Sheng, R. Huang, and D. Zhao, "gStore: A Graph-based SPARQL Query Engine", VLDB Journal, vol. 24, issue 3, pp. 565-590, 2014.
Beskales, G., I. F. Ilyas, L. Golab, and A. Galiullin, "Sampling from repairs of conditional functional dependency violations", VLDB Journal, vol. 23, issue 1, pp. 103-128, 2014.
Aluç, G., T. M. Özsu, and K. Daudjee, "Workload Matters: Why RDF Databases Need a New Design", Proc. VLDB Endowment, vol. 7, issue 10, pp. 837 - 840, 2014.
Tan, L., and C. Clarke, "A Family of Rank Similarity Measures based on Maximized Effectiveness Difference.", CoRR, vol. abs/1408.3587, 2014.
Wu, J., A. Hudek, D. Toman, and G. Weddell, "Absorption for ABoxes.", J. Autom. Reasoning, vol. 53, no. 3, pp. 215–243, 2014.
Serafini, M., E. Mansour, A. Aboulnaga, K. Salem, T. Rafiq, and U. Minhas, "Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions.", PVLDB, vol. 7, no. 12, pp. 1035–1046, 2014.
Han, M., K. Daudjee, K. Ammar, M.. Özsu, X. Wang, and T. Jin, "An Experimental Comparison of Pregel-like Graph Processing Systems.", PVLDB, vol. 7, no. 12, pp. 1047–1058, 2014.
Chairunnanda, P., K. Daudjee, and M.. Özsu, "ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases.", PVLDB, vol. 7, no. 11, pp. 947–958, 2014.
Golab, L., H. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules.", IEEE Trans. Knowl. Data Eng., vol. 26, no. 6, pp. 1332–1348, 2014.
Özsu, M.., and P. Valduriez, "Distributed and Parallel Database Systems.", Computing Handbook, 3rd ed. (2), pp. 13: 1-24, 2014.
Li, F., B. Ooi, M.. Özsu, and S. Wu, "Distributed data management using MapReduce.", ACM Comput. Surv., vol. 46, no. 3, pp. 31:1-31:42, 2014.
Chen, T., X. Zhang, S. Jin, and O. Kim, "Efficient classification using parallel and scalable compressed model and its application on intrusion detection.", Expert Syst. Appl., vol. 41, no. 13, pp. 5972–5983, 2014.
Türe, F., and J. Lin, "Exploiting Representations from Statistical Machine Translation for Cross-Language Information Retrieval.", ACM Trans. Inf. Syst., vol. 32, no. 4, pp. 19:1-19:32, 2014.
Zou, L., M.. Özsu, L. Chen, X. Shen, R. Huang, and D. Zhao, "gStore: a graph-based SPARQL query engine.", VLDB J., vol. 23, no. 4, pp. 565–590, 2014.
Weissman, S., S. Ayhan, J. Bradley, and J. Lin, "Identifying Duplicate and Contradictory Information in Wikipedia.", CoRR, vol. abs/1406.1143, 2014.
Liu, X., and K. Salem, "Integrating SSD Caching into Database Systems.", IEEE Data Eng. Bull., vol. 37, no. 2, pp. 35–43, 2014.
Gebaly, K., P. Agrawal, L. Golab, F. Korn, and D. Srivastava, "Interpretable and Informative Explanations of Outcomes.", PVLDB, vol. 8, no. 1, pp. 61–72, 2014.
Ashkan, A., and C. Clarke, "Location- and Query-Aware Modeling of Browsing and Click Behavior in Sponsored Search.", ACM TIST, vol. 5, no. 4, pp. 59:1-59:31, 2014.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-centric Analytics on Large Graphs.", PVLDB, vol. 7, no. 13, pp. 1673–1676, 2014.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-centric Large-Scale Graph Analytics in the Cloud.", CoRR, vol. abs/1405.1499, 2014.
Peng, P., L. Zou, M.. Özsu, L. Chen, and D. Zhao, "Processing SPARQL Queries Over Linked Data-A Distributed Graph-based Approach.", CoRR, vol. abs/1411.6763, 2014.
Gupta, P., V. Satuluri, A. Grewal, S. Gurumurthy, V. Zhabiuk, Q. Li, and J. Lin, "Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs.", PVLDB, vol. 7, no. 13, pp. 1379–1380, 2014.
Albakour, M-D., C. MacDonald, I. Ounis, C. Clarke, and V. Bicer, "Report on the 1st International Workshop on Information Access in Smart Cities (i-ASC 2014).", SIGIR Forum, vol. 48, no. 2, pp. 96–104, 2014.
Balog, K., D. Elsweiler, E. Kanoulas, L. Kelly, and M. Smucker, "Report on the CIKM workshop on living labs for information retrieval evaluation.", SIGIR Forum, vol. 48, no. 1, pp. 21–28, 2014.
Asadi, N., J. Lin, and A. de Vries, "Runtime Optimizations for Tree-Based Machine Learning Models.", IEEE Trans. Knowl. Data Eng., vol. 26, no. 9, pp. 2281–2292, 2014.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "Sampling from repairs of conditional functional dependency violations.", VLDB J., vol. 23, no. 1, pp. 103–128, 2014.
Boykin, P.., S. Ritchie, I. O’Connell, and J. Lin, "Summingbird: A Framework for Integrating Batch and Online MapReduce Computations.", PVLDB, vol. 7, no. 13, pp. 1441–1451, 2014.
Dallachiesa, M., T. Palpanas, and I. Ilyas, "Top-k Nearest Neighbor Search In Uncertain Data Series.", PVLDB, vol. 8, no. 1, pp. 13–24, 2014.
Toman, D., and G. Weddell, "Undecidability of Finite Model Reasoning in DLFD.", CoRR, vol. abs/1408.4468, 2014.
Aluç, G., M.. Özsu, and K. Daudjee, "Workload Matters: Why RDF Databases Need a New Design.", PVLDB, vol. 7, no. 10, pp. 837–840, 2014.

2013

Toman, D., and G. Weddell, "", Australasian Conference on Artificial Intelligence, pp. 350–361, 2013.
Said, A., J. Lin, A. Bellogín, and A. de Vries, "A month in the life of a production news recommender system.", LivingLab@CIKM, pp. 7–10, 2013.
Wu, J., T. Kinash, D. Toman, and G. Weddell, "Absorption for ABoxes with Local Universal Restrictions.", Description Logics, pp. 489–500, 2013.
Mehdad, Y., G. Carenini, F. Tompa, and R. Ng, "Abstractive Meeting Summarization with Entailment and Fusion.", ENLG, pp. 136–146, 2013.
Jin, S., O. Kim, and W. Feng, "Accelerating Metric Space Similarity Joins with Multi-core and Many-core Processors.", ICCSA (5), pp. 166–180, 2013.
Balkesen, C., N. Tatbul, and M.. Özsu, "Adaptive input admission and management for parallel stream processing.", DEBS, pp. 15–26, 2013.
Deziel, M., D. Olawo, L. Truchon, and L. Golab, "Analyzing the Mental Health of Engineering Students using Classification and Regression.", EDM, pp. 228–231, 2013.
Toman, D., and G. Weddell, "CFDnc: A PTIME Description Logic with Functional Constraints and Disjointness.", Description Logics, pp. 451–463, 2013.
Balog, K., D. Elsweiler, E. Kanoulas, L. Kelly, and M. Smucker, "CIKM 2013 workshop on living labs for information retrieval evaluation.", CIKM, pp. 2557–2558, 2013.
Whissell, J., and C. Clarke, "Classification-Based Clustering Evaluation.", ICDM, pp. 1229–1234, 2013.
Niedermayer, J., M. Nascimento, M. Renz, P. Kröger, K. Ammar, and H-P. Kriegel, "Cost-Based Quantile Query Processing in Wireless Sensor Networks.", MDM (1), pp. 237–246, 2013.
Bellogín, A., G. Gebremeskel, J. He, A. Said, T. Samar, A. de Vries, J. Lin, and J. Vuurens, "CWI and TU Delft Notebook TREC 2013: Contextual Suggestion, Federated Web Search, KBA, and Web Tracks.", TREC, 2013.
Stonebraker, M., D. Bruckner, I. Ilyas, G. Beskales, M. Cherniack, S. Zdonik, A. Pagan, and S. Xu, "Data Curation at Scale: The Data Tamer System.", CIDR, 2013.
Lei, B., I. Surya, S. Kamali, and K. Daudjee, "Data Partitioning for Video-on-Demand Services.", NCA, pp. 49–54, 2013.
Golab, L., and T. Johnson, "Data stream warehousing.", SIGMOD Conference, pp. 949–952, 2013.
Asadi, N., J. Lin, and M. Busch, "Dynamic memory allocation policies for postings in real-time Twitter search.", KDD, pp. 1186–1194, 2013.
Whissell, J., and C. Clarke, "Effective measures for inter-document similarity.", CIKM, pp. 1361–1370, 2013.
Jin, S., O. Kim, and T. Chen, "Efficient Attack Detection Based on a Compressed Model.", ISPEC, pp. 248–262, 2013.
Dean-Hall, A., C. Clarke, J. Kamps, and P. Thomas, "Evaluating Contextual Suggestion.", EVIA@NTCIR, 2013.
Mishne, G., J. Dalton, Z. Li, A. Sharma, and J. Lin, "Fast data in the era of big data: Twitter’s real-time related query suggestion architecture.", SIGMOD Conference, pp. 1147–1158, 2013.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Faster and smaller inverted indices with treaps.", SIGIR, pp. 193–202, 2013.
Chu, X., I. Ilyas, and P. Papotti, "Holistic data cleaning: Putting violations into context.", ICDE, pp. 458–469, 2013.
Duan, X., H. Zhang, W. Li, Y. Liu, and L. Zhang, "Load balancing performance of dynamic SCell measurement period relaxing in LTE-A.", CCNC, pp. 769–772, 2013.
Duan, X., H. Zhang, W. Li, Y. Liu, and L. Zhang, "Load balancing performance of dynamic SCell measurement period relaxing in LTE-A.", CCNC, pp. 773–777, 2013.
Balkesen, C., J. Teubner, G. Alonso, and M.. Özsu, "Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware.", ICDE, pp. 362–373, 2013.
Agrawal, D., A. Abbadi, H. Mahmoud, F. Nawab, and K. Salem, "Managing Geo-replicated Data in Multi-datacenters.", DNIS, pp. 23–43, 2013.
Jin, C., R. Liu, and K. Salem, "Materialized views for eventually consistent record stores.", ICDE Workshops, pp. 250–257, 2013.
Eidelman, V., K. Wu, F. Türe, P. Resnik, and J. Lin, "Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce.", ACL (Conference System Demonstrations), pp. 199–204, 2013.
Jin, S., O. Kim, and W. Feng, "MX-tree: A Double Hierarchical Metric Index with Overlap Reduction.", ICCSA (5), pp. 574–589, 2013.
Dallachiesa, M., A. Ebaid, A. Eldawy, A. Elmagarmid, I. Ilyas, M. Ouzzani, and N. Tang, "NADEEF: a commodity data cleaning system.", SIGMOD Conference, pp. 541–552, 2013.
Clarke, C., "Nugget-Based Computation of Graded Relevance.", EVIA@NTCIR, 2013.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "On the relative trust between inconsistent data and inaccurate constraints.", ICDE, pp. 541–552, 2013.
Dean-Hall, A., C. Clarke, N. Simone, J. Kamps, P. Thomas, and E. Voorhees, "Overview of the TREC 2013 Contextual Suggestion Track.", TREC, 2013.
Smucker, M., G. Kazai, and M. Lease, "Overview of the TREC 2013 Crowdsourcing Track.", TREC, 2013.
Lin, J., and M. Efron, "Overview of the TREC-2013 Microblog Track.", TREC, 2013.
Northam, L., R. Smits, K. Daudjee, and J. Istead, "Ray tracing in the cloud using MapReduce.", HPCS, pp. 19–26, 2013.
LeBlanc, H., H. Zhang, S. Sundaram, and X. Koutsoukos, "Resilient continuous-time consensus in fractional robust networks.", ACC, pp. 1237–1242, 2013.
Kamali, S., and F. Tompa, "Retrieving documents with mathematical content.", SIGIR, pp. 353–362, 2013.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Search and exploration of X-Rated information (SEXI 2013).", WSDM, pp. 795–796, 2013.
Clarke, C., L. Freund, M. Smucker, and E. Yilmaz, "SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation.", SIGIR, pp. 1134, 2013.
Kamali, S., and F. Tompa, "Structural Similarity Search for Mathematics Retrieval.", MKM/Calculemus/DML, pp. 246–262, 2013.
Lutz, C., I. Seylan, D. Toman, and F. Wolter, "The Combined Approach to OBDA: Taming Role Hierarchies Using Filters.", International Semantic Web Conference (1), pp. 314–330, 2013.
Sakai, T., Z. Dou, and C. Clarke, "The impact of intent selection on diversified search evaluation.", SIGIR, pp. 921–924, 2013.
Clarke, C., "Time-Biased Gain.", NTCIR, 2013.
Eidelman, V., K. Wu, F. Türe, P. Resnik, and J. Lin, "Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation.", WMT@ACL, pp. 128–133, 2013.
Asadi, N., and J. Lin, "Training Efficient Tree-Based Models for Document Ranking.", ECIR, pp. 146–157, 2013.
Baruah, G., R. Guttikonda, A. Roegiest, and O. Vechtomova, "University of Waterloo at the TREC 2013 Temporal Summarization Track.", TREC, 2013.
Forsyth, S., and K. Daudjee, "Update Management in Decentralized Social Networks.", ICDCS Workshops, pp. 196–201, 2013.
Rios, M., and J. Lin, "Visualizing the “Pulse” of World Cities on Twitter.", ICWSM, 2013.
DeWitt, D., I. Ilyas, J. Naughton, and M. Stonebraker, "We are drowning in a sea of least publishable units (LPUs).", SIGMOD Conference, pp. 921–922, 2013.
Ammar, K., and M.. Özsu, "WGB: Towards a Universal Graph Benchmark.", WBDB, pp. 58–72, 2013.
Gupta, P., A. Goel, J. Lin, A. Sharma, D. Wang, and R. Zadeh, "WTF: the who to follow service at Twitter.", WWW, pp. 505–514, 2013.
Ghodsnia, P., K. Tirdad, J.. Munro, and A. López-Ortiz, "A novel approach for leveraging co-occurrence to improve the false positive error in signature files.", J. Discrete Algorithms, vol. 18, pp. 63–74, 2013.
Özsu, M.., "ACM books to launch.", Commun. ACM, vol. 56, no. 12, pp. 5, 2013.
Abu-Khzam, F., K. Daudjee, A. Mouawad, and N. Nishimura, "An Easy-to-use Scalable Framework for Parallel Recursive Backtracking.", CoRR, vol. abs/1312.7626, 2013.
Golab, L., "Data Warehouse Quality: Summary and Outlook.", Handbook of Data Quality, pp. 121–140, 2013.
Liu, R., A. Aboulnaga, and K. Salem, "DAX: A Widely Distributed Multi-tenant Storage Service for DBMS Hosting.", PVLDB, vol. 6, no. 4, pp. 253–264, 2013.
Chu, X., I. Ilyas, and P. Papotti, "Discovering Denial Constraints.", PVLDB, vol. 6, no. 13, pp. 1498–1509, 2013.
Golab, L., M. Hadjieleftheriou, H. Karloff, and B. Saha, "Distributed Data Placement via Graph Partitioning.", CoRR, vol. abs/1312.0285, 2013.
Asadi, N., and J. Lin, "Document vector representations for feature extraction in multi-stage document ranking.", Inf. Retr., vol. 16, no. 6, pp. 747–768, 2013.
Asadi, N., J. Lin, and M. Busch, "Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search", CoRR, vol. abs/1302.5302, 2013.
Lin, J., and M. Efron, "Evaluation as a service for information retrieval.", SIGIR Forum, vol. 47, no. 2, pp. 8–14, 2013.
Akinyemi, J., and C. Clarke, "Fast and effective soft links.", Softw., Pract. Exper., vol. 43, no. 5, pp. 577–593, 2013.
Asadi, N., and J. Lin, "Fast candidate generation for real-time tweet search with bloom filter chains.", ACM Trans. Inf. Syst., vol. 31, no. 3, pp. 13, 2013.
Asadi, N., and J. Lin, "Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections", CoRR, vol. abs/1305.0699, 2013.
Capra, R., L. Freund, C. Smith, M. Smucker, and R. White, "HCIR 2013: the seventh international symposium on human-computer interaction and information retrieval.", SIGIR Forum, vol. 47, no. 2, pp. 33–40, 2013.
Kumar, K.., J. Gluck, A. Deshpande, and J. Lin, "Hone: “Scaling Down” Hadoop on Shared-Memory Systems.", PVLDB, vol. 6, no. 12, pp. 1354–1357, 2013.
Liu, X., and K. Salem, "Hybrid Storage Management for Database Systems.", PVLDB, vol. 6, no. 8, pp. 541–552, 2013.
Ashkan, A., and C. Clarke, "Impact of query intent and search context on clickthrough behavior in sponsored search.", Knowl. Inf. Syst., vol. 34, no. 2, pp. 425–452, 2013.
Golbus, P., J. Aslam, and C. Clarke, "Increasing evaluation sensitivity to diversity.", Inf. Retr., vol. 16, no. 4, pp. 530–555, 2013.
Balkesen, C., G. Alonso, J. Teubner, and M.. Özsu, "Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited.", PVLDB, vol. 7, no. 1, pp. 85–96, 2013.
Ebaid, A., A. Elmagarmid, I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF: A Generalized Data Cleaning System.", PVLDB, vol. 6, no. 12, pp. 1218–1221, 2013.
Chen, T., L. Chen, M.. Özsu, and N. Xiao, "Optimizing Multi-Top-k Queries over Uncertain Data Streams.", IEEE Trans. Knowl. Data Eng., vol. 25, no. 8, pp. 1814–1829, 2013.
Ng, R., P. Arocena, D. Barbosa, G. Carenini, L. Gomes, Jr., S. Jou, R. Leung, E. Milios, R. Miller, J. Mylopoulos, et al., "Perspectives on Business Intelligence", Perspectives on Business Intelligence, pp. 1–163, 2013.
Chen, L., I. Ilyas, C. Ré, and X. Zhou, "Probabilistic Web Data Management.", World Wide Web, vol. 16, no. 3, pp. 271–272, 2013.
Mansour, E., A. El-Roby, P. Kalnis, A. Ahmadia, and A. Aboulnaga, "RACE: A Scalable and Elastic Parallel System for Discovering Repeats in Very Long Sequences.", PVLDB, vol. 6, no. 10, pp. 865–876, 2013.
Minhas, U., S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield, "RemusDB: transparent high availability for database systems.", VLDB J., vol. 22, no. 1, pp. 29–45, 2013.
Clarke, C., L. Freund, M. Smucker, and E. Yilmaz, "Report on the SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation (MUBE 2013).", SIGIR Forum, vol. 47, no. 2, pp. 84–95, 2013.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Report on the workshop on search and exploration of x-rated information (SEXI 2013).", SIGIR Forum, vol. 47, no. 1, pp. 31–37, 2013.
LeBlanc, H., H. Zhang, X. Koutsoukos, and S. Sundaram, "Resilient Asymptotic Consensus in Robust Networks.", IEEE Journal on Selected Areas in Communications, vol. 31, no. 4, pp. 766–781, 2013.
LeBlanc, H., H. Zhang, S. Sundaram, and X. Koutsoukos, "Resilient Continuous-Time Consensus in Fractional Robust Networks", CoRR, vol. abs/1303.2709, 2013.

2012

MacDonald, C., J. Wang, and C. Clarke, "2nd international workshop on diversity in document retrieval (DDR 2012).", WSDM, pp. 769–770, 2012.
Golab, L., T. Johnson, S. Sen, and J. Yates, "A Sequence-Oriented Stream Warehouse Paradigm for Network Monitoring Applications.", PAM, pp. 53–63, 2012.
Zhang, H., and S. Sundaram, "A simple median-based resilient consensus algorithm.", Allerton Conference, pp. 1734–1741, 2012.
Wu, J., A. Hudek, D. Toman, and G. Weddell, "Absorption for ABoxes.", Description Logics, 2012.
Wu, J., A. Hudek, D. Toman, and G. Weddell, "Assertion Absorption in Object Queries over Knowledge Bases.", KR, 2012.
Türe, F., J. Lin, and D. Oard, "Combining Statistical Translation Techniques for Cross-Language Information Retrieval.", COLING, pp. 2685–2702, 2012.
LeBlanc, H., H. Zhang, S. Sundaram, and X. Koutsoukos, "Consensus of multi-agent networks in the presence of adversaries using only local information.", HiCoNS, pp. 1–10, 2012.
Golab, L., H. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules.", ICDE, pp. 738–749, 2012.
Busch, M., K. Gade, B. Larson, P. Lok, S. Luckenbill, and J. Lin, "Earlybird: Real-Time Search at Twitter.", ICDE, pp. 1360–1369, 2012.
Minhas, U., R. Liu, A. Aboulnaga, K. Salem, J. Ng, and S. Robertson, "Elastic Scale-Out for Partition-Based Database Systems.", ICDE Workshops, pp. 281–288, 2012.
McCullough, D., J. Lin, C. MacDonald, I. Ounis, and R. McCreadie, "Evaluating Real-Time Search over Tweets.", ICWSM, 2012.
Drzadzewski, G., and F. Tompa, "Exploring and analyzing documents with OLAP.", PIKM, pp. 33–40, 2012.
Chairunnanda, P., S. Forsyth, and K. Daudjee, "Graph data partition models for online social networks.", HT, pp. 175–180, 2012.
Smucker, M., J. Allan, and B. Dachev, "Human question answering performance using an interactive document retrieval system.", IIiX, pp. 35–44, 2012.
Pound, J., A. Hudek, I. Ilyas, and G. Weddell, "Interpreting keyword queries over web knowledge bases.", CIKM, pp. 305–314, 2012.
El-Helw, A., M. Farid, and I. Ilyas, "Just-in-time information extraction using extraction views.", SIGMOD Conference, pp. 613–616, 2012.
Lin, J., and A. Kolcz, "Large-scale machine learning at twitter.", SIGMOD Conference, pp. 793–804, 2012.
Raveendran, G., and C. Clarke, "Lightweight contrastive summarization for news comment mining.", SIGIR, pp. 1103–1104, 2012.
Türe, F., J. Lin, and D. Oard, "Looking inside the box: context-sensitive translation for cross-language information retrieval.", SIGIR, pp. 1105–1106, 2012.
Amer-Yahia, S., S. Anjum, A. Ghenai, A. Siddique, S. Abbar, S. Madden, A. Marcus, and M. El-Haddad, "MAQSA: a system for social analytics on news.", SIGMOD Conference, pp. 653–656, 2012.
Ashkan, A., and C. Clarke, "Modeling browsing behavior for click analysis in sponsored search.", CIKM, pp. 2015–2019, 2012.
Smucker, M., and C. Clarke, "Modeling user variance in time-biased gain.", HCIR, pp. 3, 2012.
McCreadie, R., I. Soboroff, J. Lin, C. MacDonald, I. Ounis, and D. McCullough, "On building a reusable Twitter corpus.", SIGIR, pp. 1113–1114, 2012.
Dean-Hall, A., C. Clarke, J. Kamps, P. Thomas, and E. Voorhees, "Overview of the TREC 2012 Contextual Suggestion Track.", TREC, 2012.
Smucker, M., G. Kazai, and M. Lease, "Overview of the TREC 2012 Crowdsourcing Track.", TREC, 2012.
Clarke, C., N. Craswell, and E. Voorhees, "Overview of the TREC 2012 Web Track.", TREC, 2012.
Soboroff, I., I. Ounis, C. MacDonald, and J. Lin, "Overview of the TREC-2012 Microblog Track.", TREC, 2012.
Zhang, H., and S. Sundaram, "Robustness of complex networks with implications for consensus and contagion.", CDC, pp. 3426–3432, 2012.
Zhang, H., and S. Sundaram, "Robustness of information diffusion algorithms to locally bounded adversaries.", ACC, pp. 5855–5861, 2012.
Smucker, M., and C. Clarke, "Stochastic simulation of time-biased gain.", CIKM, pp. 2040–2044, 2012.
Smucker, M., and C. Jethani, "Time to judge relevance as an indicator of assessor error.", SIGIR, pp. 1153–1154, 2012.
Smucker, M., and C. Clarke, "Time-based calibration of effectiveness measures.", SIGIR, pp. 95–104, 2012.
Bär, A., and L. Golab, "Towards benchmarking stream data warehouses.", DOLAP, pp. 105–112, 2012.
Mishne, G., and J. Lin, "Twanchor text: a preliminary study of the value of tweets as anchor text.", SIGIR, pp. 1159–1160, 2012.
Lin, J., and G. Mishne, "A Study of “Churn” in Tweets and Real-Time Search Queries (Extended Version)", CoRR, vol. abs/1205.6855, 2012.
Zou, L., L. Chen, M.. Özsu, and D. Zhao, "Answering pattern match queries in large graph databases via graph embedding.", VLDB J., vol. 21, no. 1, pp. 97–120, 2012.
LeBlanc, H., H. Zhang, S. Sundaram, and X. Koutsoukos, "Consensus of Multi-Agent Networks in the Presence of Adversaries Using Only Local Information", CoRR, vol. abs/1205.3676, 2012.
Mishne, G., J. Dalton, Z. Li, A. Sharma, and J. Lin, "Fast Data in the Era of Big Data: Twitter’s Real-Time Related Query Suggestion Architecture", CoRR, vol. abs/1210.7350, 2012.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "On the Relative Trust between Inconsistent Data and Inaccurate Constraints", CoRR, vol. abs/1207.5226, 2012.
Trotman, A., C. Clarke, I. Ounis, J.. Culpepper, M-A. Cartright, and S. Geva, "Open source information petrieval: a report on the SIGIR 2012 workshop.", SIGIR Forum, vol. 46, no. 2, pp. 95–101, 2012.
Zhang, H., and S. Sundaram, "Robustness of Complex Networks: Reaching Consensus Despite Adversaries", CoRR, vol. abs/1203.6119, 2012.
Asadi, N., J. Lin, and A. de Vries, "Runtime Optimizations for Prediction with Tree-Based Models", CoRR, vol. abs/1212.2287, 2012.
Golab, L., T. Johnson, and V. Shkapenyuk, "Scalable Scheduling of Updates in Streaming Data Warehouses.", IEEE Trans. Knowl. Data Eng., vol. 24, no. 6, pp. 1092–1105, 2012.
Lin, J., and D. Ryaboy, "Scaling big data mining infrastructure: the twitter experience.", SIGKDD Explorations, vol. 14, no. 2, pp. 6–19, 2012.
Beskales, G., G. Das, A. Elmagarmid, I. Ilyas, F. Naumann, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, and N. Tang, "The data analytics group at the qatar computing research institute.", SIGMOD Record, vol. 41, no. 4, pp. 33–38, 2012.
Lee, G., J. Lin, C. Liu, A. Lorek, and D. Ryaboy, "The Unified Logging Infrastructure for Data Analytics at Twitter", CoRR, vol. abs/1208.4171, 2012.
Lee, G., J. Lin, C. Liu, A. Lorek, and D. Ryaboy, "The Unified Logging Infrastructure for Data Analytics at Twitter.", PVLDB, vol. 5, no. 12, pp. 1771–1780, 2012.

2011

Wang, L., J. Lin, and D. Metzler, "A cascade ranking model for efficient ranked retrieval.", SIGIR, pp. 105–114, 2011.
Clarke, C., N. Craswell, I. Soboroff, and A. Ashkan, "A comparative analysis of cascade measures for novelty and diversity.", WSDM, pp. 75–84, 2011.
Ammar, K., M. Nascimento, and J. Niedermayer, "An adaptive refinement-based algorithm for median queries in wireless sensor networks.", MobiDE, pp. 9–16, 2011.
Pound, J., D. Toman, G. Weddell, and J. Wu, "An Assertion Retrieval Algebra for Object Queries over Knowledge Bases.", IJCAI, pp. 1051–1056, 2011.
Leibert, F., J. Mannix, J. Lin, and B. Hamadani, "Automatic management of partitioned, replicated search services.", SoCC, pp. 27, 2011.
Whissell, J., and C. Clarke, "Clustering for semi-supervised spam filtering.", CEAS, pp. 125–134, 2011.
Tirdad, K., P. Ghodsnia, J.. Munro, and A. López-Ortiz, "COCA Filters: Co-occurrence Aware Bloom Filters.", SPIRE, pp. 313–325, 2011.
Golab, L., and T. Johnson, "Consistency in a Stream Warehouse.", CIDR, pp. 114–122, 2011.
Asadi, N., D. Metzler, and J. Lin, "Cross-corpus relevance projection.", SIGIR, pp. 1163–1164, 2011.
Sarrafzadeh, B., N. Yakovets, N. Cercone, and A. An, "Cross-Lingual Word Sense Disambiguation for Languages with Scarce Resources.", Canadian Conference on AI, pp. 347–358, 2011.
Özsu, M.., P. Valduriez, S. Abiteboul, B. Kemme, R. Jiménez-Peris, and B. Ooi, "Distributed data management in 2020?", ICDE, pp. 1360, 2011.
Akinyemi, J., and C. Clarke, "Do Subtopic Judgments Reflect Diversity?", ICTIR, pp. 309–312, 2011.
Kamali, S., P. Ghodsnia, and K. Daudjee, "Dynamic data allocation with replication in distributed systems.", IPCCC, pp. 1–8, 2011.
Cheng, J., Y. Ke, S. Chu, and M.. Özsu, "Efficient core decomposition in massive networks.", ICDE, pp. 51–62, 2011.
Franconi, E., and D. Toman, "Fixpoints in Temporal Description Logics.", IJCAI, pp. 875–880, 2011.
Kamali, S., and F. Tompa, "Grammar Inference for Web Documents.", WebDB, 2011.
Ammar, K., and M. Nascimento, "Histogram and Other Aggregate Queries in Wireless Sensor Networks.", SSDBM, pp. 527–536, 2011.
Smucker, M., and C. Jethani, "Measuring assessor accuracy: a comparison of nist assessors and user study participants.", SIGIR, pp. 1231–1232, 2011.
Türe, F., T. Elsayed, and J. Lin, "No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity.", SIGIR, pp. 943–952, 2011.
Miller, R., F. Tompa, S. McIlraith, J. Slonim, and E. Yu, "NSERC business intelligence network: selected topics.", CASCON, pp. 313–315, 2011.
Ashkan, A., and C. Clarke, "On the informativeness of cascade and intent-aware effectiveness measures.", WWW, pp. 407–416, 2011.
Grossman, M., G. Cormack, B. Hedin, and D. Oard, "Overview of the TREC 2011 Legal Track.", TREC, 2011.
Clarke, C., N. Craswell, I. Soboroff, and E. Voorhees, "Overview of the TREC 2011 Web Track.", TREC, 2011.
Asadi, N., D. Metzler, T. Elsayed, and J. Lin, "Pseudo test collections for learning web search ranking functions.", SIGIR, pp. 1073–1082, 2011.
Soliman, M., I. Ilyas, D. Martinenghi, and M. Tagliasacchi, "Ranking with uncertain scoring functions: semantics and sensitivity measures.", SIGMOD Conference, pp. 805–816, 2011.
Lin, J., R. Snow, and W. Morgan, "Smoothing techniques for adaptive online language models: topic tracking in tweet streams.", KDD, pp. 422–429, 2011.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "The Combined Approach to Ontology-Based Data Access.", IJCAI, pp. 2656–2661, 2011.
Itakura, K., C. Clarke, S. Geva, A. Trotman, and W. Huang, "Topical and Structural Linkage in Wikipedia.", ECIR, pp. 460–465, 2011.
Sarrafzadeh, B., N. Yakovets, N. Cercone, and A. An, "Towards Automatic Acquisition of a Fully Sense Tagged Corpus for Persian.", ISMIS, pp. 449–455, 2011.
Roegiest, A., and G. Cormack, "University of Waterloo at TREC 2011 Microblog Track.", TREC, 2011.
Akinyemi, J., and C. Clarke, "UWaterloo at NTCIR-9: Intent discovery with anchor text.", NTCIR, 2011.
Elsayed, T., J. Lin, and D. Metzler, "When close enough is good enough: approximate positional indexes for efficient ranked retrieval.", CIKM, pp. 1993–1996, 2011.
Kane, A., and F. Tompa, "", LLC, vol. 26, no. 4, pp. 407–415, 2011.
Chen, G., H. Vo, S. Wu, B. Ooi, and M.. Özsu, "A Framework for Supporting DBMS-like Indexes in the Cloud.", PVLDB, vol. 4, no. 11, pp. 702–713, 2011.
Ataullah, A., and F. Tompa, "Business Policy Modeling and Enforcement in Databases.", PVLDB, vol. 4, no. 11, pp. 921–931, 2011.
Golab, L., F. Korn, and D. Srivastava, "Efficient and Effective Analysis of Data Quality using Pattern Tableaux.", IEEE Data Eng. Bull., vol. 34, no. 3, pp. 26–33, 2011.
Cormack, G., M. Smucker, and C. Clarke, "Efficient and effective spam filtering and re-ranking for large web datasets.", Inf. Retr., vol. 14, no. 5, pp. 441–465, 2011.
Toman, D., and G. Weddell, "Fundamentals of Physical Design and Query Compilation", Fundamentals of Physical Design and Query Compilation, 2011.
Zou, L., J. Mo, L. Chen, M.. Özsu, and D. Zhao, "gStore: Answering SPARQL Queries via Subgraph Matching.", PVLDB, vol. 4, no. 8, pp. 482–493, 2011.
Yakout, M., A. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided Data Repair", CoRR, vol. abs/1103.3103, 2011.
Yakout, M., A. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided data repair.", PVLDB, vol. 4, no. 5, pp. 279–289, 2011.
Whissell, J., and C. Clarke, "Improving document clustering using Okapi BM25 feature weighting.", Inf. Retr., vol. 14, no. 5, pp. 466–487, 2011.
Wong, R., M.. Özsu, A. Fu, P. Yu, L. Liu, and Y. Liu, "Maximizing bichromatic reverse nearest neighbor for L p -norm in two- and three-dimensional spaces.", VLDB J., vol. 20, no. 6, pp. 893–919, 2011.
Özsu, M.., and P. Valduriez, Principles of Distributed Database Systems, Third Edition., , pp. I–XIX, 1-845, 2011.
Ilyas, I., and M. Soliman, "Probabilistic Ranking Techniques in Relational Databases", Probabilistic Ranking Techniques in Relational Databases, 2011.
Minhas, U., S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield, "RemusDB: Transparent High Availability for Database Systems.", PVLDB, vol. 4, no. 11, pp. 738–748, 2011.
Belkin, N., C. Clarke, N. Gao, J. Kamps, and J. Karlgren, "Report on the SIGIR workshop on “entertain me”: supporting complex search tasks.", SIGIR Forum, vol. 45, no. 2, pp. 51–59, 2011.
Zhang, H., and S. Sundaram, "Robustness of Information Diffusion Algorithms to Locally Bounded Adversaries", CoRR, vol. abs/1110.3843, 2011.
Kling, P., M.. Özsu, and K. Daudjee, "Scaling XML query processing: distribution, localization and pruning.", Distributed and Parallel Databases, vol. 29, no. 5–6, pp. 445–490, 2011.
Bateni, MH., L. Golab, MT. Hajiaghayi, and H. Karloff, "Scheduling to Minimize Staleness and Stretch in Real-Time Data Warehouses.", Theory Comput. Syst., vol. 49, no. 4, pp. 757–780, 2011.
Chockler, G., E. Dekel, J. JáJá, and J. Lin, "Special Issue on Cloud Computing.", J. Parallel Distrib. Comput., vol. 71, no. 6, pp. 731, 2011.
MacDonald, C., C. Clarke, and J. Wang, "The 1st international workshop on diversity in document retrieval.", SIGIR Forum, vol. 45, no. 2, pp. 87–93, 2011.

2010

Itakura, K., and C. Clarke, "A framework for BM25F-based XML retrieval.", SIGIR, pp. 843–844, 2010.
Kamali, S., and F. Tompa, "A new mathematics retrieval system.", CIKM, pp. 1413–1416, 2010.
Abouzour, M., K. Salem, and P. Bumbulis, "Automatic tuning of the multiprogramming level in Sybase SQL Anywhere.", ICDE Workshops, pp. 99–104, 2010.
Lafreniere, B., A. Bunt, J. Whissell, C. Clarke, and M. Terry, "Characterizing large-scale use of a direct manipulation application in the wild.", Graphics Interface, pp. 11–18, 2010.
Clarke, C., "ClueWeb09 and TREC Diversity.", NTCIR, pp. 13, 2010.
Lin, J., and C. Dyer, "Data-Intensive Text Processing with MapReduce.", NAACL (Tutorial Abstracts), pp. 1–2, 2010.
Lin, J., and M. Schatz, "Design patterns for efficient graph algorithms in MapReduce.", MLG@KDD, pp. 78–85, 2010.
Özsu, M.., and P. Kling, "Distributed XML Query Processing - (Extended Abstract).", XSym, pp. 1–2, 2010.
Savinov, S., and K. Daudjee, "Dynamic database replica provisioning through virtualization.", CloudDB, pp. 41–46, 2010.
Zou, L., L. Chen, M.. Özsu, and D. Zhao, "Dynamic Skyline Queries in Large Graphs.", DASFAA (2), pp. 62–78, 2010.
Tao, Y., and M.. Özsu, "Efficient Decision Tree Re-alignment for Clustering Time-Changing Data Streams.", From Active Data Management to Event-Based Systems and More, pp. 20–43, 2010.
Pound, J., I. Ilyas, and G. Weddell, "Expressive and flexible access to web-extracted data: a keyword-based structured query language.", SIGMOD Conference, pp. 423–434, 2010.
Smucker, M., and C. Jethani, "Human performance and retrieval precision revisited.", SIGIR, pp. 595–602, 2010.
Wang, L., J. Lin, and D. Metzler, "Learning to efficiently rank.", SIGIR, pp. 138–145, 2010.
Soliman, M., M. Saleeb, and I. Ilyas, "MashRank: Towards uncertainty-aware and rank-aware mashups.", ICDE, pp. 1137–1140, 2010.
Dolman, L., F. Tompa, I. Kiringa, R. Pottinger, and J. Mylopoulos, "Next generation business intelligence (BI) tools.", CASCON, pp. 352–354, 2010.
Stanchev, L., and G. Weddell, "On Building an Index Advisor for Semantic Web Queries.", FOIS, pp. 147–157, 2010.
Borgida, A., J. de Bruijn, E. Franconi, I. Seylan, U. Straccia, D. Toman, and G. Weddell, "On Finding Query Rewritings under Expressive Constraints.", SEBD, pp. 426–437, 2010.
Lunn, D., M. Bernstein, C. Marshall, J.. Matias, J. Nyce, and F. Tompa, "Past visions of hypertext and their influence on us today.", HT, pp. 315, 2010.
Beskales, G., M. Soliman, I. Ilyas, S. Ben-David, and Y. Kim, "ProbClean: A probabilistic duplicate detection system.", ICDE, pp. 1193–1196, 2010.
Lin, J., N. Madnani, and B. Dorr, "Putting the User in the Loop: Interactive Maximal Marginal Relevance for Query-Focused Summarization.", HLT-NAACL, pp. 305–308, 2010.
Pound, J., D. Toman, G. Weddell, and J. Wu, "Query Algebra and Query Optimization for Concept Assertion Retrieval.", Description Logics, 2010.
Wang, L., D. Metzler, and J. Lin, "Ranking under temporal constraints.", CIKM, pp. 79–88, 2010.
Mojdeh, M., and G. Cormack, "Semi-supervised spam filtering using aggressive consistency learning.", SIGIR, pp. 751–752, 2010.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "The Combined Approach to Query Answering in DL-Lite.", KR, 2010.
Akinyemi, J., C. Clarke, and M. Kolla, "Towards a collection-based results diversification.", RIAO, pp. 202–205, 2010.
Ilyas, I., D. Martinenghi, N. Polyzotis, and M. Tagliasacchi, "Trends in Rank Join.", SeCO Workshop, pp. 135–137, 2010.
Elsayed, T., N. Asadi, L. Wang, J. Lin, and D. Metzler, "UMD and USC/ISI: TREC 2010 Web Track Experiments with Ivory.", TREC, 2010.
Ilyas, I., "Uncertainty in Rank Join.", SeCO Workshop, pp. 128–134, 2010.
Smucker, M., C. Clarke, G. Cormack, and O. Vechtomova, "University of Waterloo at TREC 2010: Legal Interactive.", TREC, 2010.
Ozmen, O., K. Salem, J. Schindler, and S. Daniel, "Workload-aware storage layout for database systems.", SIGMOD Conference, pp. 939–950, 2010.
Lo, E., C. Binnig, D. Kossmann, M.. Özsu, and W-K. Hon, "A framework for testing DBMS features.", VLDB J., vol. 19, no. 2, pp. 203–230, 2010.
Bidoki, A., P. Ghodsnia, N. Yazdani, and F. Oroumchian, "A3CRank: An adaptive ranking method based on connectivity, content and click-through data.", Inf. Process. Manage., vol. 46, no. 2, pp. 159–169, 2010.
Soror, A., U. Minhas, A. Aboulnaga, K. Salem, P. Kokosielis, and S. Kamath, "Automatic virtual machine configuration for database workloads.", ACM Trans. Database Syst., vol. 35, no. 1, pp. 7:1-7:47, 2010.
Soliman, M., I. Ilyas, and M. Saleeb, "Building Ranked Mashups of Unstructured Sources with Uncertain Information.", PVLDB, vol. 3, no. 1, pp. 826–837, 2010.
Golab, L., H. Karloff, F. Korn, and D. Srivastava, "Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux.", PVLDB, vol. 3, no. 2, pp. 1641–1644, 2010.
Golab, L., and M.. Özsu, "Data Stream Management", Data Stream Management, 2010.
Lin, J., and C. Dyer, "Data-Intensive Text Processing with MapReduce", Data-Intensive Text Processing with MapReduce, 2010.
Cormack, G., M. Smucker, and C. Clarke, "Efficient and Effective Spam Filtering and Re-ranking for Large Web Datasets", CoRR, vol. abs/1004.5168, 2010.
Srivastava, D., L. Golab, R. Greer, T. Johnson, J. Seidel, V. Shkapenyuk, O. Spatscheck, and J. Yates, "Enabling Real Time Data Analysis.", PVLDB, vol. 3, no. 1, pp. 1–2, 2010.
Kling, P., M.. Özsu, and K. Daudjee, "Generating Efficient Execution Plans for Vertically Partitioned XML Databases.", PVLDB, vol. 4, no. 1, pp. 1–11, 2010.
Büttcher, S., C. Clarke, and G. Cormack, Information Retrieval - Implementing and Evaluating Search Engines., , pp. I–XXIV, 1-606, 2010.
Ben-David, S., R. Trefler, and G. Weddell, "Model Checking Using Description Logic.", J. Log. Comput., vol. 20, no. 1, pp. 111–131, 2010.
Wang, Q., K. Daudjee, and M.. Özsu, "Popularity-aware prefetch in P2P range caching.", Peer-to-Peer Networking and Applications, vol. 3, no. 2, pp. 145–160, 2010.
Pound, J., I. Ilyas, and G. Weddell, "QUICK: Expressive and Flexible Search over Knowledge Bases and Text Collections.", PVLDB, vol. 3, no. 2, pp. 1573–1576, 2010.
Azzopardi, L., K. Järvelin, J. Kamps, and M. Smucker, "Report on the SIGIR 2010 workshop on the simulation of interaction.", SIGIR Forum, vol. 44, no. 2, pp. 35–47, 2010.
Beskales, G., I. Ilyas, and L. Golab, "Sampling the Repairs of Functional Dependency Violations under Hard Constraints.", PVLDB, vol. 3, no. 1, pp. 197–207, 2010.
Stanchev, L., and G. Weddell, "Saving space and time using index merging.", Data Knowl. Eng., vol. 69, no. 10, pp. 1062–1080, 2010.
Soliman, M., I. Ilyas, and S. Ben-David, "Supporting ranking queries on uncertain and incomplete data.", VLDB J., vol. 19, no. 4, pp. 477–501, 2010.
Ailamaki, A., L. Haas, H.. Jagadish, D. Maier, M.. Özsu, and M. Winslett, "Time for Our Field to Grow Up.", PVLDB, vol. 3, no. 2, pp. 1658, 2010.

2009

Fiser, P., and D. Toman, "A Fast SOP Minimizer for Logic Funcions Described by Many Product Terms.", DSD, pp. 757–764, 2009.
Smucker, M., and J. Allan, "A New Measure of the Cluster Hypothesis.", ICTIR, pp. 281–288, 2009.
Qasim, U., V. Oria, Y-fang. Wu, M. Houle, and M.. Özsu, "A partial-order based active cache for recommender systems.", RecSys, pp. 209–212, 2009.
Smucker, M., J. Allan, and B. Carterette, "Agreement among statistical significance tests for information retrieval evaluation at varying sample sizes.", SIGIR, pp. 630–631, 2009.
Clarke, C., M. Kolla, and O. Vechtomova, "An Effectiveness Measure for Ambiguous and Underspecified Queries.", ICTIR, pp. 188–199, 2009.
Toman, D., and G. Weddell, "Applications and Extensions of PTIME Description Logics with Functional Constraints.", IJCAI, pp. 948–954, 2009.
Ashkan, A., and C. Clarke, "Characterizing commercial intent.", CIKM, pp. 67–76, 2009.
Ashkan, A., C. Clarke, E. Agichtein, and Q. Guo, "Classifying and Characterizing Query Intent.", ECIR, pp. 578–586, 2009.
Liu, X., A. Aboulnaga, K. Salem, and X. Li, "CLIC: CLient-Informed Caching for Storage Servers.", FAST, pp. 297–310, 2009.
Whissell, J., C. Clarke, and A. Ashkan, "Clustering web queries.", CIKM, pp. 899–908, 2009.
Kontchakov, R., C. Lutz, D. Toman, F. Wolter, and M. Zakharyaschev, "Combined FO Rewritability for Conjunctive Query Answering in DL-Lite.", Description Logics, 2009.
Pound, J., D. Toman, G. Weddell, and J. Wu, "Concept Projection in Algebras for Computing Certain Answer Descriptions.", Description Logics, 2009.
Lutz, C., D. Toman, and F. Wolter, "Conjunctive Query Answering in the Description Logic EL Using a Relational Database System.", IJCAI, pp. 2070–2075, 2009.
Lin, J., and C. Dyer, "Data Intensive Text Processing with MapReduce.", HLT-NAACL (Tutorial Abstracts), pp. 1–2, 2009.
Özsu, M.., "Distributed XML Processing.", APWeb/WAIM, pp. 1, 2009.
Tao, Y., and M.. Özsu, "Efficient decision tree construction for mining time-varying data streams.", CASCON, pp. 43–57, 2009.
Chan, E., and J. Zhang, "Efficient Evaluation of Static and Dynamic Optimal Route Queries.", SSTD, pp. 386–391, 2009.
Henry, K., C. Swanson, Q. Xie, and K. Daudjee, "Efficient Hierarchical Quorums in Unstructured Peer-to-Peer Networks.", OTM Conferences (1), pp. 183–200, 2009.
Ashkan, A., C. Clarke, E. Agichtein, and Q. Guo, "Estimating Ad Clickthrough Rate through Query Intent Analysis.", Web Intelligence, pp. 222–229, 2009.
Cormode, G., L. Golab, F. Korn, A. McGregor, D. Srivastava, and X. Zhang, "Estimating the confidence of conditional functional dependencies.", SIGMOD Conference, pp. 469–482, 2009.
Smucker, M., C. Clarke, and G. Cormack, "Experiments with ClueWeb09: Relevance Feedback and Web Tracks.", TREC, 2009.
Ben-David, S., J. Pound, R. Trefler, D. Tsarkov, and G. Weddell, "Fair Cycle Detection using Description Logic Reasoning.", Description Logics, 2009.
Kolcz, A., and G. Cormack, "Genre-based decomposition of email class noise.", KDD, pp. 427–436, 2009.
Guo, Q., E. Agichtein, C. Clarke, and A. Ashkan, "In the Mood to Click? Towards Inferring Receptiveness to Search Advertising.", Web Intelligence, pp. 319–324, 2009.
Tang, N., J. Yu, H. Tang, M.. Özsu, and P. Boncz, "Materialized View Selection in XML Databases.", DASFAA, pp. 616–630, 2009.
Tao, Y., and M.. Özsu, "Mining data streams with periodically changing distributions.", CIKM, pp. 887–896, 2009.
Tao, Y., and M.. Özsu, "Mining frequent itemsets in time-varying data streams.", CIKM, pp. 1521–1524, 2009.
Lin, J., T. Elsayed, L. Wang, and D. Metzler, "Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search.", TREC, 2009.
Cormack, G., and J-M. da Cruz, "On the relative age of spam and ham training samples for email filtering.", SIGIR, pp. 744–745, 2009.
Clarke, C., N. Craswell, and I. Soboroff, "Overview of the TREC 2009 Web Track.", TREC, 2009.
Zhang, H., I. Ilyas, and K. Salem, "PSALM: Cardinality Estimation inthe Presence of Fine-Grained Access Controls.", ICDE, pp. 505–516, 2009.
Ilyas, I., D. Martinenghi, and M. Tagliasacchi, "Rank-Join Algorithms for Search Computing.", SeCO Workshop, pp. 211–224, 2009.
Soliman, M., and I. Ilyas, "Ranking with Uncertain Scores.", ICDE, pp. 317–328, 2009.
Cormack, G., C. Clarke, and S. Büttcher, "Reciprocal rank fusion outperforms condorcet and individual rank learning methods.", SIGIR, pp. 758–759, 2009.
Bateni, MH., L. Golab, M. Hajiaghayi, and H. Karloff, "Scheduling to minimize staleness and stretch in real-time data warehouses.", SPAA, pp. 29–38, 2009.
Golab, L., T. Johnson, and V. Shkapenyuk, "Scheduling Updates in a Real-Time Stream Warehouse.", ICDE, pp. 1207–1210, 2009.
Cormack, G., and A. Kolcz, "Spam filter evaluation with imprecise ground truth.", SIGIR, pp. 604–611, 2009.
Golab, L., T. Johnson, J.. Seidel, and V. Shkapenyuk, "Stream warehousing with DataDepot.", SIGMOD Conference, pp. 847–854, 2009.
Ashkan, A., and C. Clarke, "Term-based commercial intent analysis.", SIGIR, pp. 800–801, 2009.
Murray, G.., J. Lin, W.. Wilbur, and Z. Lu, "Users’ adjustments to unsuccessful queries in biomedical search.", JCDL, pp. 433–434, 2009.
Itakura, K., and C. Clarke, "Using dynamic markov compression to detect vandalism in the wikipedia.", SIGIR, pp. 822–823, 2009.
Lin, J., and W.. Wilbur, "", Inf. Retr., vol. 12, no. 4, pp. 487–503, 2009.
Lin, J., G.. Murray, B. Dorr, J. Hajic, and P. Pecina, "A cost-effective lexical acquisition process for large-scale thesaurus translation.", Language Resources and Evaluation, vol. 43, no. 1, pp. 27–40, 2009.
Chomicki, J., and D. Toman, "Abstract Versus Concrete Temporal Query Languages.", Encyclopedia of Database Systems, pp. 1–6, 2009.
Özsu, M.., "Client-Server DBMS.", Encyclopedia of Database Systems, pp. 342–344, 2009.
Klavans, J., C. Sheffield, E. Abels, J. Lin, R. Passonneau, T. Sidhu, and D. Soergel, "Computational linguistics for metadata building (CLiMB): using text mining for the automatic identification, categorization, and disambiguation of subject terms for image metadata.", Multimedia Tools Appl., vol. 42, no. 1, pp. 115–138, 2009.
Wan, Q., R. Wong, I. Ilyas, M.. Özsu, and Y. Peng, "Creating Competitive Products.", PVLDB, vol. 2, no. 1, pp. 898–909, 2009.
Golab, L., "Data Stream.", Encyclopedia of Database Systems, pp. 638, 2009.
Aboulnaga, A., K. Salem, A. Soror, U. Minhas, P. Kokosielis, and S. Kamath, "Deploying Database Appliances in the Cloud.", IEEE Data Eng. Bull., vol. 32, no. 1, pp. 13–20, 2009.
Haas, P., I. Ilyas, G. Lohman, and V. Markl, "Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey.", Statistical Analysis and Data Mining, vol. 1, no. 4, pp. 223–250, 2009.
Zou, L., L. Chen, and M.. Özsu, "DistanceJoin: Pattern Match Query In a Large Graph Database.", PVLDB, vol. 2, no. 1, pp. 886–897, 2009.
Tompa, F., "Document Databases.", Encyclopedia of Database Systems, pp. 938–939, 2009.