2021
"The proper care and feeding of CAMELS: How limited training data affects streamflow prediction.", Environ. Model. Softw., vol. 135, pp. 104926, 2021.
,
2020
"A Framework for Extracted View Maintenance.", DocEng, pp. 16:1-16:4, 2020.
,
"A Lightweight Environment for Learning Experimental IR Research Practices.", SIGIR, pp. 2113–2116, 2020.
,
"A Mixed-Method Analysis of Text and Audio Search Interfaces with Varying Task Complexity.", ICTIR, pp. 61-68, 2020.
,
"A Think-Aloud Study to Understand Factors Affecting Online Health Search.", CHIIR, pp. 273–282, 2020.
,
"An Open-Source Interface to the Canadian Surface Prediction Archive.", JCDL, pp. 529–530, 2020.
,
"Approximate Nearest Neighbor Search and Lightweight Dense Vector Reranking in Multi-Stage Retrieval Architectures.", ICTIR, pp. 97–100, 2020.
,
"Attention-based Learning for Missing Data Imputation in HoloClean.", MLSys, 2020.
,
"Building community at distance: a datathon during COVID-19.", Digit. Libr. Perspect., vol. 36, pp. 415-428, 2020.
,
"Capreolus: A Toolkit for End-to-End Neural Ad Hoc Retrieval.", WSDM, pp. 861–864, 2020.
,
"ChronoCache: Predictive and Adaptive Mid-Tier Query Result Caching.", SIGMOD Conference, pp. 2391–2406, 2020.
,
"Consentio: Managing Consent to Data Access using Permissioned Blockchains.", IEEE ICBC, pp. 1-9, 2020.
,
"Content-Based Exploration of Archival Images Using Neural Networks.", JCDL, pp. 489–490, 2020.
,
"Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset.", SDP@EMNLP, pp. 31-41, 2020.
,
"Cross-Lingual Training of Neural Models for Document Ranking.", EMNLP (Findings), pp. 2768–2773, 2020.
,
"DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference.", ACL, pp. 2246–2251, 2020.
,
"Designing Templates for Eliciting Commonsense Knowledge from Pretrained Sequence-to-Sequence Models.", COLING, pp. 3449–3453, 2020.
,
"Distant Supervision for Multi-Stage Fine-Tuning in Retrieval-Based Question Answering.", WWW, pp. 2934-2940, 2020.
,
"Document Ranking with a Pretrained Sequence-to-Sequence Model.", EMNLP (Findings), pp. 708–718, 2020.
,
"Dowsing for Math Answers with Tangent-L.", CLEF (Working Notes), 2020.
,
"DynaMast: Adaptive Dynamic Mastering for Replicated Systems.", ICDE, pp. 1381–1392, 2020.
,
"ELite: Cost-effective Approximation of Exploration-based Graph Analysis.", GRADES-NDA@SIGMOD, pp. 6:1-6:10, 2020.
,
"Erratum for Discovering Order Dependencies through Order Compatibility (EDBT 2019).", EDBT, pp. 659–663, 2020.
,
"Evaluating Pretrained Transformer Models for Citation Recommendation.", BIR@ECIR, pp. 89–100, 2020.
,
"Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT.", RepL4NLP@ACL, pp. 72–77, 2020.
,
"First Order Rewritability for Ontology Mediated Querying in Horn-DLFD.", Description Logics, 2020.
,
"Flexible IR Pipelines with Capreolus.", CIKM, pp. 3181–3188, 2020.
,
"G-thinker: A Distributed Framework for Mining Subgraphs in a Big Graph.", ICDE, pp. 1369–1380, 2020.
,
"Generalized and Scalable Optimal Sparse Decision Trees.", ICML, pp. 6150–6160, 2020.
,
"GSI: GPU-friendly Subgraph Isomorphism.", ICDE, pp. 1249–1260, 2020.
,
"Howl: A Deployed, Open-Source Wake Word Detection System.", CoRR, vol. abs/2008.09606, 2020.
,
"Inserting Information Bottleneck for Attribution in Transformers.", EMNLP (Findings), pp. 3850–3857, 2020.
,
"Iterative Edit-Based Unsupervised Sentence Simplification.", ACL, pp. 7918–7928, 2020.
,
"Leaving stragglers at the window: low-latency stream sampling with accuracy guarantees.", DEBS, pp. 15-26, 2020.
,
"Made to Measure: A Workshop on Human-centred metrics for information seeking.", CHIIR, pp. 484–487, 2020.
,
"Message from the General Chairs of DSC 2020.", DSC, pp. 1, 2020.
,
"Offline Evaluation by Maximum Similarity to an Ideal Ranking.", CIKM, pp. 225–234, 2020.
,
"Offline Evaluation without Gain.", ICTIR, pp. 185–192, 2020.
,
"Parallel Scheduling of Data-Intensive Tasks.", Euro-Par, pp. 117–133, 2020.
,
"Reddit Mining to Understand Gendered Movements.", EDBT/ICDT Workshops, 2020.
,
"Reddit Mining to Understand Women’s Issues in STEM.", EDBT/ICDT Workshops, 2020.
,
"Regular Path Query Evaluation on Streaming Graphs.", SIGMOD Conference, pp. 1415–1430, 2020.
,
"Reproducibility is a Process, Not an Achievement: The Replicability of IR Reproducibility Experiments.", ECIR (2), pp. 43-49, 2020.
,
"Research challenges in deep reinforcement learning-based join query optimization.", aiDM@SIGMOD, pp. 3:1-3:6, 2020.
,
"Sentinel: Understanding Data Systems.", SIGMOD Conference, pp. 2729–2732, 2020.
,
"Showing Your Work Doesn’t Always Work.", ACL, pp. 2766–2772, 2020.
,
"SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter.", KDD, pp. 3183-3193, 2020.
,
"Social Media Mining to Understand the Impact of Co-operative Education on Mental Health.", EDM, 2020.
,
"Streaming graph processing and analytics.", DEBS, pp. 1, 2020.
,
"Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format.", SIGIR, pp. 2149–2152, 2020.
,
"The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives.", JCDL, pp. 157-166, 2020.
,
"Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data.", ACL, 2020.
,
"Update Delivery Mechanisms for Prospective Information Needs: A Reproducibility Study.", CHIIR, pp. 308–312, 2020.
,
"We Could, but Should We?: Ethical Considerations for Providing Access to GeoCities and Other Historical Digital Collections.", CHIIR, pp. 135–144, 2020.
,
"Which BM25 Do You Mean? A Large-Scale Reproducibility Study of Scoring Variants.", ECIR (2), pp. 28–34, 2020.
,
"XOX Fabric: A hybrid approach to blockchain transaction execution.", IEEE ICBC, pp. 1-9, 2020.
,
"A Data Scientist’s Guide to Streamflow Prediction.", CoRR, vol. abs/2006.12975, 2020.
,
"A Prototype of Serverless Lucene.", CoRR, vol. abs/2002.01447, 2020.
,
"A+ Indexes: Lightweight and Highly Flexible Adjacency Lists for Graph Database Management Systems.", CoRR, vol. abs/2004.00130, 2020.
,
"aeSpTV: An Adaptive and Efficient Framework for Sparse Tensor-Vector Product Kernel on a High-Performance Computing Platform.", IEEE Trans. Parallel Distributed Syst., vol. 31, pp. 2329–2345, 2020.
,
"Approximate Denial Constraints.", Proc. VLDB Endow., vol. 13, pp. 1682–1695, 2020.
,
"Approximate Denial Constraints.", CoRR, vol. abs/2005.08540, 2020.
,
"Assessing top-k preferences.", CoRR, vol. abs/2007.11682, 2020.
,
"Batchwise Probabilistic Incremental Data Cleaning.", CoRR, vol. abs/2011.04730, 2020.
,
"Compact group discovery in attributed graphs and social networks.", Inf. Process. Manag., vol. 57, pp. 102054, 2020.
,
"Conversational Question Reformulation via Sequence-to-Sequence Architectures and Pretrained Language Models.", CoRR, vol. abs/2004.01909, 2020.
,
"Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset.", CoRR, vol. abs/2007.07846, 2020.
,
"Cydex: Neural Search Infrastructure for the Scholarly Literature.", SDP@EMNLP, pp. 168–173, 2020.
,
"DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference.", CoRR, vol. abs/2004.12993, 2020.
,
"Detecting Opportunities for Differential Maintenance of Extracted Views.", CoRR, vol. abs/2007.01973, 2020.
,
"Discovering Domain Orders through Order Dependencies.", CoRR, vol. abs/2005.14068, 2020.
,
"Distilling Dense Representations for Ranking using Tightly-Coupled Teachers.", CoRR, vol. abs/2010.11386, 2020.
,
"Document Ranking with a Pretrained Sequence-to-Sequence Model.", CoRR, vol. abs/2003.06713, 2020.
,
"Evaluating sentence-level relevance feedback for high-recall information retrieval.", Inf. Retr. J., vol. 23, pp. 1-26, 2020.
,
"FastFabric: Scaling hyperledger fabric to 20 000 transactions per second.", Int. J. Netw. Manag., vol. 30, 2020.
,
"Generalized Optimal Sparse Decision Trees.", CoRR, vol. abs/2006.08690, 2020.
,
"Graphsurge: Graph Analytics on View Collections Using Differential Computation.", CoRR, vol. abs/2004.05297, 2020.
,
"Inserting Information Bottlenecks for Attribution in Transformers.", CoRR, vol. abs/2012.13838, 2020.
,
"Introduction to the special issue on Self-managing and Hardware-Optimized Database Systems 2019.", Distributed Parallel Databases, vol. 38, pp. 767–769, 2020.
,
"Iterative Edit-Based Unsupervised Sentence Simplification.", CoRR, vol. abs/2006.09639, 2020.
,
"Kamino: Constraint-Aware Differentially Private Data Synthesis.", CoRR, vol. abs/2012.15713, 2020.
,
"Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures.", CoRR, vol. abs/2010.11351, 2020.
,
"Locating Influential Agents in Social Networks: Budget-Constrained Seed Set Selection.", Canadian Conference on AI, pp. 15–28, 2020.
,
"Micro-journal mining to understand mood triggers.", Computing, vol. 102, pp. 1227–1244, 2020.
,
"MorphoSys: Automatic Physical Design Metamorphosis for Distributed Database Systems.", Proc. VLDB Endow., vol. 13, pp. 3573–3587, 2020.
,
"Navigation-based candidate expansion and pretrained language models for citation recommendation.", Scientometrics, vol. 125, pp. 3001–3016, 2020.
,
"Navigation-Based Candidate Expansion and Pretrained Language Models for Citation Recommendation.", CoRR, vol. abs/2001.08687, 2020.
,
"On sampling from data with duplicate records.", CoRR, vol. abs/2008.10549, 2020.
,
"Participation in TREC 2020 COVID Track Using Continuous Active Learning.", CoRR, vol. abs/2011.01453, 2020.
,
"Pretrained Transformers for Text Ranking: BERT and Beyond.", CoRR, vol. abs/2010.06467, 2020.
,
"Rainfall-Runoff Prediction at Multiple Timescales with a Single Long Short-Term Memory Network.", CoRR, vol. abs/2010.07921, 2020.
,
"Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents.", CoRR, vol. abs/2002.01861, 2020.
,
"Rapidly Bootstrapping a Question Answering Dataset for COVID-19.", CoRR, vol. abs/2004.11339, 2020.
,
"Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned.", CoRR, vol. abs/2004.05125, 2020.
,
"Record fusion: A learning approach.", CoRR, vol. abs/2006.10208, 2020.
,
"Regular Path Query Evaluation on Streaming Graphs.", CoRR, vol. abs/2004.02012, 2020.
,
Robust keyword search in large attributed graphs., vol. 23, pp. 502-524, 2020.
,
"Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach.", CoRR, vol. abs/2005.00081, 2020.
,
"Scientific Claim Verification with VERT5ERINI.", CoRR, vol. abs/2010.11930, 2020.
,
"SegaBERT: Pre-training of Segment-aware BERT for Language Understanding.", CoRR, vol. abs/2004.14996, 2020.
,
"Semantics of the Unwritten.", CoRR, vol. abs/2004.02251, 2020.
,
"Sentinel: Universal Analysis and Insight for Data Systems.", Proc. VLDB Endow., vol. 13, pp. 2720–2733, 2020.
,
"Showing Your Work Doesn’t Always Work.", CoRR, vol. abs/2004.13705, 2020.
,
"Special issue on best papers of DaMoN 2018.", VLDB J., vol. 29, pp. 755, 2020.
,
"Special issue on best papers of VLDB 2017.", VLDB J., vol. 29, 2020.
,
"Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format.", CoRR, vol. abs/2003.08276, 2020.
,
"The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives.", CoRR, vol. abs/2001.05399, 2020.
,
"The Future is Big Graphs! A Community View on Graph Processing Systems.", CoRR, vol. abs/2012.06171, 2020.
,
"The ubiquity of large graphs and surprising challenges of graph processing: extended survey.", VLDB J., vol. 29, pp. 595–618, 2020.
,
"To Paraphrase or Not To Paraphrase: User-Controllable Selective Paraphrase Generation.", CoRR, vol. abs/2008.09290, 2020.
,
"TTTTTackling WinoGrande Schemas.", CoRR, vol. abs/2003.08380, 2020.
,
"Using Feature-Based Description Logics to avoid Duplicate Elimination in Object-Relational Query Languages.", K\"unstliche Intell., vol. 34, pp. 355–363, 2020.
,
2019
Data Cleaning, pp. 285, 2019.
,
"A Formal Framework for Probabilistic Unclean Databases.", ICDT, 2019.
,
"A Semi-Supervised Framework of Clustering Selection for De-Duplication.", ICDE, pp. 208–219, 2019.
,
"Aligning Cross-Lingual Entities with Multi-Aspect Information.", EMNLP/IJCNLP (1), pp. 4430–4440, 2019.
,
"APEx: Accuracy-Aware Differentially Private Data Exploration.", SIGMOD Conference, pp. 177–194, 2019.
,
"Applying BERT to Document Retrieval with Birch.", EMNLP/IJCNLP (3), pp. 19–24, 2019.
,
"Approximate Inference in Structured Instances with Noisy Categorical Observations.", UAI, pp. 152, 2019.
,
"Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling.", EMNLP/IJCNLP (1), pp. 5369–5380, 2019.
,
"Bring Order to Data.", AMW, 2019.
,
"Building Community and Tools for Analyzing Web Archives Through Datathons.", JCDL, pp. 265–268, 2019.
,
"Building Scalable Machine Learning Solutions for Data Cleaning.", BTW, pp. 27–28, 2019.
,
"Challenges and Opportunities in Understanding Spoken Queries Directed at Modern Entertainment Platforms.", SIGIR, pp. 1375–1376, 2019.
,
"Critically Examining the “Neural Hype”: Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models.", SIGIR, pp. 1129–1132, 2019.
,
"Cross-Domain Modeling of Sentence-Level Evidence for Document Retrieval.", EMNLP/IJCNLP (1), pp. 3488–3494, 2019.
,
"DaMoN 19: The 15th International Workshop on Data Management on New Hardware", SIGMOD Conference, pp. 2070–2071, 2019.
,
"Detecting Customer Complaint Escalation with Recurrent Neural Networks and Manually-Engineered Features.", NAACL-HLT (2), pp. 56–63, 2019.
,
"Detecting Customer Complaint Escalation with Recurrent Neural Networks and Manually-Engineered Features.", NAACL-HLT (2), pp. 56–63, 2019.
,
"Distributed Discovery of Functional Dependencies.", ICDE, pp. 1590–1593, 2019.
,
"DPI: The Data Processing Interface for Modern Networks.", CIDR, 2019.
,
"Dynamic Sampling Meets Pooling.", SIGIR, pp. 1217–1220, 2019.
,
"End-to-End Open-Domain Question Answering with BERTserini.", NAACL-HLT (Demonstrations), pp. 72–77, 2019.
,
"Exhaustive Query Answering via Referring Expressions.", Description Logics, 2019.
,
"Exhaustive Query Answering via Referring Expressions.", Description Logics, 2019.
,
"Experimental Analysis of Streaming Algorithms for Graph Partitioning.", SIGMOD Conference, pp. 1375–1392, 2019.
,
"ExplIQuE: Interactive Databases Exploration with SQL.", CIKM, pp. 2877–2880, 2019.
,
"FastFabric: Scaling Hyperledger Fabric to 20, 000 Transactions per Second.", IEEE ICBC, pp. 455–463, 2019.
,
"Finding ALL Answers to OBDA Queries Using Referring Expressions.", Australasian Conference on Artificial Intelligence, pp. 117–129, 2019.
,
"From MAXSCORE to Block-Max Wand: The Story of How Lucene Significantly Improved Query Evaluation Performance.", ECIR (2), pp. 20-27, 2019.
,
"FunDL - A Family of Feature-Based Description Logics, with Applications in Querying Structured Data Sources.", Description Logic, Theory Combination, and All That, pp. 404–430, 2019.
,
"Gender Differences in Science and Engineering: A Data Mining Approach.", EDBT/ICDT Workshops, 2019.
,
"Gender Differences in Work-Integrated Learning Assessments.", EDM, 2019.
,
"GraphWrangler: An Interactive Graph View on Relational Data.", SIGMOD Conference, pp. 1865–1868, 2019.
,
"HoloDetect: Few-Shot Learning for Error Detection.", SIGMOD Conference, pp. 829–846, 2019.
,
"Honkling: In-Browser Personalization for Ubiquitous Keyword Spotting.", EMNLP/IJCNLP (3), pp. 91–96, 2019.
,
"Identification and Ranking of Biomedical Informatics Researcher Citation Statistics through a Google Scholar Scraper.", AMIA, 2019.
,
"Identity Resolution in Ontology Based Data Access to Structured Data Sources.", PRICAI (1), pp. 473–485, 2019.
,
"Incorporating Contextual and Syntactic Structures Improves Semantic Similarity Modeling.", EMNLP/IJCNLP (1), pp. 1204–1209, 2019.
,
"Information Retrieval Meets Scalable Text Analytics: Solr Integration with Spark.", SIGIR, pp. 1313–1316, 2019.
,
"Informative Summarization of Numeric Data.", SSDBM, pp. 97–108, 2019.
,
"Mitigating Trust Issues in Electric Vehicle Charging using a Blockchain.", e-Energy, pp. 160–164, 2019.
,
"Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search.", AAAI, pp. 232–240, 2019.
,
"Natural Language Generation for Effective Knowledge Distillation.", DeepLo@EMNLP-IJCNLP, pp. 202–208, 2019.
,
"On Limited Conjunctions and Partial Features in Parameter-Tractable Feature Logics.", AAAI, pp. 2995–3002, 2019.
,
"On Special Description Logics for Processes and Plans.", Description Logics, 2019.
,
"Online abuse detection: the value of preprocessing and neural attention models.", WASSA@NAACL-HLT, pp. 16–24, 2019.
,
"Overview of the 2019 Open-Source IR Replicability Challenge (OSIRRC 2019).", OSIRRC@SIGIR, pp. 1–7, 2019.
,
"Overview of the TREC 2016 Real-Time Summarization Track.", TREC, 2019.
,
"Patterns of Search Result Examination: Query to First Action.", CIKM, pp. 1833–1842, 2019.
,
"Predictable and Consistent Information Extraction.", DocEng, pp. 14:1-14:10, 2019.
,
"Quantifying Bias and Variance of System Rankings.", SIGIR, pp. 1089–1092, 2019.
,
"Query and Answer Expansion from Conversation History.", TREC, 2019.
,
"Rethinking Complex Neural Network Architectures for Document Classification.", NAACL-HLT (1), pp. 4046–4051, 2019.
,
"Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit.", JCDL, pp. 436–437, 2019.
,
"Semi-supervised clustering for de-duplication.", AISTATS, pp. 1659–1667, 2019.
,
"Sift: resource-efficient consensus with RDMA.", CoNEXT, pp. 260–271, 2019.
,
"Simple Attention-Based Representation Learning for Ranking Short Social Media Posts.", NAACL-HLT (1), pp. 2212–2217, 2019.
,
"Simple Techniques for Cross-Collection Relevance Feedback.", ECIR (1), 2019.
,
"Solr Integration in the Anserini Information Retrieval Toolkit.", SIGIR, pp. 1285–1288, 2019.
,
"T-thinker: a task-centric distributed framework for compute-intensive divide-and-conquer algorithms.", PPoPP, pp. 411-412, 2019.
,
"The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration of Web Archives.", JCDL, pp. 337–338, 2019.
,
"The Cost of a WARC: Analyzing Web Archives in the Cloud.", JCDL, pp. 261–264, 2019.
,
"The Impact of Score Ties on Repeatability in Document Ranking.", SIGIR, pp. 1125–1128, 2019.
,
"The SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019).", SIGIR, pp. 1432–1434, 2019.
,
"Time Constrained Continuous Subgraph Search Over Streaming Graphs.", ICDE, pp. 1082–1093, 2019.
,
"Time-Limits and Summaries for Faster Relevance Assessing.", SIGIR, pp. 901–904, 2019.
,
"Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data.", CoRR, vol. abs/1909.10158, 2019.
,
"Unbiased Low-Variance Estimators for Precision and Related Information Retrieval Effectiveness Measures.", SIGIR, pp. 945–948, 2019.
,
"Universal voice-enabled user interfaces using JavaScript.", IUI Companion, pp. 81-82, 2019.
,
"University of Waterloo Docker Images for OSIRRC at SIGIR 2019.", OSIRRC@SIGIR, pp. 36, 2019.
,
"Unsupervised String Transformation Learning for Entity Consolidation.", ICDE, pp. 196–207, 2019.
,
"UWaterlooMDS at the TREC 2019 Decision Track.", TREC, 2019.
,
"Warclight: A Rails Engine for Web Archive Discovery.", JCDL, pp. 442–443, 2019.
,
"WatDFS: A Project for Understanding Distributed Systems in the Undergraduate Curriculum.", SIGCSE, pp. 920-926, 2019.
,
"WaterlooClarke at the TREC 2019 Conversational Assistant Track.", TREC, 2019.
,
"What Part of the Neural Network Does This? Understanding LSTMs by Measuring and Dissecting Neurons.", EMNLP/IJCNLP (1), pp. 5822–5829, 2019.
,
"Yelling at Your TV: An Analysis of Speech Recognition Errors and Subsequent User Behavior on Entertainment Systems.", SIGIR, pp. 853–856, 2019.
,
"Aligning Cross-Lingual Entities with Multi-Aspect Information.", CoRR, vol. abs/1910.06575, 2019.
,
"Approximate Inference in Structured Instances with Noisy Categorical Observations.", CoRR, vol. abs/1907.00141, 2019.
,
"Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation for Pretrained Models.", CoRR, vol. abs/1911.03588, 2019.
,
"Box Covers and Domain Orderings for Beyond Worst-Case Join Processing.", CoRR, vol. abs/1909.12102, 2019.
,
"Building self-clustering RDF databases using Tunable-LSH.", VLDB J., vol. 28, issue 2, 2019.
,
"Consentio: Managing Consent to Data Access using Permissioned Blockchains.", CoRR, vol. abs/1910.07110, 2019.
,
"Correlation Constraint Shortest Path over Large Multi-Relation Graphs.", Proc. VLDB Endow., vol. 12, pp. 488–501, 2019.
,
"Critically Examining the “Neural Hype”: Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models.", CoRR, vol. abs/1904.09171, 2019.
,
"Cross-Lingual Relevance Transfer for Document Retrieval.", CoRR, vol. abs/1911.02989, 2019.
,
"Cross-lingual text alignment for fine-grained plagiarism detection.", J. Inf. Sci., vol. 45, issue 4, 2019.
,
"Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering.", CoRR, vol. abs/1904.06652, 2019.
,
"Data unification at scale: data tamer.", Making Databases Work, 2019.
,
"Distilling Task-Specific Knowledge from BERT into Simple Neural Networks.", CoRR, vol. abs/1903.12136, 2019.
,
"Distributed Dependency Discovery.", CoRR, vol. abs/1903.05228, 2019.
,
"Distributed Implementations of Dependency Discovery Algorithms.", PVLDB, vol. 12, pp. 1624–1636, 2019.
,
"DocBERT: BERT for Document Classification.", CoRR, vol. abs/1904.08398, 2019.
,
"Document Expansion by Query Prediction.", CoRR, vol. abs/1904.08375, 2019.
,
"End-to-End Open-Domain Question Answering with BERTserini.", CoRR, vol. abs/1902.01718, 2019.
,
"Errata Note: Discovering Order Dependencies through Order Compatibility.", CoRR, vol. abs/1905.02010, 2019.
,
"Explicit Pairwise Word Interaction Modeling Improves Pretrained Transformers for English Semantic Similarity Tasks.", CoRR, vol. abs/1911.02847, 2019.
,
"Exploiting Token and Path-based Representations of Code for Identifying Security-Relevant Commits.", CoRR, vol. abs/1911.07620, 2019.
,
"FastFabric: Scaling Hyperledger Fabric to 20, 000 Transactions per Second.", CoRR, vol. abs/1901.00910, 2019.
,
"Graph Query Processing.", Encyclopedia of Big Data Technologies, 2019.
,
"GSI: GPU-friendly Subgraph Isomorphism.", CoRR, vol. abs/1906.03420, 2019.
,
"HoloDetect: Few-Shot Learning for Error Detection.", CoRR, vol. abs/1904.02285, 2019.
,
"Lucene for Approximate Nearest-Neighbors Search on Arbitrary Dense Vectors.", CoRR, vol. abs/1910.10208, 2019.
,
"Matching Entities Across Different Knowledge Graphs with Graph Embeddings.", CoRR, vol. abs/1903.06607, 2019.
,
"Multi-Stage Document Ranking with BERT.", CoRR, vol. abs/1910.14424, 2019.
,
"Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins.", PVLDB, vol. 12, issue 11, 2019.
,
"Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins.", CoRR, vol. abs/1903.02076, 2019.
,
"Principles of Progress Indicators for Database Repairing.", CoRR, vol. abs/1904.06492, 2019.
,
"Query Reformulation using Query History for Passage Retrieval in Conversational Search.", CoRR, vol. abs/2005.02230, 2019.
,
"Secure Multi-Party Functional Dependency Discovery.", Proc. VLDB Endow., vol. 13, pp. 184–196, 2019.
,
"Simple Applications of BERT for Ad Hoc Document Retrieval.", CoRR, vol. abs/1903.10972, 2019.
,
"Simple BERT Models for Relation Extraction and Semantic Role Labeling.", CoRR, vol. abs/1904.05255, 2019.
,
"Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation.", CoRR, vol. abs/1906.06574, 2019.
,
"The Performance Envelope of Inverted Indexing on Modern Hardware.", CoRR, vol. abs/1910.11028, 2019.
,
"The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction.", CoRR, vol. abs/1911.07249, 2019.
,
"The Simplest Thing That Can Possibly Work: Pseudo-Relevance Feedback Using Text Classification.", CoRR, vol. abs/1904.08861, 2019.
,
"Types of Stream Processing Algorithms.", Encyclopedia of Big Data Technologies, 2019.
,
"What Would Elsa Do? Freezing Layers During Transformer Fine-Tuning.", CoRR, vol. abs/1911.03090, 2019.
,
"XOX Fabric: A hybrid approach to transaction execution.", CoRR, vol. abs/1906.11229, 2019.
,
"Foreword.", Making Databases Work, 2019.
2018
"A Study of Immediate Requery Behavior in Search.", CHIIR, pp. 181–190, 2018.
,
"A System for Efficient High-Recall Retrieval.", SIGIR, pp. 1317–1320, 2018.
,
"Algorithmic Aspects of Parallel Query Processing.", SIGMOD Conference, pp. 1659–1664, 2018.
,
"An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting.", ICASSP, pp. 5479–5483, 2018.
,
"Apollo: Learning Query Correlations for Predictive Caching in Geo-Distributed Systems.", EDBT, pp. 253–264, 2018.
,
"Beyond Pooling.", SIGIR, pp. 1169–1172, 2018.
,
"Building Data Civilizer Pipelines with an Advanced Workflow Engine.", ICDE, pp. 1593–1596, 2018.
,
"Carousel: Low-Latency Transaction Processing for Globally-Distributed Data.", SIGMOD Conference, pp. 231–243, 2018.
,
"Choosing Math Features for BM25 Ranking with Tangent-L.", DocEng, pp. 17:1-17:10, 2018.
,
"Contextual Data Cleaning.", ICDE Workshops, pp. 21–24, 2018.
,
"Data Analytics to Improve Co-Operative Education.", EDBT/ICDT Workshops, pp. 16–21, 2018.
,
"Deep Residual Learning for Small-Footprint Keyword Spotting.", ICASSP, pp. 5484–5488, 2018.
,
"Distribution-Aware Stream Partitioning for Distributed Stream Processing Systems.", BeyondMR@SIGMOD, pp. 6:1-6:10, 2018.
,
"EC-Store: Bridging the Gap between Storage and Latency in Distributed Erasure Coded Systems.", ICDCS, pp. 255–266, 2018.
,
"Effective Team Formation in Expert Networks.", AMW, 2018.
,
"Effective User Interaction for High-Recall Retrieval: Less is More.", CIKM, pp. 187–196, 2018.
,
"Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia.", COLING, pp. 2093–2103, 2018.
,
"FASTOD: Bringing Order to Data.", ICDE, pp. 1561–1564, 2018.
,
"FastOFD: Contextual Data Cleaning with Ontology Functional Dependencies.", EDBT, pp. 694–697, 2018.
,
"Identity Resolution in Conjunctive Querying over DL-Based Knowledge Bases.", Description Logics, 2018.
,
"Job Description Mining to Understand Work-Integrated Learning.", EDM, 2018.
,
"Multi-query Optimization in Federated RDF Systems.", DASFAA (1), pp. 745–765, 2018.
,
"Multi-Task Learning with Neural Networks for Voice Query Understanding on an Entertainment Platform.", KDD, pp. 636–645, 2018.
,
"On Limited Conjunctions in Polynomial Feature Logics, with Applications in OBDA.", KR, pp. 655–656, 2018.
,
"Query Driven Algorithm Selection in Early Stage Retrieval.", WSDM, pp. 396–404, 2018.
,
"RaMP: A Lightweight RDMA Abstraction for Loosely Coupled Applications.", HotCloud, 2018.
,
"RecService: Distributed Real-Time Graph Processing at Twitter.", HotCloud, 2018.
,
"Refresh Strategies in Continuous Active Learning.", ProfS/KG4IR/Data:Search@SIGIR, pp. 18–23, 2018.
,
"Renormalization of NoSQL Database Schemas.", ER, pp. 479–487, 2018.
,
"Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter.", SIGMOD Conference, pp. 595–599, 2018.
,
"Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery.", ICDE, pp. 989–1000, 2018.
,
"Serverless Data Analytics with Flint.", IEEE CLOUD, pp. 451–455, 2018.
,
"Spectral Measures of Distortion for Change Detection in Dynamic Graphs.", COMPLEX NETWORKS (2), pp. 54–66, 2018.
,
"Split-Lists and Initial Thresholds for WAND-based Search.", SIGIR, pp. 877–880, 2018.
,
"Stream WatDiv: A Streaming RDF Benchmark.", SBD@SIGMOD, pp. 3:1-3:6, 2018.
,
"Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", NAACL-HLT (2), pp. 291–296, 2018.
,
"Technology-Assisted Review in Empirical Medicine: Waterloo Participation in CLEF eHealth 2018.", CLEF (Working Notes), 2018.
,
"The Evolution of Content Analysis for Personalized Recommendations at Twitter.", SIGIR, pp. 1355–1356, 2018.
,
"The Quest for Total Recall.", DocEng, pp. 6:1-6:2, 2018.
,
"The Utility of the Abstract Relational Model and Attribute Paths in SQL.", EKAW, pp. 195–211, 2018.
,
"Tutorial: Adaptive Replication and Partitioning in Data Systems.", Middleware (Tutorials), pp. 1:1-1:5, 2018.
,
"Update Delivery Mechanisms for Prospective Information Needs: An Analysis of Attention in Mobile Users.", SIGIR, pp. 785–794, 2018.
,
"What Do Viewers Say to Their TVs?: An Analysis of Voice Queries to Entertainment Systems.", SIGIR, pp. 1213–1216, 2018.
,
"Workload-Aware CPU Performance Scaling for Transactional Database Systems.", SIGMOD Conference, pp. 291–306, 2018.
,
"CNNs for NLP in the Browser: Client-Side Deployment and Visualization Opportunities.", NAACL-HLT (Demonstrations), pp. 61-65, 2018.
,
"CNNs for NLP in the Browser: Client-Side Deployment and Visualization Opportunities.", NAACL-HLT (Demonstrations), pp. 61-65, 2018.
,
"Computing without Servers, V8, Rocket Ships, and Other Batsh*t Crazy Ideas in Data Systems.", DESIRES, pp. 3-6, 2018.
,
Fashioning a Search Engine to Support Humanities Research., vol. abs/1901.00910: DocEng, pp. 32:1-32:10, 2018.
,
"H2oloo at TREC 2018: Cross-Collection Relevance Transfer for the Common Core Track.", TREC, 2018.
,
"Hypertexts.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"MRG_UWaterloo Participation in the TREC 2018 Common Core Track.", TREC, 2018.
,
"Overview of the TREC 2018 Real-Time Summarization Track.", TREC, 2018.
,
"Pay-Per-Request Deployment of Neural Network Models Using Serverless Architectures.", NAACL-HLT (Demonstrations), pp. 6-10, 2018.
,
"Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", NAACL-HLT (2), pp. 291–296, 2018.
,
"UWaterlooMDS at the TREC 2018 Common Core Track.", TREC, 2018.
,
"A Formal Framework For Probabilistic Unclean Databases.", CoRR, vol. abs/1801.06750, 2018.
,
"A Location-Query-Browse Graph for Contextual Recommendation.", IEEE Trans. Knowl. Data Eng., vol. 30, no. 2, pp. 204–218, 2018.
,
"Abstract Versus Concrete Temporal Query Languages.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Adaptive Pruning of Neural Language Models for Mobile Devices.", CoRR, vol. abs/1809.10282, 2018.
,
"Algorithmic Aspects of Parallel Data Processing.", Found. Trends Databases, vol. 8, no. 4, pp. 239–370, 2018.
,
"Anserini: Reproducible Ranking Baselines Using Lucene.", ACM J. Data Inf. Qual., vol. 10, no. 4, pp. 16:1-16:20, 2018.
,
"Bikeshare Pool Sizing for Bike-and-Ride Multimodal Transit.", IEEE Trans. Intelligent Transportation Systems, vol. 19, no. 7, pp. 2279–2289, 2018.
,
"Client-Server Architecture.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Data Integration: The Current Status and the Way Forward.", IEEE Data Eng. Bull., vol. 41, no. 2, pp. 3–9, 2018.
,
"Data Manipulation Language (DML).", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Data Stream.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Database Administrator (DBA).", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Database.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Distributed Evaluation of Subgraph Queries Using Worst-case Optimal and Low-Memory Dataflows.", PVLDB, vol. 11, no. 6, pp. 691–704, 2018.
,
"Distributed Evaluation of Subgraph Queries Using Worstcase Optimal LowMemory Dataflows.", CoRR, vol. abs/1802.03760, 2018.
,
"Document Databases.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Effective and complete discovery of bidirectional order dependencies via set-based axioms.", VLDB J., vol. 27, no. 4, pp. 573–591, 2018.
,
"Enterprise Content Management.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Evaluating Computational Creativity: An Interdisciplinary Tutorial.", ACM Comput. Surv., vol. 51, no. 2, pp. 28:1-28:34, 2018.
,
"Evaluating Sentence-Level Relevance Feedback for High-Recall Information Retrieval.", CoRR, vol. abs/1803.08988, 2018.
,
"Evaluation-as-a-Service for the Computational Sciences: Overview and Outlook.", J. Data and Information Quality, vol. 10, no. 4, pp. 15:1-15:32, 2018.
,
"Experimental Analysis of Distributed Graph Systems.", PVLDB, vol. 11, no. 10, pp. 1151–1164, 2018.
,
"Experimental Analysis of Distributed Graph Systems.", CoRR, vol. abs/1806.08082, 2018.
,
"Explanation Tables.", IEEE Data Eng. Bull., vol. 41, no. 3, pp. 43–51, 2018.
,
"FLOPs as a Direct Optimization Objective for Learning Sparse Neural Networks.", CoRR, vol. abs/1811.03060, 2018.
,
"In-Browser Split-Execution Support for Interactive Analytics in the Cloud.", CoRR, vol. abs/1804.08822, 2018.
,
"JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis.", CoRR, vol. abs/1810.12859, 2018.
,
"Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search.", CoRR, vol. abs/1805.08159, 2018.
,
"Point-Stamped Temporal Models.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Progress and Tradeoffs in Neural Language Models.", CoRR, vol. abs/1811.00942, 2018.
,
"Questionable Answers in Question Answering Research: Reproducibility and Variability of Published Results.", TACL, vol. 6, pp. 241–252, 2018.
,
"Rank-Aware Query Processing.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Rank-Join.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Repeatability Corner Cases in Document Ranking: The Impact of Score Ties.", CoRR, vol. abs/1807.05798, 2018.
,
"Report on NTCIR-13: The Thirteenth Round of NII Testbeds and Community for Information Access Research.", SIGIR Forum, vol. 52, no. 1, pp. 102–110, 2018.
,
"Research Frontiers in Information Retrieval: Report from the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018).", SIGIR Forum, vol. 52, no. 1, pp. 34–90, 2018.
,
"Response to “Scale Up or Scale Out for Graph Processing”.", IEEE Internet Computing, vol. 22, no. 5, pp. 18–24, 2018.
,
"Sagas.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Sapphire: Querying RDF Data Made Simple.", CoRR, vol. abs/1805.11728, 2018.
,
"Scale Up or Scale Out for Graph Processing?", IEEE Internet Computing, vol. 22, no. 3, pp. 72–78, 2018.
,
"Semi-supervised clustering for de-duplication.", CoRR, vol. abs/1810.04361, 2018.
,
"Serverless Data Analytics with Flint.", CoRR, vol. abs/1803.06354, 2018.
,
"Simple Attention-Based Representation Learning for Ranking Short Social Media Posts.", CoRR, vol. abs/1811.01013, 2018.
,
"Stream Models.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks.", CoRR, vol. abs/1812.07754, 2018.
,
"Temporal Logic in Database Query Languages.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Temporal Relational Calculus.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Temporal Vacuuming.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"The Neural Hype and Comparisons Against Weak Baselines.", SIGIR Forum, vol. 52, issue 2, pp. 40–51, 2018.
,
"Time Constrained Continuous Subgraph Search over Streaming Graphs.", CoRR, vol. abs/1801.09240, 2018.
,
"Top-k Queries.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
"Web Question Answering.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
Data Profiling, 2018.
,
"Summarization.", Encyclopedia of Database Systems (2nd ed.), 2018.
,
2017
"A Comparison of Document-at-a-Time and Score-at-a-Time Query Evaluation.", WSDM, pp. 201–210, 2017.
,
"A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries.", CIKM, pp. 67–76, 2017.
,
"A Demo of the Data Civilizer System.", SIGMOD Conference, pp. 1639–1642, 2017.
,
"An analysis of memory power consumption in database systems.", DaMoN, pp. 2:1-2:9, 2017.
,
"An Exploration of Serverless Architectures for Information Retrieval.", ICTIR, pp. 241–244, 2017.
,
"An Interpolation-based Compiler and Optimizer for Relational Queries (System design Report).", IWIL@LPAR, 2017.
,
"Anserini: Enabling the Use of Lucene for Information Retrieval Research.", SIGIR, pp. 1253–1256, 2017.
,
"Authority-based Team Discovery in Social Networks.", EDBT, pp. 498–501, 2017.
,
"Automatic and Semi-Automatic Document Selection for Technology-Assisted Review.", SIGIR, pp. 905–908, 2017.
,
"Automatically Extracting High-Quality Negative Examples for Answer Selection in Question Answering.", SIGIR, pp. 797–800, 2017.
,
"Concerning Referring Expressions in Query Answers.", IJCAI, pp. 4791–4795, 2017.
,
"Data Profiling: A Tutorial.", SIGMOD Conference, pp. 1747–1751, 2017.
,
"Do We Need Specialized Graph Databases?: Benchmarking Real-Time Social Networking Applications.", GRADES@SIGMOD/PODS, pp. 12:1-12:7, 2017.
,
"Efficient Discovery of Ontology Functional Dependencies.", CIKM, pp. 1847–1856, 2017.
,
"Event Detection on Curated Tweet Streams.", SIGIR, pp. 1325–1328, 2017.
,
"Experiments with Convolutional Neural Network Models for Answer Selection.", SIGIR, pp. 1217–1220, 2017.
,
"Exploring Conversational Search With Humans, Assistants, and Wizards.", CHI Extended Abstracts, pp. 2187–2193, 2017.
,
"Finally, a Downloadable Test Collection of Tweets.", SIGIR, pp. 1225–1228, 2017.
,
"Graph Mining to Characterize Competition for Employment.", NDA@SIGMOD, pp. 3:1-3:7, 2017.
,
"Graphflow: An Active Graph Database.", SIGMOD Conference, pp. 1695–1698, 2017.
,
"GYM: A Multiround Distributed Join Algorithm.", ICDT, pp. 4:1-4:18, 2017.
,
"How Similar is the Usage of Electric Cars and Electric Bicycles?", e-Energy, pp. 334–340, 2017.
,
"In-Browser Interactive SQL Analytics with Afterburner.", SIGMOD Conference, pp. 1623–1626, 2017.
,
"Incorporating novelty, meaning, reaction and craft into computational poetry: a negative experimental result.", ICCC, pp. 183–188, 2017.
,
"Managing Sensor Data Streams: Lessons Learned from the WeBike Project.", SSDBM, pp. 1:1-1:11, 2017.
,
"Mining the Temporal Statistics of Query Terms for Searching Social Media Posts.", ICTIR, pp. 133–140, 2017.
,
"MRG_UWaterloo and WaterlooCormack Participation in the TREC 2017 Common Core Track.", TREC, 2017.
,
"Navigating Imprecision in Relevance Assessments on the Road to Total Recall: Roger and Me.", SIGIR, pp. 5–14, 2017.
,
"Netstore: leveraging network optimizations to improve distributed transaction processing performance.", ACTIVE@Middleware, pp. 1–10, 2017.
,
"On Partial Features in the DLF Dialects of Description Logic with Inverse Features.", Description Logics, 2017.
,
"On the Reusability of “Living Labs” Test Collections: : A Case Study of Real-Time Summarization.", SIGIR, pp. 793–796, 2017.
,
"Online In-Situ Interleaved Evaluation of Real-Time Push Notification Systems.", SIGIR, pp. 415–424, 2017.
,
"Optimal reducer placement to minimize data transfer in MapReduce-style processing.", BigData, pp. 339–346, 2017.
,
"Overview of the TREC 2017 Real-Time Summarization Track.", TREC, 2017.
,
"Partitioning and Segment Organization Strategies for Real-Time Selective Search on Document Streams.", WSDM, pp. 221–230, 2017.
,
"Quantization in Append-Only Collections.", ICTIR, pp. 265–268, 2017.
,
"Robust Multi-tenant Server Consolidation in the Cloud for Data Analytics Workloads.", ICDCS, pp. 2111–2118, 2017.
,
"Scalable Informative Rule Mining.", ICDE, pp. 437–448, 2017.
,
"Small-Term Distribution for Disk-Based Search.", DocEng, pp. 49–58, 2017.
,
"Social Media Mining to Understand Public Mental Health.", DMAH@VLDB, pp. 55–70, 2017.
,
"Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks.", CIKM, pp. 557–566, 2017.
,
"Technology-Assisted Review in Empirical Medicine: Waterloo Participation in CLEF eHealth 2017.", CLEF (Working Notes), 2017.
,
"Ten Blue Links on Mars.", WWW, pp. 273–281, 2017.
,
"The Data Civilizer System.", CIDR, 2017.
,
"The Lucene for Information Access and Retrieval Research (LIARR) Workshop at SIGIR 2017.", SIGIR, pp. 1429–1430, 2017.
,
"The Pareto Frontier of Utility Models as a Framework for Evaluating Push Notification Systems.", ICTIR, pp. 253–256, 2017.
,
"The Positive and Negative Influence of Search Results on People’s Decisions about the Efficacy of Medical Treatments.", ICTIR, pp. 209–216, 2017.
,
"UWaterlooMDS at the TREC 2017 Common Core Track.", TREC, 2017.
,
"An Insight Extraction System on BioMedical Literature with Deep Neural Networks.", EMNLP, pp. 2691–2701, 2017.
,
"Partitioning and Segment Organization Strategies for Real-Time Selective Search on Document Streams.", WSDM, pp. 221–230, 2017.
,
"The Data Civilizer System.", CIDR, 2017.
,
"An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting.", CoRR, vol. abs/1711.00333, 2017.
,
"An Exploration of Approaches to Integrating Neural Reranking Models in Multi-Stage Ranking Architectures.", CoRR, vol. abs/1707.08275, 2017.
,
"Combining Vertex-Centric Graph Processing with SPARQL for Large-Scale RDF Data Analytics.", IEEE Trans. Parallel Distrib. Syst., vol. 28, no. 12, pp. 3374–3388, 2017.
,
"Data Quality: The Role of Empiricism.", SIGMOD Rec., vol. 46, no. 4, pp. 35–43, 2017.
,
"Deep Residual Learning for Small-Footprint Keyword Spotting.", CoRR, vol. abs/1710.10361, 2017.
,
"Distant Supervision for Topic Classification of Tweets in Curated Streams.", CoRR, vol. abs/1704.06726, 2017.
,
"Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization.", PVLDB, vol. 10, no. 7, pp. 721–732, 2017.
,
"Efficient and Effective Tail Latency Minimization in Multi-Stage Retrieval Systems.", CoRR, vol. abs/1704.03970, 2017.
,
"Entity Consolidation: The Golden Record Problem.", CoRR, vol. abs/1709.10436, 2017.
,
"Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering.", CoRR, vol. abs/1707.07804, 2017.
,
"G-thinker: Big Graph Mining Made Easier and Faster.", CoRR, vol. abs/1709.03110, 2017.
,
"Graph-Based RDF Data Management.", Data Science and Engineering, vol. 2, no. 1, pp. 56–70, 2017.
,
"HoloClean: Holistic Data Repairs with Probabilistic Inference.", PVLDB, vol. 10, no. 11, pp. 1190–1201, 2017.
,
"HoloClean: Holistic Data Repairs with Probabilistic Inference.", CoRR, vol. abs/1702.00820, 2017.
,
"Honk: A PyTorch Reimplementation of Convolutional Neural Networks for Keyword Spotting.", CoRR, vol. abs/1710.06554, 2017.
,
"Impact of Feature Selection on Micro-Text Classification.", CoRR, vol. abs/1708.08123, 2017.
,
"In Defense of MapReduce.", IEEE Internet Computing, vol. 21, no. 3, pp. 94–98, 2017.
,
"Integrating Lexical and Temporal Signals in Neural Ranking Models for Searching Social Media Streams.", CoRR, vol. abs/1707.07792, 2017.
,
"Inverted Treaps.", ACM Trans. Inf. Syst., vol. 35, no. 3, pp. 22:1-22:45, 2017.
,
"Logic programming approach to automata-based decision procedures.", J. Log. Algebraic Methods Program., vol. 86, no. 1, pp. 391–407, 2017.
"NoSE: Schema Design for NoSQL Applications.", IEEE Trans. Knowl. Data Eng., vol. 29, no. 10, pp. 2275–2289, 2017.
,
"Overview of Special Issue.", SIGIR Forum, vol. 51, no. 2, pp. 1–25, 2017.
,
"Private Exploration Primitives for Data Cleaning.", CoRR, vol. abs/1712.10266, 2017.
,
"Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking.", ACM Trans. Database Syst., vol. 42, no. 1, pp. 2:1-2:39, 2017.
,
"Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks.", CoRR, vol. abs/1712.01969, 2017.
,
"Talking to Your TV: Context-Aware Voice Search with Hierarchical Recurrent Neural Networks.", CoRR, vol. abs/1705.04892, 2017.
,
"The Lambda and the Kappa.", IEEE Internet Computing, vol. 21, no. 5, pp. 60–66, 2017.
,
"The role of index compression in score-at-a-time query evaluation.", Inf. Retr. Journal, vol. 20, no. 3, pp. 199–220, 2017.
,
"The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing.", PVLDB, vol. 11, no. 4, pp. 420–431, 2017.
,
"The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: A User Survey.", CoRR, vol. abs/1709.03188, 2017.
,
"ViewDF: Declarative incremental view maintenance for streaming data.", Inf. Syst., vol. 71, pp. 55–67, 2017.
,
"Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives.", JOCCH, vol. 10, no. 4, pp. 22:1-22:30, 2017.
,
"Comparative Assessment of Alignment Algorithms for NGS Data: Features, Considerations, Implementations, and Future.", Algorithms for Next-Generation Sequencing Data, pp. 187–202, 2017.
,
2016
"A Performance Comparison of Algorithms for Byzantine Agreement in Distributed Systems.", EDCC, pp. 249–260, 2016.
,
"A Platform for Streaming Push Notifications to Mobile Assessors.", SIGIR, pp. 1077–1080, 2016.
,
"A Space-Efficient Data Structure for Fast Access Control in ECM Systems.", SACMAT, pp. 191–201, 2016.
,
"An Architecture for Privacy-Preserving and Replicable High-Recall Retrieval Experiments.", SIGIR, pp. 1085–1088, 2016.
,
"An Easter Egg Hunting Approach to Test Collection Building in Dynamic Domains.", EVIA@NTCIR, 2016.
,
"An Exploration of Evaluation Metrics for Mobile Push Notifications.", SIGIR, pp. 741–744, 2016.
,
"Are Secondary Assessors Uncertain When They Disagree About Relevance Judgements?", CHIIR, pp. 233–236, 2016.
,
"Burst Detection in Social Media Streams for Tracking Interest Profiles in Real Time.", SIGIR, pp. 777–780, 2016.
,
"CLAMS: Bringing Quality to Data Lakes.", SIGMOD Conference, pp. 2089–2092, 2016.
,
"Compressing and Decoding Term Statistics Time Series.", ECIR, pp. 675–681, 2016.
,
"Content Selection and Curation for Web Archiving: The Gatekeepers vs. the Masses.", JCDL, pp. 107–110, 2016.
,
"Dark Data: Are we solving the right problems?", ICDE, pp. 1444–1445, 2016.
,
"Data Cleaning: Overview and Emerging Challenges.", SIGMOD Conference, pp. 2201–2206, 2016.
,
"Data profiling.", ICDE, pp. 1432–1435, 2016.
,
"DataXFormer: A robust transformation discovery system.", ICDE, pp. 1134–1145, 2016.
,
"Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities.", JCDL, pp. 103–106, 2016.
,
"Discovering key moments in social media streams.", CCNC, pp. 366–374, 2016.
,
"Dynamic Cutoff Prediction in Multi-Stage Retrieval Systems.", ADCS, pp. 17–24, 2016.
,
"eGraphSearch: Effective Keyword Search in Graphs.", CIKM, pp. 2461–2464, 2016.
,
"Engineering Quality and Reliability in Technology-Assisted Review.", SIGIR, pp. 75–84, 2016.
,
"Estimating topical volume in social media streams.", SAC, pp. 1096–1101, 2016.
,
"Evaluating digital poetry: Insights from the CAT.", ICCC, pp. 60–67, 2016.
,
"Evaluating Search Among Secrets.", EVIA@NTCIR, 2016.
,
"Exploring and Discovering Archive-It Collections with Warcbase.", DH, pp. 285–288, 2016.
,
"Impact of Review-Set Selection on Human Assessment for Text Classification.", SIGIR, pp. 861–864, 2016.
,
"In Vacuo and In Situ Evaluation of SIMD Codecs.", ADCS, pp. 1–8, 2016.
,
"Interleaved Evaluation for Retrospective Summarization and Prospective Notification on Document Streams.", SIGIR, pp. 175–184, 2016.
,
"LONLIES: Estimating Property Values for Long Tail Entities.", SIGIR, pp. 1125–1128, 2016.
,
"Modeling Optimal Switching Behavior.", CHIIR, pp. 317–320, 2016.
,
"Multi-Stage Math Formula Search: Using Appearance-Based Similarity Metrics at Scale.", SIGIR, pp. 145–154, 2016.
,
"Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks.", CIKM, pp. 1913–1916, 2016.
,
"NoSE: Schema design for NoSQL applications.", ICDE, pp. 181–192, 2016.
,
"Object-Relational Queries over CFDI_nc Knowledge Bases: OBDA for the SQL-Literate (extended abstract).", Description Logics, 2016.
,
"On Competition for Undergraduate Co-op Placements: A Graph Mining Approach.", EDM, pp. 394–399, 2016.
,
"On Partial Features in the DLF Family of Description Logics.", PRICAI, pp. 529–542, 2016.
,
"On Referring Expressions in Information Systems Derived from Conceptual Modelling.", ER, pp. 183–197, 2016.
,
"On Referring Expressions in Query Answering over First Order Knowledge Bases.", KR, pp. 319–328, 2016.
,
"Ontology Based Data Access with Referring Expressions for Logics with the Tree Model Property - (Extended Abstract).", Australasian Conference on Artificial Intelligence, pp. 353–361, 2016.
,
"Optimizing Nugget Annotations with Active Learning.", CIKM, pp. 2359–2364, 2016.
,
"Pairwise Word Interaction Modeling with Deep Neural Networks for Semantic Similarity Measurement.", HLT-NAACL, pp. 937–948, 2016.
,
"Panel: The State of Data: Invited Paper from panelists.", IDEAS, pp. 2–11, 2016.
,
"Preface.", EVIA@NTCIR, 2016.
,
"Privacy-Preserving IR 2016: Differential Privacy, Search, and Social Media.", SIGIR, pp. 1247–1248, 2016.
,
"Prizm: A Wireless Access Point for Proxy-Based Web Lifelogging.", LTA@MM, pp. 19–25, 2016.
,
"Providing Serializability for Pregel-like Graph Processing Systems.", EDBT, pp. 77–88, 2016.
,
"Range prediction for electric bicycles.", e-Energy, pp. 21:1-21:11, 2016.
,
"Rank-at-a-Time Query Processing.", ICTIR, pp. 229–232, 2016.
,
"Retrievability in API-Based “Evaluation as a Service”.", ICTIR, pp. 91–94, 2016.
,
"Sampling Strategies and Active Learning for Volume Estimation.", SIGIR, pp. 981–984, 2016.
,
"Scalability of Continuous Active Learning for Reliable High-Recall Text Classification.", CIKM, pp. 1039–1048, 2016.
,
"Second Workshop on Search and Exploration of X-Rated Information (SEXI’16): WSDM Workshop Summary.", WSDM, pp. 697–698, 2016.
,
"SIGIR 2016 Workshop WebQA II: Web Question Answering Beyond Factoids.", SIGIR, pp. 1251–1252, 2016.
,
"Simple Dynamic Emission Strategies for Microblog Filtering.", SIGIR, pp. 1009–1012, 2016.
,
"Tangent-3 at the NTCIR-12 MathIR Task.", NTCIR, 2016.
,
"Temporal Query Expansion Using a Continuous Hidden Markov Model.", ICTIR, pp. 295–298, 2016.
,
"Total Recall: Blue Sky on Mars.", ICTIR, pp. 45–48, 2016.
,
"Toward Reproducible Baselines: The Open-Source IR Reproducibility Challenge.", ECIR, pp. 408–420, 2016.
,
"TREC 2016 Total Recall Track Overview.", TREC, 2016.
,
"Using a Dictionary and n-gram Alignment to Improve Fine-grained Cross-Language Plagiarism Detection.", DocEng, pp. 59–68, 2016.
,
"V-Hadoop: Virtualized Hadoop using containers.", NCA, pp. 237–241, 2016.
,
"Walking Without a Map: Ranking-Based Traversal for Querying Linked Data.", International Semantic Web Conference (1), pp. 305–324, 2016.
,
"Web Data Management in the RDF Age: Keynote talk abstract.", IDEAS, pp. 1, 2016.
,
"“When to Stop” Waterloo (Cormack) Participation in the TREC 2016 Total Recall Track.", TREC, 2016.
,
"Overview of the TREC 2016 Contextual Suggestion Track.", TREC, 2016.
,
"UMD-TTIC-UW at SemEval-2016 Task 1: Attention-Based Multi-Perspective Convolutional Neural Networks for Textual Similarity Measurement.", SemEval@NAACL-HLT, pp. 1103–1108, 2016.
,
"A General-Purpose Query-Centric Framework for Querying Big Graphs.", PVLDB, vol. 9, no. 7, pp. 564–575, 2016.
,
"A survey of RDF data management systems.", Frontiers Comput. Sci., vol. 10, no. 3, pp. 418–432, 2016.
,
"A Survey of RDF Data Management Systems.", CoRR, vol. abs/1601.00707, 2016.
,
"Afterburner: The Case for In-Browser Analytics.", CoRR, vol. abs/1605.04035, 2016.
,
"Assessing efficiency-effectiveness tradeoffs in multi-stage retrieval systems without using relevance judgments.", Inf. Retr. Journal, vol. 19, no. 4, pp. 351–377, 2016.
,
"Authority-based Team Discovery in Social Networks.", CoRR, vol. abs/1611.02992, 2016.
,
"Data Mining of Undergraduate Course Evaluations.", Informatics in Education, vol. 15, no. 1, pp. 85–102, 2016.
,
"DBStream: A holistic approach to large-scale network traffic monitoring and analysis.", Comput. Networks, vol. 107, pp. 5–19, 2016.
,
"Detecting Data Errors: Where are we and what needs to be done?", PVLDB, vol. 9, no. 12, pp. 993–1004, 2016.
,
"Distributed Data Deduplication.", PVLDB, vol. 9, no. 11, pp. 864–875, 2016.
,
"Dynamic Trade-Off Prediction in Multi-Stage Retrieval Systems.", CoRR, vol. abs/1610.02502, 2016.
,
"Editorial: Special Issue on Web Data Quality.", J. Data and Information Quality, vol. 8, no. 1, pp. 1:1-1:3, 2016.
,
"Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization.", CoRR, vol. abs/1608.06169, 2016.
,
"Effective Data Cleaning with Continuous Evaluation.", IEEE Data Eng. Bull., vol. 39, no. 2, pp. 38–46, 2016.
,
"EVIA 2016: The Seventh International Workshop on Evaluating Information Access.", SIGIR Forum, vol. 50, no. 2, pp. 44–46, 2016.
,
"Front Matter.", PVLDB, vol. 10, no. 1, pp. i–vi, 2016.
,
"GraphJet: Real-Time Content Recommendations at Twitter.", PVLDB, vol. 9, no. 13, pp. 1281–1292, 2016.
,
"Learning to identify relevant studies for systematic reviews using random forest and external information.", Machine Learning, vol. 102, no. 3, pp. 465–482, 2016.
,
"NScale: neighborhood-centric large-scale graph analytics in the cloud.", VLDB J., vol. 25, no. 2, pp. 125–150, 2016.
,
"Partial materialization for online analytical processing over multi-tagged document collections.", Knowl. Inf. Syst., vol. 47, no. 3, pp. 697–732, 2016.
,
"Processing SPARQL queries over distributed RDF graphs.", VLDB J., vol. 25, no. 2, pp. 243–268, 2016.
,
"Qualitative Data Cleaning.", PVLDB, vol. 9, no. 13, pp. 1605–1608, 2016.
,
"Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs.", CoRR, vol. abs/1601.06497, 2016.
,
"Sapphire: Querying RDF Data Made Simple.", PVLDB, vol. 9, no. 13, pp. 1481–1484, 2016.
,
"Searching from Mars.", IEEE Internet Computing, vol. 20, no. 1, pp. 78–82, 2016.
,
"Ten Blue Links on Mars.", CoRR, vol. abs/1610.06468, 2016.
,
"The Effects of Latency Penalties in Evaluating Push Notification Systems.", CoRR, vol. abs/1606.03066, 2016.
,
"The Future of Big Data Is ... JavaScript?", IEEE Internet Computing, vol. 20, no. 5, pp. 82–88, 2016.
,
"Walking without a Map: Optimizing Response Times of Traversal-Based Linked Data Queries (Extended Version).", CoRR, vol. abs/1607.01046, 2016.
,
2015
"A graph-based RDF triple store.", ICDE, pp. 1508–1511, 2015.
,
"Absorption for ABoxes and TBoxes with General Value Restrictions.", Australasian Conference on Artificial Intelligence, pp. 609–622, 2015.
,
"Anytime Ranking for Impact-Ordered Indexes.", ICTIR, pp. 301–304, 2015.
,
"Assessor Differences and User Preferences in Tweet Timeline Generation.", SIGIR, pp. 615–624, 2015.
,
"Benchmarking Smart Meter Data Analytics.", EDBT, pp. 385–396, 2015.
,
"BigDansing: A System for Big Data Cleansing.", SIGMOD Conference, pp. 1215–1230, 2015.
,
"Building a Self-Contained Search Engine in the Browser.", ICTIR, pp. 309–312, 2015.
,
"Burst Detection in Social Media Streams for Tracking Interest Profiles in Real Time.", TREC, 2015.
,
"Cache-oblivious scheduling of shared workloads.", ICDE, pp. 855–866, 2015.
,
"Contextual Search and Exploration.", RuSSIR, pp. 3–23, 2015.
,
"Database high availability using SHADOW systems.", SoCC, pp. 209–221, 2015.
,
"DataXFormer: An Interactive Data Transformation Tool.", SIGMOD Conference, pp. 883–888, 2015.
,
"Dataxformer: Leveraging the Web for Semantic Transformations.", CIDR, 2015.
,
"Developing an Open-Source Bibliometric Ranking Website Using Google Scholar Citation Profiles for Researchers in the Field of Biomedical Informatics.", MedInfo, pp. 1004, 2015.
,
"EdgeX: Edge Replication for Web Applications.", CLOUD, pp. 1041–1044, 2015.
,
"Enhancing Exploration with a Faceted Browser through Summarization.", DocEng, pp. 61–64, 2015.
,
"Evaluating Streams of Evolving News Events.", SIGIR, pp. 675–684, 2015.
,
"Executing queries over schemaless RDF databases.", ICDE, pp. 807–818, 2015.
,
"Graph Search of Software Models Using Multidimensional Scaling.", EDBT/ICDT Workshops, pp. 163–170, 2015.
,
"HDRF: Stream-Based Partitioning for Power-Law Graphs.", CIKM, pp. 243–252, 2015.
,
"Hermes: Dynamic Partitioning for Distributed Social Network Graph Databases.", EDBT, pp. 25–36, 2015.
,
"Human Competence in Creativity Evaluation.", ICCC, pp. 102–109, 2015.
,
"Identifying Duplicate and Contradictory Information in Wikipedia.", JCDL, pp. 57–60, 2015.
,
"Impact of Surrogate Assessments on High-Recall Retrieval.", SIGIR, pp. 555–564, 2015.
,
"Indexing bi-temporal windows.", SSDBM, pp. 19:1-19:12, 2015.
,
"IR Evaluation: Modeling User Behavior for Measuring Effectiveness.", SIGIR, pp. 1117–1120, 2015.
,
"KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing.", SIGMOD Conference, pp. 1247–1261, 2015.
,
"Lexical Comparison Between Wikipedia and Twitter Corpora by Using Word Embeddings.", ACL (2), pp. 657–661, 2015.
,
"Multi-Faceted Recall of Continuous Active Learning for Technology-Assisted Review.", SIGIR, pp. 763–766, 2015.
,
"On Axiomatization and Inference Complexity over a Hierarchy of Functional Dependencies.", AMW, 2015.
,
"On Enumerating Query Plans Using Analytic Tableau.", TABLEAUX, pp. 339–354, 2015.
,
"On the Krom Extension of CFDI^∀ -_nc.", Australasian Conference on Artificial Intelligence, pp. 559–571, 2015.
,
"On the Reusability of Open Test Collections.", SIGIR, pp. 827–830, 2015.
,
"On the Utility of CFDI.", Description Logics, 2015.
,
"Online Evaluation of Point-Of-Interest Recommendation Systems.", SCST@ECIR, 2015.
,
"Overview of the TREC 2015 Contextual Suggestion Track.", TREC, 2015.
,
"Polynomial encoding of ORM conceptual models in CFDI.", Description Logics, 2015.
,
"Pooling for User-Oriented Evaluation Measures.", ICTIR, pp. 341–344, 2015.
,
"Reproducible Experiments on Lexical and Temporal Feedback for Tweet Search.", ECIR, pp. 755–767, 2015.
,
"Scaling Down Distributed Infrastructure on Wimpy Machines for Personal Web Archiving.", WWW (Companion Volume), pp. 1351–1355, 2015.
,
"SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR).", SIGIR, pp. 1147–1148, 2015.
,
"Singular Referring Expressions in Conjunctive Query Answers: the case for a CFD DL Dialect.", Description Logics, 2015.
,
"Size-Constrained Weighted Set Cover.", ICDE, pp. 879–890, 2015.
,
"SMAS: A smart meter data analytics system.", ICDE, pp. 1476–1479, 2015.
,
"The Feasibility of Brute Force Scans for Real-Time Tweet Search.", ICTIR, pp. 321–324, 2015.
,
"The Power of Contextual Suggestion.", ECIR, pp. 352–357, 2015.
,
"The Sum of All Human Knowledge in Your Pocket: Full-Text Searchable Wikipedia on a Raspberry Pi.", JCDL, pp. 85–86, 2015.
,
"Towards Dynamic Green-Sizing for Database Servers.", ADMS@VLDB, pp. 25–36, 2015.
,
"University of Waterloo at TREC 2015 Microblog Track.", TREC, 2015.
,
"WaterlooClarke: TREC 2015 Clinical Decision Support Track.", TREC, 2015.
,
"WaterlooClarke: TREC 2015 Contextual Suggestion Track.", TREC, 2015.
,
"WaterlooClarke: TREC 2015 LiveQA Track.", TREC, 2015.
,
"WaterlooClarke: TREC 2015 Microblog Track.", TREC, 2015.
,
"WaterlooClarke: TREC 2015 Temporal Summarization Track.", TREC, 2015.
,
"WaterlooClarke: TREC 2015 Total Recall Track.", TREC, 2015.
,
"Web Question Answering: Beyond Factoids: SIGIR 2015 Workshop.", SIGIR, pp. 1143, 2015.
,
"What’s Wrong with my Solar Panels: a Data-Driven Approach.", EDBT/ICDT Workshops, pp. 86–93, 2015.
,
"Write Amplification: An Analysis of In-Memory Database Durability Techniques.", IMDM@VLDB, pp. 1:1-1:7, 2015.
,
"Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks.", EMNLP, pp. 1576–1586, 2015.
,
"TREC 2015 Total Recall Track Overview.", TREC, 2015.
,
"A Family of Rank Similarity Measures Based on Maximized Effectiveness Difference.", IEEE Trans. Knowl. Data Eng., vol. 27, no. 11, pp. 2865–2877, 2015.
,
"A taxonomy of decentralized online social networks.", Peer Peer Netw. Appl., vol. 8, no. 3, pp. 367–383, 2015.
,
"A Taxonomy of Partitioned Replicated Cloud-based Database Systems.", IEEE Data Eng. Bull., vol. 38, no. 1, pp. 4–9, 2015.
,
"Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments.", CoRR, vol. abs/1506.00717, 2015.
,
"Autonomy and Reliability of Continuous Active Learning for Technology-Assisted Review.", CoRR, vol. abs/1504.06868, 2015.
,
"Clustering RDF Databases Using Tunable-LSH.", CoRR, vol. abs/1504.02523, 2015.
,
"Effective Keyword Search in Graphs.", CoRR, vol. abs/1512.06395, 2015.
,
"Evaluation-as-a-Service: Overview and Outlook.", CoRR, vol. abs/1512.07454, 2015.
,
"Gappy Pattern Matching on GPUs for On-Demand Extraction of Hierarchical Translation Grammars.", Trans. Assoc. Comput. Linguistics, vol. 3, pp. 87–100, 2015.
,
"Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems.", PVLDB, vol. 8, no. 9, pp. 950–961, 2015.
,
"Is Big Data a Transient Problem?", IEEE Internet Comput., vol. 19, no. 5, pp. 86–90, 2015.
,
"KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing.", PVLDB, vol. 8, no. 12, pp. 1952–1955, 2015.
,
"Learning to Discover Key Moments in Social Media Streams.", CoRR, vol. abs/1508.00488, 2015.
,
"Main-Memory Hash Joins on Modern Processor Architectures.", IEEE Trans. Knowl. Data Eng., vol. 27, no. 7, pp. 1754–1766, 2015.
,
"On scalable parallel recursive backtracking.", J. Parallel Distrib. Comput., vol. 84, pp. 65–75, 2015.
,
"Profiling relational data: a survey.", VLDB J., vol. 24, no. 4, pp. 557–581, 2015.
,
"Report on the Evaluation-as-a-Service (EaaS) Expert Workshop.", SIGIR Forum, vol. 49, no. 1, pp. 57–65, 2015.
,
"Report on the SIGIR 2015 Workshop on Reproducibility, Inexplicability, and Generalizability of Results (RIGOR).", SIGIR Forum, vol. 49, no. 2, pp. 107–116, 2015.
,
"Special issue of the Journal of Web Semantics on ontology-based data access.", J. Web Semant., vol. 33, pp. 1-2, 2015.
,
"The Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search.", CoRR, vol. abs/1507.06235, 2015.
,
"Trends in Cleaning Relational Data: Consistency and Deduplication.", Foundations and Trends in Databases, vol. 5, no. 4, pp. 281–393, 2015.
,
2014
"A qualitative exploration of secondary assessor relevance judging behavior.", IIiX, pp. 195–204, 2014.
,
"Assessing Contextual Suggestion.", EVIA@NTCIR, 2014.
,
"Column Stores as an IR Prototyping Tool.", ECIR, pp. 789–792, 2014.
,
"Computing Electricity Consumption Profiles from Household Smart Meter Data.", EDBT/ICDT Workshops, pp. 140–147, 2014.
,
"Cost-Based Query Optimization via AI Planning.", AAAI, pp. 2344–2351, 2014.
,
"Cumulative Citation Recommendation: A Feature-Aware Comparison of Approaches.", DEXA Workshops, pp. 193–197, 2014.
,
"Data mining of undergraduate course evaluations.", EDM, pp. 347–348, 2014.
,
"Data stream warehousing.", ICDE, pp. 1290–1293, 2014.
,
"DBStream: An online aggregation, filtering and processing system for network traffic monitoring.", IWCMC, pp. 611–616, 2014.
,
"Descriptive and prescriptive data cleaning.", SIGMOD Conference, pp. 445–456, 2014.
,
"Distributed data placement to minimize communication costs via graph partitioning.", SSDBM, pp. 20:1-20:12, 2014.
,
"Diversified Stress Testing of RDF Data Management Systems.", International Semantic Web Conference (1), pp. 197–212, 2014.
,
"Do recommendations matter?: news recommendation in real life.", CSCW Companion, pp. 237–240, 2014.
,
"Effective and Efficient Bitmaps for Access Control.", DCC, pp. 433, 2014.
,
"Evaluation of machine-learning protocols for technology-assisted review in electronic discovery.", SIGIR, pp. 153–162, 2014.
,
"Information Access in Smart Cities (i-ASC).", ECIR, pp. 810–814, 2014.
,
"Information network or social network?: the structure of the twitter follow graph.", WWW (Companion Volume), pp. 493–498, 2014.
,
"Infrastructure for supporting exploration and discovery in web archives.", WWW (Companion Volume), pp. 851–856, 2014.
,
"Infrastructure support for evaluation as a service.", WWW (Companion Volume), pp. 79–82, 2014.
,
"Is the grass greener?: mining electric vehicle opinions.", e-Energy, pp. 241–252, 2014.
,
"Large-scale network traffic monitoring with DBStream, a system for rolling big data analysis.", BigData, pp. 165–170, 2014.
,
"Latency Amplification: Characterizing the Impact of Web Page Content on Load Times.", SRDS Workshops, pp. 20–25, 2014.
,
"Learning to efficiently rank on big data.", WWW (Companion Volume), pp. 209–210, 2014.
,
"Linked Data query processing.", ICDE, pp. 1286–1289, 2014.
,
"MicroFuge: A Middleware Approach to Providing Performance Isolation in Cloud Storage Systems.", ICDCS, pp. 503–513, 2014.
,
"Mouse movement during relevance judging: implications for determining user attention.", SIGIR, pp. 979–982, 2014.
,
"NADEEF/ER: generic and interactive entity resolution.", SIGMOD Conference, pp. 1071–1074, 2014.
,
"Old dogs are great at new tricks: column stores for ir prototyping.", SIGIR, pp. 863–866, 2014.
,
"On run diversity in Evaluation as a Service.", SIGIR, pp. 959–962, 2014.
,
"On the online fault-tolerant server consolidation problem.", SPAA, pp. 12–21, 2014.
,
"Optimization Techniques for “Scaling Down” Hadoop on Multi-Core, Shared-Memory Systems.", EDBT, pp. 13–24, 2014.
,
"Overview of the TREC 2014 Contextual Suggestion Track.", TREC, 2014.
,
"Partitioning strategies for spatio-textual similarity join.", BigSpatial@SIGSPATIAL, pp. 40–49, 2014.
,
"Predicting peak-demand days in the ontario peak reduction program for large consumers.", e-Energy, pp. 221–222, 2014.
,
"Pushing the CFDnc Envelope.", Description Logics, pp. 340–351, 2014.
,
"R-Store: A scalable distributed system for supporting real-time analytics.", ICDE, pp. 40–51, 2014.
,
"Reachable subwebs for traversal-based query execution.", WWW (Companion Volume), pp. 541–546, 2014.
,
"RuleMiner: Data quality rules discovery.", ICDE, pp. 1222–1225, 2014.
,
"Skewed partial bitvectors for list intersection.", SIGIR, pp. 263–272, 2014.
,
"Succinct Queries for Linking and Tracking News in Social Media.", CIKM, pp. 1883–1886, 2014.
,
"Supporting “Distant Reading” for Web Archives.", DH, 2014.
,
"Temporal feedback for tweet search with non-parametric density estimation.", SIGIR, pp. 33–42, 2014.
,
"The effect of expanding relevance judgements with duplicates.", SIGIR, pp. 1159–1162, 2014.
,
"The Impact of Future Term Statistics in Real-Time Tweet Search.", ECIR, pp. 567–572, 2014.
,
"Time well spent.", IIiX, pp. 205–214, 2014.
,
"Tolerance of Effectiveness Measures to Relevance Judging Errors.", ECIR, pp. 148–159, 2014.
,
"University of Waterloo at TREC 2014 Contextual Suggestion: Experiments with suggestion clustering.", TREC, 2014.
,
"Using visualizations to monitor changes and harvest insights from a global-scale logging infrastructure at Twitter.", IEEE VAST, pp. 113–122, 2014.
,
"Visual analytics of MOOCs at maryland.", L@S, pp. 195–196, 2014.
,
"Overview of the TREC-2014 Microblog Track.", TREC, 2014.
,
"A Family of Rank Similarity Measures based on Maximized Effectiveness Difference.", CoRR, vol. abs/1408.3587, 2014.
,
"Absorption for ABoxes.", J. Autom. Reasoning, vol. 53, no. 3, pp. 215–243, 2014.
,
"Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions.", PVLDB, vol. 7, no. 12, pp. 1035–1046, 2014.
,
"An Experimental Comparison of Pregel-like Graph Processing Systems.", PVLDB, vol. 7, no. 12, pp. 1047–1058, 2014.
,
"ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases.", PVLDB, vol. 7, no. 11, pp. 947–958, 2014.
,
"Discovering Conservation Rules.", IEEE Trans. Knowl. Data Eng., vol. 26, no. 6, pp. 1332–1348, 2014.
,
"Distributed data management using MapReduce.", ACM Comput. Surv., vol. 46, no. 3, pp. 31:1-31:42, 2014.
,
"Exploiting Representations from Statistical Machine Translation for Cross-Language Information Retrieval.", ACM Trans. Inf. Syst., vol. 32, no. 4, pp. 19:1-19:32, 2014.
,
"gStore: a graph-based SPARQL query engine.", VLDB J., vol. 23, no. 4, pp. 565–590, 2014.
,
"Identifying Duplicate and Contradictory Information in Wikipedia.", CoRR, vol. abs/1406.1143, 2014.
,
"Integrating SSD Caching into Database Systems.", IEEE Data Eng. Bull., vol. 37, no. 2, pp. 35–43, 2014.
,
"Interpretable and Informative Explanations of Outcomes.", PVLDB, vol. 8, no. 1, pp. 61–72, 2014.
,
"Location- and Query-Aware Modeling of Browsing and Click Behavior in Sponsored Search.", ACM TIST, vol. 5, no. 4, pp. 59:1-59:31, 2014.
,
"NScale: Neighborhood-centric Analytics on Large Graphs.", PVLDB, vol. 7, no. 13, pp. 1673–1676, 2014.
,
"NScale: Neighborhood-centric Large-Scale Graph Analytics in the Cloud.", CoRR, vol. abs/1405.1499, 2014.
,
"On the Feasibility and Implications of Self-Contained Search Engines in the Browser.", CoRR, vol. abs/1410.4500, 2014.
,
"Processing SPARQL Queries Over Linked Data-A Distributed Graph-based Approach.", CoRR, vol. abs/1411.6763, 2014.
,
"Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs.", PVLDB, vol. 7, no. 13, pp. 1379–1380, 2014.
,
"Report on the 1st International Workshop on Information Access in Smart Cities (i-ASC 2014).", SIGIR Forum, vol. 48, no. 2, pp. 96–104, 2014.
,
"Report on the CIKM workshop on living labs for information retrieval evaluation.", SIGIR Forum, vol. 48, no. 1, pp. 21–28, 2014.
,
"Runtime Optimizations for Tree-Based Machine Learning Models.", IEEE Trans. Knowl. Data Eng., vol. 26, no. 9, pp. 2281–2292, 2014.
,
"Sampling from repairs of conditional functional dependency violations.", VLDB J., vol. 23, no. 1, pp. 103–128, 2014.
,
"Summingbird: A Framework for Integrating Batch and Online MapReduce Computations.", PVLDB, vol. 7, no. 13, pp. 1441–1451, 2014.
,
"Top-k Nearest Neighbor Search In Uncertain Data Series.", PVLDB, vol. 8, no. 1, pp. 13–24, 2014.
,
"Undecidability of Finite Model Reasoning in DLFD.", CoRR, vol. abs/1408.4468, 2014.
,
"Workload Matters: Why RDF Databases Need a New Design.", PVLDB, vol. 7, no. 10, pp. 837–840, 2014.
,
"Data unification at scale: data tamer.", Making Databases Work, pp. 269–277, 2014.
,
"Distributed and Parallel Database Systems.", Computing Handbook, 3rd ed. (2), pp. 13: 1-24, 2014.
2013
"A month in the life of a production news recommender system.", LivingLab@CIKM, pp. 7–10, 2013.
,
"Absorption for ABoxes with Local Universal Restrictions.", Description Logics, pp. 489–500, 2013.
,
"Adaptive input admission and management for parallel stream processing.", DEBS, pp. 15–26, 2013.
,
"Analyzing the Mental Health of Engineering Students using Classification and Regression.", EDM, pp. 228–231, 2013.
,
"CFDnc: A PTIME Description Logic with Functional Constraints and Disjointness.", Description Logics, pp. 451–463, 2013.
,
"CIKM 2013 workshop on living labs for information retrieval evaluation.", CIKM, pp. 2557–2558, 2013.
,
"Classification-Based Clustering Evaluation.", ICDM, pp. 1229–1234, 2013.
,
"CWI and TU Delft Notebook TREC 2013: Contextual Suggestion, Federated Web Search, KBA, and Web Tracks.", TREC, 2013.
,
"Data Curation at Scale: The Data Tamer System.", CIDR, 2013.
,
"Data Partitioning for Video-on-Demand Services.", NCA, pp. 49–54, 2013.
,
"Data stream warehousing.", SIGMOD Conference, pp. 949–952, 2013.
,
"Dynamic memory allocation policies for postings in real-time Twitter search.", KDD, pp. 1186–1194, 2013.
,
"Effective measures for inter-document similarity.", CIKM, pp. 1361–1370, 2013.
,
"Effectiveness/efficiency tradeoffs for candidate generation in multi-stage retrieval architectures.", SIGIR, pp. 997–1000, 2013.
,
"Evaluating Contextual Suggestion.", EVIA@NTCIR, 2013.
,
"Fast data in the era of big data: Twitter’s real-time related query suggestion architecture.", SIGMOD Conference, pp. 1147–1158, 2013.
,
"Faster and smaller inverted indices with treaps.", SIGIR, pp. 193–202, 2013.
,
"Flat vs. hierarchical phrase-based translation models for cross-language information retrieval.", SIGIR, pp. 813–816, 2013.
,
"Holistic data cleaning: Putting violations into context.", ICDE, pp. 458–469, 2013.
,
"Lazy data structure maintenance for main-memory analytics over sliding windows.", DOLAP, pp. 33–38, 2013.
,
"Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware.", ICDE, pp. 362–373, 2013.
,
"Managing Geo-replicated Data in Multi-datacenters.", DNIS, pp. 23–43, 2013.
,
"Massively Parallel Suffix Array Queries and On-Demand Phrase Extraction for Statistical Machine Translation Using GPUs.", HLT-NAACL, pp. 325–334, 2013.
,
"Materialized views for eventually consistent record stores.", ICDE Workshops, pp. 250–257, 2013.
,
"Mr. MIRA: Open-Source Large-Margin Structured Learning on MapReduce.", ACL (Conference System Demonstrations), pp. 199–204, 2013.
,
"NADEEF: a commodity data cleaning system.", SIGMOD Conference, pp. 541–552, 2013.
,
"Nugget-Based Computation of Graded Relevance.", EVIA@NTCIR, 2013.
,
"On the relative trust between inconsistent data and inaccurate constraints.", ICDE, pp. 541–552, 2013.
,
"Overview of the TREC 2013 Contextual Suggestion Track.", TREC, 2013.
,
"Overview of the TREC 2013 Crowdsourcing Track.", TREC, 2013.
,
"Overview of the TREC-2013 Microblog Track.", TREC, 2013.
,
"Ray tracing in the cloud using MapReduce.", HPCS, pp. 19–26, 2013.
,
"Retrieving documents with mathematical content.", SIGIR, pp. 353–362, 2013.
,
"Search and exploration of X-Rated information (SEXI 2013).", WSDM, pp. 795–796, 2013.
,
"SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation.", SIGIR, pp. 1134, 2013.
,
"Structural Similarity Search for Mathematics Retrieval.", MKM/Calculemus/DML, pp. 246–262, 2013.
,
"The Combined Approach to OBDA: Taming Role Hierarchies Using Filters.", International Semantic Web Conference (1), pp. 314–330, 2013.
,
"The impact of intent selection on diversified search evaluation.", SIGIR, pp. 921–924, 2013.
,
"Time-Biased Gain.", NTCIR, 2013.
,
"Training Efficient Tree-Based Models for Document Ranking.", ECIR, pp. 146–157, 2013.
,
"Update Management in Decentralized Social Networks.", ICDCS Workshops, pp. 196–201, 2013.
,
"Visualizing the “Pulse” of World Cities on Twitter.", ICWSM, 2013.
,
"We are drowning in a sea of least publishable units (LPUs).", SIGMOD Conference, pp. 921–922, 2013.
,
"WGB: Towards a Universal Graph Benchmark.", WBDB, pp. 58–72, 2013.
,
"WTF: the who to follow service at Twitter.", WWW, pp. 505–514, 2013.
,
"Abstractive Meeting Summarization with Entailment and Fusion.", ENLG, pp. 136–146, 2013.
,
"Towards Efficient Large-Scale Feature-Rich Statistical Machine Translation.", WMT@ACL, pp. 128–133, 2013.
,
"ACM books to launch.", Commun. ACM, vol. 56, no. 12, pp. 5, 2013.
,
"An Easy-to-use Scalable Framework for Parallel Recursive Backtracking.", CoRR, vol. abs/1312.7626, 2013.
,
"DAX: A Widely Distributed Multi-tenant Storage Service for DBMS Hosting.", PVLDB, vol. 6, no. 4, pp. 253–264, 2013.
,
"Discovering Denial Constraints.", PVLDB, vol. 6, no. 13, pp. 1498–1509, 2013.
,
"Distributed Data Placement via Graph Partitioning.", CoRR, vol. abs/1312.0285, 2013.
,
"Document vector representations for feature extraction in multi-stage document ranking.", Inf. Retr., vol. 16, no. 6, pp. 747–768, 2013.
,
"Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search", CoRR, vol. abs/1302.5302, 2013.
,
"Evaluation as a service for information retrieval.", SIGIR Forum, vol. 47, no. 2, pp. 8–14, 2013.
,
"Fast and effective soft links.", Softw., Pract. Exper., vol. 43, no. 5, pp. 577–593, 2013.
,
"Fast candidate generation for real-time tweet search with bloom filter chains.", ACM Trans. Inf. Syst., vol. 31, no. 3, pp. 13, 2013.
,
"Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections", CoRR, vol. abs/1305.0699, 2013.
,
"HCIR 2013: the seventh international symposium on human-computer interaction and information retrieval.", SIGIR Forum, vol. 47, no. 2, pp. 33–40, 2013.
,
"Hone: “Scaling Down” Hadoop on Shared-Memory Systems.", PVLDB, vol. 6, no. 12, pp. 1354–1357, 2013.
,
"Hybrid Storage Management for Database Systems.", Proc. VLDB Endow., vol. 6, no. 8, pp. 541–552, 2013.
,
"Impact of query intent and search context on clickthrough behavior in sponsored search.", Knowl. Inf. Syst., vol. 34, no. 2, pp. 425–452, 2013.
,
"Increasing evaluation sensitivity to diversity.", Inf. Retr., vol. 16, no. 4, pp. 530–555, 2013.
,
"Mapreduce is Good Enough?If All You Have is a Hammer, Throw Away Everything That’s Not a Nail!", Big Data, vol. 1, pp. 28–37, 2013.
,
"Monoidify! Monoids as a Design Principle for Efficient MapReduce Algorithms", CoRR, vol. abs/1304.7544, 2013.
,
"Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited.", PVLDB, vol. 7, no. 1, pp. 85–96, 2013.
,
"NADEEF: A Generalized Data Cleaning System.", PVLDB, vol. 6, no. 12, pp. 1218–1221, 2013.
,
"Optimizing Multi-Top-k Queries over Uncertain Data Streams.", IEEE Trans. Knowl. Data Eng., vol. 25, no. 8, pp. 1814–1829, 2013.
,
"Probabilistic Web Data Management.", World Wide Web, vol. 16, no. 3, pp. 271–272, 2013.
,
"RemusDB: transparent high availability for database systems.", VLDB J., vol. 22, no. 1, pp. 29–45, 2013.
,
"Report on the SIGIR 2013 workshop on modeling user behavior for information retrieval evaluation (MUBE 2013).", SIGIR Forum, vol. 47, no. 2, pp. 84–95, 2013.
,
"Report on the workshop on search and exploration of x-rated information (SEXI 2013).", SIGIR Forum, vol. 47, no. 1, pp. 31–37, 2013.
,
"Data Warehouse Quality: Summary and Outlook.", Handbook of Data Quality, pp. 121–140, 2013.
,
"Perspectives on Business Intelligence", Perspectives on Business Intelligence, pp. 1–163, 2013.
,
2012
"2nd international workshop on diversity in document retrieval (DDR 2012).", WSDM, pp. 769–770, 2012.
,
"A Sequence-Oriented Stream Warehouse Paradigm for Network Monitoring Applications.", PAM, pp. 53–63, 2012.
,
"A Study of “Churn” in Tweets and Real-Time Search Queries.", ICWSM, 2012.
,
"Absorption for ABoxes.", Description Logics, 2012.
,
"Assertion Absorption in Object Queries over Knowledge Bases.", KR, 2012.
,
"Combining Statistical Translation Techniques for Cross-Language Information Retrieval.", COLING, pp. 2685–2702, 2012.
,
"Discovering Conservation Rules.", ICDE, pp. 738–749, 2012.
,
"Earlybird: Real-Time Search at Twitter.", ICDE, pp. 1360–1369, 2012.
,
"Elastic Scale-Out for Partition-Based Database Systems.", ICDE Workshops, pp. 281–288, 2012.
,
"Evaluating Real-Time Search over Tweets.", ICWSM, 2012.
,
"Exploring and analyzing documents with OLAP.", PIKM, pp. 33–40, 2012.
,
"Fast candidate generation for two-phase document ranking: postings list intersection with bloom filters.", CIKM, pp. 2419–2422, 2012.
,
"Graph data partition models for online social networks.", HT, pp. 175–180, 2012.
,
"Human question answering performance using an interactive document retrieval system.", IIiX, pp. 35–44, 2012.
,
"Interpreting keyword queries over web knowledge bases.", CIKM, pp. 305–314, 2012.
,
"Just-in-time information extraction using extraction views.", SIGMOD Conference, pp. 613–616, 2012.
,
"Large-scale machine learning at twitter.", SIGMOD Conference, pp. 793–804, 2012.
,
"Lightweight contrastive summarization for news comment mining.", SIGIR, pp. 1103–1104, 2012.
,
"Looking inside the box: context-sensitive translation for cross-language information retrieval.", SIGIR, pp. 1105–1106, 2012.
,
"Modeling browsing behavior for click analysis in sponsored search.", CIKM, pp. 2015–2019, 2012.
,
"Modeling user variance in time-biased gain.", HCIR, pp. 3, 2012.
,
"On building a reusable Twitter corpus.", SIGIR, pp. 1113–1114, 2012.
,
"Overview of the TREC 2012 Contextual Suggestion Track.", TREC, 2012.
,
"Overview of the TREC 2012 Crowdsourcing Track.", TREC, 2012.
,
"Overview of the TREC 2012 Web Track.", TREC, 2012.
,
"Overview of the TREC-2012 Microblog Track.", TREC, 2012.
,
"Stochastic simulation of time-biased gain.", CIKM, pp. 2040–2044, 2012.
,
"The Combined Approach to OBDA: Taming Role Hierarchies using Filters.", SSWS+HPCSW@ISWC, pp. 16–31, 2012.
,
"The Fault, Dear Researchers, is not in Cranfield, But in our Metrics, that they are Unrealistic.", EuroHCIR, pp. 11–12, 2012.
,
"Time to judge relevance as an indicator of assessor error.", SIGIR, pp. 1153–1154, 2012.
,
"Time-based calibration of effectiveness measures.", SIGIR, pp. 95–104, 2012.
,
"Towards benchmarking stream data warehouses.", DOLAP, pp. 105–112, 2012.
,
"Twanchor text: a preliminary study of the value of tweets as anchor text.", SIGIR, pp. 1159–1160, 2012.
,
"University of Waterloo: Logistic Regression and Reciprocal Rank Fusion at the Microblog Track.", TREC, 2012.
,
"Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling.", HLT-NAACL, pp. 626–630, 2012.
,
"A Study of “Churn” in Tweets and Real-Time Search Queries (Extended Version)", CoRR, vol. abs/1205.6855, 2012.
,
"Answering pattern match queries in large graph databases via graph embedding.", VLDB J., vol. 21, no. 1, pp. 97–120, 2012.
,
"Fast Data in the Era of Big Data: Twitter’s Real-Time Related Query Suggestion Architecture", CoRR, vol. abs/1210.7350, 2012.
,
"MapReduce is Good Enough? If All You Have is a Hammer, Throw Away Everything That’s Not a Nail!", CoRR, vol. abs/1209.2191, 2012.
,
"On the Relative Trust between Inconsistent Data and Inaccurate Constraints", CoRR, vol. abs/1207.5226, 2012.
,
"Open source information petrieval: a report on the SIGIR 2012 workshop.", SIGIR Forum, vol. 46, no. 2, pp. 95–101, 2012.
,
"Runtime Optimizations for Prediction with Tree-Based Models", CoRR, vol. abs/1212.2287, 2012.
,
"Scalable Scheduling of Updates in Streaming Data Warehouses.", IEEE Trans. Knowl. Data Eng., vol. 24, no. 6, pp. 1092–1105, 2012.
,
"Scaling big data mining infrastructure: the twitter experience.", SIGKDD Explorations, vol. 14, no. 2, pp. 6–19, 2012.
,
"The data analytics group at the qatar computing research institute.", SIGMOD Record, vol. 41, no. 4, pp. 33–38, 2012.
,
"The Unified Logging Infrastructure for Data Analytics at Twitter", CoRR, vol. abs/1208.4171, 2012.
,
"The Unified Logging Infrastructure for Data Analytics at Twitter.", PVLDB, vol. 5, no. 12, pp. 1771–1780, 2012.
,
2011
"A cascade ranking model for efficient ranked retrieval.", SIGIR, pp. 105–114, 2011.
,
"A comparative analysis of cascade measures for novelty and diversity.", WSDM, pp. 75–84, 2011.
,
"An Assertion Retrieval Algebra for Object Queries over Knowledge Bases.", IJCAI, pp. 1051–1056, 2011.
,
"Automatic management of partitioned, replicated search services.", SoCC, pp. 27, 2011.
,
"Clustering for semi-supervised spam filtering.", CEAS, pp. 125–134, 2011.
,
"Consistency in a Stream Warehouse.", CIDR, pp. 114–122, 2011.
,
"Cross-corpus relevance projection.", SIGIR, pp. 1163–1164, 2011.
,
"Crowdsourcing with a Crowd of One and Other TREC 2011 Crowdsourcing and Web Track Experiments.", TREC, 2011.
,
"Distributed data management in 2020?", ICDE, pp. 1360, 2011.
"Do Subtopic Judgments Reflect Diversity?", ICTIR, pp. 309–312, 2011.
,
"Dynamic data allocation with replication in distributed systems.", IPCCC, pp. 1–8, 2011.
,
"Efficient core decomposition in massive networks.", ICDE, pp. 51–62, 2011.
,
"Fixpoints in Temporal Description Logics.", IJCAI, pp. 875–880, 2011.
,
"Grammar Inference for Web Documents.", WebDB, 2011.
,
"In-depth accounts and passing mentions in the news: connecting readers to the context of a news event.", iConference, pp. 790–791, 2011.
,
"Lifecycle Management of Relational Records for External Auditing and Regulatory Compliance.", POLICY, pp. 73–80, 2011.
,
"Measuring assessor accuracy: a comparison of nist assessors and user study participants.", SIGIR, pp. 1231–1232, 2011.
,
"No free lunch: brute force vs. locality-sensitive hashing for cross-lingual pairwise similarity.", SIGIR, pp. 943–952, 2011.
,