Publications

Sort by: Author Type Year

Thesis

He, X., Policy Driven Data Sharing With Provable Privacy Guarantees: Duke University, Durham, NC, USA, 2018.
Salihoglu, S., Massive-Scale Processing of Record-Oriented and Graph Data: Stanford University, USA, 2015.
Golab, L., Sliding Window Query Processing Over Data Streams: University of Waterloo, Ontario, Canada, 2006.
Lin, J., Event Structure and the Encoding of Arguments: The Syntax of the Mandarin And English Verb Phrase: Massachusetts Institute of Technology, Cambridge, MA, USA, 2004.
Ilyas, I., Rank-Aware Query Processing and Optimization: Purdue University, USA, 2004.

Journal Article

Arabzadeh, N., and C. Clarke, "A Comparison of Methods for Evaluating Generative IR", ArXiv, vol. abs/2404.04044, 2024.
Arabzadeh, N., A. Bigdeli, and C. Clarke, "Adapting Standard Retrieval Benchmarks to Evaluate Generated Answers", ArXiv, vol. abs/2401.04842, 2024.
Arabzadeh, N., S. Huo, N. Mehta, Q. Wu, C. Wang, A. Awadallah, C. Clarke, and J. Kiseleva, "Assessing and Verifying Task Utility in LLM-Powered Applications", ArXiv, vol. abs/2405.02178, 2024.
Golzadeh, K., L. Golab, and J. Szlichta, "Explaining Expert Search and Team Formation Systems With ExES", ArXiv, vol. abs/2405.12881, 2024.
Lin, S-C., L. Gao, B. Oguz, W. Xiong, J. Lin, W-tau. Yih, and X. Chen, "FLAME: Factuality-Aware Alignment for Large Language Models", ArXiv, vol. abs/2405.01525, 2024.
Arabzadeh, N., and C. Clarke, "Fréchet Distance for Offline Evaluation of Information Retrieval Systems With Sparse Labels", ArXiv, vol. abs/2401.17543, 2024.
Alaofi, M., N. Arabzadeh, C. Clarke, and M. Sanderson, "Generative Information Retrieval Evaluation", ArXiv, vol. abs/2404.08137, 2024.
Lin, J., J. Li, J. Gao, W. Ma, and Y. Liu, "Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification", ArXiv, vol. abs/2404.15279, 2024.
Upadhyay, S., E. Kamalloo, and J. Lin, "LLMs Can Patch Up Missing Relevance Judgments in Evaluation", ArXiv, vol. abs/2405.04727, 2024.
Li, M., X. Chen, A. Holtzman, B. Chen, J. Lin, W-tau. Yih, and X. Victoria Lin, "Nearest Neighbor Speculative Decoding for LLM Generation and Attribution", ArXiv, vol. abs/2405.19325, 2024.
Zhuang, S., X. Ma, B. Koopman, J. Lin, and G. Zuccon, "PromptReps: Prompting Large Language Models to Generate Dense And Sparse Representations for Zero-Shot Document Retrieval", ArXiv, vol. abs/2404.18424, 2024.
Rorseth, J., P. Godfrey, L. Golab, D. Srivastava, and J. Szlichta, "RAGE Against the Machine: Retrieval-Augmented LLM Explanations", ArXiv, vol. abs/2405.13000, 2024.
Shehata, D., R. Cohen, and C. Clarke, "Rumour Evaluation With Very Large Language Models", ArXiv, vol. abs/2404.16859, 2024.
He, X., "Technical Perspective: Synthetic Data Needs a Reproducibility Benchmark", SIGMOD Record, vol. 53, issue 1, pp. 64, 2024.
Zhang, X., K. Ogueji, X. Ma, and J. Lin, "Toward Best Practices for Training Multilingual Dense Retrieval Models", ACM Transactions on Information Systems (TOIS), vol. 42, issue 2, pp. 39:1--39:33, 2024.
Arabzadeh, N., J. Kiseleva, Q. Wu, C. Wang, A. Awadallah, V. Dibia, A. Fourney, and C. Clarke, "Towards Better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications", ArXiv, vol. abs/2402.09015, 2024.
Sharifymoghaddam, S., S. Upadhyay, W. Chen, and J. Lin, "UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models", ArXiv, vol. abs/2405.10311, 2024.
Faggioli, G., L. Dietz, C. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, et al., "Who Determines What Is Relevant? Humans or AI? Why Not Both?", Communications of the ACM, vol. 67, issue 4, pp. 31--34, 2024.
Lin, S-C., and J. Lin, "A Dense Representation Framework for Lexical and Semantic Matching", ACM Transactions on Information Systems (TOIS), vol. 41, issue 4, pp. 110:1--110:29, 2023.
Chen, J., Y. Huang, M. Wang, S. Salihoglu, and K. Salem, "Accurate Summary-Based Cardinality Estimation Through the Lens Of Cardinality Estimation Graphs", SIGMOD Record, vol. 52, issue 1, pp. 94--102, 2023.
Lin, S-C., M. Li, and J. Lin, "Aggretriever: A Simple Approach to Aggregate Textual Representations For Robust Dense Passage Retrieval", Transactions of the Association for Computational Linguistics, vol. 11, pp. 436--452, 2023.
Ma, X., T. Teofili, and J. Lin, "Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes", ArXiv, vol. abs/2304.12139, 2023.
Huang, C., Y. Xie, Z. Jiang, J. Lin, and M. Li, "Approximating Human-Like Few-Shot Learning With GPT-based Compression", ArXiv, vol. abs/2308.06942, 2023.
Yang, J-H., C. Lassance, R. Sampaio de Rezende, K. Srinivasan, M. Redi, S. Clinchant, and J. Lin, "AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation", ArXiv, vol. abs/2304.01961, 2023.
Kassaie, B., and F. Tompa, "Autonomously Computable Information Extraction", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 10, pp. 2431--2443, 2023.
Hildred, J., M. Abebe, and K. Daudjee, "Caerus: Low-Latency Distributed Transactions for Geo-Replicated Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 17, issue 3, pp. 469--482, 2023.
Mousavi, A., X. Zhan, H. Bai, P. Shi, T. Rekatsinas, B. Han, Y. Li, J. Pound, J. M. Susskind, N. Schluter, et al., "Construction of Paired Knowledge Graph-Text Datasets Informed by Cyclic Evaluation", ArXiv, vol. abs/2309.11669, 2023.
Rorseth, J., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "CREDENCE: Counterfactual Explanations for Document Ranking", ArXiv, vol. abs/2302.04983, 2023.
Ozsu, T., "Data Science - A Systematic Treatment", Communications of the ACM, vol. 66, issue 7, pp. 106--116, 2023.
Ozsu, T., "Data Science: A Systematic Treatment", ArXiv, vol. abs/2301.13761, 2023.
Mohapatra, S., J. Zong, F. Kerschbaum, and X. He, "Differentially Private Data Generation With Missing Data", ArXiv, vol. abs/2310.11548, 2023.
Zhang, S., and X. He, "DProvDB: Differentially Private Query Processing With Multi-Analyst Provenance", ArXiv, vol. abs/2309.10240, 2023.
Mackenzie, J., A. Trotman, and J. Lin, "Efficient Document-at-a-Time and Score-at-a-Time Query Evaluation For Learned Sparse Representations", ACM Transactions on Information Systems (TOIS), vol. 41, issue 4, pp. 96:1--96:28, 2023.
Zou, L., Y. Pang, T. Ozsu, and J. Chen, "Efficient Execution of SPARQL Queries With OPTIONAL and UNION Expressions", ArXiv, vol. abs/2303.13844, 2023.
Chen, H., C. Lassance, and J. Lin, "End-to-End Retrieval With Learned Dense and Sparse Representations Using Lucene", ArXiv, vol. abs/2311.18503, 2023.
Kamalloo, E., X. Zhang, O. Ogundepo, N. Thakur, D. Alfonso-Hermelo, M. Rezagholizadeh, and J. Lin, "Evaluating Embedding APIs for Information Retrieval", ArXiv, vol. abs/2305.06300, 2023.
Kamalloo, E., N. Dziri, C. Clarke, and D. Rafiei, "Evaluating Open-Domain Question Answering in the Era of Large Language Models", ArXiv, vol. abs/2305.06984, 2023.
Ren, H., A. Mousavi, A. Pacaci, S. Rahman Chowdhury, J. Mohoney, I. Ilyas, Y. Li, and T. Rekatsinas, "Fact Ranking Over Large-Scale Knowledge Graphs With Reasoning Embedding Models", IEEE Data Engineering Bulletin, vol. 46, issue 2, pp. 126--139, 2023.
Ma, X., L. Wang, N. Yang, F. Wei, and J. Lin, "Fine-Tuning LLaMA for Multi-Stage Text Retrieval", ArXiv, vol. abs/2310.08319, 2023.
Bayat, F. Fatahi, K. Qian, B. Han, Y. Sang, A. Belyi, S. Khorshidi, F. Wu, I. Ilyas, and Y. Li, "FLEEK: Factual Error Detection and Correction With Evidence Retrieved From External Knowledge", ArXiv, vol. abs/2310.17119, 2023.
Tang, R., X. Zhang, X. Ma, J. Lin, and F. Türe, "Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models", ArXiv, vol. abs/2310.07712, 2023.
Piktus, A., O. Ogundepo, C. Akiki, A. Oladipo, X. Zhang, H. Schoelkopf, S. Biderman, M. Potthast, and J. Lin, "GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration", ArXiv, vol. abs/2306.01481, 2023.
Li, M., H. Zhuang, K. Hui, Z. Qin, J. Lin, R. Jagerman, X. Wang, and M. Bendersky, "Generate, Filter, and Fuse: Query Expansion via Multi-Step Keyword Generation for Zero-Shot Neural Rankers", ArXiv, vol. abs/2311.09175, 2023.
Ilyas, I., J. P. Lacerda, Y. Li, U. Farooq Minhas, A. Mousavi, J. Pound, T. Rekatsinas, and C. Sumanth, "Growing and Serving Large Open-Domain Knowledge Graphs", ArXiv, vol. abs/2305.09464, 2023.
Kamalloo, E., A. Jafari, X. Zhang, N. Thakur, and J. Lin, "HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking With Attribution", ArXiv, vol. abs/2307.16883, 2023.
Mohoney, J., A. Pacaci, S. Rahman Chowdhury, A. Mousavi, I. Ilyas, U. Farooq Minhas, J. Pound, and T. Rekatsinas, "High-Throughput Vector Similarity Search in Knowledge Graphs", ArXiv, vol. abs/2304.01926, 2023.
Mohoney, J., A. Pacaci, S. Rahman Chowdhury, A. Mousavi, I. Ilyas, U. Farooq Minhas, J. Pound, and T. Rekatsinas, "High-Throughput Vector Similarity Search in Knowledge Graphs", Proceedings of the ACM on Management of Data, vol. 1, issue 2, pp. 197:1--197:25, 2023.
Pradeep, R., K. Hui, J. Gupta, Á. Dániel Lelkes, H. Zhuang, J. Lin, D. Metzler, and V. Q. Tran, "How Does Generative Retrieval Scale to Millions of Passages?", ArXiv, vol. abs/2305.11841, 2023.
Lin, S-C., A. Asai, M. Li, B. Oguz, J. Lin, Y. Mehdad, W-tau. Yih, and X. Chen, "How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval", ArXiv, vol. abs/2302.07452, 2023.
Conia, S., M. Li, D. Lee, U. Farooq Minhas, I. Ilyas, and Y. Li, "Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs", ArXiv, vol. abs/2311.15781, 2023.
Zhang, C., A. Bonifati, and T. Ozsu, "Indexing Techniques for Graph Reachability Queries", ArXiv, vol. abs/2311.03542, 2023.
Salihoglu, S., "Kùzu: A Database Management System for "Beyond Relational" Workloads", SIGMOD Record, vol. 52, issue 3, pp. 39--40, 2023.
Thakur, N., J. Ni, G. Hernández Ábrego, J. Wieting, J. Lin, and D. Cer, "Leveraging LLMs for Synthesizing Training Data Across Many Languages In Multilingual Dense Retrieval", ArXiv, vol. abs/2311.05800, 2023.
Zhang, X., N. Thakur, O. Ogundepo, E. Kamalloo, D. Alfonso-Hermelo, X. Li, Q. Liu, M. Rezagholizadeh, and J. Lin, "MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages", Transactions of the Association for Computational Linguistics, vol. 11, pp. 1114--1131, 2023.
Kamphuis, C., A. Lin, S. Yang, J. Lin, A. P. de Vries, and F. Hasibi, "MMEAD: MS MARCO Entity Annotations and Disambiguations", ArXiv, vol. abs/2309.07574, 2023.
Hebert, L., G. Sahu, N. Kishore Sreenivas, L. Golab, and R. Cohen, "Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media", ArXiv, vol. abs/2307.09312, 2023.
Thakur, N., L. Bonifacio, X. Zhang, O. Ogundepo, E. Kamalloo, D. Alfonso-Hermelo, X. Li, Q. Liu, B. Chen, M. Rezagholizadeh, et al., "NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation", ArXiv, vol. abs/2312.11361, 2023.
Qian, K., A. Belyi, F. Wu, S. Khorshidi, A. Nikfarjam, R. Khot, Y. Sang, K. Luna, X. Chu, E. Choi, et al., "Open Domain Knowledge Extraction for Knowledge Graphs", ArXiv, vol. abs/2312.09424, 2023.
Faggioli, G., L. Dietz, C. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, et al., "Perspectives on Large Language Models for Relevance Judgment", ArXiv, vol. abs/2304.09161, 2023.
Dadvar, V., L. Golab, and D. Srivastava, "POEM: Pattern-Oriented Explanations of Convolutional Neural Networks", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 11, pp. 3192--3200, 2023.
Hebert, L., L. Golab, and R. Cohen, "Predicting Hateful Discussions on Reddit Using Graph Transformer Networks And Communal Context", ArXiv, vol. abs/2301.04248, 2023.
Hebert, L., H. Yi Chen, R. Cohen, and L. Golab, "Qualitative Analysis of a Graph Transformer Approach to Addressing Hate Speech: Adapting to Dynamically Changing Content", ArXiv, vol. abs/2301.10871, 2023.
Zhang, X., S. Hofstätter, P. Lewis, R. Tang, and J. Lin, "Rank-Without-Gpt: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models", ArXiv, vol. abs/2312.02969, 2023.
Pradeep, R., S. Sharifymoghaddam, and J. Lin, "RankVicuna: Zero-Shot Listwise Document Reranking With Open-Source Large Language Models", ArXiv, vol. abs/2309.15088, 2023.
Pradeep, R., S. Sharifymoghaddam, and J. Lin, "RankZephyr: Effective and Robust Zero-Shot Listwise Reranking Is A Breeze!", ArXiv, vol. abs/2312.02724, 2023.
Liao, V., S. Shariyar Murtaza, Y. Nie, and J. Lin, "Regex-Augmented Domain Transfer Topic Classification Based on a Pre-Trained Language Model: An Application in Financial Domain", ArXiv, vol. abs/2305.18324, 2023.
Bauer, C., B. Carterette, N. Ferro, N. Fuhr, J. Beel, T. Breuer, C. Clarke, A. Crescenzi, G. Demartini, G. Maria Di Nunzio, et al., "Report on the Dagstuhl Seminar on Frontiers of Information Access Experimentation for Research and Education", SIGIR Forum, vol. 57, issue 1, pp. 7:1--7:28, 2023.
Kamalloo, E., N. Thakur, C. Lassance, X. Ma, J-H. Yang, and J. Lin, "Resources for Brewing BEIR: Reproducible Reference Models and An Official Leaderboard", ArXiv, vol. abs/2306.07471, 2023.
Huo, S., N. Arabzadeh, and C. Clarke, "Retrieving Supporting Evidence for Generative Question Answering", ArXiv, vol. abs/2309.11392, 2023.
Huo, S., N. Arabzadeh, and C. Clarke, "Retrieving Supporting Evidence for LLMs Generated Answers", ArXiv, vol. abs/2306.13781, 2023.
Tamber, M. Singh, R. Pradeep, and J. Lin, "Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking With Seq2seq Encoder-Decoder Models", ArXiv, vol. abs/2312.16098, 2023.
Lin, J., and T. Teofili, "Searching Dense Representations With Inverted Indexes", ArXiv, vol. abs/2312.01556, 2023.
Sheshbolouki, A., and T. Ozsu, "sGrow: Explaining the Scale-Invariant Strength Assortativity of Streaming Butterflies", ACM Transactions on the Web, vol. 17, issue 3, pp. 24:1--24:46, 2023.
Zeng, L., L. Zou, and T. Ozsu, "SGSI - A Scalable GPU-Friendly Subgraph Isomorphism Algorithm", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 35, issue 11, pp. 11899--11916, 2023.
Lin, J., D. Alfonso-Hermelo, V. Jeronymo, E. Kamalloo, C. Lassance, R. Frassetto Nogueira, O. Ogundepo, M. Rezagholizadeh, N. Thakur, J-H. Yang, et al., "Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval", ArXiv, vol. abs/2304.01019, 2023.
Li, M., S-C. Lin, X. Ma, and J. Lin, "SLIM: Sparsified Late Interaction for Multi-Vector Retrieval With Inverted Indexes", ArXiv, vol. abs/2302.06587, 2023.
Seltzer, J., J. Pan, K. Cheng, Y. Sun, S. Kolagati, J. Lin, and S. Zong, "SmartProbe: A Virtual Moderator for Market Research Surveys", ArXiv, vol. abs/2305.08271, 2023.
Akiki, C., O. Ogundepo, A. Piktus, X. Zhang, A. Oladipo, J. Lin, and M. Potthast, "Spacerini: Plug-and-Play Search Engines With Pyserini and Hugging Face", ArXiv, vol. abs/2302.14534, 2023.
Thakur, N., K. Wang, I. Gurevych, and J. Lin, "SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-Shot Neural Sparse Retrieval", ArXiv, vol. abs/2307.10488, 2023.
Salem, K., "TECHNICAL PERSPECTIVE: Ad Hoc Transactions: What They Are And Why We Should Care", SIGMOD Record, vol. 52, issue 1, pp. 6, 2023.
Wu, Z., A. Anand Deshmukh, Y. Wu, J. Lin, and L. Mou, "Unsupervised Chunking With Hierarchical RNN", ArXiv, vol. abs/2309.04919, 2023.
Lin, J., R. Pradeep, T. Teofili, and J. Xian, "Vector Search With OpenAI Embeddings: Lucene Is All You Need", ArXiv, vol. abs/2308.14963, 2023.
Tang, R., X. Zhang, J. Lin, and F. Türe, "What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations", ArXiv, vol. abs/2311.18812, 2023.
Zong, S., J. Seltzer, J. Pan, K. Cheng, and J. Lin, "Which Model Shall I Choose? Cost/Quality Trade-Offs for Text Classification Tasks", ArXiv, vol. abs/2301.07006, 2023.
Adeyemi, M., A. Oladipo, R. Pradeep, and J. Lin, "Zero-Shot Cross-Lingual Reranking With Large Language Models for Low-Resource Languages", ArXiv, vol. abs/2312.16159, 2023.
Ma, X., X. Zhang, R. Pradeep, and J. Lin, "Zero-Shot Listwise Document Reranking With a Large Language Model", ArXiv, vol. abs/2305.02156, 2023.
Lin, S-C., and J. Lin, "A Dense Representation Framework for Lexical and Semantic Matching", ArXiv, vol. abs/2206.09912, 2022.
Chen, J., Y. Huang, M. Wang, S. Salihoglu, and K. Salem, "Accurate Summary-Based Cardinality Estimation Through the Lens Of Cardinality Estimation Graphs", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 8, pp. 1533--1545, 2022.
Ogundepo, O., X. Zhang, and J. Lin, "Better Than Whitespace: Information Retrieval for Languages Without Custom Tokenizers", ArXiv, vol. abs/2210.05481, 2022.
Lin, J., "Building a Culture of Reproducibility in Academic Research", ArXiv, vol. abs/2212.13534, 2022.
Xin, J., R. Tang, Z. Jiang, Y. Yu, and J. Lin, "Building an Efficiency Pipeline: Commutativity and Cumulativeness Of Efficiency Operators for Transformers", ArXiv, vol. abs/2208.00483, 2022.
Mazmudar, M., T. Humphries, J. Liu, M. Rafuse, and X. He, "Cache Me if You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration", ArXiv, vol. abs/2211.15732, 2022.
Mazmudar, M., T. Humphries, J. Liu, M. Rafuse, and X. He, "Cache Me if You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 4, pp. 574--586, 2022.
Voorhees, E. M., I. Soboroff, and J. Lin, "Can Old TREC Collections Reliably Evaluate Modern Neural Retrieval Models?", ArXiv, vol. abs/2201.11086, 2022.
Li, M., X. Zhang, J. Xin, H. Zhang, and J. Lin, "Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking", ArXiv, vol. abs/2205.09638, 2022.
Li, M., S-C. Lin, B. Oguz, A. Ghoshal, J. Lin, Y. Mehdad, W-tau. Yih, and X. Chen, "CITADEL: Conditional Token Interaction via Dynamic Lexical Routing For Efficient and Effective Multi-Vector Retrieval", ArXiv, vol. abs/2211.10411, 2022.
Kassaie, B., E. L. Irving, and F. Tompa, "Computer-Assisted Cohort Identification in Practice", ACM Transactions on Computing for Healthcare, vol. 3, issue 2, pp. 17:1--17:28, 2022.
Zheng, Z., L. Zheng, M. Alipour Langouri, F. Chiang, L. Golab, J. Szlichta, and S. Baskaran, "Contextual Data Cleaning With Ontology Functional Dependencies", Journal of Data and Information Quality, vol. 14, issue 3, pp. 20:1--20:26, 2022.
Sadri, N., and G. Cormack, "Continuous Active Learning Using Pretrained Transformers", ArXiv, vol. abs/2208.06955, 2022.
Ilyas, I., and F. Naumann, "Data Errors: Symptoms, Causes and Origins", IEEE Data Engineering Bulletin, vol. 45, issue 1, pp. 4--9, 2022.
Thakur, N., N. Reimers, and J. Lin, "Domain Adaptation for Memory-Efficient Dense Retrieval", ArXiv, vol. abs/2205.11498, 2022.
Pappachan, P., S. Zhang, X. He, and S. Mehrotra, "Don't Be a Tattle-Tale: Preventing Leakages Through Data Dependencies On Access Control Protected Data", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 11, pp. 2437--2449, 2022.
Pappachan, P., S. Zhang, X. He, and S. Mehrotra, "Don't Be a Tattle-Tale: Preventing Leakages Through Data Dependencies On Access Control Protected Data", ArXiv, vol. abs/2207.08757, 2022.
Shehata, D., N. Arabzadeh, and C. Clarke, "Early Stage Sparse Retrieval With Entity Linking", ArXiv, vol. abs/2208.04887, 2022.
Artikis, A., N. Tatbul, L. Golab, and M. Sadoghi, "Editorial", Information Systems, vol. 109, pp. 102088, 2022.
Kargar, M., L. Golab, D. Srivastava, J. Szlichta, and M. Zihayat, "Effective Keyword Search Over Weighted Graphs", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 34, issue 2, pp. 601--616, 2022.
Zhong, W., J-H. Yang, and J. Lin, "Evaluating Token-Level and Passage-Level Dense Retrieval Models For Math Information Retrieval", ArXiv, vol. abs/2203.11163, 2022.
Dadvar, V., L. Golab, and D. Srivastava, "Exploring Data Using Patterns: A Survey", Information Systems, vol. 108, pp. 101985, 2022.
Hebert, L., L. Golab, P. Poupart, and R. Cohen, "FedFormer: Contextual Federation With Attention in Reinforcement Learning", ArXiv, vol. abs/2205.13697, 2022.
Jiang, Z., Y. Dai, J. Xin, M. Li, and J. Lin, "Few-Shot Non-Parametric Learning With Deep Latent Variable Model", ArXiv, vol. abs/2206.11573, 2022.
Yan, D., G. Guo, J. Khalil, T. Ozsu, W-S. Ku, and J. C. S. Lui, "G-Thinker: A General Distributed Framework for Finding Qualified Subgraphs In a Big Graph With Load Balancing", The VLDB Journal, vol. 31, issue 2, pp. 287--320, 2022.
Dehghan, M., D. Kumar, and L. Golab, "GRS: Combining Generation and Revision in Unsupervised Sentence Simplification", ArXiv, vol. abs/2203.09742, 2022.
Yan, X., C. Luo, C. Clarke, N. Craswell, E. M. Voorhees, and P. Castells, "Human Preferences as Dueling Bandits", ArXiv, vol. abs/2204.10362, 2022.
Zhong, Y., J. Xiao, T. Vetterli, M. Matin, E. Loo, J. Lin, R. Bourgon, and O. Shapira, "Improving Precancerous Case Characterization via Transformer-Based Ensemble Learning", ArXiv, vol. abs/2212.05150, 2022.
Herodotou, H., P. K. Chrysanthis, S. Chen, M. Hsu, K. Daudjee, Y. Wu, and C. Costa, "Introduction to the special issue on self‑managing and hardware‑optimized database systems 2020", Distributed and Parallel Databases, vol. 40, issue 1, pp. 1--3, 2022.
Xia, K., W. Zhao, A. Jolfaei, and T. Ozsu, "Introduction to the Special Section on Edge/Fog Computing for Infectious Disease Intelligence", ACM Transactions on Internet Technology (TOIT), vol. 22, issue 3, pp. 63e:1--63e:2, 2022.
Jiang, Z., M. Y. R. Yang, M. Tsirlin, R. Tang, and J. Lin, "Less Is More: Parameter-Free Text Classification With Gzip", ArXiv, vol. abs/2212.09410, 2022.
Ilyas, I., and T. Rekatsinas, "Machine Learning and Data Cleaning: Which Serves the Other?", Journal of Data and Information Quality, vol. 14, issue 3, pp. 13:1--13:11, 2022.
Zhang, X., N. Thakur, O. Ogundepo, E. Kamalloo, D. Alfonso-Hermelo, X. Li, Q. Liu, M. Rezagholizadeh, and J. Lin, "Making a MIRACL: Multilingual Information Retrieval Across a Continuum Of Languages", ArXiv, vol. abs/2210.09984, 2022.
Jin, G., and S. Salihoglu, "Making RDBMSs Efficient on Graph Workloads Through Predefined Joins", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 5, pp. 1011--1023, 2022.
Ghayyur, S., D. Ghosh, X. He, and S. Mehrotra, "MIDE: Accuracy Aware Minimally Invasive Data Exploration for Decision Support", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 11, pp. 2653--2665, 2022.
Mhedhbi, A., and S. Salihoglu, "Modern Techniques for Querying Graph-Structured Relations: Foundations, System Implementations, and Open Challenges", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 12, pp. 3762--3765, 2022.
Ammar, K., S. Sahu, S. Salihoglu, and T. Ozsu, "Optimizing Differentially-Maintained Recursive Queries on Dynamic Graphs", ArXiv, vol. abs/2208.00273, 2022.
Ammar, K., S. Sahu, S. Salihoglu, and T. Ozsu, "Optimizing Differentially-Maintained Recursive Queries on Dynamic Graphs", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 11, pp. 3186--3198, 2022.
Dadvar, V., L. Golab, and D. Srivastava, "POEM: Pattern-Oriented Explanations of CNN Models", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 12, pp. 3618--3621, 2022.
Gao, L., X. Ma, J. Lin, and J. Callan, "Precise Zero-Shot Dense Retrieval Without Relevance Labels", ArXiv, vol. abs/2212.10496, 2022.
Liu, L., M. Li, J. Lin, S. Riedel, and P. Stenetorp, "Query Expansion Using Contextual Clue Sampling With Language Models", ArXiv, vol. abs/2210.07093, 2022.
Ozsu, T., "Reminiscences on Influential Papers", SIGMOD Record, vol. 51, issue 2, pp. 44--46, 2022.
Yamamoto, T., Z. Dou, N. Kando, C. Clarke, M. P. Kato, and Y. Liu, "Report on the 16th Round of NII Testbeds and Community for Information Access Research (NTCIR-16)", SIGIR Forum, vol. 56, issue 2, pp. 7:1--7:8, 2022.
Ilyas, I., T. Rekatsinas, V. Konda, J. Pound, X. Qi, and M. A. Soliman, "Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale", ArXiv, vol. abs/2204.07309, 2022.
Sheshbolouki, A., and T. Ozsu, "sGrapp: Butterfly Approximation in Streaming Graphs", ACM Transactions on Knowledge Discovery from Data, vol. 16, issue 4, pp. 76:1--76:43, 2022.
Arabzadeh, N., A. Vtyurina, X. Yan, and C. Clarke, "Shallow Pooling for Sparse Labels", Information Retrieval Journal, vol. 25, issue 4, pp. 365--385, 2022.
Li, Y., L. Zou, T. Ozsu, and D. Zhao, "Space-Efficient Subgraph Search Over Streaming Graph With Timing Order Constraint", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 34, issue 9, pp. 4453--4467, 2022.
Tang, R., K. Kumar, G. Yang, A. Pandey, Y. Mao, V. Belyaev, M. Emmadi, C. G. Murray, F. Türe, and J. Lin, "SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale", ArXiv, vol. abs/2211.11740, 2022.
Gao, L., X. Ma, J. Lin, and J. Callan, "Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval", ArXiv, vol. abs/2203.05765, 2022.
Wang, R., J. Wang, S. Idreos, T. Ozsu, and W. G. Aref, "The Case for Distributed Shared-Memory Databases With RDMA-Enabled Memory Disaggregation", ArXiv, vol. abs/2207.03027, 2022.
Wang, R., J. Wang, S. Idreos, T. Ozsu, and W. G. Aref, "The Case for Distributed Shared-Memory Databases With RDMA-Enabled Memory Disaggregation", Proceedings of the VLDB Endowment (PVLDB), vol. 16, issue 1, pp. 15--22, 2022.
Abebe, M., H. Lazu, and K. Daudjee, "Tiresias: Enabling Predictive Autonomous Storage and Indexing", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 11, pp. 3126--3136, 2022.
Li, H., S. Wang, S. Zhuang, A. Mourad, X. Ma, J. Lin, and G. Zuccon, "To Interpolate or Not to Interpolate: PRF, Dense and Sparse Retrievers", ArXiv, vol. abs/2205.00235, 2022.
Zhang, X., K. Ogueji, X. Ma, and J. Lin, "Towards Best Practices for Training Multilingual Dense Retrieval Models", ArXiv, vol. abs/2204.02363, 2022.
Arabzadeh, N., M. Seifikar, and C. Clarke, "Unsupervised Question Clarity Prediction Through Retrieved Item Coherency", ArXiv, vol. abs/2208.04882, 2022.
Nanayakkara, P., J. Bater, X. He, J. Hullman, and J. Rogers, "Visualizing Privacy-Utility Trade-Offs in Differentially Private Data Releases", ArXiv, vol. abs/2201.05964, 2022.
Nanayakkara, P., J. Bater, X. He, J. Hullman, and J. Rogers, "Visualizing Privacy-Utility Trade-Offs in Differentially Private Data Releases", Proceedings on Privacy Enhancing Technologies (PoPETs), vol. 2022, issue 2, pp. 601--618, 2022.
Durvasula, S., R. Kiguru, S. Mathur, J. Xu, J. Lin, and N. Vijaykumar, "VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks", ArXiv, vol. abs/2210.08729, 2022.
Tang, R., A. Pandey, Z. Jiang, G. Yang, K. Kumar, J. Lin, and F. Türe, "What the DAAM: Interpreting Stable Diffusion Using Cross Attention", ArXiv, vol. abs/2210.04885, 2022.
Shi, P., R. Zhang, H. Bai, and J. Lin, "XRICL: Cross-Lingual Retrieval-Augmented in-Context Learning For Cross-Lingual Text-to-SQL Semantic Parsing", ArXiv, vol. abs/2210.13693, 2022.
Lin, J., "A Proposed Conceptual Framework for a Representational Approach To Information Retrieval", SIGIR Forum, vol. 55, issue 2, pp. 4:1--4:29, 2021.
Ma, X., K. Sun, R. Pradeep, and J. Lin, "A Replication Study of Dense Passage Retriever", ArXiv, vol. abs/2104.05740, 2021.
Chen, J., Y. Huang, M. Wang, S. Salihoglu, and K. Salem, "Accurate Summary-Based Cardinality Estimation Through the Lens Of Cardinality Estimation Graphs", ArXiv, vol. abs/2105.08878, 2021.
Clarke, C., A. Vtyurina, and M. Smucker, "Assessing Top- Preferences", ACM Transactions on Information Systems (TOIS), vol. 39, issue 3, pp. 33:1--33:21, 2021.
Liu, J., K. Knopf, Y. Tan, B. Ding, and X. He, "Catch a Blowfish Alive: A Demonstration of Policy-Aware Differential Privacy for Interactive Data Exploration", Proceedings of the VLDB Endowment (PVLDB), vol. 14, issue 12, pp. 2859--2862, 2021.
Parsa, M. S., L. Golab, and S. Keshav, "Climate Action During COVID-19 Recovery and Beyond: A Twitter Text Mining Study", ArXiv, vol. abs/2105.12190, 2021.
Gupta, P., A. Mhedhbi, and S. Salihoglu, "Columnar Storage and List-Based Processing for Graph Database Management Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 14, issue 11, pp. 2491--2504, 2021.
Lin, S-C., J-H. Yang, and J. Lin, "Contextualized Query Embeddings for Conversational Search", ArXiv, vol. abs/2104.08707, 2021.
Shi, P., R. Zhang, H. Bai, and J. Lin, "Cross-Lingual Training With Dense Retrieval for Document Retrieval", ArXiv, vol. abs/2109.01628, 2021.
Lin, S-C., and J. Lin, "Densifying Sparse Representations for Passage Retrieval by Representational Slicing", ArXiv, vol. abs/2112.04666, 2021.
Near, J. P., and X. He, "Differential Privacy for Databases", Foundations and Trends in Databases, vol. 11, issue 2, pp. 109--225, 2021.
Zheng, Z., L. Zheng, M. Alipour Langouri, F. Chiang, L. Golab, and J. Szlichta, "Discovery and Contextual Data Cleaning With Ontology Functional Dependencies", ArXiv, vol. abs/2105.08105, 2021.
Valduriez, P., R. Jiménez-Peris, and T. Ozsu, "Distributed Database Systems: The Case for NewSQL", Transactions on Large-Scale Data- and Knowledge-Centered Systems, vol. 48, pp. 1--15, 2021.
Wagh, S., X. He, A. Machanavajjhala, and P. Mittal, "DP-cryptography: Marrying Differential Privacy and Cryptography In Emerging Applications", Communications of the ACM, vol. 64, issue 2, pp. 84--93, 2021.
Karegar, R., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Efficient Discovery of Approximate Order Dependencies", ArXiv, vol. abs/2101.02174, 2021.
Hofstätter, S., S-C. Lin, J-H. Yang, J. Lin, and A. Hanbury, "Efficiently Teaching an Effective Dense Retriever With Balanced Topic Aware Sampling", ArXiv, vol. abs/2104.06967, 2021.
Suri, S., I. Ilyas, C. Ré, and T. Rekatsinas, "Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins", ArXiv, vol. abs/2106.01501, 2021.
Suri, S., I. Ilyas, C. Ré, and T. Rekatsinas, "Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins", Proceedings of the VLDB Endowment (PVLDB), vol. 15, issue 3, pp. 699--712, 2021.
Li, M., and J. Lin, "Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering", ArXiv, vol. abs/2110.01599, 2021.
Pacaci, A., A. Bonifati, and T. Ozsu, "Evaluating Complex Queries on Streaming Graphs", ArXiv, vol. abs/2101.12305, 2021.
Fritz, S., I. Milligan, N. Ruest, and J. Lin, "Fostering Community Engagement Through Datathon Events: The Archives Unleashed Experience", Digital Humanities Quarterly, vol. 15, issue 1, 2021.
Chen, Y., T. Ozsu, G. Xiao, Z. Tang, and K. Li, "GSmart: An Efficient SPARQL Query Engine Using Sparse Matrix Algebra - Full Version", ArXiv, vol. abs/2106.14038, 2021.
Li, H., S. Zhuang, A. Mourad, X. Ma, J. Lin, and G. Zuccon, "Improving Query Representations for Dense Retrieval With Pseudo Relevance Feedback: A Reproducibility Study", ArXiv, vol. abs/2112.06400, 2021.
Gupta, P., A. Mhedhbi, and S. Salihoglu, "Integrating Column-Oriented Storage and Query Processing Techniques Into Graph Database Management Systems", ArXiv, vol. abs/2103.02284, 2021.
Nogueira, R., Z. Jiang, and J. Lin, "Investigating the Limitations of the Transformers With Simple Arithmetic Tasks", ArXiv, vol. abs/2102.13019, 2021.
Ge, C., S. Mohapatra, X. He, and I. Ilyas, "Kamino: Constraint-Aware Differentially Private Data Synthesis", Proceedings of the VLDB Endowment (PVLDB), vol. 14, issue 10, pp. 1886--1899, 2021.
Jin, G., and S. Salihoglu, "Making RDBMSs Efficient on Graph Workloads Through Predefined Joins", ArXiv, vol. abs/2108.10540, 2021.
Zhang, X., X. Ma, P. Shi, and J. Lin, "Mr. TyDi: A Multi-Lingual Benchmark for Dense Retrieval", ArXiv, vol. abs/2108.08787, 2021.
Craswell, N., B. Mitra, E. Yilmaz, D. Campos, and J. Lin, "MS MARCO: Benchmarking Ranking Models in the Large-Data Regime", ArXiv, vol. abs/2105.04021, 2021.
Lin, S-C., J-H. Yang, R. Nogueira, M-F. Tsai, C-J. Wang, and J. Lin, "Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting", ACM Transactions on Information Systems (TOIS), vol. 39, issue 4, pp. 48:1--48:29, 2021.
Peng, P., Q. Ge, L. Zou, T. Ozsu, Z. Xu, and D. Zhao, "Optimizing Multi-Query Evaluation in Federated RDF Systems", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 33, issue 4, pp. 1692--1707, 2021.
Mhedhbi, A., C. Kankanamge, and S. Salihoglu, "Optimizing One-Time and Continuous Subgraph Queries Using Worst-Case Optimal Joins", ACM Transactions on Database Systems (TODS), vol. 46, issue 2, pp. 6:1--6:45, 2021.
Shafieinejad, M., F. Kerschbaum, and I. Ilyas, "PCOR: Private Contextual Outlier Release via Differentially Private Search", ArXiv, vol. abs/2103.05173, 2021.
Arabzadeh, N., X. Yan, and C. Clarke, "Predicting Efficiency/Effectiveness Trade-Offs for Dense vs. Sparse Retrieval Strategy Selection", ArXiv, vol. abs/2109.10739, 2021.
Lin, J., X. Ma, S-C. Lin, J-H. Yang, R. Pradeep, and R. Nogueira, "Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research With Sparse and Dense Representations", ArXiv, vol. abs/2102.10073, 2021.
Saxena, H., L. Golab, S. Idreos, and I. Ilyas, "Real-Time LSM-Trees for HTAP Workloads", ArXiv, vol. abs/2101.06801, 2021.
Kato, M. P., Y. Liu, N. Kando, and C. Clarke, "Report on the 15th Round of NII Testbeds and Community for Information Access Research (NTCIR-15)", SIGIR Forum, vol. 55, issue 2, pp. 21:1--21:6, 2021.
Sheshbolouki, A., and T. Ozsu, "Scale-Invariant Strength Assortativity of Streaming Butterflies", ArXiv, vol. abs/2111.12217, 2021.
Sheshbolouki, A., and T. Ozsu, "sGrapp: Butterfly Approximation in Streaming Graphs", ArXiv, vol. abs/2101.12334, 2021.
Arabzadeh, N., A. Vtyurina, X. Yan, and C. Clarke, "Shallow Pooling for Sparse Labels", ArXiv, vol. abs/2109.00062, 2021.
Lin, J., D. Campos, N. Craswell, B. Mitra, and E. Yilmaz, "Significant Improvements Over the State of the Art? A Case Study Of the MS MARCO Document Ranking Leaderboard", ArXiv, vol. abs/2102.12887, 2021.
Yang, J-H., X. Ma, and J. Lin, "Sparsifying Sparse Representations for Passage Retrieval by Top-K Masking", ArXiv, vol. abs/2112.09628, 2021.
Grossman, M., and G. Cormack, "The eDiscovery Medicine Show", ArXiv, vol. abs/2109.13908, 2021.
Pradeep, R., R. Nogueira, and J. Lin, "The Expando-Mono-Duo Design Pattern for Text Ranking With Pretrained Sequence-to-Sequence Models", ArXiv, vol. abs/2101.05667, 2021.
Sakr, S., A. Bonifati, H. Voigt, A. Iosup, K. Ammar, R. Angles, W. G. Aref, M. Arenas, M. Besta, P. A. Boncz, et al., "The Future Is Big Graphs: A Community View on Graph Processing Systems", Communications of the ACM, vol. 64, issue 9, pp. 62--71, 2021.
Gauch, M., J. Mai, and J. Lin, "The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction", Environmental Modelling and Software, vol. 135, pp. 104926, 2021.
Mohapatra, S., S. Sasy, X. He, G. Kamath, and O. Thakkar, "The Role of Adaptive Optimizers for Honest Private Hyperparameter Selection", ArXiv, vol. abs/2111.04906, 2021.
Xue, H., F. D. Salim, Y. Ren, and C. Clarke, "Translating Human Mobility Forecasting Through Natural Language Generation", ArXiv, vol. abs/2112.11481, 2021.
Covington, C., X. He, J. Honaker, and G. Kamath, "Unbiased Statistical Estimation and Valid Confidence Intervals Under Differential Privacy", ArXiv, vol. abs/2110.14465, 2021.
Mackenzie, J., A. Trotman, and J. Lin, "Wacky Weights in Learned Sparse Representations and the Revenge Of Score-at-a-Time Query Evaluation", ArXiv, vol. abs/2110.11540, 2021.
Gauch, M., and J. Lin, "A Data Scientist's Guide to Streamflow Prediction", ArXiv, vol. abs/2006.12975, 2020.
Lin, J., "A Prototype of Serverless Lucene", ArXiv, vol. abs/2002.01447, 2020.
Ozsu, T., "A Systematic View of Data Science", IEEE Data Engineering Bulletin, vol. 43, issue 3, pp. 3--11, 2020.
Mhedhbi, A., P. Gupta, S. Khaliq, and S. Salihoglu, "A+ Indexes: Lightweight and Highly Flexible Adjacency Lists For Graph Database Management Systems", ArXiv, vol. abs/2004.00130, 2020.
Chen, Y., G. Xiao, T. Ozsu, C. Liu, A. Y. Zomaya, and T. Li, "aeSpTV: An Adaptive and Efficient Framework for Sparse Tensor-Vector Product Kernel on a High-Performance Computing Platform", IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 31, issue 10, pp. 2329--2345, 2020.
Livshits, E., A. Heidari, I. Ilyas, and B. Kimelfeld, "Approximate Denial Constraints", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 10, pp. 1682--1695, 2020.
Livshits, E., A. Heidari, I. Ilyas, and B. Kimelfeld, "Approximate Denial Constraints", ArXiv, vol. abs/2005.08540, 2020.
Clarke, C., A. Vtyurina, and M. Smucker, "Assessing Top-K Preferences", ArXiv, vol. abs/2007.11682, 2020.
Oliveira, P. H., D. S. Kaster, C. Traina, Jr., and I. Ilyas, "Batchwise Probabilistic Incremental Data Cleaning", ArXiv, vol. abs/2011.04730, 2020.
Fritz, S., I. Milligan, N. Ruest, and J. Lin, "Building Community at Distance: A Datathon During COVID-19", Digital Library Perspectives, vol. 36, issue 4, pp. 415--428, 2020.
Khan, A., L. Golab, M. Kargar, J. Szlichta, and M. Zihayat, "Compact Group Discovery in Attributed Graphs and Social Networks", Information Processing and Management, vol. 57, issue 2, pp. 102054, 2020.
Tao, Y., X. He, A. Machanavajjhala, and S. Roy, "Computing Local Sensitivities of Counting Queries With Joins", ArXiv, vol. abs/2004.04656, 2020.
Lin, S-C., J-H. Yang, R. Nogueira, M-F. Tsai, C-J. Wang, and J. Lin, "Conversational Question Reformulation via Sequence-to-Sequence Architectures And Pretrained Language Models", ArXiv, vol. abs/2004.01909, 2020.
Zhang, E., N. Gupta, R. Tang, X. Han, R. Pradeep, K. Lu, Y. Zhang, R. Nogueira, K. Cho, H. Fang, et al., "Covidex: Neural Ranking Models and Keyword Search Infrastructure For The COVID-19 Open Research Dataset", ArXiv, vol. abs/2007.07846, 2020.
Xin, J., R. Tang, J. Lee, Y. Yu, and J. Lin, "DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference", ArXiv, vol. abs/2004.12993, 2020.
Kassaie, B., and F. Tompa, "Detecting Opportunities for Differential Maintenance of Extracted Views", ArXiv, vol. abs/2007.01973, 2020.
Karegar, R., M. Mirsafian, P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Discovering Domain Orders Through Order Dependencies", ArXiv, vol. abs/2005.14068, 2020.
Lin, S-C., J-H. Yang, and J. Lin, "Distilling Dense Representations for Ranking Using Tightly-Coupled Teachers", ArXiv, vol. abs/2010.11386, 2020.
Nogueira, R., Z. Jiang, and J. Lin, "Document Ranking With a Pretrained Sequence-to-Sequence Model", ArXiv, vol. abs/2003.06713, 2020.
Wagh, S., X. He, A. Machanavajjhala, and P. Mittal, "DP-Cryptography: Marrying Differential Privacy and Cryptography In Emerging Applications", ArXiv, vol. abs/2004.08887, 2020.
Zhang, H., G. Cormack, M. Grossman, and M. Smucker, "Evaluating Sentence-Level Relevance Feedback for High-Recall Information Retrieval", Information Retrieval Journal, vol. 23, issue 1, pp. 1--26, 2020.
Gorenflo, C., S. Lee, L. Golab, and S. Keshav, "FastFabric: Scaling Hyperledger Fabric to 20 000 Transactions Per Second", International Journal of Network Management, vol. 30, issue 5, 2020.
Lin, J., C. Zhong, D. Hu, C. Rudin, and M. I. Seltzer, "Generalized Optimal Sparse Decision Trees", ArXiv, vol. abs/2006.08690, 2020.
Sahu, S., and S. Salihoglu, "Graphsurge: Graph Analytics on View Collections Using Differential Computation", ArXiv, vol. abs/2004.05297, 2020.
Tang, R., J. Lee, A. Razi, J. Cambre, I. Bicking, J. Kaye, and J. Lin, "Howl: A Deployed, Open-Source Wake Word Detection System", ArXiv, vol. abs/2008.09606, 2020.
Jiang, Z., R. Tang, J. Xin, and J. Lin, "Inserting Information Bottlenecks for Attribution in Transformers", ArXiv, vol. abs/2012.13838, 2020.
Chen, S., P. K. Chrysanthis, K. Daudjee, M. Hsu, and M. Sadoghi, "Introduction to the Special Issue on Self-Managing and Hardware-Optimized Database Systems 2019", Distributed and Parallel Databases, vol. 38, issue 4, pp. 767--769, 2020.
Kumar, D., L. Mou, L. Golab, and O. Vechtomova, "Iterative Edit-Based Unsupervised Sentence Simplification", ArXiv, vol. abs/2006.09639, 2020.
Ge, C., S. Mohapatra, X. He, and I. Ilyas, "Kamino: Constraint-Aware Differentially Private Data Synthesis", ArXiv, vol. abs/2012.15713, 2020.
Li, M., H. Bai, L. Tan, K. Xiong, M. Li, and J. Lin, "Latte-Mix: Measuring Sentence Semantic Similarity With Latent Categorical Mixtures", ArXiv, vol. abs/2010.11351, 2020.
Chen, L., and L. Golab, "Micro-Journal Mining to Understand Mood Triggers", Computing, vol. 102, issue 5, pp. 1227--1244, 2020.
Abebe, M., B. Glasbergen, and K. Daudjee, "MorphoSys: Automatic Physical Design Metamorphosis for Distributed Database Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 13, pp. 3573--3587, 2020.
Nogueira, R., Z. Jiang, K. Cho, and J. Lin, "Navigation-Based Candidate Expansion and Pretrained Language Models For Citation Recommendation", Scientometrics, vol. 125, issue 3, pp. 3001--3016, 2020.
Nogueira, R., Z. Jiang, K. Cho, and J. Lin, "Navigation-Based Candidate Expansion and Pretrained Language Models For Citation Recommendation", ArXiv, vol. abs/2001.08687, 2020.
Heidari, A., S. Kushagra, and I. Ilyas, "On Sampling From Data With Duplicate Records", ArXiv, vol. abs/2008.10549, 2020.
Wang, X-J., M. Grossman, and S. Gyu Hyun, "Participation in TREC 2020 COVID Track Using Continuous Active Learning", ArXiv, vol. abs/2011.01453, 2020.
Lin, J., R. Nogueira, and A. Yates, "Pretrained Transformers for Text Ranking: BERT and Beyond", ArXiv, vol. abs/2010.06467, 2020.
Lin, S-C., J-H. Yang, R. Nogueira, M-F. Tsai, C-J. Wang, and J. Lin, "Query Reformulation Using Query History for Passage Retrieval in Conversational Search", ArXiv, vol. abs/2005.02230, 2020.
Gauch, M., F. Kratzert, D. Klotz, G. Nearing, J. Lin, and S. Hochreiter, "Rainfall-Runoff Prediction at Multiple Timescales With a Single Long Short-Term Memory Network", ArXiv, vol. abs/2010.07921, 2020.
Zhang, R., W. Yang, L. Lin, Z. Tu, Y. Xie, Z. Fu, Y. Xie, L. Tan, K. Xiong, and J. Lin, "Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents", ArXiv, vol. abs/2002.01861, 2020.
Tang, R., R. Nogueira, E. Zhang, N. Gupta, P. Cam, K. Cho, and J. Lin, "Rapidly Bootstrapping a Question Answering Dataset for COVID-19", ArXiv, vol. abs/2004.11339, 2020.
Zhang, E., N. Gupta, R. Nogueira, K. Cho, and J. Lin, "Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned", ArXiv, vol. abs/2004.05125, 2020.
Heidari, A., G. Michalopoulos, S. Kushagra, I. Ilyas, and T. Rekatsinas, "Record Fusion: A Learning Approach", ArXiv, vol. abs/2006.10208, 2020.
Pacaci, A., A. Bonifati, and T. Ozsu, "Regular Path Query Evaluation on Streaming Graphs", ArXiv, vol. abs/2004.02012, 2020.
Bryson, S., H. Davoudi, L. Golab, M. Kargar, Y. Lytvyn, P. Mierzejewski, J. Szlichta, and M. Zihayat, "Robust Keyword Search in Large Attributed Graphs", Information Retrieval Journal, vol. 23, issue 5, pp. 502--524, 2020.
Bater, J., Y. Park, X. He, X. Wang, and J. Rogers, "SAQE: Practical Privacy-Preserving Approximate Query Processing For Data Federations", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 11, pp. 2691--2705, 2020.
Guo, G., D. Yan, T. Ozsu, Z. Jiang, and J. Khalil, "Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach", Proceedings of the VLDB Endowment (PVLDB), vol. 14, issue 4, pp. 573--585, 2020.
Guo, G., D. Yan, T. Ozsu, and Z. Jiang, "Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach", ArXiv, vol. abs/2005.00081, 2020.
Pradeep, R., X. Ma, R. Nogueira, and J. Lin, "Scientific Claim Verification With VERT5ERINI", ArXiv, vol. abs/2010.11930, 2020.
Bai, H., P. Shi, J. Lin, L. Tan, K. Xiong, W. Gao, and M. Li, "SegaBERT: Pre-Training of Segment-Aware BERT for Language Understanding", ArXiv, vol. abs/2004.14996, 2020.
Bai, H., P. Shi, J. Lin, L. Tan, K. Xiong, W. Gao, J. Liu, and M. Li, "Semantics of the Unwritten", ArXiv, vol. abs/2004.02251, 2020.
Glasbergen, B., M. Abebe, K. Daudjee, and A. Levi, "Sentinel: Universal Analysis and Insight for Data Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 11, pp. 2720--2733, 2020.
Tang, R., J. Lee, J. Xin, X. Liu, Y. Yu, and J. Lin, "Showing Your Work Doesn't Always Work", ArXiv, vol. abs/2004.13705, 2020.
Salem, K., "Special Issue on Best Papers of DaMoN 2018", The VLDB Journal, vol. 29, issue 2-3, pp. 755, 2020.
Boncz, P. A., and K. Salem, "Special Issue on Best Papers of VLDB 2017", The VLDB Journal, vol. 29, issue 1, pp. 483--484, 2020.
Lin, J., J. M. Mackenzie, C. Kamphuis, C. Macdonald, A. Mallia, M. Siedlaczek, A. Trotman, and A. P. de Vries, "Supporting Interoperability Between Open-Source Search Engines With The Common Index File Format", ArXiv, vol. abs/2003.08276, 2020.
Ruest, N., J. Lin, I. Milligan, and S. Fritz, "The Archives Unleashed Project: Technology, Process, and Community To Improve Scholarly Access to Web Archives", ArXiv, vol. abs/2001.05399, 2020.
Sakr, S., A. Bonifati, H. Voigt, A. Iosup, K. Ammar, R. Angles, W. G. Aref, M. Arenas, M. Besta, P. A. Boncz, et al., "The Future Is Big Graphs! A Community View on Graph Processing Systems", ArXiv, vol. abs/2012.06171, 2020.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and T. Ozsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: Extended Survey", The VLDB Journal, vol. 29, issue 2-3, pp. 595--618, 2020.
Zhang, M., L. Tan, Z. Tu, Z. Fu, K. Xiong, M. Li, and J. Lin, "To Paraphrase or Not to Paraphrase: User-Controllable Selective Paraphrase Generation", ArXiv, vol. abs/2008.09290, 2020.
Lin, S-C., J-H. Yang, R. Nogueira, M-F. Tsai, C-J. Wang, and J. Lin, "TTTTTackling WinoGrande Schemas", ArXiv, vol. abs/2003.08380, 2020.
Toman, D., and G. Weddell, "Using Feature-Based Description Logics to Avoid Duplicate Elimination In Object-Relational Query Languages", German Journal of Artificial Intelligence (KI), vol. 34, issue 3, pp. 355--363, 2020.
Yang, H-W., Y. Zou, P. Shi, W. Lu, J. Lin, and X. Sun, "Aligning Cross-Lingual Entities With Multi-Aspect Information", ArXiv, vol. abs/1910.06575, 2019.
Heidari, A., I. Ilyas, and T. Rekatsinas, "Approximate Inference in Structured Instances With Noisy Categorical Observations", ArXiv, vol. abs/1907.00141, 2019.
Liu, L., H. Wang, J. Lin, R. Socher, and C. Xiong, "Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation For Pretrained Models", ArXiv, vol. abs/1911.03588, 2019.
Alway, K., E. Blais, and S. Salihoglu, "Box Covers and Domain Orderings for Beyond Worst-Case Join Processing", ArXiv, vol. abs/1909.12102, 2019.
Aluç, G., T. Ozsu, and K. Daudjee, "Building Self-Clustering RDF Databases Using Tunable-LSH", The VLDB Journal, vol. 28, issue 2, pp. 173--195, 2019.
Agarwal, R. Raj, D. Kumar, L. Golab, and S. Keshav, "Consentio: Managing Consent to Data Access Using Permissioned Blockchains", ArXiv, vol. abs/1910.07110, 2019.
Zhang, X., and T. Ozsu, "Correlation Constraint Shortest Path Over Large Multi-Relation Graphs", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 5, pp. 488--501, 2019.
Shi, P., and J. Lin, "Cross-Lingual Relevance Transfer for Document Retrieval", ArXiv, vol. abs/1911.02989, 2019.
Ehsan, N., A. Shakery, and F. Tompa, "Cross-Lingual Text Alignment for Fine-Grained Plagiarism Detection", Journal of Information Science, vol. 45, issue 4, 2019.
Yang, W., Y. Xie, L. Tan, K. Xiong, M. Li, and J. Lin, "Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering", ArXiv, vol. abs/1904.06652, 2019.
Xiang, Z., B. Ding, X. He, and J. Zhou, "Design of Algorithms Under Policy-Aware Local Differential Privacy: Utility-Privacy Trade-Offs", ArXiv, vol. abs/1909.11778, 2019.
Karyakin, A., and K. Salem, "DimmStore: Memory Power Optimization for Database Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 11, pp. 1499--1512, 2019.
Tang, R., Y. Lu, L. Liu, L. Mou, O. Vechtomova, and J. Lin, "Distilling Task-Specific Knowledge From BERT Into Simple Neural Networks", ArXiv, vol. abs/1903.12136, 2019.
Saxena, H., L. Golab, and I. Ilyas, "Distributed Dependency Discovery", ArXiv, vol. abs/1903.05228, 2019.
Saxena, H., L. Golab, and I. Ilyas, "Distributed Implementations of Dependency Discovery Algorithms", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 11, pp. 1624--1636, 2019.
Adhikari, A., A. Ram, R. Tang, and J. Lin, "DocBERT: BERT for Document Classification", ArXiv, vol. abs/1904.08398, 2019.
Nogueira, R., W. Yang, J. Lin, and K. Cho, "Document Expansion by Query Prediction", ArXiv, vol. abs/1904.08375, 2019.
Yang, W., Y. Xie, A. Lin, X. Li, L. Tan, K. Xiong, M. Li, and J. Lin, "End-to-End Open-Domain Question Answering With BERTserini", ArXiv, vol. abs/1902.01718, 2019.
Godfrey, P., L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "Errata Note: Discovering Order Dependencies Through Order Compatibility", ArXiv, vol. abs/1905.02010, 2019.
Ram, A., J. Xin, M. Nagappan, Y. Yu, R. Cabrera Lozoya, A. Sabetta, and J. Lin, "Exploiting Token and Path-Based Representations of Code for Identifying Security-Relevant Commits", ArXiv, vol. abs/1911.07620, 2019.
Gorenflo, C., S. Lee, L. Golab, and S. Keshav, "FastFabric: Scaling Hyperledger Fabric to 20, 000 Transactions Per Second", ArXiv, vol. abs/1901.00910, 2019.
Zeng, L., L. Zou, T. Ozsu, L. Hu, and F. Zhang, "GSI: GPU-friendly Subgraph Isomorphism", ArXiv, vol. abs/1906.03420, 2019.
Heidari, A., J. McGrath, I. Ilyas, and T. Rekatsinas, "HoloDetect: Few-Shot Learning for Error Detection", ArXiv, vol. abs/1904.02285, 2019.
Liu, C., X. He, T. Chanyaswad, S. Wang, and P. Mittal, "Investigating Statistical Privacy Frameworks From the Perspective Of Hypothesis Testing", Proceedings on Privacy Enhancing Technologies (PoPETs), vol. 2019, issue 3, pp. 233--254, 2019.
Teofili, T., and J. Lin, "Lucene for Approximate Nearest-Neighbors Search on Arbitrary Dense Vectors", ArXiv, vol. abs/1910.10208, 2019.
Azmy, M., P. Shi, J. Lin, and I. Ilyas, "Matching Entities Across Different Knowledge Graphs With Graph Embeddings", ArXiv, vol. abs/1903.06607, 2019.
Nogueira, R., W. Yang, K. Cho, and J. Lin, "Multi-Stage Document Ranking With BERT", ArXiv, vol. abs/1910.14424, 2019.
Mhedhbi, A., and S. Salihoglu, "Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 11, pp. 1692--1704, 2019.
Mhedhbi, A., and S. Salihoglu, "Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins", ArXiv, vol. abs/1903.02076, 2019.
Chowdhury, A. Roy, C. Wang, X. He, A. Machanavajjhala, and S. Jha, "Outis: Crypto-Assisted Differential Privacy on Untrusted Servers", ArXiv, vol. abs/1902.07756, 2019.
Livshits, E., I. Ilyas, B. Kimelfeld, and S. Roy, "Principles of Progress Indicators for Database Repairing", ArXiv, vol. abs/1904.06492, 2019.
Kotsogiannis, I., Y. Tao, X. He, M. Fanaeepour, A. Machanavajjhala, M. Hay, and G. Miklau, "PrivateSQL: A Differentially Private SQL Query Engine", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 11, pp. 1371--1384, 2019.
Ge, C., I. Ilyas, and F. Kerschbaum, "Secure Multi-Party Functional Dependency Discovery", Proceedings of the VLDB Endowment (PVLDB), vol. 13, issue 2, pp. 184--196, 2019.
Yang, W., H. Zhang, and J. Lin, "Simple Applications of BERT for Ad Hoc Document Retrieval", ArXiv, vol. abs/1903.10972, 2019.
Shi, P., and J. Lin, "Simple BERT Models for Relation Extraction and Semantic Role Labeling", ArXiv, vol. abs/1904.05255, 2019.
Sun, J., D. Deng, I. Ilyas, G. Li, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Technical Report: Optimizing Human Involvement for Entity Matching And Consolidation", ArXiv, vol. abs/1906.06574, 2019.
Lin, J., "The Neural Hype, Justified!: A Recantation", SIGIR Forum, vol. 53, issue 2, pp. 88--93, 2019.
Lin, J., L. Paniak, and G. Boerke, "The Performance Envelope of Inverted Indexing on Modern Hardware", ArXiv, vol. abs/1910.11028, 2019.
Gauch, M., J. Mai, and J. Lin, "The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction", ArXiv, vol. abs/1911.07249, 2019.
Lee, J., R. Tang, and J. Lin, "What Would Elsa Do? Freezing Layers During Transformer Fine-Tuning", ArXiv, vol. abs/1911.03090, 2019.
Gorenflo, C., L. Golab, and S. Keshav, "XOX Fabric: A Hybrid Approach to Transaction Execution", ArXiv, vol. abs/1906.11229, 2019.
De Sa, C., I. Ilyas, B. Kimelfeld, C. Ré, and T. Rekatsinas, "A Formal Framework for Probabilistic Unclean Databases", ArXiv, vol. abs/1801.06750, 2018.
Ren, Y., M. Tomko, F. Dilys Salim, J. Chan, C. Clarke, and M. Sanderson, "A Location-Query-Browse Graph for Contextual Recommendation", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 30, issue 2, pp. 204--218, 2018.
Tang, R., and J. Lin, "Adaptive Pruning of Neural Language Models for Mobile Devices", ArXiv, vol. abs/1809.10282, 2018.
Koutris, P., S. Salihoglu, and D. Suciu, "Algorithmic Aspects of Parallel Data Processing", Foundations and Trends in Databases, vol. 8, issue 4, pp. 239--370, 2018.
Yang, P., H. Fang, and J. Lin, "Anserini: Reproducible Ranking Baselines Using Lucene", Journal of Data and Information Quality, vol. 10, issue 4, pp. 16:1--16:20, 2018.
Tang, G., S. Keshav, L. Golab, and K. Wu, "Bikeshare Pool Sizing for Bike-and-Ride Multimodal Transit", IEEE Transactions on Intelligent Transportation Systems, vol. 19, issue 7, pp. 2279--2289, 2018.
Stonebraker, M., and I. Ilyas, "Data Integration: The Current Status and the Way Forward", IEEE Data Engineering Bulletin, vol. 41, issue 2, pp. 3--9, 2018.
Ammar, K., F. McSherry, S. Salihoglu, and M. Joglekar, "Distributed Evaluation of Subgraph Queries Using Worst-Case Optimal And Low-Memory Dataflows", Proceedings of the VLDB Endowment (PVLDB), vol. 11, issue 6, pp. 691--704, 2018.
Ammar, K., F. McSherry, S. Salihoglu, and M. Joglekar, "Distributed Evaluation of Subgraph Queries Using Worstcase Optimal LowMemory Dataflows", ArXiv, vol. abs/1802.03760, 2018.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Bidirectional Order Dependencies Via Set-Based Axioms", The VLDB Journal, vol. 27, issue 4, pp. 573--591, 2018.
Lamb, C., D. G. Brown, and C. Clarke, "Evaluating Computational Creativity: An Interdisciplinary Tutorial", ACM Computing Surveys, vol. 51, issue 2, pp. 28:1--28:34, 2018.
Zhang, H., G. Cormack, M. Grossman, and M. Smucker, "Evaluating Sentence-Level Relevance Feedback for High-Recall Information Retrieval", ArXiv, vol. abs/1803.08988, 2018.
Hopfgartner, F., A. Hanbury, H. Müller, I. Eggel, K. Balog, T. Brodt, G. Cormack, J. Lin, J. Kalpathy-Cramer, N. Kando, et al., "Evaluation-as-a-Service for the Computational Sciences: Overview And Outlook", Journal of Data and Information Quality, vol. 10, issue 4, pp. 15:1--15:32, 2018.
Ammar, K., and T. Ozsu, "Experimental Analysis of Distributed Graph Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 11, issue 10, pp. 1151--1164, 2018.
Ammar, K., and T. Ozsu, "Experimental Analysis of Distributed Graph Systems", ArXiv, vol. abs/1806.08082, 2018.
Gebaly, K. El, G. Feng, L. Golab, F. Korn, and D. Srivastava, "Explanation Tables", IEEE Data Engineering Bulletin, vol. 41, issue 3, pp. 43--51, 2018.
Tang, R., A. Adhikari, and J. Lin, "FLOPs as a Direct Optimization Objective for Learning Sparse Neural Networks", ArXiv, vol. abs/1811.03060, 2018.
Gebaly, K. El, and J. Lin, "In-Browser Split-Execution Support for Interactive Analytics in The Cloud", ArXiv, vol. abs/1804.08822, 2018.
Rao, J., W. Yang, Y. Zhang, F. Türe, and J. Lin, "Multi-Perspective Relevance Matching With Hierarchical ConvNets For Social Media Search", ArXiv, vol. abs/1805.08159, 2018.
Tang, R., and J. Lin, "Progress and Tradeoffs in Neural Language Models", ArXiv, vol. abs/1811.00942, 2018.
Lin, J., and P. Yang, "Repeatability Corner Cases in Document Ranking: The Impact of Score Ties", ArXiv, vol. abs/1807.05798, 2018.
Liu, Y., M. P. Kato, C. Clarke, N. Kando, and T. Sakai, "Report on NTCIR-13: The Thirteenth Round of NII Testbeds and Community For Information Access Research", SIGIR Forum, vol. 52, issue 1, pp. 102--110, 2018.
J. Culpepper, S., F. Diaz, and M. Smucker, "Research Frontiers in Information Retrieval: Report From the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018)", SIGIR Forum, vol. 52, issue 1, pp. 34--90, 2018.
Salihoglu, S., and T. Ozsu, "Response to "Scale Up or Scale Out for Graph Processing"", IEEE Internet Computing, vol. 22, issue 5, pp. 18--24, 2018.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple", ArXiv, vol. abs/1805.11728, 2018.
Lin, J., "Scale Up or Scale Out for Graph Processing?", IEEE Internet Computing, vol. 22, issue 3, pp. 72--78, 2018.
Kushagra, S., S. Ben-David, and I. Ilyas, "Semi-Supervised Clustering for De-Duplication", ArXiv, vol. abs/1810.04361, 2018.
Kim, Y., and J. Lin, "Serverless Data Analytics With Flint", ArXiv, vol. abs/1803.06354, 2018.
Bater, J., X. He, W. Ehrich, A. Machanavajjhala, and J. Rogers, "Shrinkwrap: Differentially-Private Query Processing in Private Data Federations", ArXiv, vol. abs/1810.01816, 2018.
Bater, J., X. He, W. Ehrich, A. Machanavajjhala, and J. Rogers, "ShrinkWrap: Efficient SQL Query Processing in Differentially Private Data Federations", Proceedings of the VLDB Endowment (PVLDB), vol. 12, issue 3, pp. 307--320, 2018.
Shi, P., J. Rao, and J. Lin, "Simple Attention-Based Representation Learning for Ranking Short Social Media Posts", ArXiv, vol. abs/1811.01013, 2018.
Tang, R., G. Yang, H. Wei, Y. Mao, F. Türe, and J. Lin, "Streaming Voice Query Recognition Using Causal Convolutional Recurrent Neural Networks", ArXiv, vol. abs/1812.07754, 2018.
Lin, J., "The Neural Hype and Comparisons Against Weak Baselines", SIGIR Forum, vol. 52, issue 2, pp. 40--51, 2018.
Li, Y., L. Zou, T. Ozsu, and D. Zhao, "Time Constrained Continuous Subgraph Search Over Streaming Graphs", ArXiv, vol. abs/1801.09240, 2018.
Tang, R., W. Wang, Z. Tu, and J. Lin, "An Experimental Analysis of the Power Consumption of Convolutional Neural Networks for Keyword Spotting", ArXiv, vol. abs/1711.00333, 2017.
Tu, Z., M. Crane, R. Sequiera, J. Zhang, and J. Lin, "An Exploration of Approaches to Integrating Neural Reranking Models In Multi-Stage Ranking Architectures", ArXiv, vol. abs/1707.08275, 2017.
Abdelaziz, I., R. Harbi, S. Salihoglu, and P. Kalnis, "Combining Vertex-Centric Graph Processing With SPARQL for Large-Scale RDF Data Analytics", IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 28, issue 12, pp. 3374--3388, 2017.
Sadiq, S. Wasim, T. Dasu, X. Luna Dong, J. Freire, I. Ilyas, S. Link, R. J. Miller, F. Naumann, X. Zhou, and D. Srivastava, "Data Quality: The Role of Empiricism", SIGMOD Record, vol. 46, issue 4, pp. 35--43, 2017.
Bejnordi, B. Ehteshami, J. Lin, B. Glass, M. Mullooly, G. L. Gierach, M. E. Sherman, N. Karssemeijer, J. van der Laak, and A. H. Beck, "Deep Learning-Based Assessment of Tumor-Associated Stroma for Diagnosing Breast Cancer in Histopathology Images", ArXiv, vol. abs/1702.05803, 2017.
Tang, R., and J. Lin, "Deep Residual Learning for Small-Footprint Keyword Spotting", ArXiv, vol. abs/1710.10361, 2017.
Mohammed, S., N. Ghelani, and J. Lin, "Distant Supervision for Topic Classification of Tweets in Curated Streams", ArXiv, vol. abs/1704.06726, 2017.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Order Dependencies via Set-Based Axiomatization", Proceedings of the VLDB Endowment (PVLDB), vol. 10, issue 7, pp. 721--732, 2017.
Mackenzie, J. M., S. J. Culpepper, R. Blanco, M. Crane, C. Clarke, and J. Lin, "Efficient and Effective Tail Latency Minimization in Multi-Stage Retrieval Systems", ArXiv, vol. abs/1704.03970, 2017.
Deng, D., W. Tao, Z. Abedjan, A. K. Elmagarmid, I. Ilyas, S. Madden, M. Ouzzani, M. Stonebraker, and N. Tang, "Entity Consolidation: The Golden Record Problem", ArXiv, vol. abs/1709.10436, 2017.
Sequiera, R., G. Baruah, Z. Tu, S. Mohammed, J. Rao, H. Zhang, and J. Lin, "Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering", ArXiv, vol. abs/1707.07804, 2017.
Yan, D., H. Chen, J. Cheng, T. Ozsu, Q. Zhang, and J. C. S. Lui, "G-Thinker: Big Graph Mining Made Easier and Faster", ArXiv, vol. abs/1709.03110, 2017.
Zou, L., and T. Ozsu, "Graph-Based RDF Data Management", Data Science and Engineering, vol. 2, issue 1, pp. 56--70, 2017.
Rekatsinas, T., X. Chu, I. Ilyas, and C. Ré, "HoloClean: Holistic Data Repairs With Probabilistic Inference", Proceedings of the VLDB Endowment (PVLDB), vol. 10, issue 11, pp. 1190--1201, 2017.
Rekatsinas, T., X. Chu, I. Ilyas, and C. Ré, "HoloClean: Holistic Data Repairs With Probabilistic Inference", ArXiv, vol. abs/1702.00820, 2017.
Tang, R., and J. Lin, "Honk: A PyTorch Reimplementation of Convolutional Neural Networks For Keyword Spotting", ArXiv, vol. abs/1710.06554, 2017.
Vadehra, A., M. Grossman, and G. Cormack, "Impact of Feature Selection on Micro-Text Classification", ArXiv, vol. abs/1708.08123, 2017.
Lin, J., "In Defense of MapReduce", IEEE Internet Computing, vol. 21, issue 3, pp. 94--98, 2017.
Rao, J., H. He, H. Zhang, F. Türe, R. Sequiera, S. Mohammed, and J. Lin, "Integrating Lexical and Temporal Signals in Neural Ranking Models For Searching Social Media Streams", ArXiv, vol. abs/1707.07792, 2017.
Konow, R., G. Navarro, C. Clarke, and A. López-Ortiz, "Inverted Treaps", ACM Transactions on Information Systems (TOIS), vol. 35, issue 3, pp. 22:1--22:45, 2017.
Ünel, G., and D. Toman, "Logic Programming Approach to Automata-Based Decision Procedures", Journal of Logic Programming, vol. 86, issue 1, pp. 391--407, 2017.
Mior, M. J., K. Salem, A. Aboulnaga, and R. Liu, "NoSE: Schema Design for NoSQL Applications", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 29, issue 10, pp. 2275--2289, 2017.
Allan, J., N. J. Belkin, P. N. Bennett, J. Callan, C. Clarke, F. Diaz, S. T. Dumais, N. Ferro, D. Harman, D. Hiemstra, et al., "Overview of Special Issue", SIGIR Forum, vol. 51, issue 2, pp. 1--25, 2017.
Ge, C., I. Ilyas, X. He, and A. Machanavajjhala, "Private Exploration Primitives for Data Cleaning", ArXiv, vol. abs/1712.10266, 2017.
He, X., A. Machanavajjhala, C. J. Flynn, and D. Srivastava, "Scaling Private Record Linkage Using Output Constrained Differential Privacy", ArXiv, vol. abs/1702.00535, 2017.
Liu, X., L. Golab, W. M. Golab, I. Ilyas, and S. Jin, "Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking", ACM Transactions on Database Systems (TODS), vol. 42, issue 1, pp. 2:1--2:39, 2017.
Mohammed, S., P. Shi, and J. Lin, "Strong Baselines for Simple Question Answering Over Knowledge Graphs With and Without Neural Networks", ArXiv, vol. abs/1712.01969, 2017.
Rao, J., F. Türe, H. He, O. Jojic, and J. Lin, "Talking to Your TV: Context-Aware Voice Search With Hierarchical Recurrent Neural Networks", ArXiv, vol. abs/1705.04892, 2017.
Lin, J., "The Lambda and the Kappa", IEEE Internet Computing, vol. 21, issue 5, pp. 60--66, 2017.
Lin, J., and A. Trotman, "The Role of Index Compression in Score-at-a-Time Query Evaluation", Information Retrieval Journal, vol. 20, issue 3, pp. 199--220, 2017.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and T. Ozsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing", Proceedings of the VLDB Endowment (PVLDB), vol. 11, issue 4, pp. 420--431, 2017.
Sahu, S., A. Mhedhbi, S. Salihoglu, J. Lin, and T. Ozsu, "The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing: A User Survey", ArXiv, vol. abs/1709.03188, 2017.
Yang, Y., L. Golab, and T. Ozsu, "ViewDF: Declarative Incremental View Maintenance for Streaming Data", Information Systems, vol. 71, pp. 55--67, 2017.
Lin, J., I. Milligan, J. Wiebe, and A. Zhou, "Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives", ACM Journal on Computing and Cultural Heritage, vol. 10, issue 4, pp. 22:1--22:30, 2017.
He, X., N. Raval, and A. Machanavajjhala, "A Demonstration of VisDPT: Visual Exploration of Differentially Private Trajectories", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 13, pp. 1489--1492, 2016.
Yan, D., J. Cheng, T. Ozsu, F. Yang, Y. Lu, J. C. S. Lui, Q. Zhang, and W. Ng, "A General-Purpose Query-Centric Framework for Querying Big Graphs", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 7, pp. 564--575, 2016.
Ozsu, T., "A Survey of RDF Data Management Systems", Frontiers of Computer Science, vol. 10, issue 3, pp. 418--432, 2016.
Ozsu, T., "A Survey of RDF Data Management Systems", ArXiv, vol. abs/1601.00707, 2016.
Gebaly, K. El, and J. Lin, "Afterburner: The Case for in-Browser Analytics", ArXiv, vol. abs/1605.04035, 2016.
Clarke, C., S. J. Culpepper, and A. Moffat, "Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments", Information Retrieval Journal, vol. 19, issue 4, pp. 351--377, 2016.
Zihayat, M., A. An, L. Golab, M. Kargar, and J. Szlichta, "Authority-Based Team Discovery in Social Networks", ArXiv, vol. abs/1611.02992, 2016.
Jiang, Y. Helen, S. Javaad Syed, and L. Golab, "Data Mining of Undergraduate Course Evaluations", Informatics in Education, vol. 15, issue 1, pp. 85--102, 2016.
Bär, A., P. Casas, A. D'Alconzo, P. Fiadino, L. Golab, M. Mellia, and E. Schikuta, "DBStream: A Holistic Approach to Large-Scale Network Traffic Monitoring And Analysis", Computer Networks, vol. 107, pp. 5--19, 2016.
Abedjan, Z., X. Chu, D. Deng, R. Castro Fernandez, I. Ilyas, M. Ouzzani, P. Papotti, M. Stonebraker, and N. Tang, "Detecting Data Errors: Where Are We and What Needs to Be Done?", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 12, pp. 993--1004, 2016.
Machanavajjhala, A., X. He, and M. Hay, "Differential Privacy in the Wild: A Tutorial on Current Practices & Open Challenges", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 13, pp. 1611--1614, 2016.
Chu, X., I. Ilyas, and P. Koutris, "Distributed Data Deduplication", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 11, pp. 864--875, 2016.
J. Culpepper, S., C. Clarke, and J. Lin, "Dynamic Trade-Off Prediction in Multi-Stage Retrieval Systems", ArXiv, vol. abs/1610.02502, 2016.
Bizer, C., L. Dong, I. Ilyas, and M-E. Vidal, "Editorial: Special Issue on Web Data Quality", Journal of Data and Information Quality, vol. 8, issue 1, pp. 1:1--1:3, 2016.
Szlichta, J., P. Godfrey, L. Golab, M. Kargar, and D. Srivastava, "Effective and Complete Discovery of Order Dependencies via Set-Based Axiomatization", ArXiv, vol. abs/1608.06169, 2016.
Ilyas, I., "Effective Data Cleaning With Continuous Evaluation", IEEE Data Engineering Bulletin, vol. 39, issue 2, pp. 38--46, 2016.
Clarke, C., and E. Yilmaz, "EVIA 2016: The Seventh International Workshop on Evaluating Information Access", SIGIR Forum, vol. 50, issue 2, pp. 44--46, 2016.
Sharma, A., J. Jiang, P. Bommannavar, B. Larson, and J. Lin, "GraphJet: Real-Time Content Recommendations at Twitter", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 13, pp. 1281--1292, 2016.
Khabsa, M., A. K. Elmagarmid, I. Ilyas, H. Hammady, and M. Ouzzani, "Learning to Identify Relevant Studies for Systematic Reviews Using Random Forest and External Information", Machine Learning, vol. 102, issue 3, pp. 465--482, 2016.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-Centric Large-Scale Graph Analytics in the Cloud", The VLDB Journal, vol. 25, issue 2, pp. 125--150, 2016.
Drzadzewski, G., and F. Tompa, "Partial Materialization for Online Analytical Processing Over Multi-Tagged Document Collections", Knowledge and Information Systems (KAIS), vol. 47, issue 3, pp. 697--732, 2016.
Peng, P., L. Zou, T. Ozsu, L. Chen, and D. Zhao, "Processing SPARQL Queries Over Distributed RDF Graphs", The VLDB Journal, vol. 25, issue 2, pp. 243--268, 2016.
Chu, X., and I. Ilyas, "Qualitative Data Cleaning", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 13, pp. 1605--1608, 2016.
Yan, D., J. Cheng, T. Ozsu, F. Yang, Y. Lu, J. C. S. Lui, Q. Zhang, and W. Ng, "Quegel: A General-Purpose Query-Centric Framework for Querying Big Graphs", ArXiv, vol. abs/1601.06497, 2016.
El-Roby, A., K. Ammar, A. Aboulnaga, and J. Lin, "Sapphire: Querying RDF Data Made Simple", Proceedings of the VLDB Endowment (PVLDB), vol. 9, issue 13, pp. 1481--1484, 2016.
Lin, J., C. Clarke, and G. Baruah, "Searching From Mars", IEEE Internet Computing, vol. 20, issue 1, pp. 78--82, 2016.
Clarke, C., G. Cormack, J. Lin, and A. Roegiest, "Ten Blue Links on Mars", ArXiv, vol. abs/1610.06468, 2016.
Tan, L., J. Lin, A. Roegiest, and C. Clarke, "The Effects of Latency Penalties in Evaluating Push Notification Systems", ArXiv, vol. abs/1606.03066, 2016.
Lin, J., and K. El Gebaly, "The Future of Big Data Is ... JavaScript?", IEEE Internet Computing, vol. 20, issue 5, pp. 82--88, 2016.
Tan, L., and C. Clarke, "A Family of Rank Similarity Measures Based on Maximized Effectiveness Difference", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 27, issue 11, pp. 2865--2877, 2015.
Chowdhury, S. Rahman, A. Raton Roy, M. Shaikh, and K. Daudjee, "A Taxonomy of Decentralized Online Social Networks", Peer-to-Peer Networking and Applications, vol. 8, issue 3, pp. 367--383, 2015.
Agrawal, D., A. El Abbadi, and K. Salem, "A Taxonomy of Partitioned Replicated Cloud-Based Database Systems", IEEE Data Engineering Bulletin, vol. 38, issue 1, pp. 4--9, 2015.
Clarke, C., S. J. Culpepper, and A. Moffat, "Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments", ArXiv, vol. abs/1506.00717, 2015.
Cormack, G., and M. Grossman, "Autonomy and Reliability of Continuous Active Learning for Technology-Assisted Review", ArXiv, vol. abs/1504.06868, 2015.
Aluç, G., T. Ozsu, and K. Daudjee, "Clustering RDF Databases Using Tunable-LSH", ArXiv, vol. abs/1504.02523, 2015.
He, X., G. Cormode, A. Machanavajjhala, C. M. Procopiuc, and D. Srivastava, "DPT: Differentially Private Trajectory Synthesis Using Hierarchical Reference Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 11, pp. 1154--1165, 2015.
Kargar, M., L. Golab, and J. Szlichta, "Effective Keyword Search in Graphs", ArXiv, vol. abs/1512.06395, 2015.
Hanbury, A., H. Müller, K. Balog, T. Brodt, G. Cormack, I. Eggel, T. Gollub, F. Hopfgartner, J. Kalpathy-Cramer, N. Kando, et al., "Evaluation-as-a-Service: Overview and Outlook", ArXiv, vol. abs/1512.07454, 2015.
He, H., J. Lin, and A. Lopez, "Gappy Pattern Matching on GPUs for on-Demand Extraction of Hierarchical Translation Grammars", Transactions of the Association for Computational Linguistics, vol. 3, pp. 87--100, 2015.
Han, M., and K. Daudjee, "Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-Like Graph Processing Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 9, pp. 950--961, 2015.
Lin, J., "Is Big Data a Transient Problem?", IEEE Internet Computing, vol. 19, issue 5, pp. 86--90, 2015.
Chu, X., M. Ouzzani, J. Morcos, I. Ilyas, P. Papotti, N. Tang, and Y. Ye, "KATARA: Reliable Data Cleaning With Knowledge Bases and Crowdsourcing", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 12, pp. 1952--1955, 2015.
Buntain, C., J. Lin, and J. Golbeck, "Learning to Discover Key Moments in Social Media Streams", ArXiv, vol. abs/1508.00488, 2015.
Balkesen, C., J. Teubner, G. Alonso, and T. Ozsu, "Main-Memory Hash Joins on Modern Processor Architectures", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 27, issue 7, pp. 1754--1766, 2015.
Abu-Khzam, F. N., K. Daudjee, A. E. Mouawad, and N. Nishimura, "On Scalable Parallel Recursive Backtracking", Journal of Parallel and Distributed Computing, vol. 84, pp. 65--75, 2015.
Abedjan, Z., L. Golab, and F. Naumann, "Profiling Relational Data: A Survey", The VLDB Journal, vol. 24, issue 4, pp. 557--581, 2015.
Hopfgartner, F., A. Hanbury, H. Müller, N. Kando, S. Mercer, J. Kalpathy-Cramer, M. Potthast, T. Gollub, A. Krithara, J. Lin, et al., "Report on the Evaluation-as-a-Service (EaaS) Expert Workshop", SIGIR Forum, vol. 49, issue 1, pp. 57--65, 2015.
Arguello, J., M. Crane, F. Diaz, J. Lin, and A. Trotman, "Report on the SIGIR 2015 Workshop on Reproducibility, Inexplicability, And Generalizability of Results (RIGOR)", SIGIR Forum, vol. 49, issue 2, pp. 107--116, 2015.
Abdelaziz, I., R. Harbi, S. Salihoglu, P. Kalnis, and N. Mamoulis, "SPARTex: A Vertex-Centric Framework for RDF Data Analytics", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 12, pp. 1880--1883, 2015.
Calvanese, D., M. Koubarakis, and D. Toman, "Special Issue of the Journal of Web Semantics on Ontology-Based Data Access", Journal of Web Semantics, vol. 33, pp. 1--2, 2015.
Zanibbi, R., K. Davila, A. Kane, and F. Tompa, "The Tangent Search Engine: Improved Similarity Metrics and Scalability For Math Formula Search", ArXiv, vol. abs/1507.06235, 2015.
Ilyas, I., and X. Chu, "Trends in Cleaning Relational Data: Consistency and Deduplication", Foundations and Trends in Databases, vol. 5, issue 4, pp. 281--393, 2015.
Tan, L., and C. Clarke, "A Family of Rank Similarity Measures Based on Maximized Effectiveness Difference", ArXiv, vol. abs/1408.3587, 2014.
Wu, J., A. K. Hudek, D. Toman, and G. Weddell, "Absorption for ABoxes", Journal of Automated Reasoning, vol. 53, issue 3, pp. 215--243, 2014.
Serafini, M., E. Mansour, A. Aboulnaga, K. Salem, T. Rafiq, and U. Farooq Minhas, "Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 12, pp. 1035--1046, 2014.
Han, M., K. Daudjee, K. Ammar, T. Ozsu, X. Wang, and T. Jin, "An Experimental Comparison of Pregel-Like Graph Processing Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 12, pp. 1047--1058, 2014.
Chairunnanda, P., K. Daudjee, and T. Ozsu, "ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 11, pp. 947--958, 2014.
Golab, L., H. J. Karloff, F. Korn, B. Saha, and D. Srivastava, "Discovering Conservation Rules", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 26, issue 6, pp. 1332--1348, 2014.
Li, F., B. Chin Ooi, T. Ozsu, and S. Wu, "Distributed Data Management Using MapReduce", ACM Computing Surveys, vol. 46, issue 3, pp. 31:1--31:42, 2014.
Türe, F., and J. Lin, "Exploiting Representations From Statistical Machine Translation For Cross-Language Information Retrieval", ACM Transactions on Information Systems (TOIS), vol. 32, issue 4, pp. 19:1--19:32, 2014.
Zou, L., T. Ozsu, L. Chen, X. Shen, R. Huang, and D. Zhao, "gStore: A Graph-Based SPARQL Query Engine", The VLDB Journal, vol. 23, issue 4, pp. 565--590, 2014.
Afrati, F. N., M. Joglekar, C. Ré, S. Salihoglu, and J. D. Ullman, "GYM: A Multiround Join Algorithm in MapReduce", ArXiv, vol. abs/1410.4156, 2014.
Weissman, S., S. Ayhan, J. Bradley, and J. Lin, "Identifying Duplicate and Contradictory Information in Wikipedia", ArXiv, vol. abs/1406.1143, 2014.
Liu, X., and K. Salem, "Integrating SSD Caching Into Database Systems", IEEE Data Engineering Bulletin, vol. 37, issue 2, pp. 35--43, 2014.
Gebaly, K. El, P. Agrawal, L. Golab, F. Korn, and D. Srivastava, "Interpretable and Informative Explanations of Outcomes", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 1, pp. 61--72, 2014.
Ashkan, A., and C. Clarke, "Location- And Query-Aware Modeling of Browsing and Click Behavior In Sponsored Search", ACM Transactions on Intelligent Systems and Technology (TIST), vol. 5, issue 4, pp. 59:1--59:31, 2014.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-Centric Analytics on Large Graphs", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 13, pp. 1673--1676, 2014.
Quamar, A., A. Deshpande, and J. Lin, "NScale: Neighborhood-Centric Large-Scale Graph Analytics in the Cloud", ArXiv, vol. abs/1405.1499, 2014.
Salihoglu, S., and J. Widom, "Optimizing Graph Algorithms on Pregel-Like Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 7, pp. 577--588, 2014.
Peng, P., L. Zou, T. Ozsu, L. Chen, and D. Zhao, "Processing SPARQL Queries Over Linked Data-a Distributed Graph-Based Approach", ArXiv, vol. abs/1411.6763, 2014.
Gupta, P., V. Satuluri, A. Grewal, S. Gurumurthy, V. Zhabiuk, Q. Li, and J. Lin, "Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 13, pp. 1379--1380, 2014.
Albakour, M-D., C. Macdonald, I. Ounis, C. Clarke, and V. Bicer, "Report on the 1st International Workshop on Information Access In Smart Cities (I-Asc 2014)", SIGIR Forum, vol. 48, issue 2, pp. 96--104, 2014.
Balog, K., D. Elsweiler, E. Kanoulas, L. Kelly, and M. Smucker, "Report on the CIKM Workshop on Living Labs for Information Retrieval Evaluation", SIGIR Forum, vol. 48, issue 1, pp. 21--28, 2014.
Asadi, N., J. Lin, and A. P. de Vries, "Runtime Optimizations for Tree-Based Machine Learning Models", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 26, issue 9, pp. 2281--2292, 2014.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "Sampling From Repairs of Conditional Functional Dependency Violations", The VLDB Journal, vol. 23, issue 1, pp. 103--128, 2014.
P. Boykin, O., S. Ritchie, I. O'Connell, and J. Lin, "Summingbird: A Framework for Integrating Batch and Online MapReduce Computations", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 13, pp. 1441--1451, 2014.
Dallachiesa, M., T. Palpanas, and I. Ilyas, "Top-K Nearest Neighbor Search in Uncertain Data Series", Proceedings of the VLDB Endowment (PVLDB), vol. 8, issue 1, pp. 13--24, 2014.
Toman, D., and G. Weddell, "Undecidability of Finite Model Reasoning in DLFD", ArXiv, vol. abs/1408.4468, 2014.
Aluç, G., T. Ozsu, and K. Daudjee, "Workload Matters: Why RDF Databases Need a New Design", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 10, pp. 837--840, 2014.
Ozsu, T., "ACM Books to Launch", Communications of the ACM, vol. 56, issue 12, pp. 5, 2013.
Abu-Khzam, F. N., K. Daudjee, A. E. Mouawad, and N. Nishimura, "An Easy-to-Use Scalable Framework for Parallel Recursive Backtracking", ArXiv, vol. abs/1312.7626, 2013.
He, X., A. Machanavajjhala, and B. Ding, "Blowfish Privacy: Tuning Privacy-Utility Trade-Offs Using Policies", ArXiv, vol. abs/1312.3913, 2013.
Liu, R., A. Aboulnaga, and K. Salem, "DAX: A Widely Distributed Multi-Tenant Storage Service for DBMS Hosting", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 4, pp. 253--264, 2013.
Chu, X., I. Ilyas, and P. Papotti, "Discovering Denial Constraints", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 13, pp. 1498--1509, 2013.
Golab, L., M. Hadjieleftheriou, H. J. Karloff, and B. Saha, "Distributed Data Placement via Graph Partitioning", ArXiv, vol. abs/1312.0285, 2013.
Asadi, N., and J. Lin, "Document Vector Representations for Feature Extraction in Multi-Stage Document Ranking", Information Retrieval Journal, vol. 16, issue 6, pp. 747--768, 2013.
Asadi, N., J. Lin, and M. Busch, "Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search", ArXiv, vol. abs/1302.5302, 2013.
Lin, J., and M. Efron, "Evaluation as a Service for Information Retrieval", SIGIR Forum, vol. 47, issue 2, pp. 8--14, 2013.
Akinyemi, J. A., and C. Clarke, "Fast and Effective Soft Links", Software - Practice and Experience (SPE), vol. 43, issue 5, pp. 577--593, 2013.
Asadi, N., and J. Lin, "Fast Candidate Generation for Real-Time Tweet Search With Bloom Filter Chains", ACM Transactions on Information Systems (TOIS), vol. 31, issue 3, pp. 13, 2013.
Asadi, N., and J. Lin, "Fast, Incremental Inverted Indexing in Main Memory for Web-Scale Collections", ArXiv, vol. abs/1305.0699, 2013.
Capra, R., L. Freund, C. L. Smith, M. Smucker, and R. W. White, "HCIR 2013: The Seventh International Symposium on Human-Computer Interaction and Information Retrieval", SIGIR Forum, vol. 47, issue 2, pp. 33--40, 2013.
K. Kumar, A., J. Gluck, A. Deshpande, and J. Lin, "Hone: "Scaling Down" Hadoop on Shared-Memory Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 12, pp. 1354--1357, 2013.
Liu, X., and K. Salem, "Hybrid Storage Management for Database Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 8, pp. 541--552, 2013.
Ashkan, A., and C. Clarke, "Impact of Query Intent and Search Context on Clickthrough Behavior In Sponsored Search", Knowledge and Information Systems (KAIS), vol. 34, issue 2, pp. 425--452, 2013.
Golbus, P. B., J. A. Aslam, and C. Clarke, "Increasing Evaluation Sensitivity to Diversity", Information Retrieval Journal, vol. 16, issue 4, pp. 530--555, 2013.
Balkesen, C., G. Alonso, J. Teubner, and T. Ozsu, "Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited", Proceedings of the VLDB Endowment (PVLDB), vol. 7, issue 1, pp. 85--96, 2013.
Ebaid, A., A. K. Elmagarmid, I. Ilyas, M. Ouzzani, J-A. Quiané-Ruiz, N. Tang, and S. Yin, "NADEEF: A Generalized Data Cleaning System", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 12, pp. 1218--1221, 2013.
Chen, T., L. Chen, T. Ozsu, and N. Xiao, "Optimizing Multi-Top-K Queries Over Uncertain Data Streams", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 25, issue 8, pp. 1814--1829, 2013.
Chen, L., I. Ilyas, C. Ré, and X. Zhou, "Probabilistic Web Data Management", World Wide Web (WWW), vol. 16, issue 3, pp. 271--272, 2013.
Minhas, U. Farooq, S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield, "RemusDB: Transparent High Availability for Database Systems", The VLDB Journal, vol. 22, issue 1, pp. 29--45, 2013.
Clarke, C., L. Freund, M. Smucker, and E. Yilmaz, "Report on the SIGIR 2013 Workshop on Modeling User Behavior For Information Retrieval Evaluation (MUBE 2013)", SIGIR Forum, vol. 47, issue 2, pp. 84--95, 2013.
Murdock, V., C. Clarke, J. Kamps, and J. Karlgren, "Report on the Workshop on Search and Exploration of X-Rated Information (Sexi 2013)", SIGIR Forum, vol. 47, issue 1, pp. 31--37, 2013.
Afrati, F. N., A. Das Sarma, S. Salihoglu, and J. D. Ullman, "Upper and Lower Bounds on the Cost of a Map-Reduce Computation", Proceedings of the VLDB Endowment (PVLDB), vol. 6, issue 4, pp. 277--288, 2013.
Lin, J., and G. Mishne, "A Study of "Churn" in Tweets and Real-Time Search Queries (Extended Version)", ArXiv, vol. abs/1205.6855, 2012.
Zou, L., L. Chen, T. Ozsu, and D. Zhao, "Answering Pattern Match Queries in Large Graph Databases via Graph Embedding", The VLDB Journal, vol. 21, issue 1, pp. 97--120, 2012.
Mishne, G., J. Dalton, Z. Li, A. Sharma, and J. Lin, "Fast Data in the Era of Big Data: Twitter's Real-Time Related Query Suggestion Architecture", ArXiv, vol. abs/1210.7350, 2012.
Beskales, G., I. Ilyas, L. Golab, and A. Galiullin, "On the Relative Trust Between Inconsistent Data and Inaccurate Constraints", ArXiv, vol. abs/1207.5226, 2012.
Trotman, A., C. Clarke, I. Ounis, S. J. Culpepper, M-A. Cartright, and S. Geva, "Open Source Information Petrieval: A Report on the SIGIR 2012 Workshop", SIGIR Forum, vol. 46, issue 2, pp. 95--101, 2012.
Asadi, N., J. Lin, and A. P. de Vries, "Runtime Optimizations for Prediction With Tree-Based Models", ArXiv, vol. abs/1212.2287, 2012.
Golab, L., T. Johnson, and V. Shkapenyuk, "Scalable Scheduling of Updates in Streaming Data Warehouses", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 24, issue 6, pp. 1092--1105, 2012.
Lin, J., and D. V. Ryaboy, "Scaling Big Data Mining Infrastructure: The Twitter Experience", SIGKDD Explorations, vol. 14, issue 2, pp. 6--19, 2012.
Beskales, G., G. Das, A. K. Elmagarmid, I. Ilyas, F. Naumann, M. Ouzzani, P. Papotti, J-A. Quiané-Ruiz, and N. Tang, "The Data Analytics Group at the Qatar Computing Research Institute", SIGMOD Record, vol. 41, issue 4, pp. 33--38, 2012.
Lee, G., J. Lin, C. Liu, A. Lorek, and D. V. Ryaboy, "The Unified Logging Infrastructure for Data Analytics at Twitter", Proceedings of the VLDB Endowment (PVLDB), vol. 5, issue 12, pp. 1771--1780, 2012.
Lee, G., J. Lin, C. Liu, A. Lorek, and D. V. Ryaboy, "The Unified Logging Infrastructure for Data Analytics at Twitter", ArXiv, vol. abs/1208.4171, 2012.
Afrati, F. N., A. Das Sarma, S. Salihoglu, and J. D. Ullman, "Upper and Lower Bounds on the Cost of a Map-Reduce Computation", ArXiv, vol. abs/1206.4377, 2012.
Afrati, F. N., A. Das Sarma, S. Salihoglu, and J. D. Ullman, "Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation", ArXiv, vol. abs/1204.1754, 2012.
Chen, G., H. Tam Vo, S. Wu, B. Chin Ooi, and T. Ozsu, "A Framework for Supporting DBMS-like Indexes in the Cloud", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 11, pp. 702--713, 2011.
Ataullah, A. A., and F. Tompa, "Business Policy Modeling and Enforcement in Databases", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 11, pp. 921--931, 2011.
Golab, L., F. Korn, and D. Srivastava, "Efficient and Effective Analysis of Data Quality Using Pattern Tableaux", IEEE Data Engineering Bulletin, vol. 34, issue 3, pp. 26--33, 2011.
Cormack, G., M. Smucker, and C. Clarke, "Efficient and Effective Spam Filtering and Re-Ranking for Large Web Datasets", Information Retrieval Journal, vol. 14, issue 5, pp. 441--465, 2011.
Zou, L., J. Mo, L. Chen, T. Ozsu, and D. Zhao, "gStore: Answering SPARQL Queries via Subgraph Matching", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 8, pp. 482--493, 2011.
Yakout, M., A. K. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided Data Repair", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 5, pp. 279--289, 2011.
Yakout, M., A. K. Elmagarmid, J. Neville, M. Ouzzani, and I. Ilyas, "Guided Data Repair", ArXiv, vol. abs/1103.3103, 2011.
Whissell, J. S., and C. Clarke, "Improving Document Clustering Using Okapi BM25 Feature Weighting", Information Retrieval Journal, vol. 14, issue 5, pp. 466--487, 2011.
Kane, A., and F. Tompa, "Janus: The Intertextuality Search Engine for the Electronic Manipulus Florum Project", Digital Scholarship in the Humanities (DSH), vol. 26, issue 4, pp. 407--415, 2011.
Wong, R. Chi- Wing, T. Ozsu, A. Wai- Chee Fu, P. S. Yu, L. Liu, and Y. Liu, "Maximizing Bichromatic Reverse Nearest Neighbor for L P -Norm In Two- And Three-Dimensional Spaces", The VLDB Journal, vol. 20, issue 6, pp. 893--919, 2011.
Minhas, U. Farooq, S. Rajagopalan, B. Cully, A. Aboulnaga, K. Salem, and A. Warfield, "RemusDB: Transparent High Availability for Database Systems", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 11, pp. 738--748, 2011.
Belkin, N. J., C. Clarke, N. Gao, J. Kamps, and J. Karlgren, "Report on the SIGIR Workshop on "Entertain Me": Supporting Complex Search Tasks", SIGIR Forum, vol. 45, issue 2, pp. 51--59, 2011.
Kling, P., T. Ozsu, and K. Daudjee, "Scaling XML Query Processing: Distribution, Localization and Pruning", Distributed and Parallel Databases, vol. 29, issue 5-6, pp. 445--490, 2011.
Bateni, MH., L. Golab, MT. Hajiaghayi, and H. J. Karloff, "Scheduling to Minimize Staleness and Stretch in Real-Time Data Warehouses", Theory of Computing Systems, vol. 49, issue 4, pp. 757--780, 2011.
Chockler, G. V., E. Dekel, J. F. JáJá, and J. Lin, "Special Issue on Cloud Computing", Journal of Parallel and Distributed Computing, vol. 71, issue 6, pp. 731, 2011.
Macdonald, C., C. Clarke, and J. Wang, "The 1st International Workshop on Diversity in Document Retrieval", SIGIR Forum, vol. 45, issue 2, pp. 87--93, 2011.
Lo, E., C. Binnig, D. Kossmann, T. Ozsu, and W-K. Hon, "A Framework for Testing DBMS Features", The VLDB Journal, vol. 19, issue 2, pp. 203--230, 2010.
Soror, A. A., U. Farooq Minhas, A. Aboulnaga, K. Salem, P. Kokosielis, and S. Kamath, "Automatic Virtual Machine Configuration for Database Workloads", ACM Transactions on Database Systems (TODS), vol. 35, issue 1, pp. 7:1--7:47, 2010.
Soliman, M. A., I. Ilyas, and M. Saleeb, "Building Ranked Mashups of Unstructured Sources With Uncertain Information", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 1, pp. 826--837, 2010.
Golab, L., H. J. Karloff, F. Korn, and D. Srivastava, "Data Auditor: Exploring Data Quality and Semantics Using Pattern Tableaux", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 2, pp. 1641--1644, 2010.
Cormack, G., M. Smucker, and C. Clarke, "Efficient and Effective Spam Filtering and Re-Ranking for Large Web Datasets", ArXiv, vol. abs/1004.5168, 2010.
Srivastava, D., L. Golab, R. Greer, T. Johnson, J. Seidel, V. Shkapenyuk, O. Spatscheck, and J. Yates, "Enabling Real Time Data Analysis", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 1, pp. 1--2, 2010.
Kling, P., T. Ozsu, and K. Daudjee, "Generating Efficient Execution Plans for Vertically Partitioned XML Databases", Proceedings of the VLDB Endowment (PVLDB), vol. 4, issue 1, pp. 1--11, 2010.
Ben-David, S., R. J. Trefler, and G. Weddell, "Model Checking Using Description Logic", Journal of Logic and Computation, vol. 20, issue 1, pp. 111--131, 2010.
Wang, Q., K. Daudjee, and T. Ozsu, "Popularity-Aware Prefetch in P2P Range Caching", Peer-to-Peer Networking and Applications, vol. 3, issue 2, pp. 145--160, 2010.
Pound, J., I. Ilyas, and G. Weddell, "QUICK: Expressive and Flexible Search Over Knowledge Bases and Text Collections", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 2, pp. 1573--1576, 2010.
Azzopardi, L., K. Järvelin, J. Kamps, and M. Smucker, "Report on the SIGIR 2010 Workshop on the Simulation of Interaction", SIGIR Forum, vol. 44, issue 2, pp. 35--47, 2010.
Beskales, G., I. Ilyas, and L. Golab, "Sampling the Repairs of Functional Dependency Violations Under Hard Constraints", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 1, pp. 197--207, 2010.
Stanchev, L., and G. Weddell, "Saving Space and Time Using Index Merging", Data & Knowledge Engineering (DKE), vol. 69, issue 10, pp. 1062--1080, 2010.
Soliman, M. A., I. Ilyas, and S. Ben-David, "Supporting Ranking Queries on Uncertain and Incomplete Data", The VLDB Journal, vol. 19, issue 4, pp. 477--501, 2010.
Ailamaki, A., L. M. Haas, H. V. Jagadish, D. Maier, T. Ozsu, and M. Winslett, "Time for Our Field to Grow Up", Proceedings of the VLDB Endowment (PVLDB), vol. 3, issue 2, pp. 1658, 2010.
Lin, J., C. G. Murray, B. J. Dorr, J. Hajic, and P. Pecina, "A Cost-Effective Lexical Acquisition Process for Large-Scale Thesaurus Translation", Language Resources and Evaluation (LRE), vol. 43, issue 1, pp. 27--40, 2009.
Klavans, J. L., C. Sheffield, E. G. Abels, J. Lin, R. J. Passonneau, T. Sidhu, and D. Soergel, "Computational Linguistics for Metadata Building (CLiMB): Using Text Mining for the Automatic Identification, Categorization, and Disambiguation Of Subject Terms for Image Metadata", Multimedia Tools and Applications, vol. 42, issue 1, pp. 115--138, 2009.
Wan, Q., R. Chi- Wing Wong, I. Ilyas, T. Ozsu, and Y. Peng, "Creating Competitive Products", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 898--909, 2009.
Aboulnaga, A., K. Salem, A. A. Soror, U. Farooq Minhas, P. Kokosielis, and S. Kamath, "Deploying Database Appliances in the Cloud", IEEE Data Engineering Bulletin, vol. 32, issue 1, pp. 13--20, 2009.
Haas, P. J., I. Ilyas, G. M. Lohman, and V. Markl, "Discovering and Exploiting Statistical Properties for Query Optimization In Relational Databases: A Survey", Statistical Analysis and Data Mining, vol. 1, issue 4, pp. 223--250, 2009.
Zou, L., L. Chen, and T. Ozsu, "DistanceJoin: Pattern Match Query in a Large Graph Database", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 886--897, 2009.
Wong, R. Chi- Wing, T. Ozsu, P. S. Yu, A. Wai- Chee Fu, and L. Liu, "Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 1126--1137, 2009.
Hawes, T., J. Lin, and P. Resnik, "Elements of a Computational Model for Multi-Party Discourse: The Turn-Taking Behavior of Supreme Court Justices", Journal of the Association for Information Science and Technology (JASIST), vol. 60, issue 8, pp. 1607--1615, 2009.
Ilyas, I., "Guest Editorial: Special Issue on Ranking in Databases", Distributed and Parallel Databases, vol. 26, issue 1, pp. 1--2, 2009.
Lin, J., "Is Searching Full Text More Effective Than Searching Abstracts?", BMC Bioinformatics, vol. 10, 2009.
Zou, L., L. Chen, and T. Ozsu, "K-Automorphism: A General Framework for Privacy Preserving Network Publication", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 946--957, 2009.
Lin, J., and J. W. Wilbur, "Modeling Actions of PubMed Users With n-Gram Language Models", Information Retrieval Journal, vol. 12, issue 4, pp. 487--503, 2009.
Beskales, G., M. A. Soliman, I. Ilyas, and S. Ben-David, "Modeling and Querying Possible Repairs in Duplicate Detection", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 598--609, 2009.
Aboulnaga, A., and K. Salem, "Report: 4th Int'l Workshop on Self-Managing Database Systems (SMDB 2009)", IEEE Data Engineering Bulletin, vol. 32, issue 4, pp. 2--5, 2009.
Golab, L., H. J. Karloff, F. Korn, A. Saha, and D. Srivastava, "Sequential Dependencies", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 1, pp. 574--585, 2009.
Chockler, G. V., E. Dekel, J. F. JáJá, and J. Lin, "Special Issue of the Journal of Parallel and Distributed Computing: Cloud Computing", Journal of Parallel and Distributed Computing, vol. 69, issue 9, pp. 813, 2009.
El-Helw, A., I. Ilyas, and C. Zuzarte, "StatAdvisor: Recommending Statistical Views", Proceedings of the VLDB Endowment (PVLDB), vol. 2, issue 2, pp. 1306--1317, 2009.
Clarke, C., G. Cormack, T. R. Lynam, C. Buckley, and D. Harman, "Swapping Documents and Terms", Information Retrieval Journal, vol. 12, issue 6, pp. 680--694, 2009.
Jaeger, P. T., J. Lin, J. M. Grimes, and S. N. Simmons, "Where Is the Cloud? Geography, Economics, Environment, and Jurisdiction In Cloud Computing", First Monday, vol. 14, issue 5, 2009.
Li, Y., T. Ozsu, and K-L. Tan, "XCube: Processing XPath Queries in a Hypercube Overlay Network", Peer-to-Peer Networking and Applications, vol. 2, issue 2, pp. 128--145, 2009.
Ilyas, I., G. Beskales, and M. A. Soliman, "A Survey of Top-k Query Processing Techniques in Relational Database Systems", ACM Computing Surveys, vol. 40, issue 4, pp. 11:1--11:58, 2008.
Beskales, G., M. A. Soliman, and I. Ilyas, "Efficient Search for the Top-K Probable Nearest Neighbors in Uncertain Databases", Proceedings of the VLDB Endowment (PVLDB), vol. 1, issue 1, pp. 326--339, 2008.
Plattner, C., G. Alonso, and T. Ozsu, "Extending DBMSs With Satellite Databases", The VLDB Journal, vol. 17, issue 4, pp. 657--682, 2008.
Büttcher, S., and C. Clarke, "Hybrid Index Maintenance for Contiguous Inverted Lists", Information Retrieval Journal, vol. 11, issue 3, pp. 175--207, 2008.
Lin, J., M. DiCuccio, V. Grigoryan, and J. W. Wilbur, "Navigating Information Spaces: A Case Study of Related Article Search In PubMed", Information Processing and Management, vol. 44, issue 5, pp. 1771--1783, 2008.
Golab, L., H. J. Karloff, F. Korn, D. Srivastava, and B. Yu, "On Generating Near-Optimal Tableaux for Conditional Functional Dependencies", Proceedings of the VLDB Endowment (PVLDB), vol. 1, issue 1, pp. 376--390, 2008.
Toman, D., and G. Weddell, "On Keys and Functional Dependencies as First-Class Citizens in Description Logics", Journal of Automated Reasoning, vol. 40, issue 2-3, pp. 117--132, 2008.
Korth, H. F., P. A. Bernstein, M. F. Fernández, L. Gruenwald, P. G. Kolaitis, K. S. McKinley, and T. Ozsu, "Paper and Proposal Reviews: Is the Process Flawed?", SIGMOD Record, vol. 37, issue 3, pp. 36--39, 2008.
Soliman, M. A., I. Ilyas, and K. Chen- Chuan Chang, "Probabilistic Top-k and Ranking-Aggregate Queries", ACM Transactions on Database Systems (TODS), vol. 33, issue 3, pp. 13:1--13:54, 2008.
Ailamaki, A., S. Babu, P. Furtado, S. Lightstone, G. M. Lohman, P. Martin, V. R. Narasayya, G. Pauley, K. Salem, K-U. Sattler, et al., "Report: 3rd Int'l Workshop on Self-Managing Database Systems (SMDB 2008)", IEEE Data Engineering Bulletin, vol. 31, issue 4, pp. 2--5, 2008.
Zajic, D. M., B. J. Dorr, and J. Lin, "Single-Document and Multi-Document Summarization Techniques for Email Threads Using Sentence Compression", Information Processing and Management, vol. 44, issue 4, pp. 1600--1610, 2008.
Lin, J., P. Wu, and E. G. Abels, "Toward Automatic Facet Analysis and Need Negotiation: Lessons From Mediated Search", ACM Transactions on Information Systems (TOIS), vol. 27, issue 1, pp. 6:1--6:42, 2008.
Gil, J., W. Pugh, G. Weddell, and Y. Zibin, "Two-Dimensional Bidirectional Object Layout", ACM Transactions on Programming Languages and Systems (TOPLAS), vol. 30, issue 5, pp. 28:1--28:38, 2008.
Lin, J., "An Exploration of the Principles Underlying Redundancy-Based Factoid Question Answering", ACM Transactions on Information Systems (TOIS), vol. 25, issue 2, pp. 6, 2007.
Demner-Fushman, D., and J. Lin, "Answering Clinical Questions With Knowledge-Based and Statistical Techniques", Computational Linguistics, vol. 33, issue 1, pp. 63--103, 2007.
Zhang, H., N. Zhang, K. Salem, and D. Zhuo, "Compact Access Control Labeling for Efficient Secure XML Query Evaluation", Data & Knowledge Engineering (DKE), vol. 60, issue 2, pp. 326--344, 2007.
Bartolini, I., P. Ciaccia, V. Oria, and T. Ozsu, "Flexible Integration of Multimedia Sub-Queries With Qualitative Preferences", Multimedia Tools and Applications, vol. 33, issue 3, pp. 273, 2007.
Bartolini, I., P. Ciaccia, V. Oria, and T. Ozsu, "Flexible Integration of Multimedia Sub-Queries With Qualitative Preferences", Multimedia Tools and Applications, vol. 33, issue 3, pp. 275--300, 2007.
Callan, J., J. Allan, C. Clarke, S. T. Dumais, D. A. Evans, M. Sanderson, and CX. Zhai, "Meeting of the MINDS: An Information Retrieval Research Agenda", SIGIR Forum, vol. 41, issue 2, pp. 25--34, 2007.
Zajic, D. M., B. J. Dorr, J. Lin, and R. M. Schwartz, "Multi-Candidate Reduction: Sentence Compression as a Tool for Document Summarization Tasks", Information Processing and Management, vol. 43, issue 6, pp. 1549--1570, 2007.
Cormack, G., and T. R. Lynam, "Online Supervised Spam Filter Evaluation", ACM Transactions on Information Systems (TOIS), vol. 25, issue 3, pp. 11, 2007.
Kelly, D., and J. Lin, "Overview of the TREC 2006 ciQA Task", SIGIR Forum, vol. 41, issue 1, pp. 107--116, 2007.
Kantor, P. B., and J. Lin, "Presentation Schemes for Component Analysis in IR Experiments", SIGIR Forum, vol. 41, issue 1, pp. 34--39, 2007.
Lin, J., and J. W. Wilbur, "PubMed Related Articles: A Probabilistic Topic-Based Model for Content Similarity", BMC Bioinformatics, vol. 8, 2007.
Ilyas, I., and G. Das, "Report on the First International Workshop on Ranking in Databases (DBRank'07)", SIGMOD Record, vol. 36, issue 4, pp. 49--51, 2007.
Ailamaki, A., S. Chaudhuri, S. Lightstone, G. M. Lohman, P. Martin, K. Salem, and G. Weikum, "Report on the Second International Workshop on Self-Managing Database Systems (SMDB 2007)", IEEE Data Engineering Bulletin, vol. 30, issue 2, pp. 2--4, 2007.
Goodman, J., G. Cormack, and D. Heckerman, "Spam and the Ongoing Battle for the Inbox", Communications of the ACM, vol. 50, issue 2, pp. 24--33, 2007.
Chomicki, J., and D. Toman, "Special Issue: TIME 2005", Information and Computation, vol. 205, issue 1, pp. 1, 2007.
Lin, J., and J. W. Wilbur, "Syntactic Sentence Compression in the Biomedical Domain: Facilitating Access to Related Articles", Information Retrieval Journal, vol. 10, issue 4-5, pp. 393--414, 2007.
Lin, J., "User Simulations for Evaluating Answers to Question Series", Information Processing and Management, vol. 43, issue 3, pp. 717--729, 2007.
Ilyas, I., W. G. Aref, A. K. Elmagarmid, H. G. Elmongui, R. Shah, and J. Scott Vitter, "Adaptive Rank-Aware Query Optimization in Relational Databases", ACM Transactions on Database Systems (TODS), vol. 31, issue 4, pp. 1257--1304, 2006.
Büttcher, S., and C. Clarke, "Adding Full-Text Filesystem Search to Linux", login - The Usenix Magazine, vol. 31, issue 3, 2006.
M. Attar, H. Sheikh, and T. Ozsu, "Alternative Architectures and Protocols for Providing Strong Consistency In Dynamic Web Applications", World Wide Web (WWW), vol. 9, issue 3, pp. 215--251, 2006.
Lian, J., K. Naik, G. B. Agnew, L. Chen, and T. Ozsu, "BBS: An Energy Efficient Localized Routing Scheme for Query Processing In Wireless Sensor Networks", International Journal of Distributed Sensor Networks, vol. 2, issue 1, pp. 23--54, 2006.
Lin, J., and B. Katz, "Building a Reusable Test Collection for Question Answering", Journal of the Association for Information Science and Technology (JASIST), vol. 57, issue 7, pp. 851--861, 2006.
Cormack, G., "Email Spam Filtering: A Systematic Review", Foundations and Trends in Information Retrieval, vol. 1, issue 4, pp. 335--455, 2006.
Ögüdücü, S. Gündüz, and T. Ozsu, "Incremental Click-Stream Tree Model: Learning From New Users for Web Page Prediction", Distributed and Parallel Databases, vol. 19, issue 1, pp. 5--27, 2006.
Lin, J., and D. Demner-Fushman, "Methods for Automatically Evaluating Answers to Complex Questions", Information Retrieval Journal, vol. 9, issue 5, pp. 565--587, 2006.
Che, D., K. Aberer, and T. Ozsu, "Query Optimization in XML Structured-Document Databases", The VLDB Journal, vol. 15, issue 3, pp. 263--289, 2006.
Cormack, G., "Random Factors in IOI 2005 Test Case Scoring", Informatics in Education, vol. 5, issue 1, pp. 5--14, 2006.
Bratko, A., G. Cormack, B. Filipic, T. R. Lynam, and B. Zupan, "Spam Filtering Using Statistical Data Compression Models", Journal of Machine Learning Research (JMLR), vol. 7, pp. 2673--2698, 2006.
Cormack, G., I. J. Munro, T. Vasiga, and G. Kemkes, "Structure, Scoring and Purpose of Computing Competitions", Informatics in Education, vol. 5, issue 1, pp. 15--36, 2006.
Bernstein, P. A., E. Bertino, A. Heuer, C. S. Jensen, H. Meyer, T. Ozsu, R. T. Snodgrass, and K-Y. Whang, "An Apples-to-Apples Comparison of Two Database Journals", SIGMOD Record, vol. 34, issue 4, pp. 61--64, 2005.
Toman, D., and G. Weddell, "On Reasoning About Structural Equality in XML: A Description Logic Approach", Theoretical Computer Science, vol. 336, issue 1, pp. 181--203, 2005.
Bowman, I. T., and K. Salem, "Optimization of Query Streams Using Semantic Prefetching", ACM Transactions on Database Systems (TODS), vol. 30, issue 4, pp. 1056--1101, 2005.
Pacitti, E., C. Coulon, P. Valduriez, and T. Ozsu, "Preventive Replication in a Database Cluster", Distributed and Parallel Databases, vol. 18, issue 3, pp. 223--251, 2005.
Ozsu, T., D. Kossmann, and R. J. Miller, "Special Issue: Best Papers of VLDB 2004", The VLDB Journal, vol. 14, issue 4, pp. 355--356, 2005.
Clarke, C., N. Craswell, and I. Soboroff, "The TREC Terabyte Retrieval Track", SIGIR Forum, vol. 39, issue 1, pp. 25, 2005.
Voruganti, K., T. Ozsu, and R. C. Unrau, "An Adaptive Data-Shipping Architecture for Client Caching Data Management Systems", Distributed and Parallel Databases, vol. 15, issue 2, pp. 137--177, 2004.
Oria, V., T. Ozsu, and P. Iglinski, "Foundation of the DISIMA Image Query Languages", Multimedia Tools and Applications, vol. 23, issue 3, pp. 185--201, 2004.
Chen, L., T. Ozsu, and V. Oria, "MINDEX: An Efficient Index Structure for Salient-Object-Based Queries In Video Databases", Multimedia Systems, vol. 10, issue 1, pp. 56--71, 2004.
Ross, K. A., P. A. Boncz, I. Ilyas, V. Markl, and V. Vassalos, "Reminiscences on Influential Papers", SIGMOD Record, vol. 33, issue 4, pp. 91--92, 2004.
Gertz, M., T. Ozsu, G. Saake, and K-U. Sattler, "Report on the Dagstuhl Seminar: "Data Quality on the Web"", SIGMOD Record, vol. 33, issue 1, pp. 127--132, 2004.
Ilyas, I., W. G. Aref, and A. K. Elmagarmid, "Supporting Top-K Join Queries in Relational Databases", The VLDB Journal, vol. 13, issue 3, pp. 207--221, 2004.
Cox, A., and C. Clarke, "Three-Layered Source-Code Modelling", Electronic Notes in Theoretical Computer Science (ENTCS), vol. 94, pp. 71--79, 2004.
Berry, D. M., K. Daudjee, J. Dong, I. Fainchtein, M. Augusta V. Nelson, T. Nelson, and L. Ou, "User's Manual as a Requirements Specification: Case Studies", Requirements Engineering, vol. 9, issue 1, pp. 67--82, 2004.
Aref, W. G., A. Christine Catlin, A. K. Elmagarmid, J. Fan, M. A. Hammad, I. Ilyas, M. S. Marzouk, S. Prabhakar, Y-C. Tu, and X. Zhu, "VDBMS: A Testbed Facility for Research in Video Database Benchmarking", Multimedia Systems, vol. 9, issue 6, pp. 575--585, 2004.
Golab, L., and T. Ozsu, "Issues in Data Stream Management", SIGMOD Record, vol. 32, issue 2, pp. 5--14, 2003.
Ozsu, T., "New Partnership With ACM and Update on the Journal", The VLDB Journal, vol. 12, issue 1, pp. 1, 2003.
Young-Lai, M., and F. Tompa, "One-Pass Evaluation of Region Algebra Expressions", Information Systems, vol. 28, issue 3, pp. 159--168, 2003.
Bowman, I. T., and D. Toman, "Optimizing Temporal Queries: Efficient Handling of Duplicates", Data & Knowledge Engineering (DKE), vol. 44, issue 2, pp. 143--164, 2003.
Lushman, B., and G. Cormack, "Proof of Correctness of Ressel's adOPTed Algorithm", Information Processing Letters, vol. 86, issue 6, pp. 303--310, 2003.
Khizder, V. L., and G. Weddell, "Reasoning About Uniqueness Constraints in Object Relational Databases", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 15, issue 5, pp. 1295--1306, 2003.
Chomicki, J., D. Q. Goldin, G. M. Kuper, and D. Toman, "Variable Independence in Constraint Databases", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 15, issue 6, pp. 1422--1436, 2003.
Oria, V., and T. Ozsu, "Views or Points of View on Images", International Journal of Image and Graphics, vol. 3, issue 1, pp. 55--80, 2003.
Zhang, H., and F. Tompa, "XQuery Rewriting at the Relational Algebra Level", Computer Systems: Science & Engineering, vol. 18, issue 5, pp. 241--262, 2003.
Li, Q., and T. Ozsu, "Editorial: Introduction to Web Media Information Systems", World Wide Web (WWW), vol. 5, issue 2, pp. 179--180, 2002.
Cao, L. Y., and T. Ozsu, "Evaluation of Strong Consistency Web Caching Techniques", World Wide Web (WWW), vol. 5, issue 2, pp. 95--124, 2002.
Leontiev, Y., T. Ozsu, and D. Szafron, "On Type Systems for Object-Oriented Database Programming Languages", ACM Computing Surveys, vol. 34, issue 4, pp. 409--449, 2002.
Marathe, A. P., and K. Salem, "Query Processing Techniques for Arrays", The VLDB Journal, vol. 11, issue 1, pp. 68--91, 2002.
Attaluri, G. K., and K. Salem, "The Presumed-Either Two-Phase Commit Protocol", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 14, issue 5, pp. 1190--1196, 2002.
Chomicki, J., D. Toman, and M. H. Böhlen, "Querying ATSQL Databases With Temporal Logic", ACM Transactions on Database Systems (TODS), vol. 26, issue 2, pp. 145--178, 2001.
Aref, W. G., and I. Ilyas, "SP-GiST: An Extensible Database Index for Supporting Space Partitioning Trees", Journal of Intelligent Information Systems (JIIS), vol. 17, issue 2-3, pp. 215--240, 2001.
Ozsu, T., H-J. Schek, K. Tanaka, and Y. Zhang, "Special Issue on the 2nd Web Information Systems Engineering Conference (Wise'01)", World Wide Web (WWW), vol. 4, issue 3, pp. 147--149, 2001.
Goralwalla, I. A., Y. Leontiev, T. Ozsu, D. Szafron, and C. Combi, "Temporal Granularity: Completing the Puzzle", Journal of Intelligent Information Systems (JIIS), vol. 16, issue 1, pp. 41--63, 2001.
Chai, J. Y., J. Lin, W. Zadrozny, Y. Ye, M. Stys-Budzikowska, V. Horvath, N. Kambhatla, and C. G. Wolf, "The Role of a Natural Language Conversational Interface in Online Sales: A Case Study", International Journal of Speech Technology, vol. 4, issue 3-4, pp. 285--295, 2001.
Ozsu, T., and P. Iglinski, "An Interoperable Multimedia Catalog System for Electronic Commerce", IEEE Data Engineering Bulletin, vol. 23, issue 1, pp. 17--22, 2000.
Cormack, G., C. Clarke, C. R. Palmer, and S. S. L. To, "Passage-Based Query Refinement (MultiText Experiments for TREC-6)", Information Processing and Management, vol. 36, issue 1, pp. 133--153, 2000.
Clarke, C., G. Cormack, and E. A. Tudhope, "Relevance Ranking for One to Three Term Queries", Information Processing and Management, vol. 36, issue 2, pp. 291--311, 2000.
Ozsu, T., "Review - Record-Boundary Discovery in Web Documents", ACM SIGMOD Digital Review, vol. 2, 2000.
Clarke, C., and G. Cormack, "Shortest-Substring Retrieval and Ranking", ACM Transactions on Information Systems (TOIS), vol. 18, issue 1, pp. 44--78, 2000.
Young-Lai, M., and F. Tompa, "Stochastic Grammatical Inference of Text Database Structure", Machine Learning, vol. 40, issue 2, pp. 111--137, 2000.
Salminen, A., and F. Tompa, "Grammars++ for Modelling Information in Text", Information Systems, vol. 24, issue 1, pp. 1--24, 1999.
Brown, L. J., M. P. Consens, I. J. Davis, C. R. Palmer, and F. Tompa, "A Structured Text ADT for Object-Relational Databases", TAPOS - Theory and Practice of Object Systems, vol. 4, issue 4, pp. 227--244, 1998.
Goralwalla, I. A., D. Szafron, T. Ozsu, and R. J. Peters, "A Temporal Approach to Managing Schema Evolution in Object Database Systems", Data & Knowledge Engineering (DKE), vol. 28, issue 1, pp. 73--105, 1998.
Clarke, C., G. Cormack, and C. R. Palmer, "An Overview of MultiText", SIGIR Forum, vol. 32, issue 2, pp. 14--15, 1998.
Toman, D., and J. Chomicki, "Datalog With Integer Periodicity Constraints", Journal of Logic Programming, vol. 35, issue 3, pp. 263--290, 1998.
Dogac, A., C. Dengi, and T. Ozsu, "Distributed Object Computing Platforms", Communications of the ACM, vol. 41, issue 9, pp. 95--103, 1998.
Ozsu, T., and S. Christodoulakis, "Introduction (Special Issue on Multimedia Databases)", The VLDB Journal, vol. 7, issue 4, pp. 205, 1998.
Cowan, D. D., C. I. Mayfield, F. Tompa, and W. Gasparini, "New Role for Community Networks", Communications of the ACM, vol. 41, issue 4, pp. 61--63, 1998.
Zhou, M., and F. Tompa, "The Suffix-Signature Method for Searching for Phrases in Text", Information Systems, vol. 23, issue 8, pp. 567--588, 1998.
Akyürek, S., and K. Salem, "Adaptive Block Rearrangement Under UNIX", Software - Practice and Experience (SPE), vol. 27, issue 1, pp. 1--23, 1997.
Peters, R. J., and T. Ozsu, "An Axiomatic Model of Dynamic Schema Evolution in Objectbase Systems", ACM Transactions on Database Systems (TODS), vol. 22, issue 1, pp. 75--114, 1997.
Wong, J. W., K. A. Lyons, D. Evans, R. J. Velthuys, G. von Bochmann, E. Dubois, N. D. Georganas, G. W. Neufeld, T. Ozsu, J. Brinskelle, et al., "Enabling Technology for Distributed Multimedia Applications", IBM Systems Journal, vol. 36, issue 4, pp. 489--507, 1997.
Toman, D., "Memoing Evaluation for Constraint Extensions of Datalog", Constraints - An International Journal, vol. 2, issue 3/4, pp. 337--359, 1997.
Goralwalla, I. A., T. Ozsu, and D. Szafron, "Modeling Medical Trials in Pharmacoeconomics Using a Temporal Object Model", Computers in Biology and Medicine, vol. 27, issue 5, pp. 369--387, 1997.
Clarke, C., and G. Cormack, "On the Use of Regular Expressions for Searching Text", ACM Transactions on Programming Languages and Systems (TOPLAS), vol. 19, issue 3, pp. 413--426, 1997.
Clarke, C., and D. V. Mason, "Compacting Garbage Collection Can Be Fast and Simple", Software - Practice and Experience (SPE), vol. 26, issue 2, pp. 177--194, 1996.
Ozsu, T., and P. Valduriez, "Distributed and Parallel Database Systems", ACM Computing Surveys, vol. 28, issue 1, pp. 125--128, 1996.
Raymond, D. R., F. Tompa, and D. Wood, "From Data Representation to Data Model: Meta-Semantic Issues in The Evolution of SGML", Computer Standards & Interfaces, vol. 18, issue 1, pp. 25--36, 1996.
Ozsu, T., "Future of Database Systems: Changing Applications and Technological Developments", ACM Computing Surveys, vol. 28, issue 4es, pp. 85, 1996.
Duggan, D., G. Cormack, and J. Ophel, "Kinded Type Inference for Parametric Overloading", Acta Informatica, vol. 33, issue 1, pp. 21--68, 1996.
Akyürek, S., and K. Salem, "Adaptive Block Rearrangement", ACM Transactions on Computer Systems (TOCS), vol. 13, issue 2, pp. 89--121, 1995.
Clarke, C., G. Cormack, and F. J. Burkowski, "An Algebra for Structured Text Search and a Framework for Its Implementation", The Computer Journal, vol. 38, issue 1, pp. 43--56, 1995.
Ozsu, T., D. Szafron, G. El-Medani, and C. Vittal, "An Object-Oriented Multimedia Database System for a News-on-Demand Applications", Multimedia Systems, vol. 3, issue 5-6, pp. 182--203, 1995.
Chomicki, J., and D. Toman, "Implementing Temporal Integrity Constraints Using an Active DBMS", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 7, issue 4, pp. 566--582, 1995.
Ito, M., and G. Weddell, "Implication Problems for Functional Constraints on Databases Supporting Complex Objects", Journal of Computer and System Sciences (JCSS), vol. 50, issue 1, pp. 165--187, 1995.
Akyürek, S., and K. Salem, "Management of Partially Safe Buffers", IEEE Transactions on Computers, vol. 44, issue 3, pp. 394--407, 1995.
Garcia-Molina, H., and K. Salem, "Non-Deterministic Queue Operations", Journal of Computer and System Sciences (JCSS), vol. 51, issue 2, pp. 211--222, 1995.
Straube, D. D., and T. Ozsu, "Query Optimization and Execution Plan Generation in Object-Oriented Data Management Systems", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 7, issue 2, pp. 210--227, 1995.
Ozsu, T., R. J. Peters, D. Szafron, B. Irani, A. Lipka, and A. Muñoz, "TIGUKAT: A Uniform Behavioral Objectbase Management System", The VLDB Journal, vol. 4, issue 3, pp. 445--492, 1995.
Shen, J., and G. Cormack, "Access Control for Private Declarations in Ada", Computer Languages, Systems & Structures, vol. 20, issue 2, pp. 117--126, 1994.
Salem, K., H. Garcia-Molina, and J. Shands, "Altruistic Locking", ACM Transactions on Database Systems (TODS), vol. 19, issue 1, pp. 117--165, 1994.
Ito, M., and G. Weddell, "Implication Problems for Functional Constraints on Databases Supporting Complex Objects", Journal of Computer and System Sciences (JCSS), vol. 49, issue 3, pp. 726--768, 1994.
van Bommel, M. F., and G. Weddell, "Reasoning About Equations and Functional Dependencies on Complex Objects", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 6, issue 3, pp. 455--469, 1994.
Garcia-Molina, H., and K. Salem, "Services for a Workflow Management System", IEEE Data Engineering Bulletin, vol. 17, issue 1, pp. 40--44, 1994.
Pissinou, N., R. T. Snodgrass, R. Elmasri, I. Singh Mumick, T. Ozsu, B. Pernici, A. Segev, B. Theodoulidis, and U. Dayal, "Towards an Infrastructure for Temporal Databases: Report of an Invitational ARPA/NSF Workshop", SIGMOD Record, vol. 23, issue 1, pp. 35--51, 1994.
Ozsu, T., U. Dayal, and P. Valduriez, "Workshop Report: International Workshop on Distributed Object Management", SIGMOD Record, vol. 22, issue 1, pp. 40--54, 1993.
Garcia-Molina, H., and K. Salem, "Main Memory Database Systems: An Overview", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 4, issue 6, pp. 509--516, 1992.
Weddell, G., "Reasoning About Functional Dependencies Generalized for Semantic Data Models", ACM Transactions on Database Systems (TODS), vol. 17, issue 1, pp. 32--64, 1992.
G. Blake, E., T. Bray, and F. Tompa, "Shortening the OED: Experience With a Grammar-Defined Database", ACM Transactions on Information Systems (TOIS), vol. 10, issue 3, pp. 213--232, 1992.
Ozsu, T., and P. Valduriez, "Distributed Database Systems: Where Are We Now?", IEEE Computer, vol. 24, issue 8, pp. 68--78, 1991.
Garcia-Molina, H., D. Gawlick, J. Klein, K. Kleissner, and K. Salem, "Modeling Long-Running Activities as Nested Sagas", IEEE Data Engineering Bulletin, vol. 14, issue 1, pp. 14--18, 1991.
Buchmann, A. P., T. Ozsu, and D. Georgakopoulos, "Towards a Transaction Management System for DOM", GTE Laboratories Incorporated, vol. TR-0146-06-91-165, 1991.
Tompa, F., and J. I. Icaza, "Adaptive Selection of Query Execution Strategies by Learning Automata", Information Sciences, vol. 50, issue 3, pp. 219--240, 1990.
Ozsu, T., and D. J. Meechan, "Finding Heuristics for Processing Selection Queries in Relational Database Systems", Information Systems, vol. 15, issue 3, pp. 359--373, 1990.
Ozsu, T., and D. J. Meechan, "Join Processing Heuristics in Relational Database Systems", Information Systems, vol. 15, issue 4, pp. 429--444, 1990.
Dueck, G. D. P., and G. Cormack, "Modular Attribute Grammars", The Computer Journal, vol. 33, issue 2, pp. 164--172, 1990.
Straube, D. D., and T. Ozsu, "Queries and Query Processing in Object-Oriented Database Systems", ACM Transactions on Information Systems (TOIS), vol. 8, issue 4, pp. 387--430, 1990.
Salem, K., and H. Garcia-Molina, "System M: A Transaction Processing Testbed for Memory Resident Data", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 2, issue 1, pp. 161--172, 1990.
Tompa, F., "A Data Model for Flexible Hypertext Database Systems", ACM Transactions on Information Systems (TOIS), vol. 7, issue 1, pp. 85--100, 1989.
Salomon, D. J., and G. Cormack, "Corrections to the Paper: Scannerless NSLR(1) Parsing of Programming Languages", ACM SIGPLAN Notices, vol. 24, issue 11, pp. 80--83, 1989.
Raymond, D. R., A. J. Cañas, F. Tompa, and F. R. Safayeni, "Measuring the Effectiveness of Personal Database Structures", International Journal of Human-Computer Studies, vol. 31, issue 3, pp. 237--256, 1989.
Weddell, G., "Selection of Indexes to Memory-Resident Entities for Semantic Data Models", IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 1, issue 2, pp. 274--284, 1989.
Farrag, A. Aziz, and T. Ozsu, "Using Semantic Knowledge of Transactions to Increase Concurrency", ACM Transactions on Database Systems (TODS), vol. 14, issue 4, pp. 503--525, 1989.
Cormack, G., "A Micro-Kernel for Concurrency in C", Software - Practice and Experience (SPE), vol. 18, issue 5, pp. 485--491, 1988.
Raymond, D. R., and F. Tompa, "Hypertext and the Oxford English Dictionary", Communications of the ACM, vol. 31, issue 7, pp. 871--879, 1988.
Tompa, F., and J. A. Blakeley, "Maintaining Materialized Views Without Accessing Base Data", Information Systems, vol. 13, issue 4, pp. 393--406, 1988.
Garcia-Molina, H., and K. Salem, "The Impact of Disk Striping on Reliability", IEEE Data Engineering Bulletin, vol. 11, issue 1, pp. 26--39, 1988.
Alonso, R., H. Garcia-Molina, and K. Salem, "Concurrency Control and Recovery for Global Procedures in Federated Database Systems", IEEE Data Engineering Bulletin, vol. 10, issue 3, pp. 5--11, 1987.
Cormack, G., and N. R. Horspool, "Data Compression Using Dynamic Markov Modelling", The Computer Journal, vol. 30, issue 6, pp. 541--550, 1987.
R. Horspool, N., and G. Cormack, "Hashing as a Compaction Technique for LR Parser Tables", Software - Practice and Experience (SPE), vol. 17, issue 6, pp. 413--416, 1987.
Strothotte, T., and G. Cormack, "Structured Program Lookahead", Computer Languages, Systems & Structures, vol. 12, issue 2, pp. 95--108, 1987.
Farrag, A. Aziz, and T. Ozsu, "Towards a General Concurrency Control Algorithm for Database Systems", IEEE Transactions on Software Engineering (TSE), vol. 13, issue 10, pp. 1073--1079, 1987.
Medeiros, C. Bauzer, and F. Tompa, "Understanding the Implications of View Update Policies", Algorithmica, vol. 1, issue 3, pp. 337--360, 1986.
Cormack, G., "Data Compression on a Database System", Communications of the ACM, vol. 28, issue 12, pp. 1336--1342, 1985.
Ozsu, T., "Modeling and Analysis of Distributed Database Concurrency Control Algorithms Using an Extended Petri Net Formalism", IEEE Transactions on Software Engineering (TSE), vol. 11, issue 10, pp. 1225--1240, 1985.
Cormack, G., N. R. Horspool, and M. Kaiserswerth, "Practical Perfect Hashing", The Computer Journal, vol. 28, issue 1, pp. 54--58, 1985.
Cormack, G., and N. R. Horspool, "Algorithms for Adaptive Huffman Codes", Information Processing Letters, vol. 18, issue 3, pp. 159--165, 1984.
Gonnet, G. H., and F. Tompa, "A Constructive Approach to the Design of Algorithms and Their Data Structures", Communications of the ACM, vol. 26, issue 11, pp. 912--920, 1983.
Cormack, G., "Extensions to Static Scoping", ACM SIGPLAN Notices, vol. 18, issue 6, pp. 187--191, 1983.
Gonnet, G. H., P-Å. Larson, I. J. Munro, D. Rotem, D. J. Taylor, and F. Tompa, "Database Storage Structures Research at the University of Waterloo", IEEE Data Engineering Bulletin, vol. 5, issue 1, pp. 49--52, 1982.
Ramírez, R. J., F. Tompa, and I. J. Munro, "Optimum Reorganization Points for Arbitrary Database Costs", Acta Informatica, vol. 18, pp. 17--30, 1982.
Ling, T. Wang, F. Tompa, and T. Kameda, "An Improved Third Normal Form for Relational Databases", ACM Transactions on Database Systems (TODS), vol. 6, issue 2, pp. 329--346, 1981.
Tompa, F., J. Gecsei, and G. von Bochmann, "Special Feature: Data Structuring Facilities for Interactive Videotex Systems", IEEE Computer, vol. 14, issue 8, pp. 72--81, 1981.
Tompa, F., "A Practical Example of the Specification of Abstract Data Types", Acta Informatica, vol. 13, pp. 205--224, 1980.
Gotlieb, C. C., and F. Tompa, "Choosing a Storage Schema", Acta Informatica, vol. 3, pp. 297--319, 1974.
van Dam, A., and F. Tompa, "Software Data Paging and Segmentation for Complex Systems", Information Processing Letters, vol. 1, issue 3, pp. 80--86, 1972.
R. Bergeron, D., J. D. Gannon, D. P. Shecter, F. Tompa, and A. van Dam, "Systems Programming Languages", Advances in Computers, vol. 12, pp. 175--284, 1972.

Conference Paper

Arabzadeh, N., A. Bigdeli, and C. Clarke, "Adapting Standard Retrieval Benchmarks to Evaluate Generated Answers", European Conference on Information Retrieval (ECIR), 2024.
Usta, A., C. Liu, and S. Salihoglu, "Analysis of Open Government Datasets From a Data Design and Integration Perspective", International Conference on Extending Database Technology (EDBT), 2024.
Mousavi, A., X. Zhan, H. Bai, P. Shi, T. Rekatsinas, B. Han, Y. Li, J. Pound, J. M. Susskind, N. Schluter, et al., "Construction of Paired Knowledge Graph - Text Datasets Informed By Cyclic Evaluation", International Conference on Computational Linguistics (COLING), 2024.
Arabzadeh, N., and C. Clarke, "Fréchet Distance for Offline Evaluation of Information Retrieval Systems With Sparse Labels", Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024.
Lin, J., J. Li, J. Gao, W. Ma, and Y. Liu, "Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification", AAAI Conference on Artificial Intelligence (AAAI), 2024.
Arabzadeh, N., K. Golzadeh, C. Risi, C. Clarke, and J. Zhao, "KnowFIRES: A Knowledge-Graph Framework for Interpreting Retrieved Entities From Search", European Conference on Information Retrieval (ECIR), 2024.
Hebert, L., G. Sahu, Y. Guo, N. Kishore Sreenivas, L. Golab, and R. Cohen, "Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media", AAAI Conference on Artificial Intelligence (AAAI), 2024.
Esmaeilzadeh, A., J. Rorseth, A. Yu, P. Godfrey, L. Golab, D. Srivastava, J. Szlichta, and K. Taghva, "On Integrating the Data-Science and Machine-Learning Pipelines For Responsible AI", Workshop in Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI), 2024.
Sahu, S., and S. Salihoglu, "Optimizing Differential Computation for Large-Scale Graph Processing", International Workshop on Graph Data Management Experiences and Systems (GRADES), 2024.
Khalaji, M., T. Brown, K. Daudjee, and V. Aksenov, "Practical Hardware Transactional vEB Trees", ACM Symposium on Principles & Practice of Parallel Programming (PPoPP), 2024.
Bonifati, A., T. Ozsu, Y. Tian, H. Voigt, W. Yu, and W. Zhang, "The Future of Graph Analytics", ACM International Conference on Management of Data (SIGMOD), 2024.
Azzopardi, L., C. Clarke, P. B. Kantor, B. Mitra, J. R. Trippas, and Z. Ren, "The Search Futures Workshop", European Conference on Information Retrieval (ECIR), 2024.
Pradeep, R., and J. Lin, "Towards Automated End-to-End Health Misinformation Free Search With A Large Language Model", European Conference on Information Retrieval (ECIR), 2024.
Xian, J., T. Teofili, R. Pradeep, and J. Lin, "Vector Search With OpenAI Embeddings: Lucene Is All You Need", Web Search and Data Mining (WSDM), 2024.
Jiang, Z., M. Y. R. Yang, M. Tsirlin, R. Tang, Y. Dai, and J. Lin, ""Low-Resource" Text Classification: A Parameter-Free Classification Method With Compressors", Association for Computational Linguistics (ACL), 2023.
Arabzadeh, N., O. Kmet, B. Carterette, C. Clarke, C. Hauff, and P. Chandar, "A Is for Adele: An Offline Evaluation Metric for Instant Search", International Conference on the Theory of Information Retrieval (ICTIR), 2023.
Seifikar, M., L. Nhi Phan Minh, N. Arabzadeh, C. Clarke, and M. Smucker, "A Preference Judgment Tool for Authoritative Assessment", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Fernando, L., H. Bindra, and K. Daudjee, "An Experimental Analysis of Quantile Sketches Over Data Streams", International Conference on Extending Database Technology (EDBT), 2023.
Zhang, C., A. Bonifati, and T. Ozsu, "An Overview of Reachability Indexes on Graphs", ACM International Conference on Management of Data (SIGMOD), 2023.
Ma, X., T. Teofili, and J. Lin, "Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes", International Conference on Information and Knowledge Management (CIKM), 2023.
Zhong, W., Y. Xie, and J. Lin, "Answer Retrieval for Math Questions Using Structural and Dense Retrieval", Conference and Labs of the Evaluation Forum (CLEF), 2023.
Yang, J-H., C. Lassance, R. Sampaio de Rezende, K. Srinivasan, M. Redi, S. Clinchant, and J. Lin, "AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Oladipo, A., M. Adeyemi, O. Ahia, A. Toluwase Owodunni, O. Ogundepo, D. Ifeoluwa Adelani, and J. Lin, "Better Quality Pre-Training Data and T5 Models for African Languages", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Adeyemi, M., A. Oladipo, X. Zhang, D. Alfonso-Hermelo, M. Rezagholizadeh, B. Chen, and J. Lin, "CIRAL at FIRE 2023: Cross-Lingual Information Retrieval for African Languages", Forum for Information Retrieval Evaluation (FIRE), 2023.
Li, M., S-C. Lin, B. Oguz, A. Ghoshal, J. Lin, Y. Mehdad, W-tau. Yih, and X. Chen, "CITADEL: Conditional Token Interaction via Dynamic Lexical Routing For Efficient and Effective Multi-Vector Retrieval", Association for Computational Linguistics (ACL), 2023.
Rorseth, J., P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "CREDENCE: Counterfactual Explanations for Document Ranking", IEEE International Conference on Data Engineering (ICDE), 2023.
Wang, R., J. Wang, P. Kadam, T. Ozsu, and W. G. Aref, "dLSM: An LSM-Based Index for Memory Disaggregation", IEEE International Conference on Data Engineering (ICDE), 2023.
Chai, A., A. Vezvaei, L. Golab, M. Kargar, D. Srivastava, J. Szlichta, and M. Zihayat, "EAGER: Explainable Question Answering Using Knowledge Graphs", International Workshop on Graph Data Management Experiences and Systems (GRADES), 2023.
Ma, X., H. Fun, X. Yin, A. Mallia, and J. Lin, "Enhancing Sparse Retrieval via Unsupervised Learning", ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region (SIGIR-AP), 2023.
Kamalloo, E., X. Zhang, O. Ogundepo, N. Thakur, D. Alfonso-Hermelo, M. Rezagholizadeh, and J. Lin, "Evaluating Embedding APIs for Information Retrieval", Association for Computational Linguistics (ACL), 2023.
Kamalloo, E., N. Dziri, C. Clarke, and D. Rafiei, "Evaluating Open-Domain Question Answering in the Era of Large Language Models", Association for Computational Linguistics (ACL), 2023.
Hebert, L., L. Golab, P. Poupart, and R. Cohen, "FedFormer: Contextual Federation With Attention in Reinforcement Learning", International Joint Conference on Autonomous Agents & Multiagent Systems (AAMAS), 2023.
Bayat, F. Fatahi, K. Qian, B. Han, Y. Sang, A. Belyi, S. Khorshidi, F. Wu, I. Ilyas, and Y. Li, "FLEEK: Factual Error Detection and Correction With Evidence Retrieved From External Knowledge", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Piktus, A., O. Ogundepo, C. Akiki, A. Oladipo, X. Zhang, H. Schoelkopf, S. Biderman, M. Potthast, and J. Lin, "GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration", Association for Computational Linguistics (ACL), 2023.
Hu, L., L. Zou, and T. Ozsu, "GAMMA: A Graph Pattern Mining Framework for Large Graphs on GPU", IEEE International Conference on Data Engineering (ICDE), 2023.
Pang, Y., L. Yang, L. Zou, and T. Ozsu, "gFOV: A Full-Stack SPARQL Query Optimizer & Plan Visualizer", International Conference on Information and Knowledge Management (CIKM), 2023.
Liu, C., A. Usta, J. Zhao, and S. Salihoglu, "Governor: Turning Open Government Data Portals Into Interactive Databases", ACM Conference on Human Factors in Computing Systems (CHI), 2023.
Ilyas, I., JP. Lacerda, Y. Li, U. Farooq Minhas, A. Mousavi, J. Pound, T. Rekatsinas, and C. Sumanth, "Growing and Serving Large Open-Domain Knowledge Graphs", ACM International Conference on Management of Data (SIGMOD), 2023.
Pradeep, R., K. Hui, J. Gupta, Á. D. Lelkes, H. Zhuang, J. Lin, D. Metzler, and V. Q. Tran, "How Does Generative Retrieval Scale to Millions of Passages?", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Lin, S-C., A. Asai, M. Li, B. Oguz, J. Lin, Y. Mehdad, W-tau. Yih, and X. Chen, "How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Conia, S., M. Li, D. Lee, U. Farooq Minhas, I. Ilyas, and Y. Li, "Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Esmaeilzadeh, A., L. Golab, and K. Taghva, "InfoMoD: Information-Theoretic Model Diagnostics", International Conference on Statistical and Scientific Database Management (SSDBM), 2023.
Bianchi, A., R. Karegar, P. Godfrey, L. Golab, M. Kargar, D. Srivastava, and J. Szlichta, "iORDER: Mining Implicit Domain Orders", IEEE International Conference on Data Engineering (ICDE), 2023.
Jin, G., X. Feng, Z. Chen, C. Liu, and S. Salihoglu, "KÙZU Graph Database Management System", Conference on Innovative Data Systems Research (CIDR), 2023.
Kamalloo, E., C. Clarke, and D. Rafiei, "Limitations of Open-Domain Question Answering Benchmarks for Document-Level Reasoning", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Buchanan, G. Robert, D. McKay, and C. Clarke, "Made to Measure: A Workshop on Human-Centred Metrics for Information Seeking", Conference on Human Information Interaction and Retrieval (CHIIR), 2023.
Lin, S-C., A. Ahmad, and J. Lin, "mAggretriever: A Simple Yet Effective Approach to Zero-Shot Multilingual Dense Retrieval", Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.
Kamphuis, C., A. Lin, S. Yang, J. Lin, A. P. de Vries, and F. Hasibi, "MMEAD: MS MARCO Entity Annotations and Disambiguations", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Ghasemitaheri, S., A. Holcomb, L. Golab, and S. Keshav, "On the Data Quality of Remotely Sensed Forest Maps", Very Large Data Bases Conference (VLDB), 2023.
Zhong, W., S-C. Lin, J-H. Yang, and J. Lin, "One Blade for One Purpose: Advancing Math Information Retrieval Using Hybrid Search", International Conference on Research and Development in Information Retrieval (SIGIR), 2023.
Xin, J., R. Tang, Z. Jiang, Y. Yu, and J. Lin, "Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers", Association for Computational Linguistics (ACL), 2023.
Adeyemi, M., A. Oladipo, X. Crystina Zhang, D. Alfonso-Hermelo, M. Rezagholizadeh, B. Chen, and J. Lin, "Overview of the CIRAL Track at FIRE 2023: Cross-Lingual Information Retrieval for African Languages", Forum for Information Retrieval Evaluation (FIRE), 2023.
Feng, E., A. Borgida, E. Franconi, P. F. Patel-Schneider, D. Toman, and G. Weddell, "Path Description Dependencies in Feature-Based DLs", International Workshop on Description Logics (DL), 2023.
Faggioli, G., L. Dietz, C. Clarke, G. Demartini, M. Hagen, C. Hauff, N. Kando, E. Kanoulas, M. Potthast, B. Stein, et al., "Perspectives on Large Language Models for Relevance Judgment", International Conference on the Theory of Information Retrieval (ICTIR), 2023.
Tamber, M. Singh, R. Pradeep, and J. Lin, "Pre-Processing Matters! Improved Wikipedia Corpora for Open-Domain Question Answering", European Conference on Information Retrieval (ECIR), 2023.
Gao, L., X. Ma, J. Lin, and J. Callan, "Precise Zero-Shot Dense Retrieval Without Relevance Labels", Association for Computational Linguistics (ACL), 2023.
Ehrlinger, L., H. Harmouch, I. Ilyas, and F. Naumann, "Preface QDB", Very Large Data Bases Conference (VLDB), 2023.
Ozsu, T., and X. Xue, "Preface SDA", Very Large Data Bases Conference (VLDB), 2023.
Clarke, C., F. Diaz, and N. Arabzadeh, "Preference-Based Offline Evaluation", Web Search and Data Mining (WSDM), 2023.