Please note: This PhD seminar will take place in DC 2585 and online.
Yiwen Dong, PhD candidate
David R. Cheriton School of Computer Science
Supervisor: Professor Chengnian Sun
Code snippets are prevalent on websites such as Stack Overflow and are effective in demonstrating API usages concisely. However, they are usually difficult to be used directly because most code snippets not only are syntactically incomplete but also lack dependency information, and thus do not compile. For example, Java snippets usually do not have import statements or required library names; only 6.88% of Java snippets on Stack Overflow include import statements necessary for compilation.
This talk focuses on SnR, a precise, efficient, constraint-based technique to automatically infer the exact types used in code snippets and the libraries containing the inferred types, to compile and therefore reuse the code snippets. SnR builds a knowledge base of APIs, i.e., various facts about the available APIs, from a corpus of Java libraries. Given a code snippet with missing import statements, SnR automatically extracts typing constraints from the snippet, solves the constraints against the knowledge base, and returns a set of APIs that satisfies the constraints to be imported into the snippet.
We found SnR to significantly outperform other state-of-the-art solutions. On a benchmark of 267 code snippets from Stack Overflow, SnR correctly infers 91% of the import statements, which makes 73.8% of the snippets compilable.