Skip to content

mrezende/cbsoft-sbes-2020-short-paper

Repository files navigation

Short paper published in 34th Brazilian Symposium on Software Engineering (SBES '20) proceedings.

Title: CoNCRA: A Convolutional Neural Network Code Retrieval Approach

Authors: Marcelo de Rezende Martins and Marco Aurélio Gerosa

Abstract: Software developers routinely search for code using general-purpose search engines. However, these search engines cannot find code semantically unless it has an accompanying description. We propose a technique for semantic code search: A Convolutional Neural Network approach to code retrieval (CoNCRA). Our technique aims to find the code snippet that most closely matches the developer's intent, expressed in natural language. We evaluated our approach's efficacy on a dataset composed of questions and code snippets collected from Stack Overflow. Our preliminary results showed that our technique, which prioritizes local interactions (words nearby), improved the state-of-the-art (SOTA) by 5% on average, retrieving the most relevant code snippets in the top 3 (three) positions by almost 80% of the time. Therefore, our technique is promising and can improve the efficacy of semantic code retrieval.

Source code available at: https://github.com/mrezende/concra

Short paper available at: https://arxiv.org/abs/2009.01959

About

SBES 2020 34th Brazilian Symposium on Software Engineering – Innovative Ideas and Emerging Results Track

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages