An inception architecture-based model for improving code readability classification

Qing Mi, Jacky Keung, Yan Xiao, Solomon Mensah, Xiupei Mei

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

14 Citations (Scopus)

Abstract

The process of classifying a piece of source code into a Readable or Unreadable class is referred to as Code Readability Classification. To build accurate classification models, existing studies focus on handcrafting features from different aspects that intuitively seem to correlate with code readability, and then exploring various machine learning algorithms based on the newly proposed features. On the contrary, our work opens up a new way to tackle the problem by using the technique of deep learning. Specifically, we propose IncepCRM, a novel model based on the Inception architecture that can learn multi-scale features automatically from source code with little manual intervention. We apply the information of human annotators as the auxiliary input for training IncepCRM and empirically verify the performance of IncepCRM on three publicly available datasets. The results show that: 1) Annotator information is beneficial for model performance as confirmed by robust statistical tests (i.e., the Brunner-Munzel test and Cliff's delta); 2) IncepCRM can achieve an improved accuracy against previously reported models across all datasets. The findings of our study confirm the feasibility and effectiveness of deep learning for code readability classification.

Original languageEnglish
Title of host publicationProceedings of the 22nd International Conference on Evaluation and Assessment in Software Engineering 2018, EASE 2018
PublisherAssociation for Computing Machinery
ISBN (Print)9781450364034
DOIs
Publication statusPublished - 28 Jun 2018
Externally publishedYes
Event22nd International Conference on Evaluation and Assessment in Software Engineering, EASE 2018 - Christchurch
Duration: 28 Jun 201829 Jun 2018

Publication series

NameACM International Conference Proceeding Series
VolumePart F137700

Conference

Conference22nd International Conference on Evaluation and Assessment in Software Engineering, EASE 2018
Country/TerritoryNew Zealand
CityChristchurch
Period28/06/1829/06/18

Keywords

  • Code Readability Classification
  • Deep Learning
  • Empirical Software Engineering
  • Inception Architecture

Fingerprint

Dive into the research topics of 'An inception architecture-based model for improving code readability classification'. Together they form a unique fingerprint.

Cite this