Keyword extraction from users' requirements using textrank and frequency analysis, and their classification into ISO/IEC 25000 quality categories Conference Paper uri icon

abstract

  • Software requirements are essential for the correct development and planning of a software project. Each requirement is related to a software quality category, i.e. usability or maintainability, and their classification into these categories could greatly help the requirements analysis process. Requirements are usually expressed in natural language as written documents and many methods have been proposed for their automatic analysis and classification, based mainly on word frequency analysis. In this paper, a method for extracting keywords from users%27 written requirements using the TextRank technique and inverse frequency analysis is presented. These keywords represent relevant computing-related terms that can be mapped to a certain quality category which allows us to identify core terms that are of major relevance in the text of a given requirement. A total of 946 software requirements from six online datasets were analyzed and 390 keywords were extracted. The quality categories defined in the ISO/IEC 25000 standard will be used for keyword classification. © 2018 IEEE.
  • Software requirements are essential for the correct development and planning of a software project. Each requirement is related to a software quality category, i.e. usability or maintainability, and their classification into these categories could greatly help the requirements analysis process. Requirements are usually expressed in natural language as written documents and many methods have been proposed for their automatic analysis and classification, based mainly on word frequency analysis. In this paper, a method for extracting keywords from users' written requirements using the TextRank technique and inverse frequency analysis is presented. These keywords represent relevant computing-related terms that can be mapped to a certain quality category which allows us to identify core terms that are of major relevance in the text of a given requirement. A total of 946 software requirements from six online datasets were analyzed and 390 keywords were extracted. The quality categories defined in the ISO/IEC 25000 standard will be used for keyword classification. © 2018 IEEE.

publication date

  • 2019-01-01