Document Classification by Order of Context, Concept and Semantic Relations: OCCSR

Authors

  • A. Venkata Ramana Department of Computer Science, S. V. University College of CM & CS, Tirupati, Andhra Pradesh-517502, India.
  • E. Kesavulu Reddy Department of Computer Science, S. V. University College of CM & CS, Tirupati, Andhra Pradesh-517502, India.

DOI:

https://doi.org/10.9734/bpi/mono/978-93-5547-265-6/CH7

Keywords:

Concept relations, context relations, document classification, feature selection, semantic relations, supervised learning, text mining, OCCSR

Abstract

The contemporary study in text or document mining is focusing on syntactic components and the semantic environment. In order to accomplish this, and with the motivation gained from our previous research contributions, we investigated a mining model to classify documents based on the Order of Context, Concept, and Semantic Relations (OCCSR). This proposed model categorises documents into three levels: context, concept, and semantic. The document context is defined by the meta-data in the document, the concept is defined by the order of features, and semantic relations are assessed by correlating the activities observed in the documents. The experimental results show that the OCCSR has high classification accuracy,  scalable, and  robust. The study findings lead us to the conclusion that context similarity, in addition to concept and semantic similarity, is more important in achieving classification accuracy in supervised learning. The OCCSR is evaluated using a confusion matrix and discriminator metrics. The model developed here is extremely useful, particularly for assessing the relationship of documents published in social communities such as electronic journals, publishers, and blogs.

Published

2021-12-21

How to Cite

A. Venkata Ramana, & E. Kesavulu Reddy. (2021). Document Classification by Order of Context, Concept and Semantic Relations: OCCSR. Research Issues on Datamining, 75–86. https://doi.org/10.9734/bpi/mono/978-93-5547-265-6/CH7

Issue

Section

Contents