1. How to submit my research paper? What’s the process of publication of my paper?
The journal receives submitted manuscripts via email only. Please submit your research paper in .doc or .pdf format to the submission email: jait@etpub.com.
2.Can I submit an abstract?
The journal publishes full research papers. So only full paper submission should be considered for possible publication. Papers with insufficient content may be rejected as well, make sure your paper is sufficient enough to be published...[Read More]

An Efficient Keyword Based Search of Big Data Using Map Reduce

P. Srinivasa Rao 1, M. H. M. Krishna Prasad 2, and K. Thammi Reddy 3
1. Department of CSE, MVGRCE, Vizianagaram
2. Department of CSE, JNTUK, Kakinada
3. Department of CSE, GITAM University, Visakhapatnam
Abstract—With the arrival of the data deluge, traditional and centralized tools used to extract knowledge from data become obsolete due to their limited ability to handle massive data. To cope with the need for scalable solutions, a new framework has emerged: Hadoop, an open-source ecosystem designed for storage and large-scale processing work on a cluster of commodity hardware. In order to overcome the limitations in key word based information retrieval systems, an efficient methodology has been designed. A system with the new approach mimics the real world, where every task is laced with certain indexing as this is basic idea behind knowledge processing. Hadoop and R: open source frame works for storing and processing large datasets, are used for preprocessing the text documents. First, a set of text documents are considered. Preprocessing is performed on a large domain of data using R. This includes the removal of the stop words along with stemming and excluding less frequency words. Despite this preprocessing, owing to the colossal number of index terms still floating in the considered domain data, the problem of high dimensionality is encountered. Therefore the dimensionality of such a group of terms is reduced by incorporating a keyword based methodology in Hadoop MapReduce Framework. The developed Model is useful for processing the query which gives us the relevant information with low response time from the data pool considered.

Index Terms—Hadoop, MapReduce, Bigdata, HDFS, information retrieval systems

Cite: P. Srinivasa Rao, M. H. M. Krishna Prasad, and K. Thammi Reddy, "An Efficient Keyword Based Search of Big Data Using Map Reduce," Vol. 8, No. 3, pp. 159-164, August, 2017. doi: 10.12720/jait.8.3.159-164
Copyright © 2013-2016 Journal of Advances in Information Technology, All Rights Reserved
E-mail: jait@etpub.com