This paper is published in Volume-4, Issue-1, 2018
Area
Computer Science
Author
Suruchi Padhy, Dr. Shashi Kumar D R
Org/Univ
Cambridge Institute of Technology, Bangalore, Karnataka, India
Keywords
Big Data, Apache Hadoop, HDFS, MapReduce
Citations
IEEE
Suruchi Padhy, Dr. Shashi Kumar D R. Big Data Analysis Using Apache Hadoop, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Suruchi Padhy, Dr. Shashi Kumar D R (2018). Big Data Analysis Using Apache Hadoop. International Journal of Advance Research, Ideas and Innovations in Technology, 4(1) www.IJARIIT.com.
MLA
Suruchi Padhy, Dr. Shashi Kumar D R. "Big Data Analysis Using Apache Hadoop." International Journal of Advance Research, Ideas and Innovations in Technology 4.1 (2018). www.IJARIIT.com.
Suruchi Padhy, Dr. Shashi Kumar D R. Big Data Analysis Using Apache Hadoop, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.
APA
Suruchi Padhy, Dr. Shashi Kumar D R (2018). Big Data Analysis Using Apache Hadoop. International Journal of Advance Research, Ideas and Innovations in Technology, 4(1) www.IJARIIT.com.
MLA
Suruchi Padhy, Dr. Shashi Kumar D R. "Big Data Analysis Using Apache Hadoop." International Journal of Advance Research, Ideas and Innovations in Technology 4.1 (2018). www.IJARIIT.com.
Abstract
Traditional data management, warehousing, and analysis systems fall short of tools to analyze this data. Using traditional DBMS techniques like Joins and Indexing and other techniques like graph search is tedious and time-consuming.
In this paper, we suggest various methods for catering to the problems in hand through Map-Reduce framework over Hadoop Distributed File System (HDFS). Map Reduce is a Minimization technique which makes use of file indexing with mapping, sorting, shuffling and finally reducing. Map Reduce techniques have been studied in this paper which is implemented for Big Data analysis using HDFS.