High Speed Classification of Massive Data Streaming Using Spark

M.JALASRI; AARTHIKA.K; VISHNU PRIYA.A

Paper Details

📄 IJAERD-OJS-2460

High Speed Classification of Massive Data Streaming Using Spark

Author(s)	:	M.JALASRI, AARTHIKA.K, VISHNU PRIYA.A
Institution	:	Asst. Professor, Information Technology, JeppiaarMaamallan Engineering College
Published In	:	Vol. 5, Issue 2 — February 2018
Page No.	:	669-673
Domain	:	Engineering
Type	:	Research Paper
ISSN (Online)	:	2348-4470
ISSN (Print)	:	2348-6406

Abstract

Big data analytics deals with the mining of massive and high speed data streams with contemporarychallenges—In this paper we perform an efficient nearest neighbor solution to classify high-speed and massive datastreams using Apache Spark. A distributed metric tree has been designed to organize the case-base and consequently tospeed up the neighbor searches. DS-RNGE algorithm is an instance selection method to find out the object in the nearestneighbor searches .Resilient distributed data set is a base to check the record in searches .Smart partitioning of theincoming data streams to parallelize the proposed algorithm using Apache Kafka which is a Spark tool to process thehuge amount of data.. Spark is able to load data into memory and query it repeatedly, making it suitable for iterativeprocesses (e.g., machine learning algorithms). Pseudo Random mode is used to partition the data in effective mannercompared to references. We use the hashing algorithm to detect the duplicate records. Our work is used sequentially forreal time entities of analyzing the live streaming records in nearest neighbor searches.

🗎 Download PDF 🏆 Get Certificate

🕮 How to Cite

M.JALASRI, AARTHIKA.K, VISHNU PRIYA.A, “High Speed Classification of Massive Data Streaming Using Spark”, International Journal of Advance Engineering and Research Development (IJAERD), Vol. 5, Issue 2, pp. 669-673, February 2018.

📄 Submit Your Paper

Open Access • Peer Reviewed • CrossRef DOI
UGC Approved • Monthly Publication

Submit Now →

📅 Submission Deadline

30 Apr 2026

Vol. 13 | Issue 4
April 2026

📄 Journal Information

Journal Name	:	IJAERD — Int. Journal of Advance Engineering and Research Development
ISSN (Online)	:	2348-4470
ISSN (Print)	:	2348-6406
Impact Factor	:	7.37 (SJIF 2026)
Frequency	:	Monthly (12 Issues/Year)
Started Year	:	2014
Language	:	English
Format	:	Online & Print
Review Type	:	Double-Blind Peer Review
Review Time	:	2 – 3 Working Days
Access Type	:	Open Access
DOI	:	CrossRef DOI Assigned
UGC Approved	:	Yes
Discipline	:	Engineering & Technology
Email	:	editor.ijaerd@gmail.com
Website	:	https://www.ijaerd.org

🇮🇳 Indian Authors	:	₹ 1,500
🌍 International	:	USD 50
Payment Mode	:	UPI / NEFT / PayPal
Fee Charged	:	Only after acceptance
At Submission	:	No Fee

Email	:	editor.ijaerd@gmail.com
Website	:	https://www.ijaerd.org
Hours	:	Mon–Sat, 10AM–6PM IST
Response	:	Within 1–2 working days