A Big Data based Framework for executing complex query on Covid Data set in Apache Spark

  • M. Jayanth, Dr.K.RamMohanRao
Keywords: Spark , Scala , Hadoop, Map Reduce, HDFS

Abstract

In recent times the novel Corona Virus pandemic (COVID-19) outbreak is seriously threatening human health, life, production, social interactions, and international relations. Proper tools and methods are need to be implemented and used for analyzing the vast amount of COVID data which has been helpful for health industry to track and minimize the effects of virus. Researchers and developers are required to identify the spread of virus rate and gain complete knowledge and understanding of the disease. COVID-19 takes place in huge numbers in different countries through in the world, with which only big data application and the work of NOSQL databases are suitable. Researchers faced many application programmers and especially whose work on the COVID databases through hybrid data models through different APIs and queries. Previous researchers work on the Hadoop, HDFS /Map Reduce and NOSQL databases. This paper proposes a framework using the spark tool with Scala Language to execute complex queries over COVID Data Set. In the proposed the framework is divided into three phases. The first phase is data Collection, Second is Query Processing and Third Phase is analyzing the Results. This framework uses the Big Data processing such as Apache Spark tool with Scala Language which is flexible for a programmer. Scala is a programming Language suitable for the parallel processing. The proposed framework is flexible with less processing time.

Published
2021-10-29
How to Cite
Dr.K.RamMohanRao, M. J. (2021). A Big Data based Framework for executing complex query on Covid Data set in Apache Spark. Design Engineering, 8376-8383. Retrieved from http://www.thedesignengineering.com/index.php/DE/article/view/5878
Section
Articles