BATCH ANALYTICS
INTRODUCTION
This blog mainly consists of two parts.- Batch analytics
- Real-time analytics
- Data collection (Source of the data)
- Data processing
- Data visualization
First We'll look into the Batch analytics section.
BATCH ANALYTICS
Step 01 -Data Collection (Given Data Set)
Step 02 -Data Preprocessing
This involves the multiple steps for cleaning and pre-processing the dataset. There are two main packages in python which can be used to perform this: pandas and numpy.
Points to Remember:
The pandas package provides high-performance, easy to use structures and data analysis tools.
The pandas package provides high-performance, easy to use structures and data analysis tools.
- The numpy package is used to perform different operations.
- Below figures describe the process of the data pre-processing phase using Jupyter Notebook.
- Importing python packages & Understanding the data set.
- Checking the Data Types of the fields and null values.
- Pre Processing columns and rows which contain null values.
- Formatting the Date & Hour.
- Checking the Data.
- Extracting the cleaned data set to a new CSV file.

Step 02: Data Visualization
For this step, I used Tableau freely available software. Tableau can help anyone see and understand their data. Connect to almost any database, drag and drop to create visualizations, and share with a click.
Step 1: Connect to your data.
Step 2: Drag and drop to take the first look and select the most appropriate attributes for the scenarios
Step 3: Focus your results.
Step 4: Select the most suitable graph type.
Step 5: Add Filters.
Step 6: Build a dashboard to show your insights.









Comments
Post a Comment