Client Background

Client: A leading marketing firm in the USA

Industry Type:  Market Research

Services: Marketing, Consultancy

Organization Size: 100+

Project Description

Phase – 1: In this project first of all we have made heatmap between two columns named Author and Data Source. Then after two combining two tables named NY_data and nodeid_views made the report of all of the data.

Phase – 2: Success of story was given by if pageviews is more than 35000, if pageviews lies between 3500-35000 the story was labelled as needs improvement and if it was below 3500 the story was labelled as failure. 

Phase – 3: The powerbi report was made to find different insights in the data like different tables were drawn between different attributes of data like pie chart, time series chart, comparison charts. The data is updated every week and the report is generated automatically.

Our Solution

We provided them Phase 1 in the powerbi sql editor by combining two tables using sql queries. For phase 2 we just used the power bi program tool and written a script in Python to calculate the success of story. For Phase 3 we used the internal features of Power BI to find insights of the data.

Project Deliverables

We have provided a PowerBI report file as deliverable for the project.

Tools used

Python, PowerBI, Google Chrome

Language/techniques used

Python Programming and SQL queries editor.

Models used

Waterfall model used in this project.

Skills used

Data cleaning, Data Pre-processing, Data Visualisation are used in this project.

Databases used

We have used the traditional file systems as database storage.

What are the technical Challenges Faced during Project Execution

  1. Drawing heatmap in the PowerBI.
  2. Combining two tables on the basis of the pageviews.
  3. Converting the time series to data to 5 minute format.

How the Technical Challenges were Solved

We installed a new add on in the PowerBI to draw heatmap for the project and used the SQL editor to combine the tables on the basis of page views. We used python programming to convert the time series data to 5 minute time gap format.

Project Snapshots 

Project Video