Client Background
Client: A leading research institution in the word
Industry Type: Research, R&D
Services: R&D
Organization Size: 1000+
Project Objective
Make data ready for predictive modelling.
Making Google Data Studio dashboard.
Project Description
Phase – 1: In this project first of all we have to clean the data as the data was very noisy, we have to filter out only the needed columns of the data.
Phase – 2: Finding co-relation between the pitchbook data and the other output files.
Phase – 3: Making dashboard in Google Data Studio for the project.
Our Solution
We used pandas and numpy to clean the data and make useful for it to be used in predictive modelling. We have found the co-relation between the tempa msa pitchbook data and the output files like textual file, ai_ml_tm file etc. We have made the dashboard using the Google Data Studio.
Project Deliverables
We have provided a excel file consisting of clean data and the Google Data Studio report.
Tools used
Python, Google Data Studio, Google Chrome
Language/techniques used
Python Programming
Models used
Waterfall model used in this project.
Skills used
Data cleaning, Data Pre-processing, Data Visualisation are used in this project.
Databases used
We have used the traditional file systems as database storage.
What are the technical Challenges Faced during Project Execution
Cleaning the data was the major challenge faced while executing the project. The data has a lot of noise. It was difficult to find which data was useful and which data is not useful in this project. Secondly the co relation between the output files and pitchbook data. There was nothing common between both the datasets. So was difficult to find co-relation between them.
How the Technical Challenges were Solved
We used pandas dataframe to clean the data and make it ready for predictive modelling and used the Google Data studio to find insights between the different datasets.