Skip to content

Amanpreet04-1993/Data-Analytics-Visualization-ACCENTURE-Remote-internship

Repository files navigation

Data-Analytics-and-Visualization - ACCENTURE-Remote Internship (Mega Project)

Acc_Logo

Your are assigned a new project for a client called Social Buzz Co.

https://app.powerbi.com/groups/me/reports/efa46ead-1f72-41a5-bee5-8d34fdfeb2ae/cc8f6a73f66c4e2bb90c?experience=power-bi

Task 1: Project Understanding

What you'll learn-

What are the responsibilities of a Data Analyst at Accenture. What is a typical project team structure. How to review a client project brief.

What you'll do-

Meet your Accenture team, including the industry, IPO and data experts Read and analyze a client project brief to understand the client and business problem, identify the requirements that need to be delivered, and identify which tasks you need to focus on as a data analyst

let's get started!

Here are some key facts about this program:

You are working as a Data Analyst at Accenture.
You work within a larger team, where each member has a different role and level of responsibility.
Your team has been assigned a new project for a client called (Social Buzz).
this is an exciting opportunity for you to showcase your data analysis and visualization skills.

Your first task is to read the brief from Social Buzz and complete a short knowledge check before the call.

One of Accenture’s Managing Directors, Mae Mulligan, is the client lead for Social Buzz.

She has reviewed the brief provided by Social Buzz and has assembled a diverse team of Accenture experts to deliver the project.

Mae has scheduled a project kick off call with the internal Accenture project.

Ahead of the call, Mae has shared the project brief so you can get up to speed on what Social Buzz need help with.

Read the brief to:

Understand the client and business problem at hand.
Identify the requirements that need to be delivered for this project.
Identify which tasks you should focus on as a Data Analyst.

Data_Analytics Client Brief.pdf

Question 1
Which statement best describes the business problem that Accenture is tasked to address for this project?
Ans - The client has reached a massive scale within recent years &
does not have the resources internally to handle it.

Spot on, the client stated that scale was a big problem of theirs & they are struggling to manage the scale with the resources that they currently have. The brief said that the client is looking for help with the management of their journey into such a large scale.

Question 2
Which statement lists the three requirements that Accenture is tasked to fulfill for this project?
Ans - Audit of big data practice, recommendations for IPO, analysis of popular content

SS

: Project Understanding

Question 1
Which task out of these options is the most relevant to you as a data analyst, and therefore which one will you work on?
Ans - Analysis of sample data sets with visualizations to understand the popularity of different content categories

This is exactly the task a data analyst would be focusing on and this is what you will be working on!


Task 2: Data Cleaning & Modeling

What you'll learn

How to clean, model and analyze data to create valuable insights for the client

What you'll do

Identify which datasets will be required to answer the client’s business question Clean the datasets and merge them to prepare the data for analysis Determine the answer to the client’s business question

Let's dive into the data, The client has sent through:

7 data sets - each data set contains different columns and values
A data model - this shows the relationships between all of the data sets, as well as any links that you can use to merge tables.

There is a lot of information here and it’s easy to get lost in the data. So, to make sure you are using the right data to answer the business questions you’ll follow these steps:

1. Requirements gathering
2. Data cleaning
3. Data modelling

First up, requirements gathering

As we mentioned, you have been sent 7 datasets and a data model.

Often you won’t need all these datasets to find what you’re looking for.

So, the first step is to use this data model to identify which datasets will be required to answer your business question - which is to to figure out the top 5 categories with the largest popularity.

Data model - Task 2.pdf

Definitions of different data types:

String - Sequence of characters, digits, or symbols—always treated as text
UUID - Universally Unique Identifiers
Array - List with a number of elements in a specific order—typically of the same type
Integer - Numeric data type for numbers without fractions
Timestamp - Number of seconds that have elapsed since midnight (00:00:00 UTC), 1st January 1970 (Unix time)

Source: Direct extract from Amplitude - https://amplitude.com/blog/data-types

Question 1
Which three data sets will you need to complete your analysis?
Ans - Reaction, Content, Reaction Types

Data sets - Quick Explanation

You’ve identified Reaction, Content, and Reaction Types as our relevant data sets.

To clarify why you made this selection:

1. The brief carefully it states that the client wanted to see 
  “An analysis of their content categories showing the top 5 categories with the largest popularity”.
 2. As explained in the data model, popularity is quantified by the “Score” given to each reaction type.
  3. We therefore need data showing the content ID, category, content type, reaction type, and reaction score.
   4. So, to figure out popularity, we’ll have to add up which content categories have the largest score.

But! Before we begin to work with the data sets, we’ll need to ensure that the data is clean and ready for analysis…

Data Cleaning-

Data cleaning is a common and very important task when working with data.

What you need to do:

First: Open the three data sets below - (Raw File)

Uploading ReactionTypes (1).csv…

Uploading Reactions (1).csv…

Uploading Content (1).csv…

**Data Modelling ** Now we want to figure out the top 5 categories. To complete your data modelling, follow these steps:

  1. Create a final data set by merging your three tables together

    Ans -  “VLookUp” formula - the Reaction table as base table,
     joined relevant columns from Content data set, and then the Reaction Types data set.
    
  2. Figure out the Top 5 performing categories

    Ans -  Added up the total scores for each category  “Sum If” formula
    
  3. The end result should be one spreadsheet which contains:

A cleaned dataset The top 5 categories

Tool : Excel (To create your final data set)


Final Master File - Task 2 master copy (Data Modelling).csv


Task 3: Data Visualization & Storytelling

What you'll learn

How to connect the business and the data, & how to communicate relevant and engaging insights in a presentation.

What you'll do

The data visualizations that best support the story you want to tell to the client

PowerPoint presentation that reports on the client’s content performance For each slide, think about:

Agenda - What will your presentation cover?

Project Recap - What are the key points from the brief?
Problem - What is the problem that you answer in this presentation?
The Analytics team - Who is on your team?
As a reminder from the earlier task - this includes: Andrew Fleming (Chief Technical Architect), 
Marcus Rompton (Senior Principle), and yourself!
Process - How did you complete your analysis?

Like i’ve said, it’s important to keep presentation simple and clean. Take a look at the annotated example to see how you could have visualized the data in the form of presentation the client wants to understand the top 5 content categories. Have a think about what are the best ways to present this? In a pie chart? In a bar chart?

There are also some other interesting insights. For example:

How many unique categories are there?
How many reactions are there to the most popular category?
What was the month with the most posts?  

It is now time to submit final presentation to the client.


PowerPoint Presentation- Accenture Final Presentation.pptx


https://app.powerbi.com/groups/me/reports/efa46ead-1f72-41a5-bee5-8d34fdfeb2ae/cc8f6a73f66c4e2bb90c?experience=power-bi


Screenshot Social buzz

Summary:

ANALYSIS Animals and science are the two most popular categories of content, showing that people enjoy "Real-life" and "factual" content the most.

INSIGHT Food is a common theme with the top 5 categories with "Healthy Eating" ranking the highest. This may give an indication to the audience within your user base. You could use this insight to create a campaign and work with healthy eating brands to boost user engagement.

NEXT STEPS This ad-hoc analysis is insightful, but it's time to take this analysis into large scale production for real-time understanding of your business. We can show you how to do this.