Five Steps to Becoming a Better Data Analyst

简介: In this post, a data analyst from wangjubao.com, a leading data-marketing firm in China, shares some advice on making better data insights.

Introduction

Data analysis has become a must for many professions. In this post, a data analyst from wangjubao.com, a leading data-marketing firm in China, shares some advice on making better data insights. We will look at five key elements in user data analysis to help you advance your operation performance.

1) Analyzing the Proportion of New and Returning Users

A key data that is often disregarded is the ration of new and returning customers. This data helps us analyze the effectiveness of marketing campaigns, as well as understanding the behavior of customers based on trends. In the plot below, we can see a large discrepancy between the number of new and returning customers within the period of 2015/01/14 to 2015/01/16.

1

Horizontal axis (time range): Payment dates (in January 2015)
Vertical axis (Number of members): Number of members
Compared dimensions:
1.The red trend line represents returning customers, which have shopped more than twice
2.The blue trend line represents new customers, which have only shopped once
Analysis:
The following is the derivative of the data analysis in the scenario mentioned above:
During the period from December 2015 to January 2016, the number of returning customers is increasing, while that of new customers is dropping. Why?
Train of thought:
1.Is it due to poor attraction campaigns for new customers?
2.Is it because the Double 11 and Double 12 shopping carnivals have exhausted the market?

2) Analyzing Repurchase Rates by Members

Similar to the first trend, in this section, we analyze the differences between new and returning customers. However, the difference is that we are focused on the actual amount of payment made. As shown in the table below, 86.46% of payment amount originate from new members.

2

Horizontal axis dimension: Number of members (percentages), sum of payments (percentages)
Vertical axis dimension: Total number of purchases (1/2/3/4/5/6/7)
Time range: January 1, 2014 - February 1, 2016
Analysis:
From the data, we can see that the proportion of customers who have made one purchase is 63.79% and their payment amount accounts for 86%. The per capita contribution of new customers is greater than that of old customers.
Is this data trend positive or negative? Why do customers display this behavior?
Train of thought:
1.Do the products or prices attract the new customers?
2.Why is there a low repurchase rate by old customers? Is it because of the product features or the experience and policy considerations of the old customers?

3) Analysis of Repurchase Rates for Various Products

For this section, we take the idea from the previous section a step further. We analyze the repurchase rates based on specific products. In the table, we can clearly see that Product B is typically bought by new users.

3

Horizontal axis dimension: Number of members (percentages)
Vertical axis dimension: Total number of purchases (1/2/3/4/5/6/7)
Compared dimensions: Categories
Time range: January 1, 2014 - February 1, 2016
Analysis:
Insights into the data showed that the repurchase rate of buyers of product A is higher than that of products B and C. Different product categories have different repurchase rates. However, if there is a big difference between similar categories, what is the reason? Is it due to the characteristics of the product itself or the operational strategy?
Train of thought:
1.Is it due to the product display arrangements?
2.If member buyers of product B are recommended with a combination (including products B and A), the user will probably buy the combination when he or she returns to repurchase product B, and hence become a user of product A – the high repurchased category. As a result, it improves their repurchase rate.

4) Comparison of Product Purchase Frequency Rankings

Similar to the previous section, we analyze the purchase frequency for specific products. The goal here however is to determine a product's suitability based on the customers' journey.

4

Horizontal axis dimension: Product name
Vertical axis dimension: Number of members (percentages)
Compared dimensions: Total number of purchases (1/2/3/4 and above)
Time range: November 11, 2016 - November 11, 2016
Analysis:
The data allows us to gain insight into our customers' preferences. Which products are suitable for new customers? Which products are suitable recommendations to return customers to achieve a higher conversion? What products do regular customers favor? Based on the results of above analysis, we can adjust the product display layout in the shop and develop marketing and recommendation strategies tailored for different groups of people.
Train of thought:
1.Is it because of the product display layout in the shop?
2.Does the marketing strategy need to be adjusted?

5) Rankings of Products for Members of Various Activity Levels

Last but not least, understanding customer activity is an important measure that should not be overlooked. This data can help us understand the effectiveness of marketing efforts, as well as determining the optimal launch dates of campaigns.

5

Horizontal axis dimension: Product name
Vertical axis dimension: Number of members (percentages)
Categorization condition: The time difference between the payment date and the current time is respectively: 0 – 30 days for active; 30 – 60 days for silent; 60 – 90 days for sleeping; and 90 - 120 days for lost customers.
Analysis:
What products do customers in different degrees of activity buy? Let us take the "sleeping" customers for example; is their last payment time just before the Double 11? That is, the products became "sleeping" after product purchase, and the Double 11 failed to wake them up. Are the last purchased products to blame for the customer churn?
Train of thought:
1.Adjust the product display layout in the shop.
2.Adjust the marketing strategies for different groups of people.
3.Consider eliminating the "sleep-triggering" product.
4.Study the rankings of the goods purchased by members in various cities and rankings of products at various prices purchased by members among other data to develop precise marketing strategies.

What is Panoramic Insight?

Panoramic insight, also known as data visualization, allows for the decentralization of enterprise data assets to middle-level and first-line business personnel. Visualized operations enable non-technical staff to conveniently and quickly carry out business data exploration to form a closed exploration-execution-exploration business loop by connecting the exploration and business execution processes to assist business personnel in business growth.

Differences between Panoramic Insight and Enterprise BI

Panoramic Insight

Enterprise BI

High usage frequency Low usage frequency
Execution operation level High-level decision-making
Online analysis T+N
Coupling capability with the business Decision-making support
Convenient visualized operations Technical support required

Conclusion

Enterprise BI still plays an important role in providing strategic decision-making support to top executives. Panorama insight into cloud computing-based data and enterprise BI can complement each other to help companies tap value hidden in data. Since 2016, many companies have applied data visualization to daily business segments. Data visualization is an effective way for enterprises to stay well informed on the value of data assets.

目录
相关文章
Warning: Can save best model only with val_acc available, skipping
本文解决了在使用DenseNet网络结构保存最优模型时出现的"Warning: Can save best model only with val_acc available, skipping"问题,方法是将`ModelCheckpoint`回调函数中的`monitor`参数值从`val_acc`改为`val_accuracy`。
|
6月前
|
机器学习/深度学习 人工智能
【CatBoost报错解决】CatBoostError: Bad value for num feature[non default doc idx=0,feature idx=19]=
【CatBoost报错解决】CatBoostError: Bad value for num feature[non default doc idx=0,feature idx=19]=
Expected more than 1 value per channel when training, got input size torch.Size
因为模型中用了batchnomolization,训练中用batch训练的时候当前batch恰好只含一个sample,而由于BatchNorm操作需要多于一个数据计算平均值,因此造成该错误。
902 0
|
JSON 数据格式
ValueError: With n_samples=0, test_size=0.15 and train_size=None, the resulting train set will be em
ValueError: With n_samples=0, test_size=0.15 and train_size=None, the resulting train set will be em
515 0
ValueError: With n_samples=0, test_size=0.15 and train_size=None, the resulting train set will be em
|
PyTorch 算法框架/工具
Please ensure they have the same size. return F.mse_loss(input, target, reduction=self.reduction) 怎么解决?
这个通常是由于 input 和 target 张量的维度不匹配导致的,因此可以通过调整它们的维度来解决。
312 0
The SqQueue for Sequential queue (Continuous updates) | Data
The code in Data book (5th Edition) from the 99 page to 100 page
98 0
成功解决ValueError: With n_samples=0, test_size=0.3 and train_size=None, the resulting train set will be
成功解决ValueError: With n_samples=0, test_size=0.3 and train_size=None, the resulting train set will be
成功解决ValueError: With n_samples=0, test_size=0.3 and train_size=None, the resulting train set will be
成功解决_catboost.CatBoostError: Invalid cat_features[4] = 8 value: index must be < 8.
成功解决_catboost.CatBoostError: Invalid cat_features[4] = 8 value: index must be < 8.
|
SQL 关系型数据库 RDS
9 Easy Steps for Successful Data Migration
Migrate your database without service interruptions with Alibaba Cloud Data Transmission Service
1984 0
9 Easy Steps for Successful Data Migration
|
开发工具
R-Organize Data(step 2)
R is a data analysis and visualization platform.
936 0