Calculation of Lead Yield with rule based classification
A game company wants to create level-based new customer definitions (personas) by using some features of its customers, and to create segments according to these new customer definitions and to estimate how much the new customers can earn on average according to these segments.
The Persona.csv dataset contains the prices of the products sold by an international game company and some demographic information of the users who buy these products. The data set consists of records created in each sales transaction. This means that the table is not deduplicated. In other words, a user with certain demographic characteristics may have made more than one purchase.
Variables:
PRICE – Customer's spending amount
SOURCE – The type of device the customer is connecting to
SEX – Gender of the client
COUNTRY – Country of the customer
AGE – Customer's age