A Working Potential Client Detection Model and a 2nd-Place Finish in the Kaggle Data Science Competition
Provided Data Description
There were two data sets provided: a people file and an activity file. These could be joined together.
All unique people and their respective people_ids were gathered in the people file.
All unique activities with corresponding activity_ids and activity characteristics were gathered in the activity file.
There were several different types of activities presented in the file and distinguishable by the number of known characteristics associated with each type of activity.
Each activity had a corresponding yes/no field which defined the business value outcome. The yes/no field represented the completion of the outcome by each person within a fixed period of time after the person had performed a unique activity.
A person_id was used as a common key for joining the files.