THE CODE USED IN THIS PROJECT IS AVAILABLE ON GITHUB HERE.

This data analysis explains the steps I took to solve a take-home assessment I was offered as part of the interview pipeline for a Data Consultant gig at YipitData, a company that analyzes data to provide accurate, granular insights to over 480 investment funds and innovative companies, thereby assisting them in making better business decisions.

Note: I did not use any libraries other than Pandas to perform this task since that would render this exercise pointless.

Prompt:

At YipitData, we collect and analyze unique data sets that allow us to develop detailed insights about many companies. Through these insights we provide competitive intelligence and market research that allows our clients, to better understand the markets in which they are competing, and their own performance therein.

For the purpose of this exercise, consider that one of the companies we cover is Groupon. For Groupon, the main metric we track is called gross billings. Every quarter, Groupon reports gross billings in their financial statements. We use our proprietary data to estimate gross billings across different segments, in order for Groupon’s competitors to benchmark their own performance as well as for strategic insights. Pretending it is January 2014, before Groupon reports 4Q13 earnings, your goal is to use the attached data file to estimate Groupon’s 4Q13 North America gross billings by segment (Local, Travel, and Goods). Additionally, please describe the performance of each segment. The data is based on real data we collected for Groupon in 2013, and you will need to overcome real challenges we faced back then in order to arrive at your estimate. Remember, we are pretending it is January 2014, so you can’t use any information that is after January 2014.

Background Information:

My approach to solving this problem