In the attached file there is a very detailed description of data I want to use and a about the methodology. If you find any better method to do it I am opened to it :)
The dataset is huge, I hope you have a sample dataset to check my work upon. Rest assured I worked in Machine Learning so I have experience in data proce