This project received 15 bids from talented freelancers with an average bid price of $193 USD.Get free quotes for a project like this
Project Budget$30 - $250 USD
Deadline: Thursday 10/ Nov/2016
Using python, pandas, numpy and scikit learn.
For visualizations, you will not need anything more complex than scatter-plots, histograms or line plots. You will provide a single ipython notebook that contains the code for all the answers. Use a separate tab for each question. For each task, also write your appropriate answers in a .txt, .doc or .pdf and submit this along with your code.
1. I have provided you with a dataset called data1. It contains a train and test dataset. Use a suitable method to predict the “Value” given the features (there are 100 features) (there are a number of redundancies in the features). Evaluate and present your results using an appropriate error measure.
2. I have provided you with two datasets in data2.zip. For each dataset:
a. Analyze the data using an appropriate visualization
b. Use an appropriate method to cluster similar data-points together. Justify why you
picked the specific method for each dataset.
c. Output the clustered points using an appropriate visualization.
Browse Related Skills
Other things people do on Freelancer
Looking to make some money?
- Set your budget and the timeframe
- Outline your proposal
- Get paid for your work
Hire Freelancers who also bid on this project
Looking for work?
Work on projects like this and make money from home!Sign Up Now
- The New York Times
- Wall Street Journal
- Times Online