Find Jobs
Hire Freelancers

Algorithm Optimization 3

$250-750 USD

Cancelled
Posted almost 10 years ago

$250-750 USD

Paid on delivery
Objectives • We will provide one datasets with one target variable (“Score”), a timestamp and 24 independent variables. The dataset contains ~55 thousand observations (however your solution should be scalable to accommodate a much larger dataset). • The goal is to write at most 6 sets of “greater than” and “less than” restrictions on the independent variables. Each set of restrictions will return a subsample of the dataset on which we evaluate an objective function. • Specifically, the objective function is the sum of the target of the observations in the selected subsample. Each query (set of restrictions) has to return at least 10 valid responses. • In addition, any observations that come less than 60 seconds after a valid observation in this subsample will be removed. So each query has to return at least 10 responses that are 60 seconds apart from each other. • In other words, your goal in this project is cornering up to 6 regions of the dataset using intervals on the independent variables, and maximize the density of positive values of the target. Logistics: • You can find the dataset in the Excel file “[login to view URL]”. • You will see some variables have version A or version B (for instance W2 R2). In such cases you can use either one or the other, not both. • Your restrictions cannot be applied using a higher number of decimal places than occur in the observations. For instance a restriction to W1R1 cannot be 0.015, it must be either 0.01 or 0.02. Tips: • Regression analysis, Neural Networks, SVM, and K-clusters will not help you much. These methods classify observations by applying a weighted average of the independent variables. The classification rule has to be on the independent variables directly, cannot be on a weighted average of them or any other function. • Make sure to order the timestamp chronologically. • A start could be plotting the density of the independent variables for the subsample of positive target values and for the subsample of negative target values. Then you can identify regions with a high density of positive target observations. Reward/Milestones: • All accepted bids will be awarded on completion • We are going to judge the performance of each bid both inside the sample (milestone 1) and outside the sample (milestone 2). A good performance consists in a high aggregate sum of the target variable. • After this stage we will ask you provide details of how you would maintain the existing algorithms (milestone 3) over a much larger data set, ~500 thousand observations.
Project ID: 5995085

About the project

10 proposals
Remote project
Active 10 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average $688 USD for this job
User Avatar
I am expert in matlab vectorization techniqu. Using that if we deal with large data set, we actually need less time like fraction of second but if you write the same program using loops then in that case for large data set it may require even 20-30 minute depending or ur data set. I have experirence in doing such works. So u can definitely rely on me regarding your project. Regards
$666 USD in 10 days
5.0 (52 reviews)
5.5
5.5
User Avatar
A proposal has not yet been provided
$998 USD in 10 days
5.0 (2 reviews)
4.8
4.8
User Avatar
Hi, very interesting project. Could you send me the dataset? I'd code this in R if you don't mind. Best regards, marcin
$400 USD in 7 days
4.8 (13 reviews)
4.4
4.4
User Avatar
Hi, I know the previous project was granted to someone else and since you said you need several ones to work in parallel, hence I would like to bid for this project. Please let me know specifically which task is expected from you. Thank you.
$388 USD in 10 days
4.7 (5 reviews)
4.1
4.1
User Avatar
HI Brother, I am Data Scientist working in Multinational Company. My work is to see the hidden pattern in the large and complex data sets and predictive analytics, Data mining,Machine Learning and also uses the statistical techniques on those datasets to see what data behaves in past. By seeing the past what will be the behavior the that data set in future. I am also working on "kaggle" competitions currently .I am also writing the column on All Analytics and my rank on All analytics in Statistician. I am also working on the Time Series analysis using ARIMA, SARIMA models and also working on econometric analsysi using Linear, Nonlinear, Logistic, Probit models, hypothesis testing and Cox regression for the survival analysis and also survey analysis etc. I am also the writer of the various research papers and one of them is relating medical data you can sight my paper at IEEEXPLORE. I am also using R, SPSS Statistics, Spss Modler, Stata, SAS, MATLAB, WEKA, Statistica and Pyhthon for Data Analysis for Machine Learning, Data Mining and Data Science. Thanks Regards, Muhammad Shoaib. Data Scientist
$555 USD in 10 days
4.9 (7 reviews)
3.2
3.2
User Avatar
I am a Subject Matter Expert in Mathematics, Statistics, Computer Science and Physics, and a SEO search engine optimization specialist. I worked as Matlab and Statistics Consultant for several years for many companies. I undertake SEO projects for websites. I have a Masters Degree in Mathematics from Purdue University, USA. I also provide tutoring, training, exam preparation, and assignment solution help in advanced college level Mathematics, Statistics, Physics and Computer Science courses.
$666 USD in 10 days
5.0 (2 reviews)
3.2
3.2
User Avatar
Hi, I have more than 14 years of exp and I am expert in this kind of work. I have completed more than 210 projects. Please look at the feedback left by my employers to know more about my work. Waiting for your positive response. Thanks.
$500 USD in 20 days
3.6 (1 review)
3.8
3.8
User Avatar
A proposal has not yet been provided
$750 USD in 10 days
5.0 (3 reviews)
2.8
2.8

About the client

Flag of UNITED KINGDOM
Birmingham, United Kingdom
4.9
26
Payment method verified
Member since Apr 20, 2009

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.