Find Jobs
Hire Freelancers

Analysis of coalesced hashing1

$10-30 USD

Completed
Posted over 9 years ago

$10-30 USD

Paid on delivery
Problem Definition: In this project you are to make a serious study of Coalesced Hashing. This technique is discussed in the paper “Implementations for Coalesced Hashing” by Jeffrey Scott Vitter, CACM, Dec 1982. This paper, the link is on the web site, divides a hash table into an address region and a cellar. The cellar is used to store records that collide when inserted. The paper indicates that near optimal performance occurs at B=.86 where B is the ratio of the size of the address region to the size of the entire table. You project is to write a simulation that supports this statement. Run your simulation for a variety of hash table sizes and B values. Draw graphs to support you work. Write up the project in word using the graphs (as generated by PIL) embedded. Attach the source to this paper, place a header page and staple. Turn in on the above date. NOTES: Your main focus in this project is to obtain data that would allow you to draw a graph such as that on page 925. Here we will restrict the project to successful searching via successful probe counting only so don’t worry about unsuccessful searching. Once a table is loaded it is very easy to determine (by a calculation) the average probe count for that set of data. See Fig 1 (a) for an example. It would suffice to create four curves on the same graph having the following different loadings ( .7,.8,.9. and 1.0). The graph on page 925 has a loading of 1.0. You will need to execute multiple runs for the range of address factors that go from say .4 to 1.0 in whatever step size you choose as long as a minimum on the curve around .86 would be visible. You only need to implement the basic algorithm, ie late-insertion. Also make enough runs so that averaging them will make the curves somewhat smooth. As a final comment please note that you can use any address size you choose. It does not have to be a prime number. We are not hashing real data that may be clustered. We are loading the table with data that is randomly generated so placement in the table is properly spread out. You can use the usual division method discussed in the overheads ie. n mod m where m is the size of the address region. This makes selection of the address region size for a specific array size easy. Let me say this again, you pick an array size, say 1000, and then use a variety of address region sizes in that array to collect data. IE m’ is assumed to be constant for data collection. If you change m’ then data collection on these should be shown on different graphs. I also expect that the project be written in Python or C++ and that the hash table and associated operations be placed in a Python Class or C++ class. If you are using Python you may want to use numpy for your arrays.
Project ID: 6736701

About the project

3 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi, I've made a simulation of the algorithm that's described in the paper. It's written in Python 3. I just noticed (as mentioned in the chat) that it's not quite producing correct results, so I still need to tweak it to see if I can find the reason. I'll keep you updated. EDIT: The bid includes providing commented source code, simulation results and graphs by Sunday morning (EST time). EDIT2: The software and results are now ready. I'm unable to update the milestone, so this bid offer will override whatever it currently says in the milestone. Best regards, Matti
$30 USD in 1 day
0.0 (0 reviews)
0.0
0.0
3 freelancers are bidding on average $72 USD for this job
User Avatar
A proposal has not yet been provided
$157 USD in 2 days
5.0 (13 reviews)
4.5
4.5
User Avatar
I have more than 2 years of experience in python and C++ programming. Thanks. Sriram
$30 USD in 2 days
0.0 (0 reviews)
0.0
0.0
User Avatar
HI,i was send you message but not sending because of project cancelled by you or freelancer team..............................................yeah i can make graph too
$50 USD in 1 day
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
texas, United States
4.9
4
Payment method verified
Member since Nov 12, 2014

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.