Hello, I need help in crawling a website’s particular link recursively (ASP pages). There is a table on each page which needs to be parsed and dumped into csv/excel along with the hierarchy information.
I need the scripts in Python. You can use scrapy/selenium + beautifulsoup. I would need the script along with documentation for the key sections.
Hierarchy structure will be as per below.
Level1: 12 <static links>
Level2 (within Level1): Within each static ~40 to 80 <ASP post calls>links
Level 3 (within each Level2): ~50 links <ASP post calls>
Level 4( within each level3): ~50 links <ASP Post calls
On Each page there will be a table with
b) Sub header
c) 8 to 9 columns < this needs to be identified>
Each of a/b/c needs to be dumped in a csv/excel. Further since there are recursive calls, hence the recursion levels also need to captured in columns in the csv <for recreating the data hierarchy>.
Let me know if you are interested, time frame and cost/charges for doing the complete project.
Website link will be shared post initial interest phase.
There will be followup projects in scraping post this initial project.
12 freelancers are bidding on average ₹7847 for this job
Hello sir, I am a professional web scraper in python and I am very interested in your project or any future projects you would have. My rate is 15$ per hour and I estimate this project would take me around 2 days.