Parse HTML Meta data, images, headings, rich snippets to JSON Node or Go ok
$30-250 USD
Paid on delivery
For this project we need a program which takes a URL and returns the following in JSON with keys such as meta_description: "the page description" etc all fields need key/values in the JSON. We will provide sample JSON of expected output to winning bidder.
For the program you may use Node JS or Golang
title,
description,
language,
headings, ( h1, h2, h3, h4, h5, h6)
total image qty
images with missing alt attributes
quantity of links
quantity of internal links
quantity of external links
if html is valid true/false
if [login to view URL] exists
if [login to view URL] is valid
favicon paths
hreflang paths
open graph meta tags (all in page)
Project ID: #24051663
About the project
Awarded to:
I can make this simple page quality checker in node.js or PHP. My average project completion time is within 3-5 hours on the same day. The skills I have include PHP, HTML5, CSS3, JavaScript, jQuery, WordPress Themes & More
9 freelancers are bidding on average $125 for this job
Hello, there. I have read your job descirption carefully and fully understood your requirements. You want to make project that user can input url and can get informations of inputted url as JSON format. I am a senior N More
Hi, I'm really experienced in scraping tools development as I've worked on a lot of similar projects and I'm available to start immediately. It's doable by the next few hours for sure. Contact me to discuss it in detai More
Hi, I am a java programmer good at data scrapping. I have done many similar projects before. I can finish your task quickly and correctly. I am ready to work now.
Hey, I'm interested in your project. I am expert in programming using almost languages. Especially, i have a rich experience in node.js. High quality, clean and well-structured code, 100% responsive (desktop and mobile More
I have built MANY web scrapers such as the one you describe. I would use a headless chrome system like Selenium to quickly grab ALL the page HTML and whatever else, then use powerful REGEX parsers to extract the data More