App for cleaning offline articles.

Completed Posted Jan 25, 2014 Paid on delivery
Completed Paid on delivery

I have a lot of news articles that i've saved to my hard disc.

I need something to...

1) clean them down to only keeping the

1a: main text itself

1b: author

1c: date published

1d: source newspaper

Thus removing all scripts and stuff)

See attached examples of original and cleaned article.

2) i also need it to find the source on the internet, so it finds and adds the web location of the text. Perhaps this can sometimes find the source newspaper.

I want to keep the "one article in one file" (so it should be an option to just save with the same file name) but how does one then add info to it? Perhaps adding the source web link? Or should it be a database????

How can this be done? What would it cost?

C++ Programming Data Processing HTML

Project ID: #5357967

About the project

6 proposals Remote project Active Jan 26, 2014

Awarded to:

alquarizm

I have written about 300 code in c++ and have good knowledge in this domain. you can through these links. [login to view URL] [login to view URL] This show More

$50 USD in 3 days
(9 Reviews)
3.7

6 freelancers are bidding on average $259 for this job

miraclesolution

A proposal has not yet been provided

$210 USD in 10 days
(22 Reviews)
5.6
denissilanov

Hello! I have 8 years experience in programming. I can make this project for you. I understand your requirements about cleaning articles, but please tell me more about finding a web location and adding information to a More

$200 USD in 10 days
(8 Reviews)
4.6
anuyadav1

it is possible to clean data in these 4 ways . .

$150 USD in 3 days
(9 Reviews)
4.3