Find Jobs
Hire Freelancers

Categorizing 40,000 Wikipedia Articles

$250-750 USD

Closed
Posted about 5 years ago

$250-750 USD

Paid on delivery
Greetings. I have accumulated a collection of around ~40,000 random uncategorized Wikipedia article's URLs. I'd like to sort these URLs and assign them to its respected category. I have established some general parent-categories which I feel the articles should fall under. Architecture, Arts, Film and Music Communication, Education and Literature Companies and Organizations Economics and Finance Energy and Environment Food and Drink Geography and Places Health and Medicine Law and Politics Mathematics Media (Books, Movies and TV) People Philosophy, Religion and Spirituality Psychology Recreation and Sports Science and Technology Social Science (Anthropology, History and Sociology) These are just the parent categories; each article should then be sorted by its sub-categories as well (Example: in Mathematics - Probability, Geometry, etc.; in Geography - Cities, National Parks, Islands, etc.; in Religion - Buddhism, Judaism, etc.; in Technology - Networking, AI, etc.; in People - Business, Sports, Politics, etc.) The URLs are in a plain text format (.txt) and the output can be the same. ------------------------ Example Uncategorized: [login to view URL] [login to view URL](philosophy) [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] [login to view URL] ------------------------ Categorized: [Communication - Journalism] [login to view URL] [Communication - Literature] [login to view URL] [Companies - Financial] [login to view URL] [Companies - Technology] [login to view URL] [Companies - Transport] [login to view URL] [Finance - Foreign Exchange] [login to view URL] [Finance - Insurance] [login to view URL] [Finance - Options] [login to view URL] [Finance - Taxation] [login to view URL] [Health - Diseases] [login to view URL] [Health - Sleep] [login to view URL] [Geography - Parks] [login to view URL] [Geography - Salt Flats] [login to view URL] [Geography - Valleys] [login to view URL] [People - Artist] [login to view URL] [People - Businessmen] [login to view URL] [People - Philosopher] [login to view URL] [People - Politics] [login to view URL] [Philosophy] [login to view URL] [Philosophy - Concepts] [login to view URL](philosophy) [Religion - Buddhism] [login to view URL] [Religion - Hinduism] [login to view URL] ------------------------ The above example has to be applied to 40,000 URLs. Avoiding overcategorization is a must. Strive to keep the sub-categories broad. I came across a few links which may be useful: [login to view URL] [login to view URL] [login to view URL]
Project ID: 18827984

About the project

13 proposals
Remote project
Active 5 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
13 freelancers are bidding on average $624 USD for this job
User Avatar
HI, I have a big team. So i think this work will be easy for me. IF you are interested to work with me so message to me. Thanks.
$250 USD in 15 days
4.9 (353 reviews)
7.9
7.9
User Avatar
Hello, I have more than 10 years experience in machine learning, natural language processing, data mining and other AI related fields. I have worked on many previous similar projects and can do this project in a perfect way. My previous projects includes generating process diagrams starting from texts, Social networks data analysis systems, Recommendation systems, Opinion mining Systems, Bird sounds recognition system, Natural Language interface to a relational database (like SIRI), Question Answering Systems (in limited domain), chatbots ,..
$555 USD in 10 days
5.0 (23 reviews)
6.9
6.9
User Avatar
Hello sir I have 9 years of experience of web scraping. I have extremely knowledge of NLP and am very familiar with categorizing the wikipedia articles. I can make a python script to categorize the articles using the library. I am ready to start the work. Best Regards, Yongtao
$250 USD in 3 days
4.9 (44 reviews)
6.6
6.6
User Avatar
Hi Sir, I’m expert in categorisation and I have scraped wiki a lot of times. I can finish 40k urls into respective main and sub categories. Thanks
$750 USD in 6 days
4.7 (167 reviews)
6.9
6.9
User Avatar
Hi, I'm an active Wikipedia editor and aware of all Wikipedia's policies and terms of reference. Wikipedia is more than just throwing the pages up, it requires a proper strategy and methods to bring them live. I have contributed to Wikipedia a lot, almost 9 years and will make sure your article is successfully published and approved. Wikipedia is a critical platform with a set of rules that noobies doesn't know well, I will explain them to you and check your notability before taking the project. I have published for companies, artists, software etc. I'll provide lifetime assistance. We can discuss any question you have, my services are honest and reliable. You can read my profile feedback, all are Wikipedia jobs. Thank you.
$2,500 USD in 60 days
4.9 (22 reviews)
6.1
6.1
User Avatar
Hello, I can build a program to categorize all your articles. The results will be fast and accurate ! I know what I'm talking about, because I have experience in this field.
$555 USD in 10 days
5.0 (4 reviews)
3.6
3.6
User Avatar
Hi! I am an IT-consultant and a virtual assistant so this project would fit me perfect. I can start asap and work flexible hours according to your needs. Please contact me for further information and lets get this project started. I am a fast and hard worker.. You wont regret hire me, I will do my best to reach your goals. I am very fast with my computer typing and research so I will do this project flawless. Give me a chance to prove myself. Lets get this project started right now! Looking forward to work work with you:)
$250 USD in 10 days
5.0 (5 reviews)
2.9
2.9
User Avatar
Hello I would really like to work with you on this one if possible! I do have a couple of questions, but first I would like to make you an offer and some background so you can check my work out. I am a professional Internet Research, Virtual Assistant, Photoshop, MS Office, Data Entry Expert. Over the last 4 years, I have developed my skills and gained experiences in Internet Research, Virtual Assistant, Photoshop, MS Office, and Data Entry. However, I am almost new to here and looking to get a few clients that I can build upon. If you look at my work and feel that I could help you. You will get all the expected stuff like a great professional service and a fast turnaround, at a bit less, and I get a bit more exposure. If the above offer sounds like something you would be interested in, I would love to hear from you. Regards, Muhammed Kabir
$250 USD in 30 days
5.0 (9 reviews)
2.2
2.2
User Avatar
Hello, Hope you are doing well. Just wanted to share that i have a good hand with probability and statistics. I am fairly comfortable with Python & R to support variety of Data science and Statistical Analysis tasks. I would split this into two simple phases : 1. Approach identification and accuracy 2. Actual implementation Feel free to drop in over chat so that we can discuss more on this. Cheers, Akarsh
$250 USD in 14 days
3.4 (1 review)
1.2
1.2
User Avatar
HI there! My partner and I would love work on your project. I offer you a calssification article based on Machine Learning algorithms such as Random Forest Classifier with Natural Processiong Language form NLTK all this made in Python one of the most usable language for scraping and ML. We have a ML certified by Stanford and we have experienced in scraping, in fact, we scraped freelancer.com to get this ML projects just straight in our Data base! and classify the projects. I you want to classify al the urls, chat with me and discuss more details about it. You won't be regret it. Greetings from Mexico!
$700 USD in 13 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I will be happy to help you, 100% original work, has been working as an academic for the last 15 years. I deliver high standard and willing to carry any changes till you are fully satisfied.
$555 USD in 10 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of UNITED STATES
United States
0.0
0
Member since Dec 20, 2015

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.