I need help with a NLP Project. I would prefer for it to be done on Pyspark, python, preferrably on Databricks. But let me know if you prefer something else but it must be on python, and preferrably on pyspark framework. It is involving wine review/description data set. I will provide the data set to you in csv. It is about 130,000 entries. After data cleanup and pre-processing, it will be about 80,000 entries for you to do the machine learning on.
I have the codes/algorithm for it. Just need someone to execute it.
There are two parts to it:
Part I: NLP Deep Learning
I have 2 ideas in mind for this Part I. Let me know which one you can help with.
The 2 ideas are:
1. Predict if the score of the wine is above average or not, based on the description from the wine critics.
Reference Code (scroll down to the bottom of the webpage): [login to view URL]@cdabakoglu/wine-reviews-visualization-and-natural-language-process-nlp-34d650365f1f
2. Predict the variety of wine based on the wine's description from the wine critics.
[login to view URL]
Part II: NLP Clustering
Alongside one of the 2 ideas above, I would also need you to do this:
Create clusters/groups of wines based on description.
Reference Code: [login to view URL]
26 freelancers are bidding on average $667 for this job
Hi, I have worked on NLP & ML projects like sentiment analysis & smart home energy usage pattern clustering. I would like to work on your project. Let me know if you want to discuss further. Regards, Monir