Natural Language Processing (NLP) Project: Word2vec conversion for Text Data

Zaprto Objavljeno pred 5 letoma/leti Plačilo ob prevzemu
Zaprto Plačilo ob prevzemu

Require a python developer with experience in NLP to code/develop a module that vectorizes text data and generates different clusters. This task can be subdivided into following tasks:

1) Extract Text data from solr database or from a set of csv files

2) Preprocess text data which includes

stop word removal

lower case conversion

stemming

lemmatization

Coder can add preprocessing based on their experience.

You are allowed to use standard NLTK libraries

3) Create word embedding models for the given dataset.

Open to explore different NLP techniques

4) Vectorize different documents by using many different techniques:

A). TFiDF

B). Word embedding (Use model developed in step 3)

C). Frequency Based

5) Cluster and visualization:

A). cluster the document vectors

B). Develop different visualization tools such as t-SNE

6) Topic model all clusters

Additional Notes:-

Each of these subtasks should be written in object-oriented fashion.

The code should be flexible such that the user can try different options, for example a user may choose not to stem the words in step 2 above.

Similarly, each subtask should offer flexibility in the overall structure.

We are also open to other techniques to complete this task.

Results of clustering will be validated by an expert in the field

This module is part of a larger project that require continual development. Looking for a developer interested in forming a longer time relation for work. Expectations: Professional code with comments , Object oriented programming for each subtask is a must

C++ programiranje Podatkovno rudarjenje Machine Learning (ML) Python Arhitektura porgramske opreme

ID projekta: #17727078

Več o projektu

19 predlogov Oddaljen projekt Aktiven pred 5 letoma/leti

19 freelancerjev ponuja v povprečju za $108 na tem delu

hbxfnzwpf

I am very proficient in c and c++. I have 16 years c++ developing experience now, and have worked for more than 7 years. My work is online game developing, and mainly focus on server side, using c++ under Linux environ Več

$100 USD v 2 dneh
(153 ocen)
7.1
alkarajput3131

I am a data scientist and am proficient at implementing machine learning models and deep learning based models, both in R and python. I also am familiar with web-scrapping using python. I have good command over variou Več

$120 USD v 3 dneh
(137 ocen)
6.4
invincible1428

Hello, Greetings of the day.!! I have worked on text mining projects. I know all the terminlogies of text mining I have hands on exp of tm packages of R which uses for text mining also python packages like ge Več

$25 USD v 1 dnevu
(65 ocen)
6.0
ach5a60adf39e644

Hi, I'm working in machine learning filed for few years and I really like to explore different fields. Right now I'm working in NLP projects only and exploring word2vec and other word embedding techniques. I'm accustom Več

$30 USD v 1 dnevu
(27 ocen)
5.7
DarkKnight2206

Hello!\nI am a python developer.\nI looked at your project and it seems interesting.\nI have all necessary skills required for this project.\nPing me to discuss in detail.

$30 USD v 2 dneh
(40 ocen)
5.6
Techiedev

I have good experiences developing code in Python Programming language for scientific computing , data science , machine learning and deep learning. I ve done some NLP projects with NLTK library and also some machin Več

$28 USD v 1 dnevu
(34 ocen)
5.0
raghavajay3

A dedicated analyst, proficient in running successful method-oriented operations & taking initiatives for business excellence. Specialized in creating Predictive modelling of data using supervised (Machine Learning) an Več

$55 USD v 2 dneh
(39 ocen)
4.8
d3bd33p

I've been working with ML past 5years and your requirement is a piece of cake for me. Let me know about this project in further detail. Best, Debdeep

$35 USD v 1 dnevu
(6 ocen)
4.1
origami07

Hello, I am a python developer. I must have a milestone created and complete specifications sent before i can do anything at all. Kind Regards

$61 USD v 3 dneh
(12 ocen)
3.8
Dvskhamele

I am expert in NLP ( NLTK, Word2vec, tasks with applying machine learning / neural networks ), where done several of projects and looking for a long term project on freelancer. Please check my profile.

$25 USD v 1 dnevu
(2 ocen)
0.3
intellosid

I am a Computer Engineering student in India having a science background, know programming languages (python and java) and can build algorithms for most of the work to automate it to take less time.

$25 USD v 1 dnevu
(0 ocen)
0.0
Hillary2018

I am ready for work Relevant Skills and Experience I can do it. And I've been doing it.

$25 USD v 1 dnevu
(0 ocen)
0.0
anumbilal2893

I have recently taken data mining and big data courses. I have experience of doing data pre-processing and finding patterns. I have done it in both Python as well as R studio.

$25 USD v 1 dnevu
(0 ocen)
0.0