Suoritettu

Peyton Data Mining

you are going to read some text files and classify them according to their labels. The Reuters corpus is one of the most famous datasets for text categorization tasks. We provide a subset of this dataset on Brightspace. You apply these files to make your classifier. There is more information about this dataset available on [login to view URL]

1- Download zip file and extract it. Consider this data is a subset of full Reuters corpus to make it possible for you to process without the need of a powerful server.

2- Each file contains some XML files. Explore XML files and find a list of all fields available there.

3- Write a function extract a Pandas's Dataframe containing: (1) headline, (2) text, (3) bip:topics,(4)

[login to view URL], (5) itemid, (6) XMLfilename

4- Write a python function to find all the possible values for bip:topics. Consider that each news can

belong to more than one topic.

5- Write a function to prepare your text data by methods such as removing stop words. You are allowed

to use the NLTK library.

6- Extract features from the text using any approach you like. Write a function that input the Dataframe

in step 3 and generates a new Dataframe of your features and labels.

7- Divide your data into a training and test set. You can use any method such as cross-validation. You

need to provide a reason why you decide so here.

8- Write a function to get the Dataframe of step 6 and a set of parameters to return a trained classifier

to classify all labels that you get in step 4.

9- Write a function to evaluate the quality of your classifier (like accuracy, F-score, AUC, ...). Explain why

you think this function is the best choice

9- Generate five different classifiers (Random Forest, Decision Tree, Linear Regression, Neural Network, and SVM) using step 8. Tune them up for the best parameters. Find the best classifier. Explain why.

Taidot: Python, tiedonlouhinta, tietojärjestelmäarkkitehtuuri, tietojenkäsittely, XML

Näytä lisää: excel data mining project, build data mining project, data mining marketing research, data mining research companies, example data mining, purchase data mining contract information, data mining cleaning, dataset data mining association, screen scraping data mining, role database developers data mining, data mining using aspnet, datasets data mining association, medical billing service data mining, data mining websites excell, find data mining clients, data mining find jobs php, email find data mining, Find research on Image Processing/ Data Mining, find data mining expert

Tietoa työnantajasta:
( 0 arvostelua ) Middle Sackville, Canada

Projektin tunnus: #21831994

Myönnetty käyttäjälle:

Zohaib748

Hello Dear...! Alert: I will give you 20% discount on my bid rate also give on my All Services. So grabs this special offer is limited. Let’s get to the point. I came to know that your Looking a developer which Lisää

$131 CAD 3 päivässä
(4 Arvostelua)
2.4

17 freelanceria on tarjonnut keskimäärin 177$ tähän työhön

DevStar925

Hi, I read your project description and I am interested in your job. As you can see my profile, I am a full-time developer and have just completed many projects. Specially, I have top skills for C/C++, C#, Java, Py Lisää

$200 CAD 2 päivässä
(69 arvostelua)
7.4
polarjin2017

Hello? How are you? I am excited to work with you on this project. I have done a lot of jobs with python like Django admin, Flask, python scrap, pysql, python tkinter GUI etc Here is on of my scrap with python wor Lisää

$155 CAD 3 päivässä
(130 arvostelua)
7.0
yongjin818

Dear, As an expert in python, I have developed many scripts and applications using python, PyQt, wxpython, tkinter. I developed FlightPlanner and Wamdam database management using python and wxpython. My recent work: Lisää

$140 CAD 7 päivässä
(59 arvostelua)
5.9
smsaurabhv

Hi, I have gone through your requirement to scrape lots of websites. I am EXPERT in building scraping tools /scripts. Hence, I can SURELY work on your project. I am having 4 YEARS of EXPERIENCE in developing PHP-PYTHON Lisää

$108 CAD 3 päivässä
(70 arvostelua)
5.5
topexpert713

Hi, Nice to meet you! I have read your requirements carefully and I am very interesting for your project. I am confident of this project as I'm a professional Python,Data Mining expert with over 5 years of experience. Lisää

$140 CAD 7 päivässä
(22 arvostelua)
5.0
Arahan00

Hi, I have worked with NLP for sentiment analysis. I used Pythonfor the development. I would like to work on your project. Let me know if you want to discuss further. Regards, Monir

$250 CAD 14 päivässä
(9 arvostelua)
4.5
superdev1888

Hi.I have checked your requirement and understand it well. I have many experience in **** python **** I am a full stack developer with enough experience and skills in Django & ReactJs & VueJs & ASP.NET & PHP & JAVA Lisää

$140 CAD 7 päivässä
(12 arvostelua)
4.1
razajen

I am a professional data scientist from Scotland I have a vast amount of experience in data mining I am more than happy to go ahead and discuss your project with you please drop me a text here.

$277 CAD 1 päivässä
(4 arvostelua)
3.8
SnakeGeneral

Hi, there! I saw your description carefully and I think it best fits on my skill set. I'm a Python expert, I have more experience in data processing. Scraping is my major skill and I can build your project using differ Lisää

$150 CAD 7 päivässä
(7 arvostelua)
3.4
agrepatil12345

Hi Sir, Having Expertise in nature language processing, using python. also worked on different classification algorithm from machine learning and Deep learning. let's connect for further discussion. Thanks

$200 CAD 2 päivässä
(1 arvostelu)
0.0
soooky92

i can do it in a couple of days, i would use cross-validation because it is the one that i normally use.

$100 CAD 10 päivässä
(0 arvostelua)
0.0
luisnarvaez19

Certified in Java 1.2. I have been working with Java and JEE for 15 years. I have worked with several programming languages as: C, Python, Javascript, Visual Basic among others. I have experience doing compilers and in Lisää

$250 CAD 7 päivässä
(0 arvostelua)
0.0
manager21

we have good team to do the project already we are doing python AUS projects on time delivery we can do python/r,data sciences

$140 CAD 7 päivässä
(0 arvostelua)
0.0
danx666

Hi! Your project is similar to the project done in chapters 9, 10, and 11 from "Data Science with Python and Dask". I already done that project, so I can work on your project with confidence. I have experience choosi Lisää

$225 CAD 7 päivässä
(0 arvostelua)
0.0
willson112203

I have gone through your job description carefully and I am very interested in your project. I am very professional in wordpress design, bugfix, PHP, javascript and I can manage your project perfectly. Thank you!

$155 CAD 3 päivässä
(0 arvostelua)
2.8
syntechsolution

Hello, Checking your requirements we found ourselves fit to proceed with this project. I have some queries . It would be really great if we can get connected here to understand requirement and clarify everything in mor Lisää

$250 CAD 12 päivässä
(2 arvostelua)
0.0