Suljettu

Data Preprocessing java code

In this project, the students are to implement data pre-processing techniques and apply them to a gene expression dataset.

The dataset contains 62 samples collected from colon-cancer patients. 40 of the samples are labeled as ”negative” and 22 are labeled as ”positive.” Each tuple (row) in the dataset is a sample containing the readings for the genes, and the class (which is the last column) of the sample. Each gene is an attribute. The columns are separated by ”,”, which is a commonly used format in data mining. We will refer to the genes as G0, ..., GN, assigned in the left-to-right order as given in the original file.

You will write a C++ or Java program to handle the following two tasks:

Task 1. Task 2.

Discretize the data using equi-density binning with 3 bins for each of the first k attributes.

Use the entropy-based binning method to discretize all genes and to select the top-k genes, ranked in decreasing information gain order. Use 3 bins for each gene. Information gain for three bins is a generalization of the two-bins case (based on size-weighted entropy). To get three bins you should first divide the range of a given attribute into two bins and then divide one of the two bins into two more bins. The two splits should maximize the size-weighted entropy gain for the three intervals. (You should select between the two splits (one for the left interval and one for the right interval) as the the second split based on size-weighted entropy gain.)

Taidot: tiedonlouhinta, Java

Näytä lisää: java redirect data code, java send data usb

About the Employer:
( 0 reviews ) United States

Projektin tunnus: #13120155

11 freelanceria on tarjonnut keskimäärin 89 $ tähän työhön

dobreiiita

Hello I am Java expert and interested in this project. I have reviewed the attached files and confident to handle it perfectly. I have a lot of experience in helping in students with assignments, so I will k Lisää

100 $ USD 2 päivässä
(376 arvostelua)
7.4
100 $ USD 1 päivässä
(195 arvostelua)
6.5
70 $ USD 2 päivässä
(88 arvostelua)
5.9
koustav2006

Hi, I am good at core java programming and familiar with required data processing algorithms. I can get the work done in 24 hours. With Regards, Koustav

80 $ USD 1 päivässä
(48 arvostelua)
5.1
110 $ USD 1 päivässä
(6 arvostelua)
4.3
moeenahmed21

I am an experienced C++ and Java developer. I will solve your problem and develop the program. Feel free to contact me for further discussion. Regards, Moeen Ahmed

70 $ USD 1 päivässä
(13 arvostelua)
4.1
point5nyble

Hello, I have been working with a MNC based company since last ~ 6 years, as an IT BI DWH Professional. The project is a BI DWH project which covers 3 layers of BI (ETL, Data warehouse & Reporting) and helped me in Lisää

100 $ USD 1 päivässä
(6 arvostelua)
3.6
pawarpankaj923

Hello, Nice to see your post,I am having 5+ years of experience in development,just share me your detail requirement with me so we can discuss more.I am sure after discussion with me you are satisfy and we will wo Lisää

15 $ USD 1 päivässä
(2 arvostelua)
2.2
GITTechBAY

I am ready to work on your task as per the given requirement , please message me avaiaolbe 24/7 onlines for status update . Lisää

30 $ USD 1 päivässä
(7 arvostelua)
2.3
zain9674

Thank you for taking the time to review our bid! I just checked the description you have provided regarding the project and it would be a pleasure to assist you as well. I am really eager to work on your project with f Lisää

23 $ USD 0 päivässä
(0 arvostelua)
0.0
Dextersmind2

Hi, I can certainly do your task, having more than 8 years of experience, please reply me and discuss the details. High grades and short deadline is guaranteed. Regards

277 $ USD 15 päivässä
(0 arvostelua)
0.0