Data mining determining distributions

This is about statistical analysis of a data collection as well as different data reduction methods, and in particular, dimensionality reduction through feature extraction. You are given two datasets, each containing a data table of 1000 vector with 100 attributes (i.e., dimensions) in two files with 500 samples for each file. Each dataset is given by two tables of 500 samples each. Both datasets are given as text table files where each dataset is represented as a 1000 x 100 matrix where each row of the matrix is a vector. You are further told that for each dataset, for all the samples (i.e., vectors) the component values of each vector follow the same distribution.

1. Determine the distributions of the two vector component values for both datasets. For each dataset, randomly pick up 10 samples and report the distribution parameters for each of the 10 samples.

2. Compute the norms for all the samples for both datasets. Then determine the distributions for the norms of both datasets, respectively, and report their distribution parameters.

3. Implement PCA and DCT methods and apply them for feature extraction to the two datasets, respectively. Report the reduced dimensionalities for the two datasets after the feature extraction for PCA and DCT, respectively.

4. Compare the feature extraction results between the two methods for the two datasets, respectively, and report your comparison conclusion.

You can use whatever programming language you are comfortable with.(Preferrably c++)

Taidot: tiedonlouhinta, Datatiede, tietojenkäsittely, C++ -ohjelmointi

Näytä lisää: how to determine distribution of data in excel, data distribution, statistical distributions pdf, how to identify distribution of data, how to find the distribution of data in statistics, how to determine the type of distribution, how to find the distribution of data in matlab, normal distribution, excel data mining project, build data mining project, data mining marketing research, data mining research companies, example data mining, purchase data mining contract information, data mining cleaning, dataset data mining association, screen scraping data mining, role database developers data mining, data mining using aspnet, datasets data mining association

Tietoa työnantajasta:
( 6 arvostelua ) BINGHAMTON, United States

Projektin tunnus: #21736758

10 freelanceria on tarjonnut keskimäärin %project_bid_stats_avg_sub_18% %project_currencyDetails_sign_sub_19%/tunti tähän työhön


Hi there, I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the Lisää

$50 USD / tunti
(23 arvostelua)

Dear sir, I've already done this kind of project before. I'm sure that I can complete your project 'Data mining determining distributions' as soon as possible. I am senior software developer and always provide fast ser Lisää

$5 USD / tunti
(12 arvostelua)

Hi dear, Nice to meet [login to view URL] for taking your valuable time for reviewing my proposal. I am a senior web site developer. I've just checked your description. I have a confidence that I can complete your project in Lisää

$5 USD / tunti
(6 arvostelua)

Hi,sir, I'm sure that I can be a excellent candidate for your project. Please contact me, so that we can discuss more over chat. I value my credits from clients. Thank you for your reading. I have worked for a long ti Lisää

$5 USD / tunti
(6 arvostelua)

Hello Thanks for your posting. I am a senior developer so i can do it very easily if you want.I’ve read your job description carefully and I am very interested in your project. I am sure that I can finish this project Lisää

$10 USD / tunti
(5 arvostelua)

Hi, Sir! I have just read your job description carefully. I understand you want an experienced C++ developer to help you. I have been developing many statistical tasks for over 7 years. I am also a statistical exper Lisää

$5 USD / tunti
(5 arvostelua)

Hello, Your job post caught my attention Because,I have lots of experience in Lead Generation, Email marketing, data mining, web scraping, data typing,ms word, ms excel, data entry, web research, database administrat Lisää

$5 USD / tunti
(1 arvostelu)

Hello I am an experienced Academic Research paper and Business report/plan writer. My expertise includes: Project Management, Python, R-Programming and Matlab Coding. Education: I have done Masters in Business marke Lisää

$11 USD / tunti
(0 arvostelua)

Hi, We could do this for you fast and easy. Please contact us for further information. Best regards!

$10 USD / tunti
(0 arvostelua)

Hello! I am very interested in your post project. While I read your description carefully, I was excited with feeling that I would be able to satisfy for your requirements in this job. We can negotiate on price/Budget Lisää

$2 USD / tunti
(0 arvostelua)