Build me a simple Data Profiler in Python 3 using panda

Käynnissä Julkaistu 4 vuotta sitten Maksettu toimituksen yhteydessä
Käynnissä Maksettu toimituksen yhteydessä

Hi My name is Amit and im currently diving into Datascience using python and wanted to learn more data profiling using pandas and would like a base profiler which i can use and then extend.

My requirements are below.

Create a python data profiling tool with the following features:

The data profiling tool should enable the user to either individually or on mass action the following:

1. Specify a particular data frame

2. Specify a particular column and generate basic statistical information and visuals for that column, i.e., histograms, pareto plots etc.

3. Specify a particular column and generate statistical information about how that column varies over a specified time window. For example, we might be interested in knowing how the distribution in the ‘Number_of_Casualties varies by day, week or month for a given dataframe.

Dataset - One or more of the road safety data csv files from 2015/16/17/etc., should be used as an example. See data link here: [login to view URL]

Core Output:

Creating a complete profiling package could take a long time and is not expected as there are multiple issues to contend with regarding data type and quality.

Instead the idea here is to demonstrate the ‘potential’ for a data profiling tool to aid ML workflows. The focus should be on creating a clear minimal viable product for demo purposes in order guide future development.

Work should demonstrate:

- Good coding practise

- Presence of unit, integration and acceptance tests

- Use of class methods

Results should include:

- Documentation highlighting what the profiling tool does and how to execute it. It should also highlight its scope, scalability/limitations and future features that could be considered for development.

- Documentation regarding func/class methods inputs/outputs should also be available in sphinx.

Python Tietojärjestelmäarkkitehtuuri tiedonlouhinta Datatiede

Projektin tunnus: #21408805

Tietoa projektista

12 ehdotusta Etäprojekti Aktiivinen 4 vuotta sitten

12 freelanceria on tarjonnut keskimäärin £168 tähän työhön

liveexperts123

Hi there, I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the Lisää

£250 GBP 3 päivässä
(66 arvostelua)
7.3
umg536

Hi there, please leave a message on my chat so we can discuss the budget and deadline of the project. I have read your project description and i'm confident i can do this project for you perfectly. Thanks . .

£250 GBP 3 päivässä
(21 arvostelua)
6.2
sharktiger

Good day! I'm a licensed full stack programming developer and designer. I have many experiences in python/Django and python selenium webscraping and python image processing by using python openCV package. I have many Lisää

£135 GBP 7 päivässä
(7 arvostelua)
4.3
Mexi2705

Hello I have walked through your note and enough confidence that I can work on your project I am having 10 years of rich experience as Mobile & Web Developer and also know graphics designing means in my career i learn Lisää

£450 GBP 12 päivässä
(5 arvostelua)
3.7
bluestar1027

*****Hello, dear!!!***** I have read your description carefully. I can handle it with full confidence and have already done this type of projects. Please give me an opportunity to work with you.

£135 GBP 7 päivässä
(8 arvostelua)
4.0
pinesucceed01

Hi there, I am Python developer, having below given skills: Engineering professional with 10 years of experience in Software development. Mastering/Leading in the development of applications/tools using Python for 6 Lisää

£135 GBP 7 päivässä
(3 arvostelua)
3.5
Valuesolutions

Hello, i have read the details provided..please contact me to discuss more on the project deadline and some other few things

£135 GBP 4 päivässä
(15 arvostelua)
4.9
Zied130

Hi I am a mathematician and a researcher in natural language processing. I do my research in python. I am also good in statistics and reports writing.

£50 GBP 3 päivässä
(7 arvostelua)
2.4
ThisIsPouya

Hi, I have extensive knowledge of Python and Pandas as well as data processing and manipulation. Also as a requirement and of my PhD studies, I worked extensively with R, Stata, Python, SPSS Modeler and Matlab for stat Lisää

£50 GBP 1 päivässä
(0 arvostelua)
0.0