Build me a simple Data Profiler in Python 3 using panda

V teku Objavljeno pred 4 letoma/leti Plačilo ob prevzemu
V teku Plačilo ob prevzemu

Hi My name is Amit and im currently diving into Datascience using python and wanted to learn more data profiling using pandas and would like a base profiler which i can use and then extend.

My requirements are below.

Create a python data profiling tool with the following features:

The data profiling tool should enable the user to either individually or on mass action the following:

1. Specify a particular data frame

2. Specify a particular column and generate basic statistical information and visuals for that column, i.e., histograms, pareto plots etc.

3. Specify a particular column and generate statistical information about how that column varies over a specified time window. For example, we might be interested in knowing how the distribution in the ‘Number_of_Casualties varies by day, week or month for a given dataframe.

Dataset - One or more of the road safety data csv files from 2015/16/17/etc., should be used as an example. See data link here: [login to view URL]

Core Output:

Creating a complete profiling package could take a long time and is not expected as there are multiple issues to contend with regarding data type and quality.

Instead the idea here is to demonstrate the ‘potential’ for a data profiling tool to aid ML workflows. The focus should be on creating a clear minimal viable product for demo purposes in order guide future development.

Work should demonstrate:

- Good coding practise

- Presence of unit, integration and acceptance tests

- Use of class methods

Results should include:

- Documentation highlighting what the profiling tool does and how to execute it. It should also highlight its scope, scalability/limitations and future features that could be considered for development.

- Documentation regarding func/class methods inputs/outputs should also be available in sphinx.

Python Arhitektura porgramske opreme Podatkovno rudarjenje Podatkovna znanost

ID projekta: #21408805

Več o projektu

12 predlogov Oddaljen projekt Aktiven pred 4 letoma/leti

12 freelancerjev ponuja v povprečju za £168 na tem delu

liveexperts123

Hi there, I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the Več

£250 GBP v 3 dneh
(66 ocen)
7.3
umg536

Hi there, please leave a message on my chat so we can discuss the budget and deadline of the project. I have read your project description and i'm confident i can do this project for you perfectly. Thanks . .

£250 GBP v 3 dneh
(21 ocen)
6.2
sharktiger

Good day! I'm a licensed full stack programming developer and designer. I have many experiences in python/Django and python selenium webscraping and python image processing by using python openCV package. I have many Več

£135 GBP v 7 dneh
(7 ocen)
4.3
Mexi2705

Hello I have walked through your note and enough confidence that I can work on your project I am having 10 years of rich experience as Mobile & Web Developer and also know graphics designing means in my career i learn Več

£450 GBP v 12 dneh
(5 ocen)
3.7
bluestar1027

*****Hello, dear!!!***** I have read your description carefully. I can handle it with full confidence and have already done this type of projects. Please give me an opportunity to work with you.

£135 GBP v 7 dneh
(8 ocen)
4.0
pinesucceed01

Hi there, I am Python developer, having below given skills: Engineering professional with 10 years of experience in Software development. Mastering/Leading in the development of applications/tools using Python for 6 Več

£135 GBP v 7 dneh
(3 ocen)
3.5
Valuesolutions

Hello, i have read the details provided..please contact me to discuss more on the project deadline and some other few things

£135 GBP v 4 dneh
(15 ocen)
4.9
Zied130

Hi I am a mathematician and a researcher in natural language processing. I do my research in python. I am also good in statistics and reports writing.

£50 GBP v 3 dneh
(7 ocen)
2.4
ThisIsPouya

Hi, I have extensive knowledge of Python and Pandas as well as data processing and manipulation. Also as a requirement and of my PhD studies, I worked extensively with R, Stata, Python, SPSS Modeler and Matlab for stat Več

£50 GBP v 1 dnevu
(0 ocen)
0.0