ago
0 like 0 dislike
0 like 0 dislike
I’m facing difficulties in analysing the data when to use what commands/how to proceed to next step.

For example :-)

1. I don’t know when to remove which column/row data

2. when to use scatter, histo, box plots

3. when remove duplicate values, when to add median, mean for which data like uni or bi

4. After plotting graphs when to use heat maps, when to use subplots


Please do post some resources which can help to understand step by step process when to use what in analysing different datasets
ago
0 like 0 dislike
0 like 0 dislike
This is a cool thought provoking post but EDA is where the most individuality exists in data science. It's just you exploring.

I doubt many will have a comprehensive answer for you
ago
0 like 0 dislike
0 like 0 dislike
There are no golden rules for EDA. Indeed, we know when to use scatter plot and box plots. However, the key is what messages you are trying to convey. How you are building the visualizations should be highly dependent on this.
ago
by
0 like 0 dislike
0 like 0 dislike
run a pandas profiling  report
ago

No related questions found

33.4k questions

135k answers

0 comments

33.7k users

OhhAskMe is a math solving hub where high school and university students ask and answer loads of math questions, discuss the latest in math, and share their knowledge. It’s 100% free!