Today I officially kicked off my Titanic Survival Prediction project π
As the first step, I focused on getting familiar with the data through basic EDA (Exploratory Data Analysis).
train
and test
datasetshead()
, info()
, describe(include='all')
)Age
, Cabin
, Embarked
, Fare
)Survived
)Sex
)Pclass
)Good EDA is about asking: βHow will this help the model later?β
Rather than aimlessly plotting graphs, I tried to focus on features that might actually impact survival outcomes.
Age
, Cabin
, Embarked
, and Fare
)Feels good to finally set sail π’
Letβs survive and thrive in this competition!