Today was all about diving deeper into the industry-level sales trends from the Seoul dataset I started cleaning yesterday.
After getting the basic structure set up, I wanted to understand:
๐ญ โWhich industries are consistently dominating the sales charts every quarter?โ
๐ญ โIs ํ์์์์ really the MVP? Or is ๋
ธ๋์ง ์์ฐ๋ฌผํ๋งค stealing the show?โ
๐ What I explored today
- Grouped total sales by industry and quarter
- Used
groupby
on ๊ธฐ์ค_๋
๋ถ๊ธฐ_์ฝ๋
and ์๋น์ค_์
์ข
_์ฝ๋_๋ช
- Aggregated national-level total sales per category
- Ranked the top 5 industries for each quarter in 2024
- Turns out ํ์์์์ (Korean restaurants) was on top more than once
- But seafood sales in ๋
ธ๋์ง2๋? Absolutely wild numbers.
- Thatโs a district-level outlier, not an industry-wide winner
- Converted sales units to ์ต (100M KRW)
- Because yes, my eyes were bleeding from all those zeros ๐
- Made a grouped barplot with value labels
- It actually turned out super readable. Might reuse that template again.
๐ง Insights
- Some industries (like Korean food) are steady earners across all quarters
- Others (like seafood) show extreme spikes in one region only
- Looks like this EDA will split nicely into macro (industry) and micro (district) levels
๐ฎ Whatโs next?
Tomorrowโs plan:
- Shift focus to ํ์ ๋ (district)-level analysis
- Estimate average sales per store (e.g., ์นํจ์ง in ๊ฐ๋จ๊ตฌ)
- Possibly join with ์ฌ์
์ฒด ์ data to get โstore-level profitabilityโ
Thatโs a wrap for today!
Feels good to have a clear separation between nationwide industry trends vs. local insights.
Weโre building something real here ๐