Data Analysis Project Based on Public Open Data
pd.melt() + pd.concat()) were collected to examine price trends per pyeong across 17 metropolitan regions.
Key insights were derived through data preprocessing and exploratory analysis using Python and Pandas.
pd.to_numeric()pd.melt()pd.concat() (4,692 rows)groupby("region") → regional averagesgroupby("year") → yearly trendsgroupby("area") → size category comparison| Column | Type | Non-Null |
|---|---|---|
| Region | object | 4,335 |
| Size Class | object | 4,335 |
| Year | int64 | 4,335 |
| Month | int64 | 4,335 |
| Price (㎡) | object | 4,058 |
| Column | Type | Non-Null |
|---|---|---|
| Region | object | 4,335 |
| Year | int64 | 4,335 |
| Month | int64 | 4,335 |
| Price | float64 | 3,957 |
| Price / Pyeong | float64 | 3,957 |
| Area Type | object | 4,335 |
Distribution of 3,957 price records across 17 regions (2015–2019). Right-skewed with outlier at 42,002.
Average price per pyeong by region and year. Darker blue = higher price. Combined from two datasets via pd.melt() + pd.concat().