Skip to content

Commit bcfe3d6

Browse files
committed
Fix code : reflect on the code review.
Signed-off-by: poo <[email protected]>
1 parent 89c6964 commit bcfe3d6

File tree

3 files changed

+17
-17
lines changed

3 files changed

+17
-17
lines changed

โ€Ž2-Working-With-Data/07-python/translations/README.ko.mdโ€Ž

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -88,9 +88,9 @@ ax = monthly.plot(kind='bar')
8888
```
8989
![Monthly Time Series Averages](../images/timeseries-3.png)
9090

91-
### ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„
91+
### ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„(DataFrame)
9292

93-
๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์€ ๊ธฐ๋ณธ์ ์œผ๋กœ ๋™์ผํ•œ ์ธ๋ฑ์Šค๋ฅผ ๊ฐ€์ง„ ์‹œ๋ฆฌ์ฆˆ ๋ชจ์Œ์ž…๋‹ˆ๋‹ค. ์—ฌ๋Ÿฌ ์‹œ๋ฆฌ์ฆˆ๋ฅผ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์œผ๋กœ ๊ฒฐํ•ฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:
93+
๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„(DataFrame)์€ ๊ธฐ๋ณธ์ ์œผ๋กœ ๋™์ผํ•œ ์ธ๋ฑ์Šค๋ฅผ ๊ฐ€์ง„ ์‹œ๋ฆฌ์ฆˆ ๋ชจ์Œ์ž…๋‹ˆ๋‹ค. ์—ฌ๋Ÿฌ ์‹œ๋ฆฌ์ฆˆ๋ฅผ DataFrame์œผ๋กœ ๊ฒฐํ•ฉํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:
9494
```python
9595
a = pd.Series(range(1,10))
9696
b = pd.Series(["I","like","to","play","games","and","will","not","change"],index=range(0,9))
@@ -126,15 +126,15 @@ df = pd.DataFrame([a,b]).T..rename(columns={ 0 : 'A', 1 : 'B' })
126126
```
127127
์—ฌ๊ธฐ์„œ `.T`๋Š” ํ–‰๊ณผ ์—ด์„ ๋ณ€๊ฒฝํ•˜๋Š” DataFrame์„ ์ „์น˜ํ•˜๋Š” ์ž‘์—…, ์ฆ‰ ํ–‰๊ณผ ์—ด์„ ๋ณ€๊ฒฝํ•˜๋Š” ์ž‘์—…์„ ์˜๋ฏธํ•˜๋ฉฐ `rename` ์ž‘์—…์„ ์‚ฌ์šฉํ•˜๋ฉด ์ด์ „ ์˜ˆ์ œ์™€ ์ผ์น˜ํ•˜๋„๋ก ์—ด ์ด๋ฆ„์„ ๋ฐ”๊ฟ€ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
128128

129-
๋‹ค์Œ์€ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์—์„œ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ๋ช‡ ๊ฐ€์ง€ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ์ž‘์—…์ž…๋‹ˆ๋‹ค:
129+
๋‹ค์Œ์€ DataFrame์—์„œ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ๋ช‡ ๊ฐ€์ง€ ๊ฐ€์žฅ ์ค‘์š”ํ•œ ์ž‘์—…์ž…๋‹ˆ๋‹ค:
130130

131-
**ํŠน์ • ์ปฌ๋Ÿผ ์„ ํƒ(Column selection)**. `df['A']`๋ฅผ ์ž‘์„ฑํ•˜์—ฌ ๊ฐœ๋ณ„ ์—ด์„ ์„ ํƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ์ž‘์—…์€ ์‹œ๋ฆฌ์ฆˆ๋ฅผ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ `df[['B','A']]`๋ฅผ ์ž‘์„ฑํ•˜์—ฌ ์—ด์˜ ํ•˜์œ„ ์ง‘ํ•ฉ์„ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์œผ๋กœ ์„ ํƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋ฉด ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์ด ๋ฐ˜ํ™˜๋ฉ๋‹ˆ๋‹ค.
131+
**ํŠน์ • ์ปฌ๋Ÿผ ์„ ํƒ(Column selection)**. `df['A']`๋ฅผ ์ž‘์„ฑํ•˜์—ฌ ๊ฐœ๋ณ„ ์—ด์„ ์„ ํƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ด ์ž‘์—…์€ ์‹œ๋ฆฌ์ฆˆ๋ฅผ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ `df[['B','A']]`๋ฅผ ์ž‘์„ฑํ•˜์—ฌ ์—ด์˜ ํ•˜์œ„ ์ง‘ํ•ฉ์„ ๋‹ค๋ฅธ DataFrame์œผ๋กœ ์„ ํƒํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋ฉด ๋‹ค๋ฅธ DataFrame์ด ๋ฐ˜ํ™˜๋ฉ๋‹ˆ๋‹ค.
132132

133133
**ํ•„ํ„ฐ๋ง(Filtering)** ์€ ๊ธฐ์ค€์— ๋”ฐ๋ผ ํŠน์ • ํ–‰๋งŒ ์ ์šฉํ•ฉ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด `A` ์—ด์ด 5๋ณด๋‹ค ํฐ ํ–‰๋งŒ ๋‚จ๊ธฐ๋ ค๋ฉด `df[df['A']>5]`๋ผ๊ณ  ์“ธ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
134134

135-
> **์ฃผ์˜**: ํ•„ํ„ฐ๋ง์ด ์ž‘๋™ํ•˜๋Š” ๋ฐฉ์‹์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค. ํ‘œํ˜„์‹ `df['A']<5`๋Š” ์›๋ž˜ ์‹œ๋ฆฌ์ฆˆ `df['A']`์˜ ๊ฐ ์š”์†Œ์— ๋Œ€ํ•ด ํ‘œํ˜„์‹์ด `True`์ธ์ง€ ์•„๋‹ˆ๋ฉด `False`์ธ์ง€๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” `๋ถ€์šธ(Boolean)` ์‹œ๋ฆฌ์ฆˆ๋ฅผ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ๋ถ€์šธ ๊ณ„์—ด์ด ์ธ๋ฑ์Šค๋กœ ์‚ฌ์šฉ๋˜๋ฉด ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์—์„œ ํ–‰์˜ ํ•˜์œ„ ์ง‘ํ•ฉ์„ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ์ž„์˜์˜ Python ๋ถ€์šธ ํ‘œํ˜„์‹์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด `df[df['A']>5 ๋ฐ df['A']<7]`๋ฅผ ์ž‘์„ฑํ•˜๋Š” ๊ฒƒ์€ ์ž˜๋ชป๋œ ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋Œ€์‹ , ๋ถ€์šธ ๊ณ„์—ด์— ํŠน์ˆ˜ `&` ์—ฐ์‚ฐ์„ ์‚ฌ์šฉํ•˜์—ฌ `df[(df['A']>5) & (df['A']<7)]`๋กœ ์ž‘์„ฑํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค(*์—ฌ๊ธฐ์„œ ๋Œ€๊ด„ํ˜ธ๊ฐ€ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค*).
135+
> **์ฃผ์˜**: ํ•„ํ„ฐ๋ง์ด ์ž‘๋™ํ•˜๋Š” ๋ฐฉ์‹์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค. ํ‘œํ˜„์‹ `df['A']<5`๋Š” ์›๋ž˜ ์‹œ๋ฆฌ์ฆˆ `df['A']`์˜ ๊ฐ ์š”์†Œ์— ๋Œ€ํ•ด ํ‘œํ˜„์‹์ด `True`์ธ์ง€ ์•„๋‹ˆ๋ฉด `False`์ธ์ง€๋ฅผ ๋‚˜ํƒ€๋‚ด๋Š” `๋ถ€์šธ(Boolean)` ์‹œ๋ฆฌ์ฆˆ๋ฅผ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ๋ถ€์šธ ๊ณ„์—ด์ด ์ธ๋ฑ์Šค๋กœ ์‚ฌ์šฉ๋˜๋ฉด DataFrame์—์„œ ํ–‰์˜ ํ•˜์œ„ ์ง‘ํ•ฉ์„ ๋ฐ˜ํ™˜ํ•ฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ ์ž„์˜์˜ Python ๋ถ€์šธ ํ‘œํ˜„์‹์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์—†์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด `df[df['A']>5 ๋ฐ df['A']<7]`๋ฅผ ์ž‘์„ฑํ•˜๋Š” ๊ฒƒ์€ ์ž˜๋ชป๋œ ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋Œ€์‹ , ๋ถ€์šธ ๊ณ„์—ด์— ํŠน์ˆ˜ `&` ์—ฐ์‚ฐ์„ ์‚ฌ์šฉํ•˜์—ฌ `df[(df['A']>5) & (df['A']<7)]`๋กœ ์ž‘์„ฑํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค(*์—ฌ๊ธฐ์„œ ๋Œ€๊ด„ํ˜ธ๊ฐ€ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค*).
136136
137-
**์ƒˆ๋กœ์šด ๊ณ„์‚ฐ ๊ฐ€๋Šฅํ•œ ์—ด ๋งŒ๋“ค๊ธฐ**. ์šฐ๋ฆฌ๋Š” ์ง๊ด€์ ์ธ ํ‘œํ˜„์„ ์‚ฌ์šฉํ•˜์—ฌ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์— ๋Œ€ํ•œ ์ƒˆ๋กœ์šด ๊ณ„์‚ฐ ๊ฐ€๋Šฅํ•œ ์—ด์„ ์‰ฝ๊ฒŒ ๋งŒ๋“ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.:
137+
**์ƒˆ๋กœ์šด ๊ณ„์‚ฐ ๊ฐ€๋Šฅํ•œ ์—ด ๋งŒ๋“ค๊ธฐ**. ์šฐ๋ฆฌ๋Š” ์ง๊ด€์ ์ธ ํ‘œํ˜„์„ ์‚ฌ์šฉํ•˜์—ฌ DataFrame์— ๋Œ€ํ•œ ์ƒˆ๋กœ์šด ๊ณ„์‚ฐ ๊ฐ€๋Šฅํ•œ ์—ด์„ ์‰ฝ๊ฒŒ ๋งŒ๋“ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.:
138138
```python
139139
df['DivA'] = df['A']-df['A'].mean()
140140
```
@@ -152,7 +152,7 @@ df['LenB'] = df['B'].apply(lambda x : len(x))
152152
df['LenB'] = df['B'].apply(len)
153153
```
154154

155-
์œ„์˜ ์ž‘์—… ํ›„์— ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์ด ์™„์„ฑ๋ฉ๋‹ˆ๋‹ค:
155+
์œ„์˜ ์ž‘์—… ํ›„์— ๋‹ค์Œ๊ณผ ๊ฐ™์€ DataFrame์ด ์™„์„ฑ๋ฉ๋‹ˆ๋‹ค:
156156

157157
| | A | B | DivA | LenB |
158158
| --- | --- | ------ | ---- | ---- |
@@ -166,12 +166,12 @@ df['LenB'] = df['B'].apply(len)
166166
| 7 | 8 | very | 3.0 | 4 |
167167
| 8 | 9 | much | 4.0 | 4 |
168168

169-
**์ˆซ์ž๋ฅผ ๊ธฐ์ค€์œผ๋กœ ํ–‰ ์„ ํƒ** `iloc(์ •์ˆ˜ ์œ„์น˜:integer location)` ๊ตฌ์„ฑ์„ ์‚ฌ์šฉํ•˜์—ฌ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์—์„œ ์ฒ˜์Œ 5๊ฐœ ํ–‰์„ ์„ ํƒํ•˜๋ ค๋ฉด:
169+
**์ˆซ์ž๋ฅผ ๊ธฐ์ค€์œผ๋กœ ํ–‰ ์„ ํƒ** `iloc(์ •์ˆ˜ ์œ„์น˜:integer location)` ๊ตฌ์„ฑ์„ ์‚ฌ์šฉํ•˜์—ฌ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด DataFrame์—์„œ ์ฒ˜์Œ 5๊ฐœ ํ–‰์„ ์„ ํƒํ•˜๋ ค๋ฉด:
170170
```python
171171
df.iloc[:5]
172172
```
173173

174-
**๊ทธ๋ฃนํ™”(Grouping)** ๋Š” ์ข…์ข… Excel์˜ *ํ”ผ๋ฒ— ํ…Œ์ด๋ธ”*๊ณผ ์œ ์‚ฌํ•œ ๊ฒฐ๊ณผ๋ฅผ ์–ป๋Š” ๋ฐ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ์ฃผ์–ด์ง„ `LenB` ์ˆ˜์— ๋Œ€ํ•ด `A` ์—ด์˜ ํ‰๊ท  ๊ฐ’์„ ๊ณ„์‚ฐํ•˜๋ ค๊ณ  ํ•œ๋‹ค๊ณ  ๊ฐ€์ •ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฐ ๋‹ค์Œ `LenB`๋กœ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์„ ๊ทธ๋ฃนํ™”ํ•˜๊ณ  `mean`์„ ํ˜ธ์ถœํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:
174+
**๊ทธ๋ฃนํ™”(Grouping)** ๋Š” ์ข…์ข… Excel์˜ *ํ”ผ๋ฒ— ํ…Œ์ด๋ธ”*๊ณผ ์œ ์‚ฌํ•œ ๊ฒฐ๊ณผ๋ฅผ ์–ป๋Š” ๋ฐ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. ์ฃผ์–ด์ง„ `LenB` ์ˆ˜์— ๋Œ€ํ•ด `A` ์—ด์˜ ํ‰๊ท  ๊ฐ’์„ ๊ณ„์‚ฐํ•˜๋ ค๊ณ  ํ•œ๋‹ค๊ณ  ๊ฐ€์ •ํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Ÿฐ ๋‹ค์Œ `LenB`๋กœ DataFrame์„ ๊ทธ๋ฃนํ™”ํ•˜๊ณ  `mean`์„ ํ˜ธ์ถœํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:
175175
```python
176176
df.groupby(by='LenB').mean()
177177
```
@@ -193,7 +193,7 @@ This gives us the following table:
193193

194194
### ๋ฐ์ดํ„ฐ ์–ป๊ธฐ
195195

196-
์šฐ๋ฆฌ๋Š” Python ๊ฐ์ฒด์—์„œ ์‹œ๋ฆฌ์ฆˆ ๋ฐ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์„ ๊ตฌ์„ฑํ•˜๋Š” ๊ฒƒ์ด ์–ผ๋งˆ๋‚˜ ์‰ฌ์šด์ง€ ๋ณด์•˜์Šต๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋ฐ์ดํ„ฐ๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ํ…์ŠคํŠธ ํŒŒ์ผ ๋˜๋Š” Excel ํ‘œ์˜ ํ˜•ํƒœ๋กœ ์ œ๊ณต๋ฉ๋‹ˆ๋‹ค. ์šด ์ข‹๊ฒŒ๋„ Pandas๋Š” ๋””์Šคํฌ์—์„œ ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๋“œํ•˜๋Š” ๊ฐ„๋‹จํ•œ ๋ฐฉ๋ฒ•์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด CSV ํŒŒ์ผ์„ ์ฝ๋Š” ๊ฒƒ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ๊ฐ„๋‹จํ•ฉ๋‹ˆ๋‹ค:
196+
์šฐ๋ฆฌ๋Š” Python ๊ฐ์ฒด์—์„œ ์‹œ๋ฆฌ์ฆˆ ๋ฐ DataFrame์„ ๊ตฌ์„ฑํ•˜๋Š” ๊ฒƒ์ด ์–ผ๋งˆ๋‚˜ ์‰ฌ์šด์ง€ ๋ณด์•˜์Šต๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ๋ฐ์ดํ„ฐ๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ํ…์ŠคํŠธ ํŒŒ์ผ ๋˜๋Š” Excel ํ‘œ์˜ ํ˜•ํƒœ๋กœ ์ œ๊ณต๋ฉ๋‹ˆ๋‹ค. ์šด ์ข‹๊ฒŒ๋„ Pandas๋Š” ๋””์Šคํฌ์—์„œ ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๋“œํ•˜๋Š” ๊ฐ„๋‹จํ•œ ๋ฐฉ๋ฒ•์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด CSV ํŒŒ์ผ์„ ์ฝ๋Š” ๊ฒƒ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์ด ๊ฐ„๋‹จํ•ฉ๋‹ˆ๋‹ค:
197197
```python
198198
df = pd.read_csv('file.csv')
199199
```
@@ -202,7 +202,7 @@ df = pd.read_csv('file.csv')
202202

203203
### ์ถœ๋ ฅ(Printing) ๋ฐ ํ”Œ๋กœํŒ…(Plotting)
204204

205-
๋ฐ์ดํ„ฐ ๊ณผํ•™์ž๋Š” ์ข…์ข… ๋ฐ์ดํ„ฐ๋ฅผ ํƒ์ƒ‰ํ•ด์•ผ ํ•˜๋ฏ€๋กœ ์‹œ๊ฐํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค. ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์ด ํด ๋•Œ ์ฒ˜์Œ ๋ช‡ ํ–‰์„ ์ธ์‡„ํ•˜์—ฌ ๋ชจ๋“  ์ž‘์—…์„ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ์ˆ˜ํ–‰ํ•˜๊ณ  ์žˆ๋Š”์ง€ ํ™•์ธํ•˜๋ ค๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ์Šต๋‹ˆ๋‹ค. ์ด๊ฒƒ์€ `df.head()`๋ฅผ ํ˜ธ์ถœํ•˜์—ฌ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. Jupyter Notebook์—์„œ ์‹คํ–‰ํ•˜๋Š” ๊ฒฝ์šฐ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์„ ๋ฉ‹์ง„ ํ‘œ ํ˜•์‹์œผ๋กœ ์ธ์‡„ํ•ฉ๋‹ˆ๋‹ค.
205+
๋ฐ์ดํ„ฐ ๊ณผํ•™์ž๋Š” ์ข…์ข… ๋ฐ์ดํ„ฐ๋ฅผ ํƒ์ƒ‰ํ•ด์•ผ ํ•˜๋ฏ€๋กœ ์‹œ๊ฐํ™”ํ•  ์ˆ˜ ์žˆ๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค. DataFrame์ด ํด ๋•Œ ์ฒ˜์Œ ๋ช‡ ํ–‰์„ ์ธ์‡„ํ•˜์—ฌ ๋ชจ๋“  ์ž‘์—…์„ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ์ˆ˜ํ–‰ํ•˜๊ณ  ์žˆ๋Š”์ง€ ํ™•์ธํ•˜๋ ค๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋งŽ์Šต๋‹ˆ๋‹ค. ์ด๊ฒƒ์€ `df.head()`๋ฅผ ํ˜ธ์ถœํ•˜์—ฌ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. Jupyter Notebook์—์„œ ์‹คํ–‰ํ•˜๋Š” ๊ฒฝ์šฐ DataFrame์„ ๋ฉ‹์ง„ ํ‘œ ํ˜•์‹์œผ๋กœ ์ธ์‡„ํ•ฉ๋‹ˆ๋‹ค.
206206

207207
๋˜ํ•œ ์ผ๋ถ€ ์—ด์„ ์‹œ๊ฐํ™”ํ•˜๊ธฐ ์œ„ํ•ด 'plot' ํ•จ์ˆ˜๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์„ ๋ณด์•˜์Šต๋‹ˆ๋‹ค. `plot`์€ ๋งŽ์€ ์ž‘์—…์— ๋งค์šฐ ์œ ์šฉํ•˜๊ณ  `kind=` ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ํ†ตํ•ด ๋‹ค์–‘ํ•œ ๊ทธ๋ž˜ํ”„ ์œ ํ˜•์„ ์ง€์›ํ•˜์ง€๋งŒ, ํ•ญ์ƒ ์›์‹œ `matplotlib` ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ๋” ๋ณต์žกํ•œ ๊ฒƒ์„ ๊ทธ๋ฆด ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋ฐ์ดํ„ฐ ์‹œ๊ฐํ™”๋Š” ๋ณ„๋„์˜ ๊ฐ•์˜์—์„œ ์ž์„ธํžˆ ๋‹ค๋ฃฐ ๊ฒƒ์ž…๋‹ˆ๋‹ค.
208208

โ€Ž2-Working-With-Data/08-data-preparation/translations/README.ko.mdโ€Ž

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88

99

1010

11-
์›๋ณธ์— ๋”ฐ๋ผ ์›์‹œ ๋ฐ์ดํ„ฐ์—๋Š” ๋ถ„์„ ๋ฐ ๋ชจ๋ธ๋ง์— ๋ฌธ์ œ๋ฅผ ์ผ์œผํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์ผ๋ถ€ ๋ถˆ์ผ์น˜ ์š”์†Œ๊ฐ€ ํฌํ•จ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ฆ‰, ์ด ๋ฐ์ดํ„ฐ๋Š” "๋”ํ‹ฐ"๋กœ ๋ถ„๋ฅ˜๋  ์ˆ˜ ์žˆ์œผ๋ฉฐ ์‚ฌ์ „์— ์ฒ˜๋ฆฌํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. ์ด ๋‹จ์›์—์„œ๋Š” ๋ˆ„๋ฝ, ํ˜น์€ ๋ถ€์ •ํ™•ํ•˜๊ฑฐ๋‚˜ ๋ถˆ์™„์ „ํ•œ ๋ฐ์ดํ„ฐ์˜ ๋ฌธ์ œ๋ฅผ ์ฒ˜๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ์ •๋ฆฌํ•˜๊ณ  ๋ณ€ํ™˜ํ•˜๋Š” ๊ธฐ์ˆ ์— ์ค‘์ ์„ ๋‘ก๋‹ˆ๋‹ค. ์ด ๊ฐ•์˜์—์„œ ๋‹ค๋ฃจ๋Š” ์ฃผ์ œ๋Š” Python๊ณผ Pandas ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ํ™œ์šฉํ•˜๋ฉฐ ์ด ๋””๋ ‰ํ† ๋ฆฌ์˜ [notebook](notebook.ipynb)์—์„œ ์‹œ์—ฐ๋ฉ๋‹ˆ๋‹ค.
11+
์›๋ณธ์— ๋”ฐ๋ผ ์›์‹œ ๋ฐ์ดํ„ฐ์—๋Š” ๋ถ„์„ ๋ฐ ๋ชจ๋ธ๋ง์— ๋ฌธ์ œ๋ฅผ ์ผ์œผํ‚ฌ ์ˆ˜ ์žˆ๋Š” ์ผ๋ถ€ ๋ถˆ์ผ์น˜ ์š”์†Œ๊ฐ€ ํฌํ•จ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ฆ‰, ์ด ๋ฐ์ดํ„ฐ๋Š” "๋”ํ‹ฐ"๋กœ ๋ถ„๋ฅ˜๋  ์ˆ˜ ์žˆ์œผ๋ฉฐ ์‚ฌ์ „์— ์ฒ˜๋ฆฌํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. ์ด ๋‹จ์›์—์„œ๋Š” ๋ˆ„๋ฝ, ํ˜น์€ ๋ถ€์ •ํ™•ํ•˜๊ฑฐ๋‚˜ ๋ถˆ์™„์ „ํ•œ ๋ฐ์ดํ„ฐ์˜ ๋ฌธ์ œ๋ฅผ ์ฒ˜๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ์ •๋ฆฌํ•˜๊ณ  ๋ณ€ํ™˜ํ•˜๋Š” ๊ธฐ์ˆ ์— ์ค‘์ ์„ ๋‘ก๋‹ˆ๋‹ค. ์ด ๊ฐ•์˜์—์„œ ๋‹ค๋ฃจ๋Š” ์ฃผ์ œ๋Š” Python๊ณผ Pandas ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ํ™œ์šฉํ•˜๋ฉฐ ์ด ๋””๋ ‰ํ† ๋ฆฌ์˜ [notebook](../notebook.ipynb)์—์„œ ์‹œ์—ฐ๋ฉ๋‹ˆ๋‹ค.
1212

1313
## ์ •์ œ ๋ฐ์ดํ„ฐ์˜ ์ค‘์š”์„ฑ
1414

@@ -31,7 +31,7 @@
3131
## DataFrame ์ •๋ณด ํƒ์ƒ‰
3232
> **ํ•™์Šต ๋ชฉํ‘œ:** ํ•˜์œ„ ์„น์…˜์ด ๋๋‚ ๋•Œ๊นŒ์ง€, pandas DataFrame์— ์ €์žฅ๋œ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ๋Šฅ์ˆ™ํ•˜๊ฒŒ ์ฐพ์„ ์ˆ˜ ์žˆ์„ ๊ฒƒ์ž…๋‹ˆ๋‹ค.
3333
34-
๋ฐ์ดํ„ฐ๋ฅผ pandas์— ๋กœ๋“œํ•˜๋ฉด DataFrame์— ์—†์„ ๊ฐ€๋Šฅ์„ฑ์ด ๋” ๋†’์•„์ง‘๋‹ˆ๋‹ค(์ด์ „ [๋‹จ์›](https://github.com/microsoft/Data-Science-For-Beginners/tree/main/2-Working-With-Data/07-python#dataframe) ์ฐธ์กฐ. ๊ทธ๋Ÿฌ๋‚˜ DataFrame์— ์žˆ๋Š” ๋ฐ์ดํ„ฐ์…‹์— 60,000๊ฐœ์˜ ํ–‰๊ณผ 400๊ฐœ์˜ ์—ด์ด ์žˆ๋Š” ๊ฒฝ์šฐ). ๋‹คํ–‰์Šค๋Ÿฝ๊ฒŒ๋„ [pandas](https://pandas.pydata.org/)๋Š” ์ฒ˜์Œ ๋ช‡ ํ–‰๊ณผ ๋งˆ์ง€๋ง‰ ๋ช‡ ํ–‰ ์™ธ์—๋„ DataFrame์— ๋Œ€ํ•œ ์ „์ฒด ์ •๋ณด๋ฅผ ๋น ๋ฅด๊ฒŒ ๋ณผ ์ˆ˜ ์žˆ๋Š” ๋ช‡ ๊ฐ€์ง€ ํŽธ๋ฆฌํ•œ ๋„๊ตฌ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
34+
๋ฐ์ดํ„ฐ๋ฅผ pandas์— ๋กœ๋“œํ•˜๋ฉด DataFrame์— ์—†์„ ๊ฐ€๋Šฅ์„ฑ์ด ๋” ๋†’์•„์ง‘๋‹ˆ๋‹ค(์ด์ „ [๋‹จ์›](https://github.com/microsoft/Data-Science-For-Beginners/tree/main/2-Working-With-Data/07-python/translations/README.ko.md#๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„) ์ฐธ์กฐ. ๊ทธ๋Ÿฌ๋‚˜ DataFrame์— ์žˆ๋Š” ๋ฐ์ดํ„ฐ์…‹์— 60,000๊ฐœ์˜ ํ–‰๊ณผ 400๊ฐœ์˜ ์—ด์ด ์žˆ๋Š” ๊ฒฝ์šฐ). ๋‹คํ–‰์Šค๋Ÿฝ๊ฒŒ๋„ [pandas](https://pandas.pydata.org/)๋Š” ์ฒ˜์Œ ๋ช‡ ํ–‰๊ณผ ๋งˆ์ง€๋ง‰ ๋ช‡ ํ–‰ ์™ธ์—๋„ DataFrame์— ๋Œ€ํ•œ ์ „์ฒด ์ •๋ณด๋ฅผ ๋น ๋ฅด๊ฒŒ ๋ณผ ์ˆ˜ ์žˆ๋Š” ๋ช‡ ๊ฐ€์ง€ ํŽธ๋ฆฌํ•œ ๋„๊ตฌ๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
3535

3636

3737
์ด ๊ธฐ๋Šฅ์„ ์‚ดํŽด๋ณด๊ธฐ ์œ„ํ•ด Python scikit-learn ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ๊ฐ€์ ธ์˜ค๊ณ  ์ƒ์ง•์ ์ธ ๋ฐ์ดํ„ฐ์…‹์ธ **Iris ๋ฐ์ดํ„ฐ์…‹** ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค.

โ€Ž2-Working-With-Data/translations/README.ko.mdโ€Ž

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,10 +7,10 @@
77

88
### ์ฃผ์ œ
99

10-
1. [๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค](05-relational-databases/README.md)
11-
2. [๋น„๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค](06-non-relational/README.md)
12-
3. [Python ํ™œ์šฉํ•˜๊ธฐ](07-python/README.md)
13-
4. [๋ฐ์ดํ„ฐ ์ค€๋น„](08-data-preparation/README.md)
10+
1. [๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค](05-relational-databases/translation/README.ko.md)
11+
2. [๋น„๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค](06-non-relational/translation/README.ko.md)
12+
3. [Python ํ™œ์šฉํ•˜๊ธฐ](07-python/translation/README.ko.md)
13+
4. [๋ฐ์ดํ„ฐ ์ค€๋น„](08-data-preparation/translation/README.ko.md)
1414

1515
### ํฌ๋ ˆ๋”ง
1616

0 commit comments

Comments
ย (0)