Pandas 데이터 조회, 정렬, 조건필터 - loc, iloc, where, isin

Posted Aug 11, 2023 Updated Oct 9, 2025

By figure.2

2 min read

데이터 분석

index 기준 정렬
- df.sort_index() : 오름차순 정렬
- df.sort_index(ascending=False) : 내림차순 정렬
value에 대한 정렬
- df.sort_values(by='age').head() : 오름차순 정렬
- df.sort_values(by='age', ascending=False).head() : 내림차순 정렬
- df.sort_values(by=['fare', 'age']).head() : 2개 이상 컬럼을 기준으로 정렬
- df.sort_values(by=['fare', 'age'], ascending=[False, True]).head() : 각 컬럼 지정 정렬

indexing
- loc[행인덱스,열인덱스]
- df.loc[5, 'class']
- df.loc[2:5, ['age', 'fare', 'who']]
slicing
- df.loc[2:5], ['age', 'fare', 'who']]
- df.loc[2:5, 'class':'deck'].head()
조건필터
- condition = df['who'] == 'man' : 조건 condition 대입
- df[condition].head() : Case1
- df.loc[condition].head() : Case2 -> 이걸 추천
- df.loc[condition, 'age'] = 10
다중조건
- & , 연산자 사용
- condition1 = (df['fare'] > 30) 조건1
- condition2 = (df['who'] == 'woman') 조건2
- df.loc[condition1 & condition2]

pandas의 where과 numpy의 where은 다름 ```python DataFrame.where(cond, other=nan, inplace=False, axis=None, level=None, errors=’raise’, try_cast=False)
cond: True/False로 판단될 수 있는 식
other: condition을 만족하지 못하는 요소에 할당 할 값 ```
df['fare'].where(df['fare'] < 20, 0).tail(10)

특정 값의 포함 여부는 isin 함수를 통해 비교
sample['name'].isin(['kim', 'lee'])
조건 필터링
- condition = sample['name'].isin(['kim', 'lee'])
- sample.loc[condition]

This post is licensed under CC BY 4.0 by the author.