Filtering in Pandas by index and column value

Question

I have a dataframe similar to this one:

value_1 value_2
1       2
9       6
2       5
7       2
2       5

What I need to do is to only get the rows that have a value_1 greater than 3 for example, but also get the first and last row of the dataframe. Like this:

value_1 value_2
1       2
9       6
7       2
2       5

I know that I can filter by index with iloc and by column value with loc, but how can I use them both together?

mozway · Accepted Answer · 2022-06-20 14:02:58Z

1

Assuming a range index, you could use boolean indexing:

# is value_1 > 3?
m1 = df['value_1'].gt(3)
# is the index the first or last value?
m2 = df.index.isin([0, len(df)-1])
# keep if any condition above is True
out = df[m1|m2]

If you don't have a range index or if you have duplicates, you can use:

m2 = np.r_[True, [False]*len(df-2), True]

output:

   value_1  value_2
0        1        2
1        9        6
3        7        2
4        2        5

answered Jun 20, 2022 at 14:02

mozway

267k13 gold badges56 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Filtering in Pandas by index and column value

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related