Pandas - Sorting By Column

Question

I have a pandas data frame known as "df":

I am splitting it up into two frames, and then trying to merge back together:

df_1 = df[df['x']==1]  
df_2 = df[df['x']!=1]

My goal is to get it back in the same order, but when I concat, I am getting the following:

frames = [df_1, df_2]
solution = pd.concat(frames)
solution.sort_values(by='x', inplace=False)

  x y
1 2 4
2 3 8
0 1 2

The problem is I need the 'x' values to go back into the new dataframe in the same order that I extracted. Is there a solution?

piRSquared · Accepted Answer · 2016-12-16 21:52:52Z

3

use .loc to specify the order you want. Choose the original index.

solution.loc[df.index]

Or, if you trust the index values in each component, then

solution.sort_index()

setup

df = pd.DataFrame([[1, 2], [2, 4], [3, 8]], columns=['x', 'y'])

df_1 = df[df['x']==1]  
df_2 = df[df['x']!=1] 

frames = [df_1, df_2]
solution = pd.concat(frames)

edited Dec 16, 2016 at 21:52

answered Dec 16, 2016 at 21:45

piRSquared

296k68 gold badges509 silver badges654 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Kelvin · Accepted Answer · 2016-12-16 21:45:49Z

0

Try this:

In [14]: pd.concat([df_1, df_2.sort_values('y')])
Out[14]:
   x  y
0  1  2
1  2  4
2  3  8

answered Dec 16, 2016 at 21:45

Kelvin

1,3672 gold badges13 silver badges23 bronze badges

Comments

Vaishali · Accepted Answer · 2016-12-16 21:52:09Z

0

When you are sorting the solution using solution.sort_values(by='x', inplace=False) you need to specify inplace = True. That would take care of it.

answered Dec 16, 2016 at 21:52

Vaishali

38.5k5 gold badges62 silver badges88 bronze badges

Comments

Mike Müller · Accepted Answer · 2016-12-16 21:59:35Z

0

Based on these assumptions on df:

Columns x and y are note necessarily ordered.
The index is ordered.

Just order your result by index:

df = pd.DataFrame({'x': [1, 2, 3], 'y': [2, 4, 8]})
df_1 = df[df['x']==1]  
df_2 = df[df['x']!=1] 
frames = [df_2, df_1]
solution = pd.concat(frames).sort_index()

Now, solution looks like this:

answered Dec 16, 2016 at 21:59

Mike Müller

86k21 gold badges174 silver badges165 bronze badges

Collectives™ on Stack Overflow

Pandas - Sorting By Column

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related