check if string contains sub string from the same column in pandas dataframe

Question

Hi I have the following dataframe:

> df1
  col1  
0 donald     
1 mike
2 donald trump
3 trump
4 mike pence
5 pence
6 jarred

i want to check for the strings that contain sub string from this column and create a new column that holds the bigger strings if the condition is full filled

something like this:

> df1
  col1           col2
0 donald        donald trump
1 mike          mike pence
2 donald trump  donald trump
3 trump         donald trump
4 mike pence    mike pence
5 pence         mike pence
6 jarred        jarred

Thanks in advance

zipa · Accepted Answer · 2018-03-27 13:21:07Z

3

This should do it:

df['Col2'] = df['Col1'].apply(lambda x: max([i for i in df['Col1'] if x in i], key=len))

answered Mar 27, 2018 at 13:21

zipa

28k6 gold badges45 silver badges62 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

jezrael Over a year ago

And dont forget add list comprehension alternative :) +1

zipa Over a year ago

@jezrael I can only think of filter at the moment.

Collectives™ on Stack Overflow

check if string contains sub string from the same column in pandas dataframe

1 Answer 1

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related