Setting flag column depending on whether column contains a given string











up vote
7
down vote

favorite












Can anyone see why this isn't working?



Its trying to do; if Column Name Contains the text 'Andy', then make a column called Andy and set that row = to 1



df.loc[df['Name'].str.contains(['Andy']),'Andy']=1









share|improve this question




















  • 2




    I think you need df.loc[df['Name'].str.contains('Andy'),'Andy']=1 - remove
    – jezrael
    1 hour ago












  • Thanks thats great. Out of curiosity, can we ammend this to say if contains Andy and George?( ... make a column called Andy and = 1)
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy'|'George'),'Andy']=1
    – fred.schwartz
    1 hour ago










  • I was thinking the above but its not working
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy|George'),'Andy']=1
    – jezrael
    1 hour ago















up vote
7
down vote

favorite












Can anyone see why this isn't working?



Its trying to do; if Column Name Contains the text 'Andy', then make a column called Andy and set that row = to 1



df.loc[df['Name'].str.contains(['Andy']),'Andy']=1









share|improve this question




















  • 2




    I think you need df.loc[df['Name'].str.contains('Andy'),'Andy']=1 - remove
    – jezrael
    1 hour ago












  • Thanks thats great. Out of curiosity, can we ammend this to say if contains Andy and George?( ... make a column called Andy and = 1)
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy'|'George'),'Andy']=1
    – fred.schwartz
    1 hour ago










  • I was thinking the above but its not working
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy|George'),'Andy']=1
    – jezrael
    1 hour ago













up vote
7
down vote

favorite









up vote
7
down vote

favorite











Can anyone see why this isn't working?



Its trying to do; if Column Name Contains the text 'Andy', then make a column called Andy and set that row = to 1



df.loc[df['Name'].str.contains(['Andy']),'Andy']=1









share|improve this question















Can anyone see why this isn't working?



Its trying to do; if Column Name Contains the text 'Andy', then make a column called Andy and set that row = to 1



df.loc[df['Name'].str.contains(['Andy']),'Andy']=1






python string pandas series






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited 50 mins ago









jpp

88.1k195099




88.1k195099










asked 1 hour ago









fred.schwartz

2597




2597








  • 2




    I think you need df.loc[df['Name'].str.contains('Andy'),'Andy']=1 - remove
    – jezrael
    1 hour ago












  • Thanks thats great. Out of curiosity, can we ammend this to say if contains Andy and George?( ... make a column called Andy and = 1)
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy'|'George'),'Andy']=1
    – fred.schwartz
    1 hour ago










  • I was thinking the above but its not working
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy|George'),'Andy']=1
    – jezrael
    1 hour ago














  • 2




    I think you need df.loc[df['Name'].str.contains('Andy'),'Andy']=1 - remove
    – jezrael
    1 hour ago












  • Thanks thats great. Out of curiosity, can we ammend this to say if contains Andy and George?( ... make a column called Andy and = 1)
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy'|'George'),'Andy']=1
    – fred.schwartz
    1 hour ago










  • I was thinking the above but its not working
    – fred.schwartz
    1 hour ago










  • df.loc[df['Name'].str.contains('Andy|George'),'Andy']=1
    – jezrael
    1 hour ago








2




2




I think you need df.loc[df['Name'].str.contains('Andy'),'Andy']=1 - remove
– jezrael
1 hour ago






I think you need df.loc[df['Name'].str.contains('Andy'),'Andy']=1 - remove
– jezrael
1 hour ago














Thanks thats great. Out of curiosity, can we ammend this to say if contains Andy and George?( ... make a column called Andy and = 1)
– fred.schwartz
1 hour ago




Thanks thats great. Out of curiosity, can we ammend this to say if contains Andy and George?( ... make a column called Andy and = 1)
– fred.schwartz
1 hour ago












df.loc[df['Name'].str.contains('Andy'|'George'),'Andy']=1
– fred.schwartz
1 hour ago




df.loc[df['Name'].str.contains('Andy'|'George'),'Andy']=1
– fred.schwartz
1 hour ago












I was thinking the above but its not working
– fred.schwartz
1 hour ago




I was thinking the above but its not working
– fred.schwartz
1 hour ago












df.loc[df['Name'].str.contains('Andy|George'),'Andy']=1
– jezrael
1 hour ago




df.loc[df['Name'].str.contains('Andy|George'),'Andy']=1
– jezrael
1 hour ago












2 Answers
2






active

oldest

votes

















up vote
5
down vote



accepted










You have to remove list, need only string:



df.loc[df['Name'].str.contains('Andy'),'Andy'] = 1


For multiple values chain by |:



df.loc[df['Name'].str.contains('Andy|George'),'Andy'] = 1





share|improve this answer




























    up vote
    3
    down vote













    pd.Series.str.contains requires for its pat argument a "Character sequence or regular expression", not a list.



    Just use Boolean assignment and convert to int. This will set unmatched rows to 0. For example:



    # Name includes 'Andy'
    df['Andy'] = df['Name'].str.contains('Andy').astype(int)

    # Name includes 'Andy' or 'George'
    df['Andy'] = df['Name'].str.contains('Andy|George').astype(int)





    share|improve this answer





















      Your Answer






      StackExchange.ifUsing("editor", function () {
      StackExchange.using("externalEditor", function () {
      StackExchange.using("snippets", function () {
      StackExchange.snippets.init();
      });
      });
      }, "code-snippets");

      StackExchange.ready(function() {
      var channelOptions = {
      tags: "".split(" "),
      id: "1"
      };
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function() {
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled) {
      StackExchange.using("snippets", function() {
      createEditor();
      });
      }
      else {
      createEditor();
      }
      });

      function createEditor() {
      StackExchange.prepareEditor({
      heartbeatType: 'answer',
      convertImagesToLinks: true,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: 10,
      bindNavPrevention: true,
      postfix: "",
      imageUploader: {
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      },
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      });


      }
      });














      draft saved

      draft discarded


















      StackExchange.ready(
      function () {
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53758621%2fsetting-flag-column-depending-on-whether-column-contains-a-given-string%23new-answer', 'question_page');
      }
      );

      Post as a guest















      Required, but never shown

























      2 Answers
      2






      active

      oldest

      votes








      2 Answers
      2






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes








      up vote
      5
      down vote



      accepted










      You have to remove list, need only string:



      df.loc[df['Name'].str.contains('Andy'),'Andy'] = 1


      For multiple values chain by |:



      df.loc[df['Name'].str.contains('Andy|George'),'Andy'] = 1





      share|improve this answer

























        up vote
        5
        down vote



        accepted










        You have to remove list, need only string:



        df.loc[df['Name'].str.contains('Andy'),'Andy'] = 1


        For multiple values chain by |:



        df.loc[df['Name'].str.contains('Andy|George'),'Andy'] = 1





        share|improve this answer























          up vote
          5
          down vote



          accepted







          up vote
          5
          down vote



          accepted






          You have to remove list, need only string:



          df.loc[df['Name'].str.contains('Andy'),'Andy'] = 1


          For multiple values chain by |:



          df.loc[df['Name'].str.contains('Andy|George'),'Andy'] = 1





          share|improve this answer












          You have to remove list, need only string:



          df.loc[df['Name'].str.contains('Andy'),'Andy'] = 1


          For multiple values chain by |:



          df.loc[df['Name'].str.contains('Andy|George'),'Andy'] = 1






          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered 1 hour ago









          jezrael

          315k22256333




          315k22256333
























              up vote
              3
              down vote













              pd.Series.str.contains requires for its pat argument a "Character sequence or regular expression", not a list.



              Just use Boolean assignment and convert to int. This will set unmatched rows to 0. For example:



              # Name includes 'Andy'
              df['Andy'] = df['Name'].str.contains('Andy').astype(int)

              # Name includes 'Andy' or 'George'
              df['Andy'] = df['Name'].str.contains('Andy|George').astype(int)





              share|improve this answer

























                up vote
                3
                down vote













                pd.Series.str.contains requires for its pat argument a "Character sequence or regular expression", not a list.



                Just use Boolean assignment and convert to int. This will set unmatched rows to 0. For example:



                # Name includes 'Andy'
                df['Andy'] = df['Name'].str.contains('Andy').astype(int)

                # Name includes 'Andy' or 'George'
                df['Andy'] = df['Name'].str.contains('Andy|George').astype(int)





                share|improve this answer























                  up vote
                  3
                  down vote










                  up vote
                  3
                  down vote









                  pd.Series.str.contains requires for its pat argument a "Character sequence or regular expression", not a list.



                  Just use Boolean assignment and convert to int. This will set unmatched rows to 0. For example:



                  # Name includes 'Andy'
                  df['Andy'] = df['Name'].str.contains('Andy').astype(int)

                  # Name includes 'Andy' or 'George'
                  df['Andy'] = df['Name'].str.contains('Andy|George').astype(int)





                  share|improve this answer












                  pd.Series.str.contains requires for its pat argument a "Character sequence or regular expression", not a list.



                  Just use Boolean assignment and convert to int. This will set unmatched rows to 0. For example:



                  # Name includes 'Andy'
                  df['Andy'] = df['Name'].str.contains('Andy').astype(int)

                  # Name includes 'Andy' or 'George'
                  df['Andy'] = df['Name'].str.contains('Andy|George').astype(int)






                  share|improve this answer












                  share|improve this answer



                  share|improve this answer










                  answered 58 mins ago









                  jpp

                  88.1k195099




                  88.1k195099






























                      draft saved

                      draft discarded




















































                      Thanks for contributing an answer to Stack Overflow!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid



                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.


                      To learn more, see our tips on writing great answers.





                      Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


                      Please pay close attention to the following guidance:


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid



                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.


                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function () {
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53758621%2fsetting-flag-column-depending-on-whether-column-contains-a-given-string%23new-answer', 'question_page');
                      }
                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      How to ignore python UserWarning in pytest?

                      What visual should I use to simply compare current year value vs last year in Power BI desktop

                      Héron pourpré