Subset of a Pandas Dataframe consisting of rows with specific column values












1















I'm having a problem with a single line of my code.
Here is what I'd like to achieve:




  • reading_now is a string consisting of 3 characters

  • df2 is a data frame that is a subset of df1

  • I'd like df2 to consist of rows from df1 where the first three characters of the value in column "Code" is equal to "reading_now"


I tried using the following two lines with no success:



*df2 = df1.loc[(df1['Code'])[0:3] == reading_now]*
*df2 = df1[(str(df1.Code)[0:3] == reading_now)]*









share|improve this question


















  • 1





    Can you provide a example of the string and the dataframe.

    – MisterMonk
    Nov 26 '18 at 21:19
















1















I'm having a problem with a single line of my code.
Here is what I'd like to achieve:




  • reading_now is a string consisting of 3 characters

  • df2 is a data frame that is a subset of df1

  • I'd like df2 to consist of rows from df1 where the first three characters of the value in column "Code" is equal to "reading_now"


I tried using the following two lines with no success:



*df2 = df1.loc[(df1['Code'])[0:3] == reading_now]*
*df2 = df1[(str(df1.Code)[0:3] == reading_now)]*









share|improve this question


















  • 1





    Can you provide a example of the string and the dataframe.

    – MisterMonk
    Nov 26 '18 at 21:19














1












1








1








I'm having a problem with a single line of my code.
Here is what I'd like to achieve:




  • reading_now is a string consisting of 3 characters

  • df2 is a data frame that is a subset of df1

  • I'd like df2 to consist of rows from df1 where the first three characters of the value in column "Code" is equal to "reading_now"


I tried using the following two lines with no success:



*df2 = df1.loc[(df1['Code'])[0:3] == reading_now]*
*df2 = df1[(str(df1.Code)[0:3] == reading_now)]*









share|improve this question














I'm having a problem with a single line of my code.
Here is what I'd like to achieve:




  • reading_now is a string consisting of 3 characters

  • df2 is a data frame that is a subset of df1

  • I'd like df2 to consist of rows from df1 where the first three characters of the value in column "Code" is equal to "reading_now"


I tried using the following two lines with no success:



*df2 = df1.loc[(df1['Code'])[0:3] == reading_now]*
*df2 = df1[(str(df1.Code)[0:3] == reading_now)]*






python string pandas dataframe






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Nov 26 '18 at 21:17









segilmezsegilmez

102




102








  • 1





    Can you provide a example of the string and the dataframe.

    – MisterMonk
    Nov 26 '18 at 21:19














  • 1





    Can you provide a example of the string and the dataframe.

    – MisterMonk
    Nov 26 '18 at 21:19








1




1





Can you provide a example of the string and the dataframe.

– MisterMonk
Nov 26 '18 at 21:19





Can you provide a example of the string and the dataframe.

– MisterMonk
Nov 26 '18 at 21:19












2 Answers
2






active

oldest

votes


















0














Looks like you were really close with your 2nd attempt.



You could solve this a couple of different ways.



reading_now = 'AAA'
df1 = pd.DataFrame([{'Code': 'AAA'}, {'Code': 'BBB'}, {'Code': 'CCC'}])


solution:



df2 = df1[df1['Code'].str.startswith(reading_now)]



or



df2 = df1[df1['Code'][0:3] == reading_now]



The df2 dataframe will contain the row that starts with the reading_now string.






share|improve this answer































    0














    You could use



    df2 = df1[df1['Code'].str[0:3] == reading_now]


    For example:



    data = ['abcd', 'cbdz', 'abcz', 'bdaz']

    df1 = pd.DataFrame(data, columns=['Code'])
    df2 = df1[df1['Code'].str[0:3] == 'abc']


    df2 will result in a dataframe with 'Code' column containing 'abcd' and 'abcz'






    share|improve this answer























      Your Answer






      StackExchange.ifUsing("editor", function () {
      StackExchange.using("externalEditor", function () {
      StackExchange.using("snippets", function () {
      StackExchange.snippets.init();
      });
      });
      }, "code-snippets");

      StackExchange.ready(function() {
      var channelOptions = {
      tags: "".split(" "),
      id: "1"
      };
      initTagRenderer("".split(" "), "".split(" "), channelOptions);

      StackExchange.using("externalEditor", function() {
      // Have to fire editor after snippets, if snippets enabled
      if (StackExchange.settings.snippets.snippetsEnabled) {
      StackExchange.using("snippets", function() {
      createEditor();
      });
      }
      else {
      createEditor();
      }
      });

      function createEditor() {
      StackExchange.prepareEditor({
      heartbeatType: 'answer',
      autoActivateHeartbeat: false,
      convertImagesToLinks: true,
      noModals: true,
      showLowRepImageUploadWarning: true,
      reputationToPostImages: 10,
      bindNavPrevention: true,
      postfix: "",
      imageUploader: {
      brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
      contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
      allowUrls: true
      },
      onDemand: true,
      discardSelector: ".discard-answer"
      ,immediatelyShowMarkdownHelp:true
      });


      }
      });














      draft saved

      draft discarded


















      StackExchange.ready(
      function () {
      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53489202%2fsubset-of-a-pandas-dataframe-consisting-of-rows-with-specific-column-values%23new-answer', 'question_page');
      }
      );

      Post as a guest















      Required, but never shown

























      2 Answers
      2






      active

      oldest

      votes








      2 Answers
      2






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      0














      Looks like you were really close with your 2nd attempt.



      You could solve this a couple of different ways.



      reading_now = 'AAA'
      df1 = pd.DataFrame([{'Code': 'AAA'}, {'Code': 'BBB'}, {'Code': 'CCC'}])


      solution:



      df2 = df1[df1['Code'].str.startswith(reading_now)]



      or



      df2 = df1[df1['Code'][0:3] == reading_now]



      The df2 dataframe will contain the row that starts with the reading_now string.






      share|improve this answer




























        0














        Looks like you were really close with your 2nd attempt.



        You could solve this a couple of different ways.



        reading_now = 'AAA'
        df1 = pd.DataFrame([{'Code': 'AAA'}, {'Code': 'BBB'}, {'Code': 'CCC'}])


        solution:



        df2 = df1[df1['Code'].str.startswith(reading_now)]



        or



        df2 = df1[df1['Code'][0:3] == reading_now]



        The df2 dataframe will contain the row that starts with the reading_now string.






        share|improve this answer


























          0












          0








          0







          Looks like you were really close with your 2nd attempt.



          You could solve this a couple of different ways.



          reading_now = 'AAA'
          df1 = pd.DataFrame([{'Code': 'AAA'}, {'Code': 'BBB'}, {'Code': 'CCC'}])


          solution:



          df2 = df1[df1['Code'].str.startswith(reading_now)]



          or



          df2 = df1[df1['Code'][0:3] == reading_now]



          The df2 dataframe will contain the row that starts with the reading_now string.






          share|improve this answer













          Looks like you were really close with your 2nd attempt.



          You could solve this a couple of different ways.



          reading_now = 'AAA'
          df1 = pd.DataFrame([{'Code': 'AAA'}, {'Code': 'BBB'}, {'Code': 'CCC'}])


          solution:



          df2 = df1[df1['Code'].str.startswith(reading_now)]



          or



          df2 = df1[df1['Code'][0:3] == reading_now]



          The df2 dataframe will contain the row that starts with the reading_now string.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Nov 26 '18 at 21:33









          brandonbanksbrandonbanks

          92111




          92111

























              0














              You could use



              df2 = df1[df1['Code'].str[0:3] == reading_now]


              For example:



              data = ['abcd', 'cbdz', 'abcz', 'bdaz']

              df1 = pd.DataFrame(data, columns=['Code'])
              df2 = df1[df1['Code'].str[0:3] == 'abc']


              df2 will result in a dataframe with 'Code' column containing 'abcd' and 'abcz'






              share|improve this answer




























                0














                You could use



                df2 = df1[df1['Code'].str[0:3] == reading_now]


                For example:



                data = ['abcd', 'cbdz', 'abcz', 'bdaz']

                df1 = pd.DataFrame(data, columns=['Code'])
                df2 = df1[df1['Code'].str[0:3] == 'abc']


                df2 will result in a dataframe with 'Code' column containing 'abcd' and 'abcz'






                share|improve this answer


























                  0












                  0








                  0







                  You could use



                  df2 = df1[df1['Code'].str[0:3] == reading_now]


                  For example:



                  data = ['abcd', 'cbdz', 'abcz', 'bdaz']

                  df1 = pd.DataFrame(data, columns=['Code'])
                  df2 = df1[df1['Code'].str[0:3] == 'abc']


                  df2 will result in a dataframe with 'Code' column containing 'abcd' and 'abcz'






                  share|improve this answer













                  You could use



                  df2 = df1[df1['Code'].str[0:3] == reading_now]


                  For example:



                  data = ['abcd', 'cbdz', 'abcz', 'bdaz']

                  df1 = pd.DataFrame(data, columns=['Code'])
                  df2 = df1[df1['Code'].str[0:3] == 'abc']


                  df2 will result in a dataframe with 'Code' column containing 'abcd' and 'abcz'







                  share|improve this answer












                  share|improve this answer



                  share|improve this answer










                  answered Nov 26 '18 at 21:26









                  HugoHondaHugoHonda

                  285




                  285






























                      draft saved

                      draft discarded




















































                      Thanks for contributing an answer to Stack Overflow!


                      • Please be sure to answer the question. Provide details and share your research!

                      But avoid



                      • Asking for help, clarification, or responding to other answers.

                      • Making statements based on opinion; back them up with references or personal experience.


                      To learn more, see our tips on writing great answers.




                      draft saved


                      draft discarded














                      StackExchange.ready(
                      function () {
                      StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53489202%2fsubset-of-a-pandas-dataframe-consisting-of-rows-with-specific-column-values%23new-answer', 'question_page');
                      }
                      );

                      Post as a guest















                      Required, but never shown





















































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown

































                      Required, but never shown














                      Required, but never shown












                      Required, but never shown







                      Required, but never shown







                      Popular posts from this blog

                      A CLEAN and SIMPLE way to add appendices to Table of Contents and bookmarks

                      Calculate evaluation metrics using cross_val_predict sklearn

                      Insert data from modal to MySQL (multiple modal on website)