Python Count Leading and Trailing Whitespace












0














I have the following dataframe note the leading and trailing whitespace in the stings:



import pandas as pd
data = ['foo ', ' bar', ' baz ', 'beetle juice']
df = pd.DataFrame(data)


I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



So, in the example above, the whitespace count should equal 3.



What's the best way to do this?










share|improve this question



























    0














    I have the following dataframe note the leading and trailing whitespace in the stings:



    import pandas as pd
    data = ['foo ', ' bar', ' baz ', 'beetle juice']
    df = pd.DataFrame(data)


    I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



    So, in the example above, the whitespace count should equal 3.



    What's the best way to do this?










    share|improve this question

























      0












      0








      0







      I have the following dataframe note the leading and trailing whitespace in the stings:



      import pandas as pd
      data = ['foo ', ' bar', ' baz ', 'beetle juice']
      df = pd.DataFrame(data)


      I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



      So, in the example above, the whitespace count should equal 3.



      What's the best way to do this?










      share|improve this question













      I have the following dataframe note the leading and trailing whitespace in the stings:



      import pandas as pd
      data = ['foo ', ' bar', ' baz ', 'beetle juice']
      df = pd.DataFrame(data)


      I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



      So, in the example above, the whitespace count should equal 3.



      What's the best way to do this?







      python-3.x pandas dataframe






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 22 at 20:26









      FunnyChef

      6322615




      6322615
























          3 Answers
          3






          active

          oldest

          votes


















          1














          This code does what you want.



          import pandas as pd

          data = ['foo ', ' bar', ' baz ', 'beetle juice']

          df = pd.DataFrame(data)
          count = 0

          for i,row in df.iterrows():
          if row[0][0] == " " or row[0][-1] == " ":
          count += 1

          print(count)





          share|improve this answer































            1














            With .str accessor you can achieve it in one line:



            (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





            share|improve this answer





























              0














              Here is a solution using defaultdict from collection module:



              from collections import defaultdict as df

              data = ['foo ', ' bar', ' baz ', 'beetle juice']
              result = df(int)

              for elm in data:
              if elm.startswith(' '):
              result['leading'] += 1
              elif elm.endswith(' '):
              result['trailing'] += 1

              print(result)
              print(dict(result))
              count = sum(k for k in result.values())
              print(count)


              Output:



              defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
              {'trailing': 1, 'leading': 2}
              3





              share|improve this answer





















                Your Answer






                StackExchange.ifUsing("editor", function () {
                StackExchange.using("externalEditor", function () {
                StackExchange.using("snippets", function () {
                StackExchange.snippets.init();
                });
                });
                }, "code-snippets");

                StackExchange.ready(function() {
                var channelOptions = {
                tags: "".split(" "),
                id: "1"
                };
                initTagRenderer("".split(" "), "".split(" "), channelOptions);

                StackExchange.using("externalEditor", function() {
                // Have to fire editor after snippets, if snippets enabled
                if (StackExchange.settings.snippets.snippetsEnabled) {
                StackExchange.using("snippets", function() {
                createEditor();
                });
                }
                else {
                createEditor();
                }
                });

                function createEditor() {
                StackExchange.prepareEditor({
                heartbeatType: 'answer',
                autoActivateHeartbeat: false,
                convertImagesToLinks: true,
                noModals: true,
                showLowRepImageUploadWarning: true,
                reputationToPostImages: 10,
                bindNavPrevention: true,
                postfix: "",
                imageUploader: {
                brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
                contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
                allowUrls: true
                },
                onDemand: true,
                discardSelector: ".discard-answer"
                ,immediatelyShowMarkdownHelp:true
                });


                }
                });














                draft saved

                draft discarded


















                StackExchange.ready(
                function () {
                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53437644%2fpython-count-leading-and-trailing-whitespace%23new-answer', 'question_page');
                }
                );

                Post as a guest















                Required, but never shown

























                3 Answers
                3






                active

                oldest

                votes








                3 Answers
                3






                active

                oldest

                votes









                active

                oldest

                votes






                active

                oldest

                votes









                1














                This code does what you want.



                import pandas as pd

                data = ['foo ', ' bar', ' baz ', 'beetle juice']

                df = pd.DataFrame(data)
                count = 0

                for i,row in df.iterrows():
                if row[0][0] == " " or row[0][-1] == " ":
                count += 1

                print(count)





                share|improve this answer




























                  1














                  This code does what you want.



                  import pandas as pd

                  data = ['foo ', ' bar', ' baz ', 'beetle juice']

                  df = pd.DataFrame(data)
                  count = 0

                  for i,row in df.iterrows():
                  if row[0][0] == " " or row[0][-1] == " ":
                  count += 1

                  print(count)





                  share|improve this answer


























                    1












                    1








                    1






                    This code does what you want.



                    import pandas as pd

                    data = ['foo ', ' bar', ' baz ', 'beetle juice']

                    df = pd.DataFrame(data)
                    count = 0

                    for i,row in df.iterrows():
                    if row[0][0] == " " or row[0][-1] == " ":
                    count += 1

                    print(count)





                    share|improve this answer














                    This code does what you want.



                    import pandas as pd

                    data = ['foo ', ' bar', ' baz ', 'beetle juice']

                    df = pd.DataFrame(data)
                    count = 0

                    for i,row in df.iterrows():
                    if row[0][0] == " " or row[0][-1] == " ":
                    count += 1

                    print(count)






                    share|improve this answer














                    share|improve this answer



                    share|improve this answer








                    edited Nov 22 at 20:42

























                    answered Nov 22 at 20:37









                    Esteban Quiros

                    1015




                    1015

























                        1














                        With .str accessor you can achieve it in one line:



                        (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





                        share|improve this answer


























                          1














                          With .str accessor you can achieve it in one line:



                          (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





                          share|improve this answer
























                            1












                            1








                            1






                            With .str accessor you can achieve it in one line:



                            (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





                            share|improve this answer












                            With .str accessor you can achieve it in one line:



                            (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()






                            share|improve this answer












                            share|improve this answer



                            share|improve this answer










                            answered Nov 22 at 21:03









                            Julian Peller

                            849511




                            849511























                                0














                                Here is a solution using defaultdict from collection module:



                                from collections import defaultdict as df

                                data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                result = df(int)

                                for elm in data:
                                if elm.startswith(' '):
                                result['leading'] += 1
                                elif elm.endswith(' '):
                                result['trailing'] += 1

                                print(result)
                                print(dict(result))
                                count = sum(k for k in result.values())
                                print(count)


                                Output:



                                defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                {'trailing': 1, 'leading': 2}
                                3





                                share|improve this answer


























                                  0














                                  Here is a solution using defaultdict from collection module:



                                  from collections import defaultdict as df

                                  data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                  result = df(int)

                                  for elm in data:
                                  if elm.startswith(' '):
                                  result['leading'] += 1
                                  elif elm.endswith(' '):
                                  result['trailing'] += 1

                                  print(result)
                                  print(dict(result))
                                  count = sum(k for k in result.values())
                                  print(count)


                                  Output:



                                  defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                  {'trailing': 1, 'leading': 2}
                                  3





                                  share|improve this answer
























                                    0












                                    0








                                    0






                                    Here is a solution using defaultdict from collection module:



                                    from collections import defaultdict as df

                                    data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                    result = df(int)

                                    for elm in data:
                                    if elm.startswith(' '):
                                    result['leading'] += 1
                                    elif elm.endswith(' '):
                                    result['trailing'] += 1

                                    print(result)
                                    print(dict(result))
                                    count = sum(k for k in result.values())
                                    print(count)


                                    Output:



                                    defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                    {'trailing': 1, 'leading': 2}
                                    3





                                    share|improve this answer












                                    Here is a solution using defaultdict from collection module:



                                    from collections import defaultdict as df

                                    data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                    result = df(int)

                                    for elm in data:
                                    if elm.startswith(' '):
                                    result['leading'] += 1
                                    elif elm.endswith(' '):
                                    result['trailing'] += 1

                                    print(result)
                                    print(dict(result))
                                    count = sum(k for k in result.values())
                                    print(count)


                                    Output:



                                    defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                    {'trailing': 1, 'leading': 2}
                                    3






                                    share|improve this answer












                                    share|improve this answer



                                    share|improve this answer










                                    answered Nov 22 at 20:45









                                    Chiheb Nexus

                                    4,72031527




                                    4,72031527






























                                        draft saved

                                        draft discarded




















































                                        Thanks for contributing an answer to Stack Overflow!


                                        • Please be sure to answer the question. Provide details and share your research!

                                        But avoid



                                        • Asking for help, clarification, or responding to other answers.

                                        • Making statements based on opinion; back them up with references or personal experience.


                                        To learn more, see our tips on writing great answers.





                                        Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


                                        Please pay close attention to the following guidance:


                                        • Please be sure to answer the question. Provide details and share your research!

                                        But avoid



                                        • Asking for help, clarification, or responding to other answers.

                                        • Making statements based on opinion; back them up with references or personal experience.


                                        To learn more, see our tips on writing great answers.




                                        draft saved


                                        draft discarded














                                        StackExchange.ready(
                                        function () {
                                        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53437644%2fpython-count-leading-and-trailing-whitespace%23new-answer', 'question_page');
                                        }
                                        );

                                        Post as a guest















                                        Required, but never shown





















































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown

































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown







                                        Popular posts from this blog

                                        A CLEAN and SIMPLE way to add appendices to Table of Contents and bookmarks

                                        Calculate evaluation metrics using cross_val_predict sklearn

                                        Insert data from modal to MySQL (multiple modal on website)