How to use Pandas to get the count of every combination inclusiveHow to get all possible combinations of a...

How to use Pandas to get the count of every combination inclusive

Why don't electron-positron collisions release infinite energy?

Shell script can be run only with sh command

I’m planning on buying a laser printer but concerned about the life cycle of toner in the machine

Is Social Media Science Fiction?

Closed subgroups of abelian groups

Are tax years 2016 & 2017 back taxes deductible for tax year 2018?

Do airline pilots ever risk not hearing communication directed to them specifically, from traffic controllers?

Why CLRS example on residual networks does not follows its formula?

The use of multiple foreign keys on same column in SQL Server

New order #4: World

When blogging recipes, how can I support both readers who want the narrative/journey and ones who want the printer-friendly recipe?

What is the offset in a seaplane's hull?

Circuitry of TV splitters

Are white and non-white police officers equally likely to kill black suspects?

Schwarzchild Radius of the Universe

Why are 150k or 200k jobs considered good when there are 300k+ births a month?

Can an x86 CPU running in real mode be considered to be basically an 8086 CPU?

How do we improve the relationship with a client software team that performs poorly and is becoming less collaborative?

Accidentally leaked the solution to an assignment, what to do now? (I'm the prof)

Copycat chess is back

Can you lasso down a wizard who is using the Levitate spell?

How can I fix this gap between bookcases I made?

Is there a familial term for apples and pears?



How to use Pandas to get the count of every combination inclusive


How to get all possible combinations of a list’s elements?How to get the ASCII value of a character?How to get the current time in PythonHow to get line count cheaply in Python?How do I get the number of elements in a list in Python?How can I count the occurrences of a list item?How to drop rows of Pandas DataFrame whose value in certain columns is NaNHow do I get the row count of a Pandas dataframe?How to iterate over rows in a DataFrame in Pandas?Get list from pandas DataFrame column headersHow to deal with SettingWithCopyWarning in Pandas?






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







8















I am trying to figure out what combination of clothing customers are buying together. I can figure out the exact combination, but the problem I can't figure out is the count that includes the combination + others.



For example, I have:



Cust_num  Item    Rev
Cust1 Shirt1 $40
Cust1 Shirt2 $40
Cust1 Shorts1 $40
Cust2 Shirt1 $40
Cust2 Shorts1 $40


This should result in:



Combo                  Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 2


The best I can do is unique combinations:



Combo                 Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 1


I tried:



df = df.pivot(index='Cust_num',columns='Item').sum()
df[df.notnull()] = "x"
df = df.loc[:,"Shirt1":].replace("x", pd.Series(df.columns, df.columns))
col = df.stack().groupby(level=0).apply(','.join)
df2 = pd.DataFrame(col)
df2.groupby([0]).size().reset_index(name='counts')


But that is just the unique counts.










share|improve this question







New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • 1





    I feel like this is one sort of problem pandas would not be suitable for.

    – coldspeed
    59 mins ago


















8















I am trying to figure out what combination of clothing customers are buying together. I can figure out the exact combination, but the problem I can't figure out is the count that includes the combination + others.



For example, I have:



Cust_num  Item    Rev
Cust1 Shirt1 $40
Cust1 Shirt2 $40
Cust1 Shorts1 $40
Cust2 Shirt1 $40
Cust2 Shorts1 $40


This should result in:



Combo                  Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 2


The best I can do is unique combinations:



Combo                 Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 1


I tried:



df = df.pivot(index='Cust_num',columns='Item').sum()
df[df.notnull()] = "x"
df = df.loc[:,"Shirt1":].replace("x", pd.Series(df.columns, df.columns))
col = df.stack().groupby(level=0).apply(','.join)
df2 = pd.DataFrame(col)
df2.groupby([0]).size().reset_index(name='counts')


But that is just the unique counts.










share|improve this question







New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
















  • 1





    I feel like this is one sort of problem pandas would not be suitable for.

    – coldspeed
    59 mins ago














8












8








8








I am trying to figure out what combination of clothing customers are buying together. I can figure out the exact combination, but the problem I can't figure out is the count that includes the combination + others.



For example, I have:



Cust_num  Item    Rev
Cust1 Shirt1 $40
Cust1 Shirt2 $40
Cust1 Shorts1 $40
Cust2 Shirt1 $40
Cust2 Shorts1 $40


This should result in:



Combo                  Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 2


The best I can do is unique combinations:



Combo                 Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 1


I tried:



df = df.pivot(index='Cust_num',columns='Item').sum()
df[df.notnull()] = "x"
df = df.loc[:,"Shirt1":].replace("x", pd.Series(df.columns, df.columns))
col = df.stack().groupby(level=0).apply(','.join)
df2 = pd.DataFrame(col)
df2.groupby([0]).size().reset_index(name='counts')


But that is just the unique counts.










share|improve this question







New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.












I am trying to figure out what combination of clothing customers are buying together. I can figure out the exact combination, but the problem I can't figure out is the count that includes the combination + others.



For example, I have:



Cust_num  Item    Rev
Cust1 Shirt1 $40
Cust1 Shirt2 $40
Cust1 Shorts1 $40
Cust2 Shirt1 $40
Cust2 Shorts1 $40


This should result in:



Combo                  Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 2


The best I can do is unique combinations:



Combo                 Count
Shirt1,Shirt2,Shorts1 1
Shirt1,Shorts1 1


I tried:



df = df.pivot(index='Cust_num',columns='Item').sum()
df[df.notnull()] = "x"
df = df.loc[:,"Shirt1":].replace("x", pd.Series(df.columns, df.columns))
col = df.stack().groupby(level=0).apply(','.join)
df2 = pd.DataFrame(col)
df2.groupby([0]).size().reset_index(name='counts')


But that is just the unique counts.







python pandas






share|improve this question







New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question







New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question






New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 1 hour ago









Taylor SmithTaylor Smith

412




412




New contributor




Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






Taylor Smith is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.








  • 1





    I feel like this is one sort of problem pandas would not be suitable for.

    – coldspeed
    59 mins ago














  • 1





    I feel like this is one sort of problem pandas would not be suitable for.

    – coldspeed
    59 mins ago








1




1





I feel like this is one sort of problem pandas would not be suitable for.

– coldspeed
59 mins ago





I feel like this is one sort of problem pandas would not be suitable for.

– coldspeed
59 mins ago












4 Answers
4






active

oldest

votes


















3














Using pandas.DataFrame.groupby:



grouped_item = df.groupby('Cust_num')['Item']
subsets = grouped_item.apply(lambda x: set(x)).tolist()
Count = [sum(s2.issubset(s1) for s1 in subsets) for s2 in subsets]
combo = grouped_item.apply(lambda x:','.join(x))
combo = combo.reset_index()
combo['Count']=Count


Output:



  Cust_num                   Item  Count
0 Cust1 Shirt1,Shirt2,Shorts1 1
1 Cust2 Shirt1,Shorts1 2





share|improve this answer































    0














    I think you need to create a combination of items first.



    How to get all possible combinations of a list’s elements?



    I used the function from Dan H's answer.



    from itertools import chain, combinations
    def all_subsets(ss):
    return chain(*map(lambda x: combinations(ss, x), range(0, len(ss)+1)))


    Then get the unique items.



    uq_items = df.Item.unique()

    list(all_subsets(uq_items))

    [(),
    ('Shirt1',),
    ('Shirt2',),
    ('Shorts1',),
    ('Shirt1', 'Shirt2'),
    ('Shirt1', 'Shorts1'),
    ('Shirt2', 'Shorts1'),
    ('Shirt1', 'Shirt2', 'Shorts1')]


    And use groupby each customer to get their items combination.



    ls = []

    for _, d in df.groupby('Cust_num', group_keys=False):
    # Get all possible subset of items
    pi = np.array(list(all_subsets(d.Item)))

    # Fliter only > 1
    ls.append(pi[[len(l) > 1 for l in pi]])


    Then convert to Series and use value_counts().



    pd.Series(np.concatenate(ls)).value_counts()

    (Shirt1, Shorts1) 2
    (Shirt2, Shorts1) 1
    (Shirt1, Shirt2, Shorts1) 1
    (Shirt1, Shirt2) 1





    share|improve this answer































      0














      Late answer, but you can use:



      df = df.groupby(['Cust_num'], as_index=False).agg(','.join).drop(columns=['Rev']).set_index(['Item']).rename_axis("combo").rename(columns={"Cust_num": "Count"})
      df['Count'] = df['Count'].str.replace(r'Cust','')




      combo                   Count                 
      Shirt1,Shirt2,Shorts1 1
      Shirt1,Shorts1 2





      share|improve this answer

































        -1














        My version which I believe is easier to understand



        new_df = df.groupby("Cust_num").agg({lambda x: ''.join(x.unique())})

        new_df ['count'] = range(1, len(new_df ) + 1)


        Output:



                                    Item      Rev count
        <lambda> <lambda>
        Cust_num
        Cust1 Shirt1 Shirt2 Shorts1 $40 1
        Cust2 Shirt1 Shorts1 $40 2


        Since you do not need the Rev column, you can drop it:



        new_df = new_df = new_df.drop(columns=["Rev"]).reset_index()

        new_df


        Output:



          Cust_num                    Item count
        <lambda>
        0 Cust1 Shirt1 Shirt2 Shorts1 1
        1 Cust2 Shirt1 Shorts1 2





        share|improve this answer


























        • How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

          – Chris
          7 mins ago














        Your Answer






        StackExchange.ifUsing("editor", function () {
        StackExchange.using("externalEditor", function () {
        StackExchange.using("snippets", function () {
        StackExchange.snippets.init();
        });
        });
        }, "code-snippets");

        StackExchange.ready(function() {
        var channelOptions = {
        tags: "".split(" "),
        id: "1"
        };
        initTagRenderer("".split(" "), "".split(" "), channelOptions);

        StackExchange.using("externalEditor", function() {
        // Have to fire editor after snippets, if snippets enabled
        if (StackExchange.settings.snippets.snippetsEnabled) {
        StackExchange.using("snippets", function() {
        createEditor();
        });
        }
        else {
        createEditor();
        }
        });

        function createEditor() {
        StackExchange.prepareEditor({
        heartbeatType: 'answer',
        autoActivateHeartbeat: false,
        convertImagesToLinks: true,
        noModals: true,
        showLowRepImageUploadWarning: true,
        reputationToPostImages: 10,
        bindNavPrevention: true,
        postfix: "",
        imageUploader: {
        brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
        contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
        allowUrls: true
        },
        onDemand: true,
        discardSelector: ".discard-answer"
        ,immediatelyShowMarkdownHelp:true
        });


        }
        });






        Taylor Smith is a new contributor. Be nice, and check out our Code of Conduct.










        draft saved

        draft discarded


















        StackExchange.ready(
        function () {
        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55565916%2fhow-to-use-pandas-to-get-the-count-of-every-combination-inclusive%23new-answer', 'question_page');
        }
        );

        Post as a guest















        Required, but never shown

























        4 Answers
        4






        active

        oldest

        votes








        4 Answers
        4






        active

        oldest

        votes









        active

        oldest

        votes






        active

        oldest

        votes









        3














        Using pandas.DataFrame.groupby:



        grouped_item = df.groupby('Cust_num')['Item']
        subsets = grouped_item.apply(lambda x: set(x)).tolist()
        Count = [sum(s2.issubset(s1) for s1 in subsets) for s2 in subsets]
        combo = grouped_item.apply(lambda x:','.join(x))
        combo = combo.reset_index()
        combo['Count']=Count


        Output:



          Cust_num                   Item  Count
        0 Cust1 Shirt1,Shirt2,Shorts1 1
        1 Cust2 Shirt1,Shorts1 2





        share|improve this answer




























          3














          Using pandas.DataFrame.groupby:



          grouped_item = df.groupby('Cust_num')['Item']
          subsets = grouped_item.apply(lambda x: set(x)).tolist()
          Count = [sum(s2.issubset(s1) for s1 in subsets) for s2 in subsets]
          combo = grouped_item.apply(lambda x:','.join(x))
          combo = combo.reset_index()
          combo['Count']=Count


          Output:



            Cust_num                   Item  Count
          0 Cust1 Shirt1,Shirt2,Shorts1 1
          1 Cust2 Shirt1,Shorts1 2





          share|improve this answer


























            3












            3








            3







            Using pandas.DataFrame.groupby:



            grouped_item = df.groupby('Cust_num')['Item']
            subsets = grouped_item.apply(lambda x: set(x)).tolist()
            Count = [sum(s2.issubset(s1) for s1 in subsets) for s2 in subsets]
            combo = grouped_item.apply(lambda x:','.join(x))
            combo = combo.reset_index()
            combo['Count']=Count


            Output:



              Cust_num                   Item  Count
            0 Cust1 Shirt1,Shirt2,Shorts1 1
            1 Cust2 Shirt1,Shorts1 2





            share|improve this answer













            Using pandas.DataFrame.groupby:



            grouped_item = df.groupby('Cust_num')['Item']
            subsets = grouped_item.apply(lambda x: set(x)).tolist()
            Count = [sum(s2.issubset(s1) for s1 in subsets) for s2 in subsets]
            combo = grouped_item.apply(lambda x:','.join(x))
            combo = combo.reset_index()
            combo['Count']=Count


            Output:



              Cust_num                   Item  Count
            0 Cust1 Shirt1,Shirt2,Shorts1 1
            1 Cust2 Shirt1,Shorts1 2






            share|improve this answer












            share|improve this answer



            share|improve this answer










            answered 1 hour ago









            ChrisChris

            3,700422




            3,700422

























                0














                I think you need to create a combination of items first.



                How to get all possible combinations of a list’s elements?



                I used the function from Dan H's answer.



                from itertools import chain, combinations
                def all_subsets(ss):
                return chain(*map(lambda x: combinations(ss, x), range(0, len(ss)+1)))


                Then get the unique items.



                uq_items = df.Item.unique()

                list(all_subsets(uq_items))

                [(),
                ('Shirt1',),
                ('Shirt2',),
                ('Shorts1',),
                ('Shirt1', 'Shirt2'),
                ('Shirt1', 'Shorts1'),
                ('Shirt2', 'Shorts1'),
                ('Shirt1', 'Shirt2', 'Shorts1')]


                And use groupby each customer to get their items combination.



                ls = []

                for _, d in df.groupby('Cust_num', group_keys=False):
                # Get all possible subset of items
                pi = np.array(list(all_subsets(d.Item)))

                # Fliter only > 1
                ls.append(pi[[len(l) > 1 for l in pi]])


                Then convert to Series and use value_counts().



                pd.Series(np.concatenate(ls)).value_counts()

                (Shirt1, Shorts1) 2
                (Shirt2, Shorts1) 1
                (Shirt1, Shirt2, Shorts1) 1
                (Shirt1, Shirt2) 1





                share|improve this answer




























                  0














                  I think you need to create a combination of items first.



                  How to get all possible combinations of a list’s elements?



                  I used the function from Dan H's answer.



                  from itertools import chain, combinations
                  def all_subsets(ss):
                  return chain(*map(lambda x: combinations(ss, x), range(0, len(ss)+1)))


                  Then get the unique items.



                  uq_items = df.Item.unique()

                  list(all_subsets(uq_items))

                  [(),
                  ('Shirt1',),
                  ('Shirt2',),
                  ('Shorts1',),
                  ('Shirt1', 'Shirt2'),
                  ('Shirt1', 'Shorts1'),
                  ('Shirt2', 'Shorts1'),
                  ('Shirt1', 'Shirt2', 'Shorts1')]


                  And use groupby each customer to get their items combination.



                  ls = []

                  for _, d in df.groupby('Cust_num', group_keys=False):
                  # Get all possible subset of items
                  pi = np.array(list(all_subsets(d.Item)))

                  # Fliter only > 1
                  ls.append(pi[[len(l) > 1 for l in pi]])


                  Then convert to Series and use value_counts().



                  pd.Series(np.concatenate(ls)).value_counts()

                  (Shirt1, Shorts1) 2
                  (Shirt2, Shorts1) 1
                  (Shirt1, Shirt2, Shorts1) 1
                  (Shirt1, Shirt2) 1





                  share|improve this answer


























                    0












                    0








                    0







                    I think you need to create a combination of items first.



                    How to get all possible combinations of a list’s elements?



                    I used the function from Dan H's answer.



                    from itertools import chain, combinations
                    def all_subsets(ss):
                    return chain(*map(lambda x: combinations(ss, x), range(0, len(ss)+1)))


                    Then get the unique items.



                    uq_items = df.Item.unique()

                    list(all_subsets(uq_items))

                    [(),
                    ('Shirt1',),
                    ('Shirt2',),
                    ('Shorts1',),
                    ('Shirt1', 'Shirt2'),
                    ('Shirt1', 'Shorts1'),
                    ('Shirt2', 'Shorts1'),
                    ('Shirt1', 'Shirt2', 'Shorts1')]


                    And use groupby each customer to get their items combination.



                    ls = []

                    for _, d in df.groupby('Cust_num', group_keys=False):
                    # Get all possible subset of items
                    pi = np.array(list(all_subsets(d.Item)))

                    # Fliter only > 1
                    ls.append(pi[[len(l) > 1 for l in pi]])


                    Then convert to Series and use value_counts().



                    pd.Series(np.concatenate(ls)).value_counts()

                    (Shirt1, Shorts1) 2
                    (Shirt2, Shorts1) 1
                    (Shirt1, Shirt2, Shorts1) 1
                    (Shirt1, Shirt2) 1





                    share|improve this answer













                    I think you need to create a combination of items first.



                    How to get all possible combinations of a list’s elements?



                    I used the function from Dan H's answer.



                    from itertools import chain, combinations
                    def all_subsets(ss):
                    return chain(*map(lambda x: combinations(ss, x), range(0, len(ss)+1)))


                    Then get the unique items.



                    uq_items = df.Item.unique()

                    list(all_subsets(uq_items))

                    [(),
                    ('Shirt1',),
                    ('Shirt2',),
                    ('Shorts1',),
                    ('Shirt1', 'Shirt2'),
                    ('Shirt1', 'Shorts1'),
                    ('Shirt2', 'Shorts1'),
                    ('Shirt1', 'Shirt2', 'Shorts1')]


                    And use groupby each customer to get their items combination.



                    ls = []

                    for _, d in df.groupby('Cust_num', group_keys=False):
                    # Get all possible subset of items
                    pi = np.array(list(all_subsets(d.Item)))

                    # Fliter only > 1
                    ls.append(pi[[len(l) > 1 for l in pi]])


                    Then convert to Series and use value_counts().



                    pd.Series(np.concatenate(ls)).value_counts()

                    (Shirt1, Shorts1) 2
                    (Shirt2, Shorts1) 1
                    (Shirt1, Shirt2, Shorts1) 1
                    (Shirt1, Shirt2) 1






                    share|improve this answer












                    share|improve this answer



                    share|improve this answer










                    answered 34 mins ago









                    ResidentSleeperResidentSleeper

                    35210




                    35210























                        0














                        Late answer, but you can use:



                        df = df.groupby(['Cust_num'], as_index=False).agg(','.join).drop(columns=['Rev']).set_index(['Item']).rename_axis("combo").rename(columns={"Cust_num": "Count"})
                        df['Count'] = df['Count'].str.replace(r'Cust','')




                        combo                   Count                 
                        Shirt1,Shirt2,Shorts1 1
                        Shirt1,Shorts1 2





                        share|improve this answer






























                          0














                          Late answer, but you can use:



                          df = df.groupby(['Cust_num'], as_index=False).agg(','.join).drop(columns=['Rev']).set_index(['Item']).rename_axis("combo").rename(columns={"Cust_num": "Count"})
                          df['Count'] = df['Count'].str.replace(r'Cust','')




                          combo                   Count                 
                          Shirt1,Shirt2,Shorts1 1
                          Shirt1,Shorts1 2





                          share|improve this answer




























                            0












                            0








                            0







                            Late answer, but you can use:



                            df = df.groupby(['Cust_num'], as_index=False).agg(','.join).drop(columns=['Rev']).set_index(['Item']).rename_axis("combo").rename(columns={"Cust_num": "Count"})
                            df['Count'] = df['Count'].str.replace(r'Cust','')




                            combo                   Count                 
                            Shirt1,Shirt2,Shorts1 1
                            Shirt1,Shorts1 2





                            share|improve this answer















                            Late answer, but you can use:



                            df = df.groupby(['Cust_num'], as_index=False).agg(','.join).drop(columns=['Rev']).set_index(['Item']).rename_axis("combo").rename(columns={"Cust_num": "Count"})
                            df['Count'] = df['Count'].str.replace(r'Cust','')




                            combo                   Count                 
                            Shirt1,Shirt2,Shorts1 1
                            Shirt1,Shorts1 2






                            share|improve this answer














                            share|improve this answer



                            share|improve this answer








                            edited 5 mins ago

























                            answered 32 mins ago









                            Pedro LobitoPedro Lobito

                            50.5k16138172




                            50.5k16138172























                                -1














                                My version which I believe is easier to understand



                                new_df = df.groupby("Cust_num").agg({lambda x: ''.join(x.unique())})

                                new_df ['count'] = range(1, len(new_df ) + 1)


                                Output:



                                                            Item      Rev count
                                <lambda> <lambda>
                                Cust_num
                                Cust1 Shirt1 Shirt2 Shorts1 $40 1
                                Cust2 Shirt1 Shorts1 $40 2


                                Since you do not need the Rev column, you can drop it:



                                new_df = new_df = new_df.drop(columns=["Rev"]).reset_index()

                                new_df


                                Output:



                                  Cust_num                    Item count
                                <lambda>
                                0 Cust1 Shirt1 Shirt2 Shorts1 1
                                1 Cust2 Shirt1 Shorts1 2





                                share|improve this answer


























                                • How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

                                  – Chris
                                  7 mins ago


















                                -1














                                My version which I believe is easier to understand



                                new_df = df.groupby("Cust_num").agg({lambda x: ''.join(x.unique())})

                                new_df ['count'] = range(1, len(new_df ) + 1)


                                Output:



                                                            Item      Rev count
                                <lambda> <lambda>
                                Cust_num
                                Cust1 Shirt1 Shirt2 Shorts1 $40 1
                                Cust2 Shirt1 Shorts1 $40 2


                                Since you do not need the Rev column, you can drop it:



                                new_df = new_df = new_df.drop(columns=["Rev"]).reset_index()

                                new_df


                                Output:



                                  Cust_num                    Item count
                                <lambda>
                                0 Cust1 Shirt1 Shirt2 Shorts1 1
                                1 Cust2 Shirt1 Shorts1 2





                                share|improve this answer


























                                • How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

                                  – Chris
                                  7 mins ago
















                                -1












                                -1








                                -1







                                My version which I believe is easier to understand



                                new_df = df.groupby("Cust_num").agg({lambda x: ''.join(x.unique())})

                                new_df ['count'] = range(1, len(new_df ) + 1)


                                Output:



                                                            Item      Rev count
                                <lambda> <lambda>
                                Cust_num
                                Cust1 Shirt1 Shirt2 Shorts1 $40 1
                                Cust2 Shirt1 Shorts1 $40 2


                                Since you do not need the Rev column, you can drop it:



                                new_df = new_df = new_df.drop(columns=["Rev"]).reset_index()

                                new_df


                                Output:



                                  Cust_num                    Item count
                                <lambda>
                                0 Cust1 Shirt1 Shirt2 Shorts1 1
                                1 Cust2 Shirt1 Shorts1 2





                                share|improve this answer















                                My version which I believe is easier to understand



                                new_df = df.groupby("Cust_num").agg({lambda x: ''.join(x.unique())})

                                new_df ['count'] = range(1, len(new_df ) + 1)


                                Output:



                                                            Item      Rev count
                                <lambda> <lambda>
                                Cust_num
                                Cust1 Shirt1 Shirt2 Shorts1 $40 1
                                Cust2 Shirt1 Shorts1 $40 2


                                Since you do not need the Rev column, you can drop it:



                                new_df = new_df = new_df.drop(columns=["Rev"]).reset_index()

                                new_df


                                Output:



                                  Cust_num                    Item count
                                <lambda>
                                0 Cust1 Shirt1 Shirt2 Shorts1 1
                                1 Cust2 Shirt1 Shorts1 2






                                share|improve this answer














                                share|improve this answer



                                share|improve this answer








                                edited 5 mins ago

























                                answered 15 mins ago









                                Lee MtotiLee Mtoti

                                13210




                                13210













                                • How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

                                  – Chris
                                  7 mins ago





















                                • How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

                                  – Chris
                                  7 mins ago



















                                How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

                                – Chris
                                7 mins ago







                                How is the count in your answer the count of inclusive combination of df['Item']? Making new column with range is not an answer.

                                – Chris
                                7 mins ago












                                Taylor Smith is a new contributor. Be nice, and check out our Code of Conduct.










                                draft saved

                                draft discarded


















                                Taylor Smith is a new contributor. Be nice, and check out our Code of Conduct.













                                Taylor Smith is a new contributor. Be nice, and check out our Code of Conduct.












                                Taylor Smith is a new contributor. Be nice, and check out our Code of Conduct.
















                                Thanks for contributing an answer to Stack Overflow!


                                • Please be sure to answer the question. Provide details and share your research!

                                But avoid



                                • Asking for help, clarification, or responding to other answers.

                                • Making statements based on opinion; back them up with references or personal experience.


                                To learn more, see our tips on writing great answers.




                                draft saved


                                draft discarded














                                StackExchange.ready(
                                function () {
                                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55565916%2fhow-to-use-pandas-to-get-the-count-of-every-combination-inclusive%23new-answer', 'question_page');
                                }
                                );

                                Post as a guest















                                Required, but never shown





















































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown

































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown







                                Popular posts from this blog

                                Anexo:Material bélico de la Fuerza Aérea de Chile Índice Aeronaves Defensa...

                                Always On Availability groups resolving state after failover - Remote harden of transaction...

                                update json value to null Announcing the arrival of Valued Associate #679: Cesar Manara ...