Description:

Background

Often times when working with raw tabular data, a common goal is to split the data into groups and perform an aggregation as a way to simplify and draw meaningful conclusions from it. The aggregation function can be anything that reduces the data (sum,mean,standard deviation,etc.). For the purpose of this kata, it will always be the sum function.

Task

Define a function that accepts two arguments, the first being a list of list that represents the raw data, and the second being a list of column indices.

The return value should be a dictionary with the key being the groups as a tuple and the values should be a list containing the aggregated sums.

Example

arr = [
  [1, 6, 2, 10],
  [8, 9, 4, 11],
  [9, 8, 7, 12],
  [1, 6, 3, 20],
]

idx = [0, 1]

group(arr, idx) == {
  (1, 6): [5, 30],      # [2 + 3, 10 + 20]
  (8, 9): [4, 11],
  (9, 8): [7, 12]
}

>>> arr = [ [1,6,2,10]
          , [8,9,4,11]
          , [9,8,7,12]
          , [1,6,3,20] ]
>>> idx = [0,1]
>>> mapM_ print (groupSum arr idx)
([1,6],[5,30])  -- [2 + 3, 10 + 20]
([8,9],[4,11])
([9,8],[7,12])

Explanation

Columns 0 and 1 are used for grouping, so columns 2 and 3 will be aggregated
Rows 0 and 3 are grouped together because they have the same values in columns idx, so the columns which are not a part of idx are aggregated
Row 1 and 2 have different values in columns idx, so they are not grouped, and the aggregated results will simply be their own values in the columns which are not a part of idx

Notes

all inputs are valid

arguments will never be empty

Fundamentals

Similar Kata:

More By Author:

Check out these other kata created by Fbasham

Stats:

Created	May 29, 2020
Published	May 29, 2020
Warriors Trained	769
Total Skips	61
Total Code Submissions	834
Total Times Completed	304
Python Completions	270
Java Completions	32
Haskell Completions	9
Total Stars	17

% of votes with a positive feedback rating	91% of 85
Total "Very Satisfied" Votes	73
Total "Somewhat Satisfied" Votes	8
Total "Not Satisfied" Votes	4
Total Rank Assessments	20
Average Assessed Rank	6 kyu
Highest Assessed Rank	4 kyu
Lowest Assessed Rank	8 kyu

Kata

Group-by and Sum

Description:

Background

Task

Example

Explanation

Notes

Similar Kata:

More By Author:

Status:Testing & feedback needed

Estimated Rank:6 kyu

Stats:

Confirm

Collect: undefined

Estimated Rank:
6 kyu