Contents

Bin Continuous Data

Examples:

Data Size: Different versions of XLMiner™  have varying limits on size of data. The size of data depicted in the example below may not be supported by your version. Refer to Data Handling Specifications for details.

  1. Open the dataset Binning_Example.xls. 

     

  2. Select XlMiner --> Data Utilities --> Bin Continuous Data. You will get the following dialog.

     

  3. Select a variable below the "Variables" list, say, x3. The options are immediately activated. 

  4. Make the necessary changes to match the options selected in the following dialog. Select "Apply this option to the selected variable". You will see the selected entry under Name of binned variable.

  5. Select OK. 

    As specified by us earlier, 5 bins are made depending on the count of records and the values of x3 are kept in them. For all the values of x3 lying in one bin, the output variable, Binned_x3  gets a value. (This value is the same for all x3s in one bin).  The value to be assigned to Binned_x3  for x3s in first bin is 10, as we specified. Since we have chosen the interval to be 3, the value to be assigned to the values of x3 in the next bin will be 10+3 = 13. You can see above that Binned_x3 takes values 10, 13, 16, 19, 22.

    Though we have specified bins to be made with equal count here, the number of records in each interval may not be essentially same. A lot of factors, such as border values, total #records etc influence the number of records lying in the bins. See the binning interval below.

  6. Select the Mean of the bin instead of the rank.

    In the output Binned_x3 gets the value equal to the mean of all x3s in that bin.

    Similarly, if we select the median, Binned_x3 gets the value equal to the median of all x3s in that bin.

  7. Let us explore the Equal interval option. The interval is calculated as (Maximum value -Minimum value)/#bins. Let us select x4 as the binning variable.

    The output is shown below.

     

  8. See also: