Data Exploration: Employee Salaries
Salary data from two companies is presented below, Company A and Company B, both in the same field and geographic region. We want to compare the salaries by looking at graphical representations of the data.
Salaried Employees: Company A
| [latex]68340[/latex] | [latex]87282[/latex] | [latex]103802[/latex] | [latex]128863[/latex] | [latex]140085[/latex] | [latex]162300[/latex] | [latex]177109[/latex] | 
| [latex]70138[/latex] | [latex]90553[/latex] | [latex]106562[/latex] | [latex]128933[/latex] | [latex]147419[/latex] | [latex]168676[/latex] | [latex]180174[/latex] | 
| [latex]71417[/latex] | [latex]95226[/latex] | [latex]120701[/latex] | [latex]130780[/latex] | [latex]149514[/latex] | [latex]169409[/latex] | [latex]180221[/latex] | 
| [latex]71867[/latex] | [latex]97042[/latex] | [latex]123313[/latex] | [latex]136204[/latex] | [latex]152008[/latex] | [latex]170031[/latex] | [latex]185837[/latex] | 
| [latex]84675[/latex] | [latex]100531[/latex] | [latex]125614[/latex] | [latex]138920[/latex] | [latex]155032[/latex] | [latex]175118[/latex] | [latex]189320[/latex] | 
Salaried Employees: Company B
| [latex]35472[/latex] | [latex]43467[/latex] | [latex]53624[/latex] | [latex]65096[/latex] | [latex]72290[/latex] | [latex]110351[/latex] | [latex]124732[/latex] | 
| [latex]36983[/latex] | [latex]46652[/latex] | [latex]57946[/latex] | [latex]66235[/latex] | [latex]75279[/latex] | [latex]117574[/latex] | [latex]228920[/latex] | 
| [latex]38382[/latex] | [latex]49655[/latex] | [latex]59096[/latex] | [latex]69721[/latex] | [latex]107368[/latex] | [latex]118810[/latex] | [latex]245427[/latex] | 
| [latex]41674[/latex] | [latex]53231[/latex] | [latex]59709[/latex] | [latex]71289[/latex] | [latex]108236[/latex] | [latex]119112[/latex] | [latex]275024[/latex] | 
| [latex]43256[/latex] | [latex]53506[/latex] | [latex]61724[/latex] | [latex]72211[/latex] | [latex]109472[/latex] | [latex]124678[/latex] | [latex]293012[/latex] | 
Hands-on Spreadsheet: Explore the Data
The examples shown below will use the Microsoft Excel spreadsheet but you can also use an open-source spreadsheet such as Apache OpenOffice Calc or Google Sheets.
Step [latex]1[/latex]: Store the data
- Type or copy the data into a new spreadsheet. Title the tab Employee Salaries. Place the columns of data side by side in column A and column B.
- Obtain descriptive statistics for each company’s data.
- Analyze the descriptive statistics and compare the companies’ data.
Step [latex]2[/latex]: Create a box and whisker plot with both data series on the same graph
- Select both columns of data together with their labels.
- Click on Insert then Box and Whisker. You should see both sets of data appear as parallel box plots on the graph in different colors. Click to select it.
- Click the plus sign next to the chart. Click to select Legend. The data column labels should appear at the top under the Chart Title. You can now delete the Chart Title.
Step [latex]3[/latex]. Create a scatter plot with both data series on the same graph
- Follow the same steps as in Step [latex]2[/latex] above, except this time choose Scatter Plot instead of Box and Whiskers.
 Figure 1. Create a scatter plot 
Step [latex]4[/latex]: Analyze the data
As we saw in the descriptive statistics, Company A has a tighter distribution of salaries about its center. Company B possesses extremes at both ends of salary range. The salaries in B are persistently and substantially lower than A’s are with the notable exception of four outliers at the top end. These are pulling the mean of B far to the right of the median.
