Chapter 3 Data transformation
There is not much cleaning required for the dataset. We kept only the rows and columns we need in order to shrink down the size of the dataset.
Head of kill dataset
## match_id time killed_by killer_placement killer_name killer_position_x killer_position_y victim_name
## 1 2U4GBNA0YmkAtDV8BIUtsazckNPT4YjBr8TFfiPl9ZSsCr8L4W_NKq7eHFO2HMHt 622 Down and Out 3 HZCBIUBIUBIU 417104.3 481058.3 RealCoconutjuice
## 2 2U4GBNA0YmkAtDV8BIUtsazckNPT4YjBr8TFfiPl9ZSsCr8L4W_NKq7eHFO2HMHt 136 M416 19 RunFastKneeGrow 569463.8 381541.6 colonelcrunker
## 3 2U4GBNA0YmkAtDV8BIUtsazckNPT4YjBr8TFfiPl9ZSsCr8L4W_NKq7eHFO2HMHt 785 M16A4 3 Daniellalala 422891.6 484844.6 Le-Passer
## 4 2U4GBNA0YmkAtDV8BIUtsazckNPT4YjBr8TFfiPl9ZSsCr8L4W_NKq7eHFO2HMHt 436 UMP9 18 orangecsl 583931.7 480778.6 Jamarkus
## 5 2U4GBNA0YmkAtDV8BIUtsazckNPT4YjBr8TFfiPl9ZSsCr8L4W_NKq7eHFO2HMHt 1511 M416 6 niuhongfei1 271145.1 562685.8 Blind_LeeSin
## 6 2U4GBNA0YmkAtDV8BIUtsazckNPT4YjBr8TFfiPl9ZSsCr8L4W_NKq7eHFO2HMHt 801 Grenade 16 THUGNIFlCENT 575606.3 503138.2 THUGNIFlCENT
## victim_position_x victim_position_y
## 1 413859.3 484157.8
## 2 569128.9 381294.2
## 3 422423.9 483145.9
## 4 585560.0 483584.9
## 5 270507.0 562770.3
## 6 575606.3 503138.2
Head of aggregated match dataset
## match_id player_name player_survive_time player_kills player_dist_ride player_dist_walk team_placement
## 1 2U4GBNA0YmmhivBOFUiipklIPVdC0DRgkX88eyhTYGiAnoXrjvEhNQHZMWi8d5y9 m3xdave 106.351 0 0.000 37.91984 88
## 2 2U4GBNA0YmmhivBOFUiipklIPVdC0DRgkX88eyhTYGiAnoXrjvEhNQHZMWi8d5y9 UDaddyHere 1690.021 3 292.205 3457.03613 8
## 3 2U4GBNA0YmmhivBOFUiipklIPVdC0DRgkX88eyhTYGiAnoXrjvEhNQHZMWi8d5y9 Dummy05 103.415 0 0.000 106.21590 89
## 4 2U4GBNA0YmmhivBOFUiipklIPVdC0DRgkX88eyhTYGiAnoXrjvEhNQHZMWi8d5y9 NA-Wuyifan 272.838 0 0.000 126.95983 67
## 5 2U4GBNA0YmmhivBOFUiipklIPVdC0DRgkX88eyhTYGiAnoXrjvEhNQHZMWi8d5y9 kickinchicken 134.342 0 0.000 43.34671 81
## 6 2U4GBNA0YmmhivBOFUiipklIPVdC0DRgkX88eyhTYGiAnoXrjvEhNQHZMWi8d5y9 WhiteCrane 193.885 0 0.000 218.51520 74