PythonFIFA 21 Players Dataset Data Cleaning & Transformation

FIFA21 Data Cleaning & Transformation

FIFA21 is a popular video game that simulates soccer matches. Often, data collected from this game might be messy, containing missing values and various formating issues. As an enthusiastic soccer fan, I put together this project to practice data cleaning and data exploring for analysis using Python and Pandas.

🛠️Tools

  • Python

📊The FIFA21 Dataset

I found the FIFA21 Dataset on Kaggle. The dataset contains 17,000+ players featuring in FIFA21, each with more than 70 attributes.

💡Highlights

  • England had the highest amount of players, making up 8.98% of all players.
  • Spain had the most players in the Top 100.
  • Identified highly valuable, underpaid players by comparing player overall performance and wages.

✏️Data Wrangling

  • Removed rows with missing values
  • Converted height and weight columns to appropriate data types
  • Cleaned and transformed the value, wage, and release clause columns into columns of integeters
  • Removed the unnecessary newlien characters
  • Separated the Team and Contract columns
  • Converted ‘star’ characters to numeric data
  • Converted attribute stats to an average of the sum