Standardization of Normal Variables
In the Z_score_Statistics.ipynb file, I perform standardizing normal variables using the Z-score method. This file includes the following steps:
Click to see details
- Mounting Google Drive in Google Colab: Access files stored in Google Drive to enable seamless data loading.
- Importing Libraries and Loading the Dataset: : Load the necessary Python packages and read the
scores.csvfile into a DataFrame. - Visualizing the Data Distribution:
- Use the Seaborn library for visualization.
- Plot the distributions of
SATandACTscores.
- Calculating Mean and Standard Deviation:
- Compute the mean and standard deviation for
SATandACTscores. - Determine the Z-score for the highest scorer in
SATandACTamong all applicants.
- Compute the mean and standard deviation for
- Applying the Z-Score to All Scores: Standardize the entire table by applying the Z-score formula to all values.
- Fit-Transform Using StandardScaler:
- Import
StandardScalerfromsklearn.preprocessing. - Initialize the scaler and apply it to the
SATscoreandACTscorecolumns usingfitandtransformmethods, or utilizefit_transformdirectly. - Display the updated DataFrame to confirm standardization.
- Import
This project highlights the use of Z-score standardization and the application of Python libraries to prepare data for further analysis.