"Are you Struggling to Stay on Top of Your Finances? Get My Budget Tracker & Take Charge Today!"
Documentation Process For Samuel’s Christmas Gift Co And The Link To The Cleaned Dataset Is Attached
The manager at Samuel's Christmas Gift Co tasked me with preparing the dataset for their end of year data analysis report.
Here’s the link to the Dataset https://docs.google.com/spreadsheets/d/1czebTHoB2qh26ZiuixHdQPNzxTwYfn7t/edit?usp=sharing&ouid=102804961337009291794&rtpof=true&sd=true
The manager of Samuel's Gift & Co gave me these steps to follow below
1. Identify and remove repeated data entries so as to help in preserving unique data.
2. Inspect the "Gift_Item" column for irregularities in the texts. Implement a transformation to ensure all gift item names are written normally.
3. Investigate the dataset for any excess spaces in column values.
4. Generate a new column that combines the child's name and age, separated by a hyphen (e.g., "James - 11").
Here’s a step by step guide of how I prepared the dataset for end of year Data Analysis Report
I downloaded the dataset from Google Drive to my Laptop
I opened it up on Microsoft Excel
I looked through the dataset carefully to understand the data. My first observation is that the dataset is in the Wide Form of Dataset so I had to select the cells containing the dataset and transposed it in the second sheet to a Long Form of Dataset so that I can easily work on the dataset.
I used the “Data” tab to remove duplicates and 3 duplicates where found and 97 unique values remains
I inspected the “Gift_Item” Column for irregularities in the texts. I used the TRIM function to remove the extra spaces in the “Gift_Item” column, I also used the PROPER FUNCTION to Capitalize each word in the “Gift_Item” Column to uniform the dataset. With this process, I ensured that all gift item names were written normally
I investigated the dataset for any excess spaces in column values, and I saw that the column names had spaces, I used the TRIM FUNCTION to remove the spaces.
I Generated a new column that combines the child’s name and age, separated by a hyphen. I used the CONCATE FUNCTION to achieve this.
Here’s a link to the cleaned dataset that is ready for the end of year Data Analysis Report.
https://1drv.ms/x/c/236489399b769a59/EUtFO4glgsBBl5vSu3n6UCMBs1rjxb70PRuRez-PVUYBRg
But it is best to view the dataset using this link. https://1drv.ms/x/c/236489399b769a59/EUtFO4glgsBBl5vSu3n6UCMBs1rjxb70PRuRez-PVUYBRg