๐ Study Resources
Explore tutorials, revision guides, practice papers, and more to sharpen your data skills.
๐ Datasets Used in This Guide
The study guides and practice paper sets used on this website are publicly available datasets from reputable sources such as Kaggle, GitHub, and UCI Machine Learning Repository. These datasets are selected for educational use and are freely accessible for learning purposes. If you have any questions, please feel free to contact us here or HERE.
๐ Free Version โ Linked Resources
In the free version of this guide, you’ll find:
- Direct links to all datasets hosted on external platforms
- Instructions for downloading and importing into Power BI
- Attribution to original data owners and sources
Note: You are responsible for reviewing and complying with each datasetโs license or usage terms when using the linked resources.
๐ก Free Tutorials โค
๐น Managing Digital Content For Data Librarians ๐นYour Infogram Project ๐นPower BI For Beginners ๐นPower BI For Intermediates ๐นHow To Become A Data Analyst ๐นHow To Become A Data Scientist In 2024 ๐นBecoming A Data Analyst ๐นHow To Become A Data Engineer In 2024๐ฅ Free Video Tutorials โค
๐น Data Collection (For Beginners) ๐นData Cleaning For Beginners ๐นImporting Data For Beginners ๐นHow To Become A Data Analyst In 2023 ๐นPower BI Tutorial ๐นHow To Become A Data Scientist In 2024 ๐นHow To Become A Data Engineer In 2024๐ฎ Free Data Games โค
๐น Tableau For Beginners ๐น Principles, Statistical And Computational Tools For Data Science ๐นMachine Learning And AI, And Python (For Intermediates) ๐นStatistics And R For Intermediates ๐นData Science and R Basics For Intermediates ๐น12 Python Data Visualization Libraries To Explore For Business Intelligence ๐นData Visualization Made Easy: With 10 Great Tips.pdf๐ง Free Podcasts โค
๐น Welcome To DataViz Explorer ๐นData Visualization Made Easy Podcast 2 ๐นData Visualization Made Easy Podcast 3 ๐นBulletproof Mindset Podcast 2 ๐นBulletproof Mindset Podcast 4ย
ย
ย
๐ 10 Free Open Data Sources โ Used For Practice
Practice your skills with these free, easy-to-follow, readily available and accessible data sources.
๐ Data Cleaning in Power BI โ Practice Papers
Put your skills to the test with realistic practice tasks designed to reinforce each core data cleaning technique covered in the revision guide. These are hands-on and ideal for preparing for assessments or real-world use.
๐งชPractice Task 1: Customer Contact Fix
๐ File Provided: customer-contacts.csv
๐ File Provided: contact-customers.csv
Skills Covered: Removing nulls, standardizing text, fixing column types.
โ Task Instructions:
- Import the dataset into Power BI.
- Remove rows with missing email addresses.
- Convert all names to proper case (e.g., “john doe” โ “John Doe”).
- Standardize phone numbers to the format
(XXX) XXX-XXXX
.
- Remove duplicate rows.
๐ Bonus: Add a column to label contacts missing a phone number as โIncompleteโ.
ย
๐งชPractice Task 2: Sales Data Merge
๐Files Provided:
๐ง Skills Covered: Appending, merging queries, replacing values, transforming data types.
โ Task Instructions:
- ย
- Append the January and February sales files into a single table. ย
- Merge with the product details table using
ProductID
.
ย
- Replace null values in the
Price
column with 0.
ย
- Ensure
Quantity Sold
andPrice
columns are numeric.
ย
- Add a calculated column:
Total Sales = Quantity ร Price
.
๐Bonus: Sort the data by total sales in descending so that the highest sales appear at the top.
(Optional extra step for even more insight) โ Add a Rank column:
- In Power Query: Use โAdd Index Columnโ after sorting, then rename it โSales Rank.โ
- In DAX: Create a measure like:
- ย
ย Sales Rank = RANKX( ย ย ALL('MergedSales'), ย ย 'MergedSales'[Total Sales], ย ย , ย ย DESC, ย ย Dense ย ) ย
- ย ย
ย
ย
๐งช Practice Task 3: Website Log Clean-Up
๐ File Provided: site-logs.csv
Skills Covered: Filtering rows, replacing errors, formatting columns, handling dates.
โ Task Instructions:
- Remove any rows where โSessionIDโ is null.
- Replace error values in the โDurationโ column with 0.
- Format โVisit Dateโ as a valid Date.
- Trim whitespace in the โUser Agentโ column.
- Remove duplicates.
๐ Bonus: Create a grouped summary of total sessions per browser type.
ย
๐งช Practice Task 4: Survey Response Cleanup
๐ File Provided:
๐ File Provided:
๐ง Skills Covered: Conditional replacement, column renaming, filtering nulls, value mapping.
โ Task Instructions:
- Replace all instances of โN/Aโ in the โAgeโ column with null.
- Remove responses where โConsentโ is marked as โNoโ.
- Rename columns to be more descriptive (e.g., โQ1โ โ โOverall Satisfactionโ).
- Map values in a new column: โVery Satisfiedโ = 5, โSatisfiedโ = 4, etc.
- Convert all responses to lowercase.
๐ Bonus: Create a โCleaned Surveyโ query and load only selected columns.
โ Submission Guidelines
- Complete your practice using Power BI Desktop.
- Save your .pbix file with each task labeled in separate queries.
- Optional: Submit to your instructor/mentor for feedback or portfolio review.
ย
๐ A. Study Guides
Minimalist Data Wrangling with Python โ a free PDF/book covering core data cleaning techniques in Python (Pandas), ideal as a reference companion.
Kaggle Notebooks โ look up “data cleaning” on Kaggle to find shared notebooks and hands-on examples.
ย
๐ B. Practice Tests & Mock Exams
๐ฏ 1. Data Cleansing using Python โ Free Online Practice Test (TestPrepTraining)
Take a 100-question interactive test for Python-based data cleaning. (Note: some follow-up content is paid, but the initial practice is free.)
๐ฏ 2. Data Science with Python โ Free Practice Exam
โถ๏ธ 3. Theory & Practice of Data Cleaning Quiz โ A 15-question multiple-choice quiz covering schema-level cleaning, outliers, duplicates, and more.
โถ๏ธ 4. Data Cleaning Test โ A 12-question, 20-minute test focusing on practical skills like duplicate removal and outlier detection.
โถ๏ธ 5. Quiz: Data Exploration and Cleaning โ A 50-question interactive quiz that reinforces data cleaning concepts learned in Python, including integrity checks and data validation.
โ๏ธ 6. Data Science with Python โ Free Practice Mock Exam
- Free 50-question Data Science with Python Mock Exam (Simplilearn) โ It takes 60 minutes to complete, multiple choice format, unlimited reattempts.ย
โ๏ธ 7. Data Cleansing using Python โ Free Practice Mock Test
- Take the free Python cleansing test (100 questions) (Testpreptraining) โ A 100-question interactive test for Python-based data cleaning.
Copyright Disclaimer
โThis website includes copyrighted material from other websites. This material is used for educational, informational and commentary purposes, which falls under the fair use doctrine of copyright law. No copyright infringement is intended.โ Therefore, the copyright owners are the true and rightful owners of their โproperty’.