Comprehensive Report on Workplace Absenteeism

My first data storytelling project

It's me

I’m excited to share my first data storytelling project, done after completing the EDA (Exploratory Data Analysis). In this project, I explored some interesting questions like:

  • Does working longer hours improve performance or lead to more absenteeism?
  • Are people with health problems at a young age more likely to be absent?
  • Can activities like yoga, physiotherapy, or medical check-ups help reduce absenteeism?
Through this, I realized how important storytelling is in data analytics. It’s not just about understanding the numbers — it’s also about sharing insights clearly and effectively with others.

Exploratory Data Analysis on Workplace Absenteeism

Conducted an in-depth Exploratory Data Analysis on an absenteeism dataset to uncover patterns and insights related to employee attendance behavior

It's me

  • Performed extensive data cleaning, feature engineering, and visualization to understand correlations between absenteeism and factors such as workload, distance from work, education level, and health-related metrics.
  • Utilized libraries such as Pandas, Matplotlib, and Seaborn to identify trends, outliers, and missing data, enabling clearer understanding of employee productivity drivers.
  • Created heatmaps, pair plots, and categorical plots to visualize relationships, distributions, and potential predictors of absenteeism.
  • Extracted actionable insights that could help HR and management design data-driven strategies to reduce employee absenteeism and improve workplace efficiency.

Automating Table Extraction from Wikipedia

Built a Python script to scrape tables from Wikipedia and save them as CSV files. Used BeautifulSoup for HTML parsing, requests for fetching pages, and pandas for data handling.

It's me

  • Created a function get_allTables(url) to extract all tables as DataFrames.
  • Parsed headers and rows, cleaned text using .strip().
  • Exported each table to a separate CSV file, with support for downloading in Google Colab environment.
  • Handled file naming dynamically using f-strings for easy identification (table_1.csv, table_2.csv, ...).
  • Ensured clean CSV export by disabling index column with index=False.

Save the King: A Grid Path Problem with a Deadly Twist"

In this video, I explore a classic grid problem with a story twist — the White King must reach his Queen without crossing paths with a dangerous Black Queen. But if he does, he turns black… and the Queen won’t recognize him — she’ll strike him down.

It's me

  • A story-driven approach to a grid path problem (chess-themed!).
  • Using factorials to calculate the total number of valid paths.
  • Black Queen acts as a forbidden cell that changes the king’s fate.
  • How to exclude paths that pass through the Black Queen's position.
  • Simple yet powerful combinatorics applied to a real scenario.