Saturday, August 16, 2025

x̄ - > My Data Science Journey: Tackling Real-World Problems with WorldQuant University

My Data Science Journey | WorldQuant University

My Data Science Journey: Tackling Real-World Problems with WorldQuant University

As a data science enthusiast, I recently completed three impactful projects as part of the WorldQuant University Applied Data Science Lab, each tackling real-world challenges with data-driven solutions. From analyzing housing markets in Mexico and Buenos Aires to predicting air quality in Nairobi, these projects pushed me to hone my skills in data analysis, modeling, and critical thinking. I’m also gearing up for a fourth project on earthquake damage in Nepal, which promises to be just as exciting. Below, I share a glimpse into each project and how they’ve shaped my understanding of data science.

For accessibility, I’ve added a text-to-speech feature to this post, so you can listen to it with a click!

1. Decoding Housing Prices in Mexico

In this project, I dove into a dataset of 21,000 properties in Mexico to answer a key question: What drives real estate prices—property size or location? I started by importing and cleaning the data from a CSV file, handling missing values and outliers. Using Python libraries like Pandas and Matplotlib, I created visualizations to explore patterns, such as scatter plots showing price trends across regions. By calculating correlations, I uncovered how location often trumped size in influencing prices, especially in urban hotspots. This project taught me the importance of thorough data cleaning and how visualizations can reveal hidden insights.

2. Predicting Apartment Prices in Buenos Aires

Next, I built a linear regression model to predict apartment prices in Buenos Aires, Argentina. This project was all about creating a robust data pipeline. I dealt with missing values, encoded categorical features like neighborhood types, and worked to reduce overfitting by fine-tuning the model. The result? A model that could reasonably predict prices based on features like square footage and amenities. This project reinforced my understanding of regression techniques and the critical need to balance model complexity to avoid overfitting.

3. Forecasting Air Quality in Nairobi

For my third project, I tackled air quality in Nairobi, Kenya, using an ARMA time-series model to predict particulate matter levels. I extracted data from a MongoDB database using the pymongo library, likely sourced from openAfrica, a major open data platform. After performing exploratory data analysis to spot trends in air pollution, I tuned the ARMA model’s hyperparameters to improve accuracy. This project was eye-opening, showing me how data science can address environmental challenges and inform public health policies.

4. On the Horizon: Earthquake Damage in Nepal

I’m currently preparing for my fourth project, which focuses on predicting earthquake damage to buildings in Nepal using logistic regression and decision tree models. This involves pulling data from a SQLite database and analyzing potential biases that could skew predictions, such as uneven representation of building types. I’m excited to explore how machine learning can help communities prepare for and mitigate natural disasters.

Why These Projects Matter

Each project challenged me to think critically about data, from cleaning and preprocessing to building and evaluating models. They also highlighted the power of data science to address global issues like housing affordability, environmental health, and disaster preparedness. Working through real-world datasets gave me hands-on experience with tools like Python, SQL, and MongoDB, while also teaching me to consider ethical implications, such as biases in predictive models.

No comments:

Meet the Authors
Zacharia Maganga’s blog features multiple contributors with clear activity status.
Active ✔
πŸ§‘‍πŸ’»
Zacharia Maganga
Lead Author
Active ✔
πŸ‘©‍πŸ’»
Linda Bahati
Co‑Author
Active ✔
πŸ‘¨‍πŸ’»
Jefferson Mwangolo
Co‑Author
Inactive ✖
πŸ‘©‍πŸŽ“
Florence Wavinya
Guest Author
Inactive ✖
πŸ‘©‍πŸŽ“
Esther Njeri
Guest Author
Inactive ✖
πŸ‘©‍πŸŽ“
Clemence Mwangolo
Guest Author

x̄ - > Bloomberg BS Model - King James Rodriguez Brazil 2014

Bloomberg BS Model - King James Rodriguez Brazil 2014 πŸ”Š Read ⏸ Pause ▶ Resume ⏹ Stop ⚽ The Silent Kin...

Labels

Data (3) Infographics (3) Mathematics (3) Sociology (3) Algebraic structure (2) Environment (2) Machine Learning (2) Sociology of Religion and Sexuality (2) kuku (2) #Mbele na Biz (1) #StopTheSpread (1) #stillamother #wantedchoosenplanned #bereavedmothersday #mothersday (1) #university#ai#mathematics#innovation#education#education #research#elearning #edtech (1) ( Migai Winter 2011) (1) 8-4-4 (1) AI Bubble (1) Accrual Accounting (1) Agriculture (1) Algebra (1) Algorithms (1) Amusement of mathematics (1) Analysis GDP VS employment growth (1) Analysis report (1) Animal Health (1) Applied AI Lab (1) Arithmetic operations (1) Black-Scholes (1) Bleu Ranger FC (1) Blockchain (1) CATS (1) CBC (1) Capital markets (1) Cash Accounting (1) Cauchy integral theorem (1) Coding theory. (1) Computer Science (1) Computer vision (1) Creative Commons (1) Cryptocurrency (1) Cryptography (1) Currencies (1) DISC (1) Data Analysis (1) Data Science (1) Decision-Making (1) Differential Equations (1) Economic Indicators (1) Economics (1) Education (1) Experimental design and sampling (1) Financial Data (1) Financial markets (1) Finite fields (1) Fractals (1) Free MCBoot (1) Funds (1) Future stock price (1) Galois fields (1) Game (1) Grants (1) Health (1) Hedging my bet (1) Holormophic (1) IS–LM (1) Indices (1) Infinite (1) Investment (1) KCSE (1) KJSE (1) Kapital Inteligence (1) Kenya education (1) Latex (1) Law (1) Limit (1) Logic (1) MBTI (1) Market Analysis. (1) Market pulse (1) Mathematical insights (1) Moby dick; ot The Whale (1) Montecarlo simulation (1) Motorcycle Taxi Rides (1) Mural (1) Nature Shape (1) Observed paterns (1) Olympiad (1) Open PS2 Loader (1) Outta Pharaoh hand (1) Physics (1) Predictions (1) Programing (1) Proof (1) Python Code (1) Quiz (1) Quotation (1) R programming (1) RAG (1) RL (1) Remove Duplicate Rows (1) Remove Rows with Missing Values (1) Replace Missing Values with Another Value (1) Risk Management (1) Safety (1) Science (1) Scientific method (1) Semantics (1) Statistical Modelling (1) Stochastic (1) Stock Markets (1) Stock price dynamics (1) Stock-Price (1) Stocks (1) Survey (1) Sustainable Agriculture (1) Symbols (1) Syntax (1) Taroch Coalition (1) The Nature of Mathematics (1) The safe way of science (1) Travel (1) Troubleshoting (1) Tsavo National park (1) Volatility (1) World time (1) Youtube Videos (1) analysis (1) and Belbin Insights (1) competency-based curriculum (1) conformal maps. (1) decisions (1) over-the-counter (OTC) markets (1) pedagogy (1) pi (1) power series (1) residues (1) stock exchange (1) uplifted (1)

Followers