Project Overview
Nova Scotia is experiencing significant demographic shifts — an aging population, changing socioeconomic conditions, and increasing pressure on its healthcare system. Despite this, there is no consolidated, data-driven view connecting population trends, socioeconomic indicators, and health outcomes across NS regions.
This project addresses that gap. Using publicly available data from Statistics Canada, the NS Open Data Portal, and CIHI, Team A builds an integrated Business Intelligence solution that surfaces regional health patterns, demographic trends, and forecasted healthcare demand — delivered as an interactive Power BI dashboard.
Project Progress
- Project charter
- GitHub repo
- Data acquisition
- Sprint plan
- SQL/Python cleaning
- Power Query ETL
- Data dictionary
- Schema design
- Pandas EDA
- Trend regression
- Forecast CSV
- PBI wireframe
- Power BI build
- DAX measures
- Slicers & drill
- Internal review
- Final report PDF
- Repo tag v1.0
- Peer evaluation
- Reflection
Business Understanding
CRISP-DM Phase 1Nova Scotia health authorities and policy planners currently lack a single integrated view of how demographic change relates to health outcomes at the regional level. This project provides that view to support better resource allocation and planning decisions.
| Stakeholder | Interest |
|---|---|
| Government of Nova Scotia | Policy planning, healthcare resource allocation |
| NS Health Authorities | Operational planning, regional health monitoring |
| Data Analysts / Researchers | Data-driven insights and modelling |
| Instructor/Evaluator | Academic assessment |
| General Public | Deployment of production-level AI systems |
- Nova Scotia population demographics data
- Health indicators (hospital usage, chronic diseases, aging
- Data cleaning and preprocessing(Python/SQL)
- Exploratory Data Analysis (EDA)
- Dashboard development (Power BI)
- Basic forecasting or trend analysis
- Documentation and GitHub repository
- Real-time hospital system intergration
- Clinical diagnosis or medical predictions
- Patient patient-level data
- Deployment of production-level AI systems
Data Understanding
CRISP-DM Phase 2| Source | URL | Coverage |
|---|---|---|
| NS Open Data Portal | data.novascotia.ca | NS health authority datasets, demographics |
| Statistics Canada | www150.statcan.gc.ca | Population, socioeconomic, health survey data |
| CIHI Open Data | cihi.ca | Hospitalization, wait times, health workforce |
Data types collected: population by age/gender, life expectancy by region, chronic disease prevalence rates, hospitalization statistics, income and education indicators.
📄 Data dictionary: /data/data_dictionary.md Week 2
Data Preparation
CRISP-DM Phase 3All raw data is stored in /data/raw/ and must not be edited directly. Cleaned outputs are written to /data/cleaned/.
| Script | Purpose | Status |
|---|---|---|
| scripts/01_cleaning.sql | SQL cleaning, joins, aggregations | Week 2 |
| scripts/02_cleaning.py | Python/pandas cleaning pipeline | Week 2 |
| scripts/03_power_query.md | Power Query M-code documentation | Week 2 |
Modelling
CRISP-DM Phase 4📓 Python notebook: /notebooks/analysis.ipynb Week 3
| Step | Description |
|---|---|
| EDA | Age group distributions by region, correlation matrix of socioeconomic vs. health variables, seaborn heatmaps |
| Trend Analysis | Time-series trend fitting per age cohort and health region |
| Predictive Model | Linear or polynomial regression — 5–10 year population and healthcare demand forecast |
| Export | Forecast table exported as CSV for Power BI import |
Dashboard
CRISP-DM Phase 5 & 6📊 Power BI file: /powerbi/NS_Health_Analytics.pbix Week 4
| Page | Key Visuals |
|---|---|
| Executive Summary | 4 KPI cards: NS Population, Avg Life Expectancy, Top Health Region, YoY Change |
| Population Trends | Area chart by age group, population pyramid, NS choropleth map |
| Health Outcomes | Chronic disease bar chart, hospitalization line chart, income vs. health scatter |
| Demographic Deep Dive | Stacked age/gender bar chart, community-level indicator table |
| Forecast | Python-generated forecast overlay with confidence interval ribbon |