In this project, I used census data, along with other data that I webscraped or used an API for to build a Power BI dashboard that presented the data.
First, I collected all the data, using a few Python libraries such as pandas and BeautifulSoup. Then I saved the data to an AWS S3 bucket. This GitHub repository contains the python scripts used to extract, transform and load the data into the S3 bucket.
I built an AWS Athena Database with the S3 bucket to organize and relate all the data.
![](https://static.wixstatic.com/media/0c39ad_3f4b4a77a7e441888e73b6b637b40cd8~mv2.png/v1/fill/w_49,h_34,al_c,q_85,usm_0.66_1.00_0.01,blur_2,enc_auto/0c39ad_3f4b4a77a7e441888e73b6b637b40cd8~mv2.png)
Using the Amazon Athena Power BI connector, I was able to pull the database into Power BI and build a model.
![](https://static.wixstatic.com/media/0c39ad_a45d33f48acf4501b9e7cb57121c0fba~mv2.png/v1/fill/w_49,h_30,al_c,q_85,usm_0.66_1.00_0.01,blur_2,enc_auto/0c39ad_a45d33f48acf4501b9e7cb57121c0fba~mv2.png)
Using this model, I design a dashboard that displayed the collected data
I used this project to learn the basics of AWS S3 management, and Athena database set-up and usage. To that end, I believe I was successful.
Thank you for reading this progression post.