Databricks Notebook Demos

Collection of notebooks I created in Jupyter while learning API calls, pandas, and python

Western Pennsylvania Regional Data Center (WPRDC) Data Downloader

Using the WPRDC API, download all of the Pittsburgh City wide revenues and expenses, and write to a delta table

Profile the WPRDC Data

Using Pandas Profiler, create a report that shows a high level breakdown of the data.

Analyze the contents of a PDF file and use AI to extract wanted data

Using PDFplumber and OpenAI I extract the text from a PDF and use AI to correctly parse the wanted data. Then, I write it to a delta table and step through the medallion architecture.

Nifty tech tag lists fromĀ Wouter Beeftink