Collection of notebooks I created in Jupyter while learning API calls, pandas, and python
Using the WPRDC API, download all of the Pittsburgh City wide revenues and expenses, and write to a delta table
Using Pandas Profiler, create a report that shows a high level breakdown of the data.
Using PDFplumber and OpenAI I extract the text from a PDF and use AI to correctly parse the wanted data. Then, I write it to a delta table and step through the medallion architecture.