Ryan Cali

Logo

Data Engineer with 9 years of experience working in the healthcare sector on cloud and AI applications integrated with ETL pipelines.

Previous experience working in computer vision and deep learning research in disease at Harvard Medical School and Massachusetts General Hospital.

Portfolio


Kafka GPS Location Streaming to Postgres

Docker-based service that allows mapping of GPS location with low latency based on movements sent via smartphone. Generates tables in a Postgres database that allow reconstruction of a path of travel based on coordinates and associated timestamps.

Code

description

Python PostgreSQL Docker Kafka


Synthesis of High-Resolution Research-Quality MRI Data from Clinical MRI Data in Patients with COVID-19

Publication

description

Python Jupyter TensorFlow


The Influence of Brain MRI Defacing Algorithms on Brain-Age Predictions via 3D Convolutional Neural Networks

Publication

description description

Python Jupyter TensorFlow


Adding provenance to new dataflow engine written in Python: Pydra

Google Summer of Code

description

Python Jupyter


Tech Stack

Languages & Frameworks

Python PySpark Shell

Data Platforms & Databases

Databricks PostgreSQL SQL Server

Cloud Providers

AWS Azure GCP

DevOps & Tools

Docker GitHub Pulumi

Analytics & Visualization

Tableau Looker QuickSight


Certifications

Databricks Certified Data Engineer

certification badge