Projects

A collection of projects spanning data engineering and AI/ML.

Video Transcription Service

GPU-accelerated video transcription pipeline on AWS with auto-scaling ECS workers and event-driven processing.

PythonAWSDockerCUDACloudFormation

Data Intelligence Tool

AI-powered schema analysis and validation code generator for PySpark applications.

PythonPySparkAnthropic Claudepandas