Hello, I'm

Dr Bradley Davy

Research Software Engineer

Infrastructure automation · Containerised platforms · HPC & Cloud · SRE

AWS Kubernetes Terraform Ansible Python Docker
bradley@ocf:~

About Me

I'm a Research Software Engineer with a strong focus on infrastructure automation, containerised platforms, and performance of distributed compute systems.

Experienced in building and operating cloud and on-prem environments using infrastructure as code, supporting users running large-scale workloads, and diagnosing system-level issues across compute, storage, and networking.

I hold a Ph.D. in Fluid Dynamics from the University of Leeds, giving me a rigorous analytical foundation that I apply to complex engineering problems. Looking to develop further in Site Reliability Engineering, with particular interest in observability, resilience, and operational readiness of production platforms.

PhD Fluid Dynamics
HPC & Cloud Platforms
SRE Focused

Technical Skills

☁️

Cloud & Infrastructure

  • AWS (VPC, EC2, networking)
  • Linux & distributed systems
  • Shared storage solutions
🏗️

Infrastructure as Code

  • Terraform
  • Ansible
  • Automated provisioning & configuration
🐳

Containers & Orchestration

  • Docker, Podman, Apptainer
  • Container build pipelines
  • Runtime debugging
⚙️

Scheduling & Compute

  • Slurm workload manager
  • GPU/CPU resource scheduling
  • HPC operations
📊

Observability & Ops

  • Log analysis & metrics collection
  • Resource monitoring
  • Performance dashboards
💻

Programming

  • Python (advanced)
  • Bash, Go, SQL
  • JavaScript

Work Experience

Research Software Engineer

OCF
Nov 2024 – Present

Build and support HPC and cloud platforms used by researchers and engineers for compute-intensive and GPU-accelerated workloads.

  • Automated deployment of distributed compute environments on AWS using Terraform and Ansible, including Slurm scheduling, shared storage, and container runtimes.
  • Supported users running production-like workloads, diagnosing issues related to job scheduling, resource contention, networking, and container execution.
  • Built and maintained containerised services using Docker and Apptainer, with emphasis on reproducibility and operational stability.
  • Developed and deployed a GPU-backed LLM service using PyTorch and Transformers, gaining experience with capacity planning, startup reliability, and monitoring of long-running services.
AWS Terraform Ansible Slurm Docker PyTorch

Research Software Engineer

University of York
Apr 2024 – Nov 2024

Developed and supported research software services within an academic environment.

  • Built Python-based web services and APIs used by academic researchers.
  • Investigated and resolved issues across application and infrastructure layers in collaboration with system administrators and research teams.
Python REST APIs Linux

Education

2020 – 2024

Ph.D. in Fluid Dynamics

University of Leeds
2020 – 2022

MSc Fluid Dynamics Merit

University of Leeds
2016 – 2020

BSc Physics First Class

Sheffield Hallam University

Get in Touch

Open to SRE, platform engineering, and infrastructure roles. Feel free to reach out.