Ben Hayes

Building GPT-2 in PyTorch

GPT-2 was introduced in 2019 but we can learn how the fundamentals of transformer training and inference work by rebuilding GPT-2 in PyTorch.

Aug 5, 2024 23 min read Ben Hayes

GPT-2 was introduced in 2019 and primarily built within the TensorFlow framework. Let's take a look at how we can reproduce GPT-2 with PyTorch while following the methodology outlined in the accompanying GPT-2 research paper.

Optimizing Language Models with DSPy

Prompt engineering language models lacks necessary refinement especially for enterprise solutions. Optimize your language models with DSPy!

Jul 3, 2024 12 min read Ben Hayes

Optimizing language models is an important task as organizations integrate generative AI capabilities into their business processes. Relying on manual prompt engineering introduces risk, biases, and other problems. Continue reading to learn how to mitigate these risks with DSPy.

Multimodal AI with LLaVA on Databricks

LLaVA, among other multimodal AI models, leaps from standalone large language models and allows us to engage with AI in novel ways.

Mar 22, 2024 14 min read Ben Hayes

Understanding our world often requires interpreting stimuli from multiple sources including through vision or text. LLaVA accomplishes this by combining a vision encoder with a large language model. Continue reading to learn how to leverage multimodal AI with Databricks and LLaVA! 🌋

Object Detection with YOLOv8 on Databricks

Using Ultralytics YOLOv8 and Databricks, we can build and deploy an AI model to reliably detect objects.

Feb 24, 2024 12 min read Ben Hayes

For many AI-powered use cases, object detection emerges as a critical capability. Examples include driving cars, performing surgery, or identifying product defects. Continue reading to learn how to leverage Databricks and YOLOv8 to detect objects! 🤖

Blog

About

Contact

Resume/CV

Building GPT-2 in PyTorch

Optimizing Language Models with DSPy

Multimodal AI with LLaVA on Databricks

Object Detection with YOLOv8 on Databricks

👋 Hi, I'm Ben.

Recent Posts

Building GPT-2 in PyTorch

Optimizing Language Models with DSPy

Multimodal AI with LLaVA on Databricks

Object Detection with YOLOv8 on Databricks

Getting Started with OpenAI

Hands-on with Edge AI

Data Science at McKinsey

Building Data Pipelines with Kedro

About