Building GPT-2 in PyTorch

GPT-2 was introduced in 2019 but we can learn how the fundamentals of transformer training and inference work by rebuilding GPT-2 in PyTorch.

23 min read Ben Hayes

GPT-2 was introduced in 2019 and primarily built within the TensorFlow framework. Let's take a look at how we can reproduce GPT-2 with PyTorch while following the methodology outlined in the accompanying GPT-2 research paper.

Optimizing Language Models with DSPy

Prompt engineering language models lacks necessary refinement especially for enterprise solutions. Optimize your language models with DSPy!

12 min read Ben Hayes

Optimizing language models is an important task as organizations integrate generative AI capabilities into their business processes. Relying on manual prompt engineering introduces risk, biases, and other problems. Continue reading to learn how to mitigate these risks with DSPy.

Multimodal AI with LLaVA on Databricks

LLaVA, among other multimodal AI models, leaps from standalone large language models and allows us to engage with AI in novel ways.

14 min read Ben Hayes

Understanding our world often requires interpreting stimuli from multiple sources including through vision or text. LLaVA accomplishes this by combining a vision encoder with a large language model. Continue reading to learn how to leverage multimodal AI with Databricks and LLaVA! 🌋

Object Detection with YOLOv8 on Databricks

Using Ultralytics YOLOv8 and Databricks, we can build and deploy an AI model to reliably detect objects.

12 min read Ben Hayes

For many AI-powered use cases, object detection emerges as a critical capability. Examples include driving cars, performing surgery, or identifying product defects. Continue reading to learn how to leverage Databricks and YOLOv8 to detect objects! 🤖