Introduction to Reinforcement Learning for Absolute Beginners

Introduction to Reinforcement Learning for Absolute Beginners Image: A child learning to ride a bicycle through trial and error - the essence of reinforcement learning Imagine teaching a child to ride a bicycle. They learn by trying, wobbling, and maybe falling – trial and error guided by little victories and tumbles. Over time, they adjust their balance and steering to maximize the thrill of coasting (and minimize the painful falls). This process of trial-and-error learning is exactly what reinforcement learning (RL) is all about. In RL, a computer agent learns from feedback: it takes actions, observes the outcomes (rewards or penalties), and adapts its behavior to get better results in the future. Just as you might avoid actions that make a puppy grumpy and repeat those that make it wag its tail, an RL agent learns to favor actions that lead to positive rewards. ...

October 20, 2025 · 27 min · Anoop Maurya

LLMs as Judges: Using Large Language Models to Evaluate AI-Generated Text

LLMs as Judges: Using Large Language Models to Evaluate AI-Generated Text 1. Introduction The Challenge of Evaluating AI-Generated Text Imagine you’re a teacher grading thousands of essays, or a company evaluating customer service responses generated by AI. How do you determine which responses are good, which are bad, and which need improvement? This is one of the biggest challenges in artifical intelligence today. Traditionally, researchers have used mathematical formulas (called metrics like BLEU and ROUGE) to automatically score text. Think of these like spell-checkers – they can catch obvious errors, but they can’t tell if a piece of writing is truly engaging, accurate, or helpful. These traditional methods often miss the nuances that make text truly good: Does it flow naturally? Is it factually correct? Does it actually answer the question asked? ...

September 27, 2025 · 36 min · 7573 words · Anoop Maurya

Building AI Agents from Scratch: Understanding the Core Components Behind the Magic

Introduction As a data scientist and AI engineer, I’ve spent countless hours working with various agentic frameworks like LangGraph, AutoGen, and CrewAI. While these tools are incredibly powerfull and make our lives easier, I often found myself wondering: what actually happens under the hood? What makes an AI agent tick? How do all these pieces come together to create something that can think, plan, and act autonomously? That curiosity led me down a fascinating rabbit hole, and eventually to building an AI agent completly from scratch - no frameworks, no abstractions, just pure Python and a deep dive into the fundamental building blocks that make agents work. ...

September 21, 2025 · 8 min · 1660 words · Anoop Maurya