Skip to content

Blog

Writing

Engineering, AI experiments, and software craft.

Posts tagged "AI / LLMs"

Essays, field notes, and engineering write-ups by Al Amin Ahamed covering AI / LLMs — practical patterns, gotchas from production code, and lessons learned shipping real systems. Topics include architecture decisions, performance tuning, debugging stories, and integration notes you can apply directly to your own AI / LLMs work.

Filtered by AI / LLMs ✕ clear
May 31, 2026
· 0 min read

Anthropic Academy — Six Certifications Completed

I completed six Anthropic Academy courses covering the Claude API, Model Context Protocol, subagents, agentic workflows, Claude Code, and AI fluency — each with a verified certificate from Anthropic Education.

May 25, 2026
· 0 min read

Prompt Versioning in Production: Managing LLM Prompts Like Code

How EasyCommerce manages LLM prompts across Claude, OpenAI, and DeepSeek with a versioned registry, structured evals in CI, and a rollback path that does not require a deployment.

May 25, 2026
· 0 min read

RAG Without a Framework: A Minimal pgvector Pipeline with Claude

How to build a complete RAG pipeline in Python without LangChain or LlamaIndex — chunking, embedding with text-embedding-3-large, pgvector retrieval, and grounded generation with Claude — with the tradeoffs that matter at production scale.

May 25, 2026
· 0 min read

Building a Claude Code MCP Server in Python: Lessons from codebase-research-agent

How codebase-research-agent exposes a tool-using AI agent as a Claude Code MCP server — JSON-RPC over Streamable HTTP, tool registration, streaming SSE, and the four things that broke in production before it worked reliably.

May 14, 2026
· 0 min read

Scaling a Laravel MCP Tool Registry to 46 Tools

The May 12 post covered the MCP server architecture. This one goes inside the tool registry: class-per-tool structure, Sanctum ability scoping, JSON Schema validation before dispatch, tag-based auto-discovery, and the three things that broke when the registry hit 46 tools.

May 14, 2026
· 0 min read

Stop Using Fixed-Size Chunks for Technical Documentation

Why naive fixed-size chunking breaks on code-heavy documentation, and the heading-aware approach with paragraph fallback and overlap that replaced it in this portfolio's RAG pipeline.

May 14, 2026
· 0 min read

Voyage AI vs OpenAI Embeddings for Technical RAG in PHP

Why I switched from OpenAI text-embedding-3-small to Voyage AI voyage-code-3 for this portfolio's RAG layer — model comparison, the input_type asymmetry Voyage requires, measured retrieval improvement, and the migration path that keeps search live throughout.

May 14, 2026
· 0 min read

A Claude Tool-Calling Loop in Laravel: From First Request to Final Answer

The exact pattern for registering tools, dispatching multi-turn Claude conversations, and processing tool results in a Laravel service — including the three production failure modes the happy path does not cover.

May 14, 2026
· 0 min read

Embedding LLMs in a WordPress Plugin: EasyCommerce's Async Architecture

How EasyCommerce wires LLM calls — product description generation, fraud detection, and inventory forecasting — into a WordPress plugin using async dispatch and a provider abstraction that survives outages and scales across catalogues.

May 12, 2026
· 0 min read

MCP Servers for Laravel: A Production Pattern

How I built the MCP server that powers AI access to this portfolio — JSON-RPC over Streamable HTTP, Sanctum bearer auth, an ability-scoped tool registry, and the gotchas worth knowing before you ship.

Apr 25, 2026
· 9 min read

RAG on SQLite: Pure-PHP Cosine Similarity Without pgvector

How I built RAG retrieval against a SQLite database using pure-PHP cosine similarity — when it scales, when it doesn't, and the migration path to pgvector.

Mar 10, 2026
· 10 min read

A Multi-Provider AI Driver Pattern for Laravel

A clean driver pattern for switching between Claude, OpenAI, Groq, and Ollama in a Laravel app — with failover, testing, and a single calling interface.