Skip to content

Blog

Writing

Engineering, AI experiments, and software craft.

Posts tagged "Performance"

Essays, field notes, and engineering write-ups by Al Amin Ahamed covering Performance — practical patterns, gotchas from production code, and lessons learned shipping real systems. Topics include architecture decisions, performance tuning, debugging stories, and integration notes you can apply directly to your own Performance work.

Filtered by Performance ✕ clear
May 25, 2026
· 0 min read

RAG Without a Framework: A Minimal pgvector Pipeline with Claude

How to build a complete RAG pipeline in Python without LangChain or LlamaIndex — chunking, embedding with text-embedding-3-large, pgvector retrieval, and grounded generation with Claude — with the tradeoffs that matter at production scale.

May 14, 2026
· 0 min read

Stop Using Fixed-Size Chunks for Technical Documentation

Why naive fixed-size chunking breaks on code-heavy documentation, and the heading-aware approach with paragraph fallback and overlap that replaced it in this portfolio's RAG pipeline.

May 14, 2026
· 0 min read

Voyage AI vs OpenAI Embeddings for Technical RAG in PHP

Why I switched from OpenAI text-embedding-3-small to Voyage AI voyage-code-3 for this portfolio's RAG layer — model comparison, the input_type asymmetry Voyage requires, measured retrieval improvement, and the migration path that keeps search live throughout.

May 14, 2026
· 0 min read

Embedding LLMs in a WordPress Plugin: EasyCommerce's Async Architecture

How EasyCommerce wires LLM calls — product description generation, fraud detection, and inventory forecasting — into a WordPress plugin using async dispatch and a provider abstraction that survives outages and scales across catalogues.

Apr 25, 2026
· 9 min read

RAG on SQLite: Pure-PHP Cosine Similarity Without pgvector

How I built RAG retrieval against a SQLite database using pure-PHP cosine similarity — when it scales, when it doesn't, and the migration path to pgvector.

Feb 5, 2026
· 8 min read

Cutting Claude API Costs by 89% with Prompt Caching

A practical Laravel integration of Anthropic prompt caching — what to cache, the 1024-token minimum, multi-breakpoint strategies, and real cost numbers.

Aug 12, 2025
· 8 min read

Tailwind v4 Migration: From v3 Config to CSS-First Tokens

A practical guide to migrating a real Laravel app from Tailwind v3 to v4 — what breaks, the new @theme syntax, plugin migration, and the 5× build speedup.

Jul 28, 2025
· 9 min read

React 19 in Production: useTransition, useOptimistic, and the End of forwardRef

How I use the new React 19 hooks in this portfolio's admin — useTransition for non-blocking input, useOptimistic for instant UI, and refs as props.

Jul 5, 2025
· 7 min read

Splitting Laravel into Three Vite Bundles for Smaller Pages

How I split this portfolio into three independent Vite bundles (frontend, auth, dashboard) — separate dependencies, separate CSS, working HMR across all three.

Mar 20, 2025
· 10 min read

Laravel Queues at Scale: Lessons from 10 Million Jobs

Patterns and pitfalls I've learned running Laravel queues in production — from connection pooling and job chunking to dead-letter handling and zero-downtime deployments.

Jan 30, 2025
· 8 min read

Hunting N+1 Queries in Laravel: Tools, Strict Mode, and Fixes

How to find N+1 queries with Telescope, Debugbar, and preventLazyLoading() — and the eager-loading patterns that fix them, including the polymorphic gotcha.

Jan 12, 2025
· 9 min read

10 Eloquent Query Optimizations, in Order of Effectiveness

A systematic optimization checklist for Eloquent — select(), chunk(), chunkById(), withCount(), pluck(), subquery selects, and when to bypass the model layer entirely.