Scolta: A Case Study in Practical AI Search

The Problem Scolta Solves

Most websites have mediocre search. Users type queries, get keyword-matched results that miss the semantic meaning, and give up. The obvious solution — vector search and dense retrieval — requires significant infrastructure investment: embedding pipelines, vector databases, reranking services, and ongoing maintenance. For the millions of websites on CMS platforms like Drupal and WordPress, this is often impractical.

Scolta, developed by Tag1 Consulting, takes a different approach: layer AI capabilities onto existing keyword search infrastructure without requiring vector databases. The result is a system that dramatically improves search quality for real-world content websites, built on technology that teams without specialized ML infrastructure can deploy and maintain.

Architecture Overview

Scolta's architecture has three core components built on top of Pagefind (a static site search system):

LLM Query Expansion: Before the query hits the search index, Scolta sends it to the Claude API to generate a richer query: synonyms, related terms, alternative phrasings, and domain-specific vocabulary the user might not have used. A query for "cancel subscription" becomes a richer query that also matches "terminate membership," "stop renewal," and "deactivate account."
Pagefind BM25 Search: The expanded query runs against a static Pagefind index — fast, client-side, no server required. The expansion dramatically improves BM25 recall without requiring semantic embeddings.
AI Overview Generation: After retrieval, Scolta sends the top results to Claude to generate a synthesized AI overview — a paragraph-length answer to the user's question, with citations to specific pages. This is what users see first, above the ranked results list.

What Makes Scolta Distinctive

The key architectural insight is pragmatic: most AI search papers optimize for absolute performance on academic benchmarks. Scolta optimizes for practicality for real-world deployments. This means:

No embedding infrastructure: Pagefind is static site search — it runs in the browser with no server-side indexing infrastructure. Scolta adds AI without adding complexity.
Incremental cost: One LLM API call per search query for expansion + one for the AI overview. Total marginal cost: under $0.05 per query at 2026 API prices.
Works with existing content: No changes to how content is structured or stored. Scolta builds on the existing Drupal (or other CMS) content model.
Transparent AI: The AI overview cites specific pages. Users see where the answer came from and can click through to verify.

Evaluation: Does It Work?

Internal evaluation comparing Scolta search to baseline Pagefind-only search on representative query sets showed: 35-45% improvement in NDCG@10 (a standard IR quality metric) and significantly higher user satisfaction ratings. The improvement is largest for natural-language and conceptual queries — exactly the cases where pure BM25 performs worst.

Teaching With Scolta

The IR-290 capstone at Meridian AI uses Scolta's open architecture as both a learning tool and a deployment option. Students who deploy Scolta implement LLM-based query expansion and AI overview generation from scratch, then compare their results to the production Scolta system. This hands-on approach produces deep understanding of how AI search systems work — from the theory through the implementation through the evaluation.

The full technical approach to AI-powered search, including Scolta's design, is documented in Tag1 Consulting's "Practical Path to AI Search" series at tag1.com/how-to/.