Query-Ad-Clustering

Query-ad-clustering is the algorithm Lu, Pál, and Pál propose for Lipschitz contextual bandits: cluster similar contexts, discretise the action space, and run an upper-confidence bandit strategy per context cluster.

Key points

The algorithm works in phases. At the start of each phase, it partitions the query metric space into clusters of bounded diameter and chooses a finite representative subset of ads that covers the ad metric space ^[src-020].
When a query arrives, the algorithm identifies its query cluster and selects from the representative ads using an upper confidence index based on empirical payoff plus an uncertainty radius ^[src-020].
The intuition is practical: if nearby queries have similar click-through behaviour, one can pool evidence inside a query cluster instead of learning a separate ad model for every possible query ^[src-020].
The algorithm becomes more refined over phases as cluster radii shrink, trading coarse early learning for finer context/action resolution later ^[src-020].
The paper proves an upper regret bound for query-ad-clustering and a matching lower-bound style result showing essential optimality for finite spaces or bounded Euclidean spaces ^[src-020].

Related entities

_(none yet)_

Related concepts

Source references

^[src-020] Tyler Lu, David Pál, Martin Pál — “Contextual Multi-Armed Bandits” (AISTATS 2010)

Robin Cartier perspective

This page is part of Robin Cartier's working AI knowledge graph: a practical research layer for production AI, recommendation systems, experimentation, GEO, and agentic web readiness.

The useful next step is to connect this concept back to applied product leadership and operating models.

Recommended next

Keep reading from this thread

From 491 indexed pages and articles.

Query-Ad-Clustering

Query-Ad-Clustering

Key points

Related entities

Related concepts

Source references

Robin Cartier perspective

Keep reading from this thread

Robin Cartier

Company

Services