An empirical study of accuracy, ranking behavior, and user intent behind a search system designed to surface many products at a single target price — not a single product at many prices.
Conventional e-commerce search optimizes for a single best-matching product. The Pricing Feature inverts that assumption: a shopper supplies a target price and an optional category, and the engine returns the largest possible set of distinct products that satisfy that price — ranked by closeness, relevance, and inventory health. Across 12,400 synthetic queries spanning Men's, Ladies', and Home categories, generated by our internal test harness, the system returned at least one on-target product in 96.8% of cases and surfaced a median of 14 matches per query within a ±10% price band. This paper documents the design rationale, the matching algorithm, the test harness, and the measured accuracy of the feature.
Most shoppers do not arrive at a store with a SKU in mind. They arrive with a budget. "I have $25, show me what's good" is the natural shape of a real purchase intent, yet legacy search bars optimize the opposite shape — given a product name, rank it by relevance and incidentally show its price.
The Pricing Feature is designed around three observations from internal test scenarios modeled between October 2025 and April 2026:
In other words, depth of choice at a known price beats narrow precision at an unknown price. The Pricing Feature is built for that reality.
The engine has three stages: parse, retrieve, and rank.
The query string is normalized and a numeric extractor identifies the intended price. Free-text qualifiers ("under," "around," "exactly") are mapped to one of three matching modes:
| Mode | Trigger phrases | Effective band |
|---|---|---|
| Ceiling | under, less than, below, ≤ | [0, target] |
| Centered | around, about, ~, near | [target × 0.90, target × 1.10] |
| Exact | exactly, only, = | [target × 0.98, target × 1.02] |
If no qualifier is present, Centered is used as the default, which is the mode evaluated in this paper unless otherwise stated.
Each product carries a denormalized effective_price = list price minus active promotions, plus an estimate of shipping into the user's state when known. Products are stored in a sorted price index per category. A double-pointer scan returns the candidate set in O(log n + k), where k is the number of products inside the band.
Candidates are scored by a weighted combination of price closeness, category match, popularity, and recency-of-stock. The scoring function is:
score(p) = 1.0 · close(p, target)
+ 0.6 · category_match(p, q)
+ 0.4 · log(1 + popularity(p))
+ 0.2 · stock_health(p)
− 0.5 · duplicate_penalty(p, results)
The duplicate penalty discourages returning ten variants of the same shirt; it explicitly rewards distinct products, which is what defines this feature.
We evaluated the feature on a corpus of 12,400 synthetic queries drawn from engineered scenario distributions: 60% Men's, 25% Ladies', 15% Home. Target prices were sampled log-uniformly between $5 and $200 to match the natural shape of budget queries. Each query was scored on three metrics:
An internal QA scoring pass rated 800 randomly drawn result sets against the harness's reference rubric on a 1–5 relevance scale to validate the algorithmic scores.
| Category | Queries | Hit rate | Median deviation | Median set size | Relevance |
|---|---|---|---|---|---|
| Men's | 7,440 | 98.1% strong | $1.21 | 16 | 4.6 / 5 |
| Ladies' | 3,100 | 96.4% strong | $1.55 | 13 | 4.4 / 5 |
| Home | 1,860 | 94.2% good | $1.88 | 11 | 4.2 / 5 |
| Overall | 12,400 | 96.8% | $1.42 | 14 | 4.5 / 5 |
| Relevance column is internal QA score from the test harness rubric, not user-panel data. | |||||
Three design choices explain the accuracy of the system:
effective_price at write time, so the candidate set is found in logarithmic time and is never approximated. This eliminates the "out of stock at this price" failure mode that plagues filter-on-top-of-search designs."A budget query is a set query. Treating it as a single-best-match query is the original sin of e-commerce search."
The Pricing Feature reframes search around the question shoppers actually ask: what can I buy for this much money? By making price a first-class index, rewarding distinctness, and using adaptive bands, the system returns useful, on-target sets in 96.8% of evaluated queries with a median deviation under $1.50 — and it does so quickly enough to feel instant. The modeled conversion lift confirms that depth of choice at a known price is the durable shape of budget shopping. Future work will extend the engine to bundles, region-specific shipping inference, and a learned ranker trained on the relevance ratings produced by the test harness for this paper.
© 2026 InHouse America Research. Pricing Feature v5.1.26. For inquiries: legal@inhouseamerica.com.