Ozon Search Working Principles

Table of Content

Search on Ozon is not only a system for ranking products by their textual relevance to customers' search queries, but also a system for distributing traffic and determining products' quality offers. Both customers and sellers have an influence upon search results.

Product position in the search affects its commercial indicators: conversions, traffic, and sales. The way the search is set up allows sellers to influence the position of their products in the search results in a direct, transparent and predictable way. Since quality offers from verified sellers are ranked higher in the search results, they get more traffic and therefore more sales.

Ozon also determines what a quality offer is based on the product price, its quality, and the delivery speed. For those sellers who can provide such products, Ozon offers free tools for promotion in the search results.

Needs that Ozon search addresses #

  • Motivate sellers to make high-quality offers.
  • Help sellers demonstrate their products to customers.
  • Help customers find and choose the necessary product.
  • Take into account the customers' choice when distributing traffic between sellers' offers.
  • Inevitability. Search results always react to the actions of sellers or customers.
  • Non-discrimination. The reaction of search results to the same actions of different sellers in similar situations is the same.
  • Speed. Ozon is actively working to make the search engine response of search results to sellers' and customers' actions fastest possible.
  • Transparency. All the details of the search operation on Ozon are open. Ozon does not affect the ranking of products in the search results and does not hide what factors lie behind it.

Ozon search scheme #

The process begins as a customer performs a search on the Ozon website. It consists of several stages.

  1. Request generation and candidate selection stage

    Every day customers make millions of search queries on the Ozon search engine. To make sure that everyone finds the necessary product among millions of others, at the first stage the search engine generates a large list of candidate products, whose names and various attributes contain words from the search query. For each request, this list can contain up to 500,000 products.

  2. Base ranking layer

    The search engine sorts the received candidates to sort out 1,000 products with the highest text relevance. If necessary, the range can be expanded, and the engine will load extra 1,000 products.

  3. Middle ranking layer (machine-trained formula)

    For the resulting list of 1,000 products we now need to estimate the probability with which customers may buy them. The ranking of these products depends on the product features that have proven to be most important to customers in the past. To figure it out, Ozon compares the product features and customer behavior. The result is a score for each product on a scale from 0 to 1.

  4. Boosters and deboosters

    Pre-determined increasing or decreasing coefficients are applied to 1,000 products that are ranked from 0 to 1 after the previous stage. For example, a multiplying coefficient can be applied to premium sellers' products, and a decreasing coefficient can be applied to products with a low rating.

  5. Displaying products

    1,000 products with a final score passed on to the search results. Customers see pages with the products in the specified order. When they have browsed through 1,000 items, the system will load the next 1,000 items.

Let’s take a closer look at the search stages.

1st stage: forming a request and selecting candidates #

Search for words by product attributes #

When a search query arrives, the system starts searching for words from this query in all fields and attributes of products that are stored in the search:

  • product name,
  • product category,
  • brand,
  • color, size, material, type and other attributes.

We check several hundred attributes, the final list of them depends on the product category. The more attributes a product has filled in, the easier it is to find it via search.

This list does not include a text description of the product. Therefore, it is very important for the seller to fill in namely the product attributes: the more fully the product is described, the more likely customers will see it in the search results.

Converting and normalizing a query #

Before selecting candidates, the search engine performs a chain of query transformations to make sure that the suitable candidates won’t be missed:

  1. Enrichment with synonyms. This step is added to make sure that the query covers more products.For example, so that the “phone” query would return smartphones as a result.
  2. Request normalization. All the words of the query are brought to a unified form so that, for example, both the “builder Johnson’s emerald ship” and “builder Johnson’s emerald ships” queries would have the same output. At this stage the system also excludes words that do not carry a semantic load, for example, conjunctions.
  3. Selection of candidates. All products containing all query words are extracted from the search index for all the remaining words. If the number of such products is too small, all products that contain at least one query word are also retrieved.

At this stage, the search results contain up to 500,000 candidate products. Their list is passed on to the second stage.

2nd stage: basic ranking layer #

Sorting candidates by relevance #

At the second stage, the selected candidates are sorted by text relevance. Here the search engine focuses on:

  • The fact of the product’s textual correspondence to the query: at first the number of matches between the words from the search query in the fields and the attributes of the product for each individual word in the query is determined by the usual linear formula. Then the number of matches for all the words from the query is summed up.
  • Product rating. At this stage the rating is used as a multiplying coefficient. The higher the product rating, the higher the coefficient. It ranges from 0% for the minimum rating to 25% for the maximum.

As a result, all candidate products receive an assessment of basic relevance as a combination of text compliance and product rating.

At this stage, the search engine sorts the candidates and passes a list of 1,000 most relevant candidates on to the third stage. If necessary, the range can be expanded, and the engine will load extra 1,000 products.

3rd stage: middle ranking layer #

How machine-trained ranking works #

At this stage, the search engine needs to evaluate which of the 1,000 products customers are most likely to purchase. Since the probability of buying each product is unknown, you need to predict it to be able to rank the products.

The prediction is made in several stages:

  1. All the features of each individual product, that may affect the fact of sale, are calculated. Here each feature is a real number. For example, the product rating or its price can be called a feature in this context.
  2. A machine-trained model is applied to a set of features. It predicts the probability of a sale based on the value of the features. Ozon uses gradient boosting of decision trees as a model.
  3. The model outputs a number that is used to estimate the product purchase probability.
  4. The evaluation of the purchase probability of a product is put into the context of the query. The same product will have a different purchase probability for different requests.

How a machine-trained model works #

The purpose of the model is to most accurately predict the product purchase probability based on the product features. Product features contain information about the characteristics of products purchased by customers in the past, and those of them that the ranked product has now.

Based on this data from the past, the model predicts future purchasing behavior. For example, there is a feature of the average conversion of a product from an impression to a purchase over the past 60 days. For many products it is possible to state that the product conversion does not change over time. Consequently, the model uses this feature to predict purchase probability, and it is possible to say that the higher the conversion rate over the past 60 days, the higher the model prediction will be.

The same thing happens with other features — for example, other things being equal, customers prefer products with a large number of reviews, and therefore the more reviews, the higher the prediction of the model will be. At the same time, different features have different predictive power and a different effect on the purchase probability.

The model training is based on using the dependency data between product features and final purchases via a special algorithm. Since many of the features are based on customer behavior, we can say that Ozon customers influence the products' position in search results by their actions. Primarily, the results are influenced by the product’s purchase, as the impact of feature weight on purchase prediction is based on the past sales of the product.

What is taken into account in machine-trained ranking #

This stage takes into account all the features of the product that affect its sale. These include the product characteristics, its rating, customer behavior, such as placing an order, viewing the product or adding it to the cart. In total, the Ozon search engine takes into account about 100 different features.

Sellers can influence the product position in the search results as they directly control the values of some features.

The main conceptual blocks of the features taken into account:

  • Popularity. This block combines characteristics that describe customer interactions with a product: product impressions, adding it to cart, and to favorites. A product impression is defined as opening the product description card, and viewing the product in the search results. In the current search model, popularity has a weight of 49%.
  • Sales. This block combines characteristics derived from the past product sales: the product conversion into purchases, and the number of purchases of that product. Sales weight is 19%.
  • Price. These characteristics are derived from the product price: the product price change in the price of this product in relation to its own price history and the size of the current discount. Its weight is 9%.
  • Rating. These are characteristics that are derived from the product rating: the product rating itself, and the number of reviews for the product. Its weight is 8%.
  • Delivery. This block contains characteristics describing the speed of product delivery: the speed of delivery itself and whether the product belongs to the express category or not. Its weight is 6%.
  • Text relevance. These characteristics describe the product’s textual relevance to the query. Its weight is 6%.

The predictive power of features is fluctuating since both the formula gets updated regularly and new features get added.

At this stage, the search engine evaluates each product from 1,000 candidates on a scale from 0 to 1. The closer the score is to one, the higher position the product will take in the search results and the higher the probability of its sale. All scores are passed on to the fourth stage.

4th stage: applying boosters and deboosters #

At this stage, predetermined increasing or decreasing coefficients are applied to candidate products that are ranked from 0 to 1. For example, a multiplying coefficient can be applied to products from premium sellers, and a decreasing one—to products with a low rating.

How boosting works #

For example, let’s take a closer look at the “red socks” query. Let’s say that after passing through three stages of search, machine-trained ranking included, a product named “red socks” gets a score of 0.7. If this is a product from a premium seller, a booster will be applied to it—for example, × 1,3. In this case, the final product rating will be 0.91 instead of 0.7.

Deboosting is an inverse operation, where the product score is multiplied by a decreasing coefficient after passing the search stages. For example, a score of 0.7 obtained due to a bad product price (in relation to the previous variants of its price) is multiplied by a deboosting coefficient of 0.8 and gives a resulting score of 0.56.

The boosting and deboosting coefficients change quite often, as Ozon is constantly looking for new coefficients that may help increase the total number of sales on the site.

All boosters, if they are not mutually exclusive, are cumulative. For example, let’s consider a product placed on Ozon, that is not only a product from a premium seller, but also has a guaranteed next day delivery, and a good price. Let’s assume that for each of these boosters, the boosting coefficient is 10%. In this case, a product with a machine training based relevance score of 0.7 will get a final relevance score of 0,7 × 1,1 × 1,1 × 1,1 = 0,9317.

Types of boosters and deboosters #

Boosters #

The following boosters are applied after the model ranking and are currently enabled:

  • Promotional boost stands for the purchase of a boosting coefficient for a price. Learn more in your seller profile: navigate to the Advertising campaigns → Search engine promotion section.
  • Logistics related boosters. Only one boosting from this category can be applied to one product. The application sequence is as follows:
    • Ozon Express products. Such products get a multiplying coefficient due to the delivery speed. The coefficient here is 7.5%.
    • Products marked with the rocket icon. Those products for which the seller guarantees same or next day delivery from the local warehouse get a multiplying coefficient of 7.5%.
    • Warehouse nearby. Products that are stored in the FBO warehouse closest to the customer get +5% in relevancy.
  • Premium sellers. Products from a premium seller receive a free booster of 5%.
  • Good price. Products with a good price in relation to their own price history get a multiplying coefficient: +10% in relevancy for reducing the price by 50% or more. This dependency is linear: 5% will be added for reducing the price by 25%. Within 30 days, the multiplying coefficient drops to 0. If the price increases, it will trigger the reverse mechanics of deboosting.

Deboosters #

  • Bad price. Products with a bad price in relation to their own price history receive a decreasing coefficient: -10% in relevancy for increasing the price by 50% or more. This dependency is linear: 5% will be deducted for increasing the price by 25%. Within 30 days, the decreasing coefficient drops to 0. If the price reduces, it will trigger the reverse mechanics of boosting.

Other deboosters are currently under development. They are expected to be released in the first quarter of 2022. The following deboosters are expected to be added:

  • Blatant violations of the marketplace rules. Products from sellers who have committed blatant violations of the marketplace rules get a decreasing coefficient applied to them.
  • Low product rating. Products with a low rating per a large number of reviews get a decreasing coefficient applied to them.
  • Low seller rating. Products from sellers with a low rating get a decreasing coefficient applied to them.

Resulting score and the effect of boosting and deboosting #

Boosting may not be sufficient to supersede the effect of customer voting.

For example, let’s consider two products, one of which after the third search stage has a score of 0.9, and the other has a score of 0.6. In this case, the first product should be ranked first in the search results, and the other should be ranked second. If the second product is boosted × 1.25, its resulting score will be 0.6 × 1.25 = 0.75. Since the initial score of the second product was rather low, boosting still won’t help it rise above the first product in the search results.

It is important to understand that for one query the final score of 0.9 can get the product to the first position, and for another it will bring it to the tenth one only. It depends on the score of other products included in the search results.

5th stage: displaying products #

How the product display works #

  1. The search engine gets a list of 1,000 products with final scores.
  2. The products are displayed to customers based on these scores: the ones with the highest scores appear at the top of the search results list, and the ones with the smallestshow up at the bottom.
    • In both desktop and mobile versions of Ozon website the search results page where products are displayed consists of 36 tiles.
    • In the mobile app, the first 36 products are shown on the first screen, and the next ones are loaded when scrolling further.
  3. The products in the search results are loaded along with their most up-to-date characteristics and images.

The product’s place in the search results affects its commercial indicators: conversions, traffic, and sales.

How to raise the product in the search results #

  1. Use the search promotion. You can affect your products position on the search page or in the catalog by placing bids for them for the necessary search queries. To place a bid, navigate to the Advertising campaigns → Search engine promotion section of your seller profile.

  2. Join the Premium loyalty program for sellers. Program participants have an opportunity to offer their customers an increased cashback. To join in the loyalty program, navigate to the More → Promotion → Premium for sellers section in your seller profile.

  3. Reduce the product price. Products with a lower price in relation to their average price in the past 30 days appear higher in the search results. Keep in mind that the average product price value gradually decreases after the changes are made, so the effect will weaken over time. To reduce the product price, navigate to the Prices → Product prices section of your seller profile.

  4. Collect customer reviews. The higher the product rating and the more reviews it has, the higher the product appears in the search results. You can offer customers points for submitting reviews of your product. In your seller profile, navigate to the More → Promotion → Reviews for points section to enable this service.

  5. Fill in the product attributes completely. The more information about the product you add, the easier it is for your customers to find it via search. To edit product details, in your seller profile navigate to the Manage Inventory section and choose the Edit product option on the necessary product. You can also edit information for several products at once using an XLS template.

Frequently Asked Questions #

New products do not get into the search results immediately—as a rule, it takes several minutes. Besides, the product may not be visible in the search results for 20 minutes after the changes are made due to the peculiarities of search results caching. If the product is still is not shown in the search results after this time has elapsed, please check whether all the key product details are filled in: namely its price, name, and description. If the product does not appear in the search results within 48 hours, contact the support service.

My product does not appear in the search results for the queries where I expect it to appear. #

If the product is not visible by the specific query, please make sure that:

  • The product really does not show up in the search results: the search results can contain up to several thousand products, and your product can appear anywhere on this list. To make certain that the product is not in the search results, use the corresponding filters, for example, by brand or price. This will reduce the selection.
  • Whether the product will be located if you remove the category prediction. If the category prediction is triggered, the Search everywhere button appears next to the title under the search bar. In this case, please check whether the product belongs to the predicted category and whether it should belong to it.
  • The product is not merged into a single product tile with other products in the search results. Check if there are different attribute options on the product tile, such as color, size, or number of pieces.
  • The product is not found when searching for synonyms of the search phrase.

If you have performed all the actions above, and the product still does not appear in the search result, please contact Ozon support team.

Is it possible to artificially increase the indicators to raise my product in the search results? #

Many of the above indicators can really be increased artificially. It can be done either by fair means, for example, by purchasing high-quality external traffic, or by unfair ones, for example, by purchasing bots.

It is important to understand that after all, the ranking is still determined by customer behavior, so buying up bots will not help you raise the product in the search results. For example, you can drive up impressions via bots, however, a large number of impressions can affect the product ranking negatively, because if the product has a lot of impressions, but the number of its sales is low, this would mean that customers do not like this product much. Thus, by increasing the impressions artificially you can even omit your product in the search results.

Clicks do not affect the ranking, and there is an anti-fraud system that controls adding products to cart and to favorites. It tracks the artificially increased indicators and excludes them from the ranking. At the same time, we do not penalize sellers for cheating to eliminate the possibility of omitting competitor’s products in the search results by artificially increasing its indicators.