top of page

Yandex Ranking Factors Leaked 2022: Source Code & Parameters

Updated: Apr 29

Yandex ranking source code was leaked in 2022. Around 1922 ranking factors we leaked publicly in 45 GB source files. We have taken the levarge to decrypt those files and to get the ranking factors used by yandex. The complete file of all ranking parameters are avaliable here on WebMarketingSchool. Codes showed the Yandex search engines ranking factors.

Yandex Leaked Ranking Factors
Yandex Leaked Ranking Factors

We have made the list of leaked yandex ranking parameter and categorized them in three parts.


  1. On-Page SEO Ranking factors

  2. Off-Page SEO Ranking factors

  3. Technical SEO Ranking factors.

The complete list of all the factors starts from below.


On-Page Ranking factors (Yandex).

On-page Optimization means changes which make on our webpage in order to increase the search rankings in the search engines. Here are few ON page ranking factors of Yandex Search Engine.

  1. Text relevance (Maxfreq is the frequency of the most frequent word that makes sense of the length of the document).

  2. Binar factor, has a value of 0 for all monosyllabic queries, and the value of 1 almost all two or more words, except for a very small number of answers for which there is not a single link that has passed quorum

  3. There are all the words of the request somewhere in the document

  4. There are all the words of the request in a row in the document.

  5. There are all the words of the request in one link.

  6. (Phras) There are all the words of a request in a row in one line.

  7. The presence of an accurate phrase (text of the request) in the header (more precisely, in the first sentence of the document). Contextual restrictions and feet are taken into account exactly as in TRP2, i.e. Factor [8] minors factor [5]

  8. There was a plot that has passed the quorum in which all the word positions are designated as having the relevance of Best_relev (title or Meta Keywords).

  9. Long document (the longer the document, the greater the value of the factor).

  10. The sum of the IDF words of the request. The name does not reflect the essence: for example, for the request of 'Gadyach' this factor will be more than for the request 'Moscow Peter Yekaterinburg Samara.

  11. Long text without links.

  12. Means the coincidence of the user and the site at the level of countries.

  13. Coincidence of the thematic spectra of the request and document. Request theme-the result of work ((http://wiki.yandex-team.ru/evgenijkroxalev/subquery Rules of the sorcerer SubquerySearch)) The subject of the document is taken from Yandex-Catalog

  14. The complex Static Rank is assembled from static components in a separate formula

  15. The factor about the number of refines. The queries in the language have a photo of User Refines ('The Word, Before which there is a percentage sign'). As an idea, it means something like

  16. Lemma coincides in text relevance.

  17. DSSM model, trained for reformulations, in the documentary part uses relevant proposals

  18. The document LR> 20 (Load Runner) The number of words of the words of the request in the Links> 16, the factor about lr.

  19. URL High Lr. (Load Runner)

  20. The Popularity of the Request

  21. TR divided by a cube of the number of words in a request and transformed by a standard REMAPTR.

  22. Document language - Russian. (for google it would english for worldwide and religional domains are promoted for local levels)

  23. The time of adding a page, more - a more old document; the root is placed from time displayed for the interval [0.1] so that 3+ years gives 1.

  24. If the main page of the owner (most often a second -level domain, for example, xxxx.ru), then the factor is 1. For bums, hosting, personal blogs, etc. (for example, Lifejornal, people, etc.) - Third -level domains (such as xxxxx.narod.ru) will also have a factor of equal 1.

  25. The time of adding the main page of the owner (host?), Remapes the same as Addtime.

  26. How often do they click in this URL for this request - CTR blasting for the correction factor

  27. There is advertising on the site. negative ranking factor

  28. The presence of pairs of words in the exact form

  29. The number of sentences in which there are many words in the exact form

  30. The presence of words in the title in the exact form

  31. The presence of pairs of words based on synonyms (> = txtpair)

  32. The presence of words in the title based on synonyms

  33. How often do they click in the URLs of this Domainid for this request - Ctr Domainid downstream for the correction factor

  34. The owner's clickness regardless of the request

  35. There are all the words of the request in the links

  36. Link relevance from Gulin.

  37. There is an exact form of all the words of the request in the text/links

  38. There is a lemma of all the words of the request in the text/links

  39. Quality of the text (classifier Alekseev)

  40. The core of the audience of the owners according to Yandex.Browszing

  41. Spam Karma named after Antispamer is the likelihood that the host is spam; based on Whois information

  42. The length of the document in sentences

  43. The length of the URL, divided by 5

  44. Document type - HTML

  45. Link relevance taking into account the quality of each link

  46. Fast document

  47. Page from ru.wikipedia.org

  48. Commercial page (Classifier of Savina)

  49. The document does not have all the words of the request (with the exact synonym)

  50. The percentage of all the words of the request in the text (with the accuracy of the form)

  51. The degree of centralization of the points from which the request is set

  52. Does the request of blog vocabulary contains

  53. The content of content is not used.

  54. The ratio of the number of clicks on this URL to all clicks on request

  55. How often they click in this URL for this request - CTR dormant for the correction factor

  56. How often do they click in this URL for this request - CTR blasting for the correction factor, by small regions from Relev_regions.web.txt

  57. TR of the best passage - how high -quality snippet can do

  58. TR with a discount for the offer number

  59. Clickable domain by words

  60. There is an ancient date in the URL. Ancient news is recognized. Factor 1 if there is a year in Url <= 2007.

  61. The weight of the maximum coincidence of forms in the text and request

  62. On the page there is a about 'payment SMS'.

  63. Three levels of coincidence of the region of links and request

  64. Geographical proximity

  65. The average for users Active continuous time of the user is (in second) on the host pages after the transition on request from the search engine (the factor depends on the pair (request, Domattr)).

  66. average for users, the number of active actions (clicks, clicks) with the continuous finding of the user on the host pages after the transition by request from the search engine (the factor depends on the pair (request, Domattr)). In the intra core, the Yandex.Bara/Elements meter/elements /Browser

  67. The number of unique visitors from search engines for a specific request

  68. Average for users, the number of active actions (clicks, keystrokes) on the page after the transition on request from the search engine (the factor depends on the pair (request, URL))

  69. Geographical proximity of the user and site

  70. The number of unique visitors, remembrance exponentially

  71. The share of traffic from search engines

  72. The share of suits to the site is not by links (set with hands or from bookmarks)

  73. The number of unique visitors to URL

  74. The average for users time is the user on the page. It is read as the difference between neighboring transitions.

  75. Link factor about the availability of video on the page.

  76. Geographical distribution of request

  77. The request is set mainly at night

  78. The request is given mainly in the morning

  79. The request is given mainly during the day

  80. The request is set mainly in the evening

  81. The degree of severity of the queries at different times of the day

  82. The weight of the words of the request that is in the text and links

  83. Entropy - distribution of clicks

  84. Entropy - distribution of impressions

  85. Entropy - distribution of clique/shows

  86. The ratio of the amount of IDFs met in a sentence+Title to all words.

  87. Request about the video

  88. The owner's clickness, regardless of the request, separately in the regions

  89. Domain in the .com zone

  90. Poemality of the Document

  91. Document language - English

  92. Requestful factors - the result of work

  93. The document has textual relevance

  94. An indicator of the unnaturalness of the text from the point of view of the Russian language. The number of bad pairs of words in the text, transferred to the segment [0.1] by formula z/(z+10)

  95. The number of words in the text (the word is what the lemmeter selected) is displayed in [0.1] according to the formula x/(x+a)

  96. The average length of the word

  97. The percentage of the number of words inside the tag <a> .. </a> from the number of all words

  98. The percentage of the number of words outside the tags (outside the brackets <>) from the number of all wordsThe percentage of the number of words, which are 200 of the most frequent words of the language, from the number of all words of the text.

  99. The number used in the text 500 of the most popular language words, divided by 500

  100. The number used in the text 500 of the most popular language words, divided by 500

  101. Logarithm of the average geometric probabilities of the trigrams in the text. (The probability of a trigram - the number of its meetings in the text, divided by the number of all trigrams) is displayed in [0.1] by formula -x (x+a)

  102. There is a big picture on the page

  103. The share of pronoun adjectives

  104. The share of pronoun nouns

  105. The share of verbs

  106. The share of words that can be both masculine nouns and nouns of the feminine, but not of the middle kind, among all nouns (examples:

  107. The number of different internal links to the page

  108. Frequency of links to the site

  109. The number of almost periodic links

  110. The number ofOwner shows on request, normalization x/(100 + x).

  111. The number of URL shows on request, normalization x/(100 + x).

  112. The popularity of Owner in query

  113. The factor about that is a good snippet.

  114. The language of the document corresponds to the query language

  115. Popularity of the request within the country

  116. The degree of centralization of the points from which the request (within the country) is set

  117. Geographical distribution of the request within the country

  118. The hour at which this request is given the most

  119. The degree of severity of the task of requests at different times of the day (inside the country)

  120. The country of the document (domain) and the country of the user coincide.

  121. Country classifier of localization - how much the request implies the context of the country

  122. The service factor that was needed to search the site, and in the future it will still be needed.

  123. Probably model built in texts of incoming links

  124. The number of clicks on the owner and the number of clicks on request is more than 5

  125. The factor evaluates the differences between the positions of words in the title from the position of words in the request

  126. URL's length with an accuracy of a symbol.

  127. The degree of borenness of the page title.

  128. The number of those in-line links between hosts

  129. The average dwelltime, with DwellTime, is cut from the session if more than 180 seconds

  130. The probability of a click on URL will be more than 120 seconds

  131. The probability is that they do not click on the URL if they click at least one URL lower.

  132. The average dwelltime, and DwellTime from the session is trimmed if more than 3600 seconds.

  133. The average dwelltime, and DwellTime from the session is trimmed if more than 180 seconds.

  134. The core of the page of the pages on which there is a metric counter

  135. The share of clicks on this URL among all clicks on similar requests

  136. The clickableness of the host according to the latest request

  137. URL contains a token that coincides with the short name of the user country. The factor is considered only on the EU stream.

  138. URL is an offer in the latest version of the market base.

  139. The average length of the logical session in which there was a request

  140. The document contains a name from a request.

  141. The document has a direct link to the file

  142. Regional attendance from search engines for a specific request

  143. Clicks on the URLs shown in the issuance for requests, by which they went to look for other search engines

  144. URLs shows in the issuance for requests, by which they went to look for other search engines

  145. Classifier of the commerciality of the site

  146. The document contains user review/comment

  147. The share of clicks on this URL among all clicks on similar queries, the country version

  148. The average number of found on request

  149. The average position of URL on a normalized request

  150. The average position of URL for all requests

  151. The average position of the host for all requests

  152. Number of URL queries

  153. Number of requests for host

  154. The share of the words of the document from segments with Score> 2.

  155. The weight of the document on a monosyllabic dictionary of commercial vocabulary

  156. The number of requests in the group of frequency requests similar to a given

  157. The share of clicks on this URL among all clicks on similar requests calculated according to Popular Search Engine

  158. The length of the Depth Nodes petal calculated for hosts

  159. The dispersion of the angle in the space of Nodes Time, calculated for hosts

  160. Average, according to the request, the probability of download the file from the host after click.

  161. Factor of content of content.

  162. CTR according to click data, the request is normalized according to Sinsets

  163. Regional CTR according to click data, the request is normalized according to Sinsets

  164. The number of chains on request / (the number of chains in which URL + the number of chains on request participated).

  165. The number of chains in which URL was the last, normalized for the total number of chains in which this URL was.

  166. Number of transitions to URL from Wikipedia

  167. The page indicator is like a hub (how many pages are the bar users go from it).

  168. The probability of URL is the last on request in the Hopes chain.

  169. DSSM Prediction of the probability of URL + Title,that the page has one product.

  170. DSSM Prediction of the probability of URL + Title, that the page has a lot of goods.

  171. Factor by name from the original request is considered according to the contents of the document.

  172. Owner has an E Com purchase.

  173. User return on URL

  174. Rank of hacking sites

  175. The share of users who returned within a month

  176. The number of users returned within a month

  177. The share of the capital letters in Title

  178. The share of incoming traffic from search engines among all incoming traffic

  179. The share of direct visits among all incoming traffic

  180. Neural model of content quality for medical subjects

  181. Neural model of content quality for SOS topics

  182. The ratio of the total area of ​​all Flash blocks to the screen area

  183. URL is a channel/post from a verified account of a social network

  184. The distance from the city, from where a request was set

  185. Logarithm of the number of shows.

  186. The probability that URL is clicking if they do not click at least one URL higher.

  187. The probability that URL is not clicks if they click at least one URL below.

  188. Ordinary CTR. Localization to the level of countries.

  189. The average dwelltime, and DwellTime from the session is cut if more than 3600 seconds. Localization to the level of countries.

  190. The probability that the URL click will be more than 120 seconds. Localization to the level of countries.

  191. The percentage of traffic from social networks in all traffic from other hosts and search.

  192. The average number of direct descendants from the host on which more than 90 seconds were spent. Only if there is a link from our page for the descendant and crossed it.

  193. The average maximum depth of word with the root in the current URL is visited from other hosts.

  194. The number of times when the Delena went to the page to the total number of pages to which they switched from a sickle. The closer to 1, the more often the page was opened in the session.

  195. The average length of search sessions when they passed from a sickle

  196. Static url factor in browser logs for the maximum period. The probability that the user will spend on the page> 120 seconds.

  197. The average time spent on the page and in all descendants of the page (URLS to which they crossed) from the host. WHOLE, if the total DT is more than 10 minutes

  198. Probability of jumping from the page

  199. The likelihood of image surges from the page

  200. The document has a turbo page for mobile platforms.

  201. BM25FDPR with standardization on the average length of the document, depending on the language of the document. Only texts hits are used.

  202. The document has a protocol HTTPS

  203. Domain in the international zone

  204. The request was recognized as having intent to copyright objects protected by the anti-pirate memorandum.

  205. The host contains pirate videos protected by anti-pirate memorandum.

  206. The host contains a video protected by anti-pirate memorandum.

  207. Average surplus of freshness of the host in 30 days

  208. The share of documents with a positive freshness from the host for 30 days

  209. The average position of the owner at the request for the last week.

  210. The average attitude of punctuation to all dividers in the documents of the owner.

  211. The value of the freshness detector, calculated in the hippo. Always 0 with the value of the detector less than the threshold.

  212. The host contains a video protected by anti-pirate memorandum.

OFF - Page Ranking Factors (Yandex). (Link Building)

Yandex has given a great importance to those pages which get backlinks from other webpages. Here are few ranking factor relating to it.

  1. Link relevance. The factor is remaped.

  2. Quality of incoming links (classifier of the bream) - broken, cm

  3. The number of incoming links. Remap.

  4. TFIDF ordinary TF*IDF by links. The frequency of a word in the links is multiplied by the reverse document frequency and summarized in all words, then it is normalized to the length of the document.

  5. Link relevance, taking into account the non -profitness of each link and quality of each link

  6. The percentage of incoming links with the words of the request

  7. The number of links that match the text of the request (other Remap)

  8. The average age of links that brought something to LR Linkage = Min (LOG (average age of the link)/7, 1), 3 years are accepted for 1

  9. The share of incoming corrupt links. The algorithm for recognition of commercial links is implemented. The factor is remaped [0.1] if the share of such links is 50%, otherwise 0. ((http://wiki.yandex-team.ru/svetlanashorina/topseolinks Sample of wound sites))

  10. It characterizes the frequency of words in the links. The factor is large if the word that played in a lingon relevance is rare for links.

  11. The presence in the links of the pairs of words taking into account synonyms

  12. The number of links passed the threshold

  13. Link relevance with pessimization for greater Link age

  14. Quality of incoming links (hausel classifier) ​​corrected

  15. The quality classifier of incoming links 2 corrected

  16. Link relevance without taking into account rare words

  17. Dispersion of the number of words in the links.


Yandex Technical SEO Ranking Factors.

We did not find any as such ranking factor for technical SEO but faster the webpage loads higher it ranks on the search engine.


Conclusion

We have tried to decrpt these ranking factors so that it can help the seo community.


13 views

Comentarios


bottom of page