Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Cracking The Code: Secrets Of Technical SEO

Cracking The Code: Secrets Of Technical SEO

For years links were the bedrock of a successful SEO campaign but the truth is modern-day SEO in 2018 is now a lot more complex. The rules of the game have shifted and over-reliance on high authority links just won’t cut it.

Technical SEO is now becoming THE MOST important ranking ingredient and if you consider yourself a top SEO you need to have your technical game nailed. Without technical SEO in your weaponry, those links you’ve worked hard for just won’t rank you like they should do. It’s time to get your foundations in check!

In this talk we will investigate how to get those Google crawlers loving your website and coming back for more. We’ll deep dive into topics such:

● Rank Brain’s impact on crawling
● Mobile First indexation
● Algorithmic content analysis
● Page Speed optimisation
● Ontological page structure
● SEOperational server log analysis
● Plus lots more

As the Director of SEO, I see A LOT of technical SEO issues on a daily basis and I will be exclusively sharing some of the incredible results that subtle technical tweaks can bring to your campaigns. Get ready for real-life case studies and live analysis. No waffle, or broad-strokes. Strictly actionable, high-level tips.

Rad Paluszak

October 26, 2018
Tweet

More Decks by Rad Paluszak

Other Decks in Marketing & SEO

Transcript

  1. The Search Initiative Rad Paluszak: Cracking The Code Contents •

    Machine Learning From The SEO Perspective • Hummingbird • RankBrain • Natural Language Processing • Algorithmic Content Analysis • Link Assessment • Super-Technical Goodies ☺ • Ontological Page Structure • Log Analysis • How to Recover From The “Medic Update”? I N T R O D U C T I O N
  2. The Search Initiative Rad Paluszak: Cracking The Code 3 Rad

    Paluszak • “SEO” birthday - 2010 (Caffeine update) • Web developer “at heart” • Algorithms <3 • Machine Learning <3 • Data Mining <3 • “Technical SEO Artist” 📧 [email protected] D i r e c t o r o f S E O
  3. The Search Initiative Rad Paluszak: Cracking The Code 4 Machine

    Learning Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention. M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e
  4. The Search Initiative Rad Paluszak: Cracking The Code 5 Deep

    Learning Deep learning is part of a broader family of machine learning methods based on learning data representations (e.g. ontology), as opposed to task- specific algorithms. A deep neural network (DNN) is an artificial neural network (ANN) with multiple layers between the input and output layers. The DNN finds the correct mathematical manipulation to turn the input into the output, whether it be a linear relationship or a non-linear relationship. The network moves through the layers calculating the probability of each output. M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e By Glosser.ca [CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0)], from Wikimedia Commons https://commons.wikimedia.org/wiki/File:Colored_neural_network.svg
  5. The Search Initiative Rad Paluszak: Cracking The Code 6 Hummingbird

    Codename given to a significant algorithm change in Google Search in 2013. Its name was derived from the speed and accuracy of the hummingbird M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e Conversational Search Natural Language Processing Query Intent Semantic Model Analysis New Ranking Signals Importance of Authority Long-tail Focused (RLY?)
  6. The Search Initiative Rad Paluszak: Cracking The Code Get. Challenge

    Google. Assistant. to understand…. Ahrefs CEO
  7. The Search Initiative Rad Paluszak: Cracking The Code 15 RankBrain

    RankBrain is an algorithm learning artificial intelligence system, the use of which was confirmed by Google on 26 October 2015. It helps Google to process search results and provide more relevant search results for users. M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e If RankBrain sees a word or phrase it isn’t familiar with, the machine can make a guess as to what words or phrases might have a similar meaning and filter the result accordingly, making it more effective at handling never-before-seen search queries or keywords.
  8. The Search Initiative Rad Paluszak: Cracking The Code 16 RankBrain

    Understands similarity of the queries based on multidimensional vector space analysis & the proximity of one query to the other. M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e
  9. The Search Initiative Rad Paluszak: Cracking The Code 17 Results

    Post process ing Relevance Matching Intent Analysis NLP Query Parsing User Query Results Satisfaction Analysis (CTR, BR) User-context-based search engine https://patents.google.com/patent/US9449105B1/en System and method for providing search query refinements https://patents.google.com/patent/US20050055341A1/en Predicting Site Quality https://patents.google.com/patent/US9767157B2/en
  10. The Search Initiative Rad Paluszak: Cracking The Code 18 Language

    Use Analysis Improved NLP & Understanding Sentiment & Relevance Learning Intent Distinction Engagement Prediction Trend Awareness Purpose Discovery Intelligent Classification Deep Learning
  11. The Search Initiative Rad Paluszak: Cracking The Code 19 Intent

    Interpretation M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e
  12. The Search Initiative Rad Paluszak: Cracking The Code 20 Query

    Matching Purpose Instant Answer Latent Intent Interpretation Semantic Structure Freshness Engagement Related Topics User Intent Content Depth Keyword Verticals
  13. The Search Initiative Rad Paluszak: Cracking The Code 21 Natural

    Language Processing M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e Natural Language Processing Machine Translation Information Retrieval Sentiment Analysis Information Extraction Question Answering
  14. The Search Initiative Rad Paluszak: Cracking The Code 22 Natural

    Language Processing M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e CC BY-SA 3.0, https://en.wikipedia.org/w/index.php?curid=11556338 Apache OpenNLP • S Simple declarative phrase • NP Noun phrase • VP Verb phrase • DT Determiner • JJ Adjective • NN Noun, singular or mass • VBZ Verb, 3rd person singular present • VBN Verb, past participle • TO to http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.9.8216&rep=rep1&type=pdf
  15. The Search Initiative Rad Paluszak: Cracking The Code 23 Algorithmic

    Content Analysis M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e https://natural-language-understanding-demo.ng.bluemix.net/
  16. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “Google is assessing the quality of your content, UX and purpose. (Remember Page Layout Algorithm?)”
  17. The Search Initiative Rad Paluszak: Cracking The Code 26 TF-IDF,

    WDF*IDF, WTF Mathematically, TF-IDF is the product of how often a keyword appears on a page (TF) and how often it is expected to appear on an average web page, based on a larger set of documents (IDF). WDF*IDF (Within Document Frequency*Inverse Document Frequency) - an analysis method to determine keywords and terms that sustainably increase the relevance of published texts. The result is the relative term frequency and weighting of a document, relative to all other web documents including analysed keyword. WTF – I have no idea what I’m doing ☺ M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e SEO PowerSuite
  18. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “Simple TF-IDF tweaks can work very well for your 80/20 rule!”
  19. The Search Initiative Rad Paluszak: Cracking The Code 28 TF-IDF,

    WDF*IDF, WTF M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e BEFORE AFTER
  20. The Search Initiative Rad Paluszak: Cracking The Code 29 TF-IDF,

    WDF*IDF, WTF M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e
  21. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “Google always tries to predict your site structure and assess what’s worth crawling and what is not.”
  22. The Search Initiative Rad Paluszak: Cracking The Code 31 ML

    In Crawling & Crawl Budget Management M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e
  23. The Search Initiative Rad Paluszak: Cracking The Code 32 ML

    In Crawling & Crawl Budget Management M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e I didn’t learn f*ck-all about the site structure. Googlebot Image Credits: http://www.thesempost.com/blocking-googlebot-with-bad-bot-scripts-wordfence/ https://www.seroundtable.com/google-crawl-report-problem-19894.html Scheduler for search engine crawler https://patents.google.com/patent/US7725452B1/en
  24. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “Manual Penalty process starts with a pre-selection of suspicious candidates detected by machine learning algorithms.”
  25. The Search Initiative Rad Paluszak: Cracking The Code 34 ML

    In Link Assessment M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e Manual Penalty Manual Verificati on Classific ation Pre- Screenin g Machine Learning
  26. The Search Initiative Rad Paluszak: Cracking The Code 35 ML

    In Link Assessment M a c h i n e L e a r n i n g F r o m T h e S E O P e r s p e c t i v e
  27. Chaos Theory • Branch of mathematics that deals with systems

    that appear to be orderly but, in fact, harbor chaotic behaviors. • Chaos theory applies to complex, dynamic systems highly sensitive to initial conditions. • The deterministic nature of these systems does not make them predictable. • Chaos paradoxically leads to formal structure and order. 1,653 Launches 9,800 live traffic experiments 18,015 side- by-side experiments 130,336 Search quality tests
  28. The Search Initiative Rad Paluszak: Cracking The Code 37 Ontology

    A model for describing the world that consists of a set of types, properties, and relationship types. There is also generally an expectation that the features of the model in an ontology should closely resemble the real world (related to the object). • Relational Classification • Logical Categorisation • Semantic Domain AKA Universe Ontological Page Structure S U P E R - T E C H N I C A L G O O D I E S https://blog.grakn.ai/what-is-an-ontology-c5baac4a2f6c
  29. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “RankBrain uses all sorts of ontologies (multidimensional vector spaces) to learn the Web. = Google MUST understand your site structure.”
  30. Flat, dead, WTF? Not Good ☺ . 0 1 Visible

    clusters, some core pages hard to crawl. Looks OK, but… . 0 2 All (trust me) core pages within 1 click from … EVERYWHERE. Great structure. The “hanging” pages are only sponsored posts. Looks weird, but… . 0 3 Yes ☺ Looks Good (?) . 0 4 Ontological Page Structure S U P E R - T E C H N I C A L G O O D I E S
  31. The Search Initiative Rad Paluszak: Cracking The Code (OBVIOUS) KNOWLEDGE

    BOMB “The easiest way to show Google your site’s ontological nature is to use silo-based structure. DO NOT SPAM IN THE URLs!!!!”
  32. Month-on-month comparison once the whole process finished Increased MOM Traffic

    Google activity in terms of monthly events logged. Improved Crawlability +11.52% +8.7% Silo Structure Results • In this case we used Canonicals, not 301s to limit fluctuations (In Google We Trust…) • ~60 pages - entire cluster size • 6 days to index 8 core pages under new URLs • ~2 weeks to transfer the 8 core pages to the new URLs • ~5.5 weeks to move the whole cluster over • ~3% traffic fluctuation in the meantime S U P E R - T E C H N I C A L G O O D I E S
  33. The Search Initiative Rad Paluszak: Cracking The Code 42 Bad

    Examples • /business/business-management/ • /air-sprayers/handheld-air-sprayers/ • /air-sprayers/handheld-air-sprayers/graco- 16Y385-paint-sprayer/ • /trecks/machu-picchu-trecks/ausangate-trek/ • /ebike-reviews/best-electric-bikes/ebikes-under- 1000/brompton-electric-review/ S U P E R - T E C H N I C A L G O O D I E S • /business/management/ • /air-sprayers/handheld/ • /air-sprayers/handheld/graco-16Y385-review/ • /trecks/machu-picchu/ausangate-trail/ • /reviews/under-1000/brompton-electric/ Good Examples
  34. The Search Initiative Rad Paluszak: Cracking The Code 43 Server

    Log Analysis Screaming Frog Log Analyser is your friend ☺ • Understanding Crawl Budget • Confirming Devaluation • Finding undiscoverable issues • Analysing Google Crawl Habits S U P E R - T E C H N I C A L G O O D I E S Googlebot Image Credits: http://www.thesempost.com/blocking-googlebot-with-bad-bot-scripts-wordfence/ https://www.seroundtable.com/google-crawl-report-problem-19894.html
  35. The Search Initiative Rad Paluszak: Cracking The Code 44 

    Healthy Site Devalued Site → Server Log Analysis S U P E R - T E C H N I C A L G O O D I E S Screaming Frog Log Analyser is your friend ☺ • Understanding Crawl Budget • Confirming Devaluation • Finding undiscoverable issues • Analysing Google Crawl Habits
  36. The Search Initiative Rad Paluszak: Cracking The Code 45 Server

    Log Analysis Screaming Frog Log Analyser is your friend ☺ • Understanding Crawl Budget • Confirming Devaluation • Finding undiscoverable issues • Analysing Google Crawl Habits S U P E R - T E C H N I C A L G O O D I E S Direct URL 11% Query String 89%
  37. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “Google was still crawling URLs from a version of the site redesigned 4 years ago.”
  38. The Search Initiative Rad Paluszak: Cracking The Code 47 Server

    Log Analysis Screaming Frog Log Analyser is your friend ☺ • Understanding Crawl Budget • Confirming Devaluation • Finding undiscoverable issues • Analysing Google Crawl Habits S U P E R - T E C H N I C A L G O O D I E S Googlebot Image Credits: http://www.thesempost.com/blocking-googlebot-with-bad-bot-scripts-wordfence/ https://www.seroundtable.com/google-crawl-report-problem-19894.html
  39. The Search Initiative Rad Paluszak: Cracking The Code 48 Server

    Log Analysis Screaming Frog Log Analyser is your friend ☺ • Understanding Crawl Budget • Confirming Devaluation • Finding undiscoverable issues • Analysing Google Crawl Habits S U P E R - T E C H N I C A L G O O D I E S
  40. The Search Initiative Rad Paluszak: Cracking The Code “Medic” Case

    Study #1 Patient: Ambitious Affiliate Site Issue: -794 positions down Diagnosis: Algorithmic Content Devaluation Recovery Time: From 31.07-08.08 to 22.09-08.10 Final Result: +1,097 positions up S U P E R - T E C H N I C A L G O O D I E S
  41. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMB

    “Write CONCISE content where user intent requires short answers! Do not blindly go by 1k+ words per page.”
  42. The Search Initiative Rad Paluszak: Cracking The Code “Medic” Case

    Study #2 Patient: Whitelabel Adventure Travel Site Issue: -187 positions down (mainly top 5) Diagnosis: Content (NOT KEYWORD!) Cannibalisation Recovery Time: From 31.07-11.08 to 28.09-07.10 Final Result: +382 positions up S U P E R - T E C H N I C A L G O O D I E S
  43. The Search Initiative Rad Paluszak: Cracking The Code “Medic” Case

    Study #2 Patient: Whitelabel Adventure Travel Site Issue: -187 positions down (mainly top 5) Diagnosis: Content (NOT KEYWORD!) Cannibalisation Recovery Time: From 31.07-11.08 to 28.09-07.10 Final Result: +382 positions up S U P E R - T E C H N I C A L G O O D I E S
  44. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMBS

    1. “Ontological Content Structure is just as important as the site structure.” 2. “Content Cannibalisation is usually more dangerous than keyword cannibalisation.”
  45. The Search Initiative Rad Paluszak: Cracking The Code Ranking in

    post “Medic” world. S U P E R - T E C H N I C A L G O O D I E S 1. Build Personas that are not easy to track down. Why the f*ck would you call it Jimmy Hendrix? 2. Ensure good, tree-like navigation. Menu (drop-downs, mega-menu, etc.) is your friend. 3. E-A-T is not just author bio and social media. Google also needs to “confirm” whatever you’re saying in external sources! 4. Do not write fluff content. Google understands the context, intent, etc. 5. Don’t stuff ads and affiliate links in every possible whole. Medic and following updates are strictly focused on quality measures. 6. Plan for latent intent. Cover your back whenever possible. 7. Avoid content cannibalisation. No need to have the same sh*t in every article. 8. Get rid of unnecessary pages. I’ve recently ranked more sites removing than creating content. 9. Look at your index management. Housekeeping is always a good thing! 10. Save crawl budget. Google ain’t gonna waste it on yo stinky site!
  46. The Search Initiative Rad Paluszak: Cracking The Code KNOWLEDGE BOMBS

    “With SICK onsite foundations you need less links.”
  47. The Search Initiative Rad Paluszak: Cracking The Code Matt Diggity

    Founder & Head Of Insights Rafid Nassir Commercial Director Will Bagnall Co-founder & CEO Rad Paluszak Director of SEO