Upgrade to Pro — share decks privately, control downloads, hide ads and more …

AI and Me: how we built a GSC Bulk Export Data ...

Noah Learner
October 21, 2024
150

AI and Me: how we built a GSC Bulk Export Data Pipeline

Here's a concise summary of the conference talk:

Noah presented his vision of merging Google Search Console (GSC) Bulk Data Export with his Branch Explorer product to create a powerful SEO analytics tool. The GSC Bulk Export provides comprehensive search data directly to BigQuery without row limits, offering detailed insights including anonymized queries and search appearances.

Noah built the tool entirely using AI, including ChatGPT and Claude. He shared lessons learned from this process, including AI's capabilities and limitations, the importance of iterative development, and effective tools for AI-assisted coding.

Key points about the GSC Bulk Export data pipeline include:
1. No historical data backfill
2. Potential costs associated with BigQuery usage
3. Advanced segmentation and custom pipeline possibilities

Noah discussed AI development pitfalls, such as natural language misunderstandings and error resolution challenges. He created custom GPTs to assist with specific coding tasks.

The final product, Branch Explorer with Bulk Data Integration, is a Looker Studio-based visualization tool that allows users to explore GSC data in novel ways, including segmentation by subdirectories, UTM parameters, and device types. Noah made the tool available to conference attendees.

Noah Learner

October 21, 2024
Tweet

Transcript

  1. Sterling Sky noahlearner I promise you, within 6 months anyone

    will be able to build anything with an LLM. -Dave Sottimano
  2. Sterling Sky noahlearner GSC Bulk Export is 3 tables •ExportLog

    •searchdata_url_impression •searchdata_site_impression
  3. Sterling Sky noahlearner GSC Bulk Export is 3 tables •ExportLog

    •searchdata_url_impression •searchdata_site_impression* *I didn’t use the site_impression table due to time constraints.
  4. Sterling Sky noahlearner GSC Bulk Export includes •All Countries •All

    Devices •All Search Types •All Search Appearances (Booleans) •Anonymized Queries •No Row Limits
  5. Sterling Sky noahlearner With Explorer we can see our data:

    •Brand vs Non Brand •Funnel stage (top, middle, bottom)
  6. Sterling Sky noahlearner With Explorer we can see our data:

    •Brand vs Non Brand •Funnel stage (top, middle, bottom) •Is GBP URL (url has GBP related UTMS)
  7. Sterling Sky noahlearner With Explorer we can see our data:

    •Brand vs Non Brand •Funnel stage (top, middle, bottom) •Is GBP URL (url has GBP related UTMS) •Segment our site by URL patterns
  8. Sterling Sky noahlearner With Explorer we can see: •Brand vs

    Non Brand •Funnel stage (top, middle, bottom) •Is GBP URL (url has GBP related UTMS) •Segment our site by URL patterns •Which Directories, pages, + queries are driving change on our sites.
  9. Sterling Sky noahlearner Tokens 1 token ~= 4 chars in

    English 1 token ~= ¾ words 100 tokens ~= 75 words 1-2 sentence ~= 30 tokens 1 paragraph ~= 100 tokens 1,500 words ~= 2048 tokens
  10. Sterling Sky noahlearner 1. Describe end goal in high level

    terms 2. Ask the LLM to summarize the goal
  11. Sterling Sky noahlearner 1. Describe end goal in high level

    terms 2. Ask the LLM to summarize the goal 3. Iteratively clarify the goal
  12. Sterling Sky noahlearner 1. Describe end goal in high level

    terms 2. Ask the LLM to summarize the goal 3. Iteratively clarify the goal 4. Ask model to build architecture
  13. Sterling Sky noahlearner 1. Describe end goal in high level

    terms 2. Ask the LLM to summarize the goal 3. Iteratively clarify the goal 4. Ask model to build architecture 5. Iteratively clarify architecture
  14. Sterling Sky noahlearner 1. Describe end goal in high level

    terms 2. Ask the LLM to summarize the goal 3. Iteratively clarify the goal 4. Ask model to build architecture 5. Iteratively clarify architecture 6. Ask model to build function pseudo code
  15. Sterling Sky noahlearner 1. Describe end goal in high level

    terms 2. Ask the LLM to summarize the goal 3. Iteratively clarify the goal 4. Ask model to build architecture 5. Iteratively clarify architecture 6. Ask model to build function pseudo code 7. Ask model to build actual code
  16. Sterling Sky noahlearner 1. Describe end goal in High level

    terms 2. Ask the LLM to summarize the goal 3. Iteratively clarify the goal 4. Ask model to build architecture 5. Iteratively clarify architecture 6. Ask model to build function pseudo code 7. Ask model to build actual code 8. Iterate, and iterate and iterate
  17. noahlearner Sterling Sky GSC Bulk Export shows •epoch_version field in

    ExportLog table lets us see when Google had data quality issues. •Google is testing serps all the time. •what pages are triggering anonymous queries.
  18. noahlearner Sterling Sky Costs for 2 years S / M

    sized site: $0-$5 Enterprise site: $50 -$400
  19. Sterling Sky noahlearner As an AI language model, I don't

    have direct access to your codebase or file system.