home.social

#sitemaps — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #sitemaps, aggregated by home.social.

  1. Today's of the day is:

    Sitemaps are an outline/map of where pages are located on a website.

    They are typically used to improve search engine indexing. 🔍

  2. Evergreen crawling with XML sitemaps (updated to run in bulk)

    The script now takes a list of tuples of XML sitemap URL, and website name, and runs the through them all, creating dynamic file names, for example

    {name}_sitemap.csv
    {name}_crawl.jl
    {name}_errors.txt

    Now you just need to add a URL and a name to add a new website to the process, instead of creating new specific files from scratch.

    bit.ly/4fnhiSO

    #advertools #crawling #sitemaps #scraping #SEO #DataScience #Python

  3. Day 67 of #100DaysOfCode

    Google Sheets tutorial: sitemaps, URL structure, and list-mode scraping with #advertools

    You can do programming with sheets if you want to.

    The recent introduction of named functions is making this easier (no app script).
    You will like it if you're into functional programming, lambda, map, reduce...

    Make a copy of the sheet to get access to all functions discussed

    Enjoy!

    bit.ly/3Ki2bvj

    #SEO #Googlesheets #scraping #crawling #sitemaps

  4. The list-mode crawler needs a list of URLs to begin with.

    One place to get those URLs is from XML sitemaps

    🔵 New sheet for extracting URLs and lastmods from XML sitemaps
    🔵 Supports regular and zipped sitemaps
    🔵 Can work with sitemap URLs or sitemapindex
    🔵 Example here shows how to interactively select the sub sitemap you want from a sitemap index
    🔵 More examples to come
    🔵 Enjoy

    bit.ly/4bODoek

    #SEO #XML #sitemaps #crawling #scraping #googlesheets

  5. 📄 New article published -> “Top 5 Sitemap Myths Demystified” 🗺️

    Whether you’re an SEO, developer, or client, sitemaps are one of the most misunderstood aspects of SEO. Read on, they probably aren’t what you think they are! #seo #sitemaps #craftcms nystudio107.com/blog/seo-myths

  6. Heads-up, Google published a post today about the sitemaps endpoint going away. And they also covered the use of lastmod in sitemaps -> "You can still submit your sitemaps through robots.txt and Search Console, but the HTTP requests ("pings") to the deprecated REST endpoint will result in a 404 error. Support for sitemaps ping and the endpoint will stop functioning in 6 months."

    developers.google.com/search/b

    #google #seo #sitemaps

  7. My #May blog post is up! 🥳

    This month, I’m sharing a method for generating custom #sitemaps with #Astro when the official sitemap integration isn’t flexible enough, for example when you want per-page `lastmod` values, instead of per-site ones, or have internationalised slugs the official integration can’t match to each other.

    Enjoy it here: amxmln.com/blog/2023/creating- 😊

    #webdev #tutorial #xml #jamstack #ssg #i18n